17 | 17 | 1. Save the URLs of the pages with the zoomed area into a text file (keyboard shortcuts come in handy here: {{{cmd-l-c-w}}} copies the link in the address bar and closes the tab, {{{cmd-TAB}}} switches to a text editor, {{{cmd-v}}} inserts the link. To note in the file which images have to be removed from the XML, copy the URL to the text file, but insert a {{{#}}} before that. You can also write other comments into this file, but be sure to begin the line with a {{{#}}}. The resulting list is to be saved in a new directory on the same level as the {{{raw}}} and the {{{xml}}} directory (see [source:/trunk/texts/WO_1/Stevin_1605] as an example). When trained, the average speed for cutting out figures is 2.5 figures per minute (completed Stevin_1605 in 2 hours) |
18 | 18 | 1. On the basis of this text file, the Python script [source:/trunk/schema/scripts/cut_figures/cut_figures.py cut_figures.py] takes care of cutting out the images from the original TIFF files and saves them in the desired format {{{page-imagenumber}}} (e. g., if pageimage {{{0056.tif}}} has three figures, these figures will be saved as {{{0056-01.tif}}}, {{{0056-02.tif}}} and {{{0056-03.tif}}}), by calling Imagemagick's commands {{{identify}}} and {{{convert}}}. They are stored in a folder called {{{figures}}} |