Changes between Version 14 and Version 15 of Cutting out images


Ignore:
Timestamp:
Dec 16, 2010, 1:15:38 PM (13 years ago)
Author:
Klaus Thoden
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Cutting out images

    v14 v15  
    1313}}}
    1414 Depending on how much context you like (with caption or not).
    15  A list of all figures in ECHO has been extracted ([https://it-dev.mpiwg-berlin.mpg.de/tracs/mpdl-project-software/attachment/wiki/WikiStart/echo-figures.html download]) and converted to html which contains links to each page containing figures.
     15 A list of all figures in ECHO has been extracted ([https://it-dev.mpiwg-berlin.mpg.de/tracs/mpdl-project-software/attachment/wiki/WikiStart/echo-figures.html download]) and converted to html which contains links to each page containing figures. Note that the link might take you to the page after the picture (happens if link is in a float-div).
    1616 1. Now, in the viewing environment, mark the images using digilib's "zoom area" tool. Be careful not to cut out surrounding text, including the catchword at the bottom of the page. However, captions are to be cut out, as well. It is sometimes advisable to first cut out a bigger section around the figure. Some small images are only there for decorative purposes. The policy is not to cut out these ones. Lateron, these have to be deleted from the xml file.
    1717 1. Save the URLs of the pages with the zoomed area into a text file (keyboard shortcuts come in handy here: {{{cmd-l-c-w}}} copies the link in the address bar and closes the tab, {{{cmd-TAB}}} switches to a text editor, {{{cmd-v}}} inserts the link. To note in the file which images have to be removed from the XML, copy the URL to the text file, but insert a {{{#}}} before that. You can also write other comments into this file, but be sure to begin the line with a {{{#}}}. The resulting list is to be saved in a new directory on the same level as the {{{raw}}} and the {{{xml}}} directory (see [source:/trunk/texts/WO_1/Stevin_1605] as an example). When trained, the average speed for cutting out figures is 2.5 figures per minute (completed Stevin_1605 in 2 hours)
     
    2626
    2727== Discussion ==
    28  1. Should an [http://mpdl-dev.mpiwg-berlin.mpg.de/ECHOdocuView?url=/mpiwg/online/permanent/library/9NN63YC9&pn=2&viewMode=images Ex libris] be cut out?
     28 1. Should an [http://mpdl-dev.mpiwg-berlin.mpg.de/ECHOdocuView?url=/mpiwg/online/permanent/library/9NN63YC9&pn=2&viewMode=images Ex libris] be cut out?
     29  - Answer: No
    2930 1. Should this be treated as one image: [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView?pn=9&ws=1&wx=0.022&wy=0.0628&ww=0.8453&wh=0.4184&url=/mpiwg/online/permanent/library/PUBSU9QD&viewMode=images&tocMode=thumbs&tocPN=1&searchPN=1&characterNormalization=regPlusNorm Apian 1550]? Otherwise, spatial information might get lost (see text version of that page)
    3031 1. Data entry tagged every single figure [http://mpdl-dev.mpiwg-berlin.mpg.de/ECHOdocuView?url=/mpiwg/online/permanent/library/S7ECRGW8&pn=295&viewMode=images on this page]. Should this be preserved? Probably yes, as some figures have variables attached (see text version)