Version 4 (modified by 14 years ago) (diff) | ,
---|
Instructions for cutting out images
- The figures should be cut out of the TIFF-images, rather than the compressed JPG-images in
online_permanent/library
on foxridge. The Digigroup should know where the relevant images are. - Do not cut out drop caps or embellishments, e. g. decorative images on the title page
- If there is already an
xml
-version of the text, it is handy to extract a list of all figures that are to be cut out. You can do this for example by using XQuery in the display system://echo:figure
resp.//echo:image
Depending on how much context you like (with caption or not). - Now, in the viewing environment, you can browse through the book, visiting the respective pages and mark the images using digilib's "zoom area" tool. Be careful not to cut out surrounding text, including the catchword at the bottom of the page. However, captions are to be cut out, as well. Some small images are only there for decorative purposes. The policy is not to cut out these ones. Lateron, these have to be deleted out of the xml file.
- Save the URLs of the pages with the zoomed area into a text file. Also, note in the file which images have to be removed from the XML.
Some steps missing (include creating a script to extract the figures from the raw TIFF files based on the list with the URLs). Also the list should be stored somewhere for future reference (e. g. in case the figures are not produced by cutting the files, but by extracting the figures on the fly). The first list was generated for Apian 1550.
- Save the cropped images again as TIFF in the format
page-imagenumber
, e. g., if pageimage0056.tif
has three figures, these figures will be saved as0056-01.tif
,0056-02.tif
and0056-03.tif
, respectively. - Put the figures into a folder named
figures
and upload it to the foxridge server alongside thepageimg
-folder of the respective book: folderonline_permanent/library/XXXXXXXX
will then contain both apageimg
and afigures
folder.
Attachments (5)
-
echo-figures.xml (3.1 MB) - added by 14 years ago.
An xml document containing the xquery of "echo:figure" of all 78 ECHO documents
-
echo-figures.html (4.1 MB) - added by 14 years ago.
HTMLized Xquery results for all figures in ECHO documents
-
arch_cut_images.py (5.5 KB) - added by 14 years ago.
Same tool for Archimedes files, one day, it will be one tool for all
-
cut_images.py (5.5 KB) - added by 14 years ago.
A bit more comfortable
-
Alvarus_1509_YHKVZ7B4.fig (2.4 KB) - added by 14 years ago.
Figure coordinates for Alvarus