Overview page of pages related to the XML workflow
This page tries to gather all the relevant documentation for producing an XML text from digitized images (a "book").
The steps are:
- Obtaining digital images, adhering to standards.
- Checking, if the textual features of that "book" are covered by the Data Entry Specs. If necessary, Special Instructions need to be written for such a book.
- Sending the images and the DESpecs to the Data Entry firm.
- If questions are raised during the Data Entry, they have to be logged. Each "book" should have its own wiki page containing all relevant information.
- Checking the returned text file for errors which the Data Entry firm is responsible for.
- Alongside the text, another file might be sent back, containing the list of unknown characters. Where possible, these characters have to be resolved to their respective Unicode representation.
- XML Workflow
- Page synchronization
- Checking the structure of the text
- Insertion of metadata and creation of wellformed XML text
- Producing an XML text that is valid according to the echo-schema
- Additional work
- Numbering of divs, assigning ids, regularisations
- Obtaining figures
- Scholarly workflow
Main Workflow
Other workflows
- Exchanging page images using Eclipse
- Variation of the above: using Oxygen
- Shorter explanation of the above
- A bit longer explanation of the above
Legacy texts
- Workflows for converting legacy texts to Archimedes or Echo format.
- Missing texts and pictures in Archimedes
- Correction of end-sigma in Archimedes
Editors
- Setting schema location for on-the-fly validation
- Evaluation of XML editors
Policy documents
- Standards for digital images from external vendors
Last modified 11 years ago
Last modified on Oct 16, 2013, 3:59:58 PM