wiki:Meditiunculae

Workflow documentation

The current version of the meditatiunculae text was generated from the med.xml text with the following steps:

  1. Preparation
    1. The colour scans of the book were used and they had to be synchronized to the pagebreaks (see also section below)
    2. Paragraphs were tagged
    3. The header was changed
    4. Unicode symbols were inserted where there were entities for astronomical symbols and Greek letters
    5. General cleanup concerning whitespace and empty lines
    6. There are no linebreaks in this text.
    7. This results in an altered version of the original med.xml. For a TEI version, this version was changed slightly.
  2. XSLT: An elaborate XSL script was used to transform the XML tags of the source. Great care had to be taken to transform the insertions and deletions in the text. Mostly, different emphasis styles were used:
    • Supralineam -> super
    • PostCorrectionem -> bf
    • Delevit -> st
    • Diverso atramento -> it
    • SignoPositoInMargine -> <mgl>
    • bis -> sic
    • CorrexitEx -> reg Also, MathML was used for Variables
  3. Used workflow scripts
    • move floats
    • div structure
    • insert semantic units
  4. Manual work
    • Remove empty semantic units
    • correct emphasis tags which were made inconsistent by insertion of semantic units
    • some floats were also incorrect
    • Insertion of language attributes
    • repositioned some figures
    • Some more cleanup
  5. Workflow scripts
    • Number divs
    • Insert ids

Correction of images

There are two versions of the scans: one is b/w showing a double-page spread of the book (MS3XWYFW), the other are colour scans of the single pages (QFWN4Q67). However, the latter are not in the right order. For the current edition, the pages were rearranged with the help of MS3XWYFW (which itself lacks the double-page spreads 59/60, 221/222 and 227/228):

  • 0146 -> 0146_0.jpg
  • 0146_2r is missing in QFWN4Q67 and was substituted by b/w version from MS3XWYFW
  • 0145 -> 0146_2v.jpg
  • 0012 -> 0146_3bisr.jpg
  • 0013 -> 0146_3bisv.jpg
  • 0014 -> 0146_3r.jpg
  • 0015 -> 0146_3v.jpg
  • 0017 -> 0146_4bisr.jpg
  • 0018 -> 0146_4bisv.jpg
  • 0019 -> 0146_4r.jpg
  • 0020 -> 0146_4v.jpg
  • 0022 -> 0146_5r.jpg
  • 0023 -> 0146_5v1.jpg
  • 0024 -> 0146_5v2.jpg
  • 0026 -> 0146_6r.jpg
  • 0027 -> 0146_6v.jpg
  • 0029 -> 0146_7r.jpg
  • 0030 -> 0146_7v.jpg

The resorted images (reviewed 2013-05-15 and 2013-05-16) were now put in the QFWN4Q67/pageimg directory, the original colour scans were put into QFWN4Q67/pageimg-orig.

Last modified 11 years ago Last modified on Jul 24, 2013, 9:29:54 AM

Attachments (1)

Download all attachments as: .zip