wiki:Meditiunculae

Version 1 (modified by Klaus Thoden, 12 years ago) (diff)

--

Workflow documentation

The current version of the meditatiunculae text was generated from the med.xml text with the following steps:

  1. Preparation
    1. The colour scans of the book were used and they had to be synchronized to the pagebreaks (see also section below)
    2. Paragraphs were tagged
    3. The header was changed
    4. Unicode symbols were inserted where there were entities for astronomical symbols and Greek letters
    5. General cleanup concerning whitespace and empty lines
    6. There are no linebreaks in this text.
  2. XSLT: An elaborate XSL script was used to transform the XML tags of the source. Great care had to be taken to transform the insertions and deletions in the text. Mostly, different emphasis styles were used:
    • Supralineam -> super
    • PostCorrectionem -> bf
    • Delevit -> st
    • Diverso atramento -> it
    • SignoPositoInMargine -> <mgl>
    • bis -> sic
    • CorrexitEx -> reg Also, MathML was used for Variables
  3. Used workflow scripts
    • move floats
    • div structure
    • insert semantic units
  4. Manual work
    • Remove empty semantic units
    • correct emphasis tags which were made inconsistent by insertion of semantic units
    • some floats were also incorrect
    • Insertion of language attributes
    • repositioned some figures
    • Some more cleanup
  5. Workflow scripts
    • Number divs
    • Insert ids

Correction of images

There are two versions of the scans: one is b/w showing a double-page spread of the book (MS3XWYFW), the other are colour scans of the single pages (QFWN4Q67). However, the latter are not in the right order. For the current edition, the pages were rearranged with the help of MS3XWYFW (which itself lacks the double-page spreads 59/60, 221/222 and 227/228):

  • 0146 -> 0146_0.jpg
  • 0146_2r is missing in QFWN4Q67 and was substituted by b/w version from MS3XWYFW
  • 0145 -> 0146_2v.jpg
  • 0012 -> 0146_3bisr.jpg
  • 0013 -> 0146_3bisv.jpg
  • 0014 -> 0146_3r.jpg
  • 0015 -> 0146_3v.jpg
  • 0017 -> 0146_4bisr.jpg
  • 0018 -> 0146_4bisv.jpg
  • 0019 -> 0146_4r.jpg
  • 0020 -> 0146_4v.jpg
  • 0022 -> 0146_5r.jpg
  • 0023 -> 0146_5v1.jpg
  • 0024 -> 0146_5v2.jpg
  • 0026 -> 0146_6r.jpg
  • 0027 -> 0146_6v.jpg
  • 0029 -> 0146_7r.jpg
  • 0030 -> 0146_7v.jpg

The resorted images were put into /online/experimental/klaus/mediatiunculae_resorted and this is also where the XML file expects to find the images.

Attachments (1)

Download all attachments as: .zip