The first fifty pages of [http://libcoll.mpiwg-berlin.mpg.de/libview?mode=imagepath&url=/mpiwg/online/permanent/library/D9V0Q862/pageimg Diversae "Conimbricenses In Universam dialecticam" (1606)], [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?mode=imagepath&url=/mpiwg/online/permanent/library/163127KK/pageimg Benedetti, Giovanni Battista de "Diversarvm specvlationvm mathematicarum, et physicarum liber" (1585)] and [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?mode=imagepath&url=/mpiwg/online/permanent/library/2QTVUHDT/pageimg Euclid "Elementorum Libri XV" (1607)] were digitized and sent back for evaluation. In general, the results are very good. Unfortunately, the work sample does not contain a page of the [http://libcoll.mpiwg-berlin.mpg.de/libview?mode=imagepath&url=/mpiwg/online/permanent/library/D9V0Q862/pageimg Conimbricenses] where the [attachment:wiki:DataEntrySpecs:DESpecs_special_Conimbricenses.pdf Special Instructions] apply. PDF versions of the work samples are attached. In these PDF versions, the font is Helvetica 12pt (10pt for Benedetti), blank lines have been inserted before tags, and < > { } _ are in bold face. Offsets ECHO - page numbers in the book: Diversae 2, Benedetti 12 = What does work = * Letters with swashes are recognized, except for this [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=15&ws=1&ww=1&wh=1&mk=0.1666/0.5916&mode=imagepath&url=/mpiwg/online/permanent/library/2QTVUHDT/pageimg Quod] which was transcribed as Luod. Character recognition is surprisingly high, e. g. [http://libcoll.mpiwg-berlin.mpg.de/libcoll_zogilib?fn=/permanent/library/D9V0Q862/pageimg&pn=3&ws=1&wx=0.7282&wy=0.5497&ww=0.2504&wh=0.1974&mk=0.9353/0.6498 Conimbricenses, p. 3] * [attachment:"Unknown Characters List.pdf" List of unknown characters] is used (two characters so far), unreadable text is marked up accurately. * Multiline headings are recognized, possibly because of punctuation * Both methods of marking up italics in headings is used: {{{ TRACTATVS QVI IN HOC volumine continentur. }}} ([http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=6&ws=1&wx=0.2393&wy=0.2861&ww=0.5816&wh=0.25&mk=0.3179/0.3342&mode=imagepath&url=/mpiwg/online/permanent/library/163127KK/pageimg Benedetti, p. 6]) {{{ _Theoremata Arithmetica._ }}} ([http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=13&ws=1&wx=0.0605&wy=0.265&ww=0.8677&wh=0.1551&mk=0.3145/0.3316&mode=imagepath&url=/mpiwg/online/permanent/library/163127KK/pageimg Benedetti, p. 13]) * Library stamps are either [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=6&ws=1&wx=0.1335&wy=0.3318&ww=0.7027&wh=0.3876&mk=0.2684/0.5102&mode=imagepath&url=/mpiwg/online/permanent/library/2QTVUHDT/pageimg typed]: {{{ MAX-PLANCK-INBTITUT $UR WISSENSCNAFT@@@@CHICHTE Bibliothek }}} or coded as [http://libcoll.mpiwg-berlin.mpg.de/libcoll_zogilib?fn=/permanent/library/D9V0Q862/pageimg&pn=2&ws=1&wx=0.0348&wy=0.2491&ww=0.906&wh=0.274&mk=0.6712/0.4059 ]: {{{ E SOCIETATE IESV, _IN VNIVERSAMDIA_ _Iecticam Ari$totelis Stagiritæ_ }}} * Parentheses work well, only one example with spaces within parentheses ([http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=9&ws=1&wx=0.0784&wy=0.2861&ww=0.7511&wh=0.1039&mk=0.185/0.3401&mode=imagepath&url=%2Fmpiwg%2Fonline%2Fpermanent%2Flibrary%2F163127KK%2Fpageimg Benedetti, p. 9]). Original has spaces. * The [attachment:"Unknown Characters List.pdf" List of unknown characters] works good and is obviously frequently updated. Unknown character <010>, however, is represented by a wrong image. The unknown character in question is [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=26&ws=1&wx=0.8&wy=0.3859&ww=0.1642&wh=0.1205&mk=0.8942/0.4563&mode=imagepath&url=/mpiwg/online/permanent/library/YS05QMU8/pageimg this one]. Unknown character <006> and <011> do not occur in the work samples, characters [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=27&ws=1&wx=0.5074&wy=0.7187&ww=0.1477&wh=0.0473&mk=0.5726/0.7516&mode=imagepath&url=/mpiwg/online/permanent/library/YS05QMU8/pageimg <012>] and [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=32&ws=1&wx=0.6519&wy=0.723&ww=0.2599&wh=0.0789&mk=0.7771/0.7772&mode=imagepath&url=/mpiwg/online/permanent/library/YS05QMU8/pageimg <014>] occur in the text, but are not on the list (yet?). Small problem with the list: there is only one list for all documents. Was this intended? = What does not work = * The tag is always closed by the tag. * Some ornamental figures are not tagged, e. g. [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=14&ws=1&ww=1&wh=1&mk=0.253/0.1341&mode=imagepath&url=/mpiwg/online/permanent/library/2QTVUHDT/pageimg this one]. * Various mistypings: * $in rather than [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=9&ws=1&wx=0.0901&wy=0.715&ww=0.766&wh=0.101&mk=0.3658/0.7575&mode=imagepath&url=/mpiwg/online/permanent/library/163127KK/pageimg $m] * f rather than [http://libcoll.mpiwg-berlin.mpg.de/libcoll_zogilib?fn=/permanent/library/D9V0Q862/pageimg&pn=36&ws=1&wx=0.2615&wy=0.0778&ww=0.2407&wh=0.0604&mk=0.3421/0.1034 $] (∫) (frequently) * ÿ rather than [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=7&ws=1&wx=0.1937&wy=0.4171&ww=0.4176&wh=0.0896&mk=0.4053/0.4789&mode=imagepath&url=/mpiwg/online/permanent/library/163127KK/pageimg {ij}] (italics only, frequently) * b rather than [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=8&ws=1&wx=0.2321&wy=0.3117&ww=0.6272&wh=0.1671&mk=0.5318/0.373&mode=imagepath&url=/mpiwg/online/permanent/library/163127KK/pageimg h] (italics only, frequently) * œ rather than [http://libcoll.mpiwg-berlin.mpg.de/libcoll_zogilib?fn=/permanent/library/D9V0Q862/pageimg&pn=32&ws=1&wx=0.0339&wy=0.2864&ww=0.497&wh=0.1523&mk=0.1289/0.3948 æ] (italics only) * Number 10 becomes {{{IO}}} in [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=13&ws=1&wx=0.1042&wy=0.7213&ww=0.7022&wh=0.0918&mk=0.471/0.7847&mode=imagepath&url=%2Fmpiwg%2Fonline%2Fpermanent%2Flibrary%2F2QTVUHDT%2Fpageimg Euclid, p. 13]. A date on the same page is recognised correctly. * Greek Ligatures * Letter variation of τ was recognized, but τ (in the same word!) was typed as [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=9&ws=1&wx=0.0645&wy=0.2575&ww=0.7245&wh=0.1084&mk=0.4172/0.2927&mode=imagepath&url=/mpiwg/online/permanent/library/2QTVUHDT/pageimg T] (as in {{{άγεωμέΤρητ@}}} (Euclid, p. 9)) and correctly in the next word * This [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=7&mk=0.5125/0.8909&mode=imagepath&url=/mpiwg/online/permanent/library/YS05QMU8/pageimg Vale.] has been taken for a catchword. * Tartaglia (1565), [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?url=/mpiwg/online/permanent/library/YS05QMU8/pageimg&pn=16&mode=imagepath p. 16]: first two figures are coded as one figure, the two at the bottom are separate. Caption works. * What happens with spaces in the text like [http://echo.mpiwg-berlin.mpg.de/ECHOdocuView/ECHOzogiLib?pn=24&ws=1&wx=0.1613&wy=0.8292&ww=0.7855&wh=0.0984&mk=0.5068/0.8679&mode=imagepath&url=/mpiwg/online/permanent/library/YS05QMU8/pageimg this one]? Are they meaningful? = Adjustments to be made = * In the [attachment:wiki:DataEntrySpecs:DESpecs_1_1_2_overview.pdf DESpecs 1.1.2] it is not said that the tag may contain the ''it'' argument. Thus, the _ _ markup is used consistently. The Specs should allow this.