| 304 | === echo-gap |
| 305 | |
| 306 | * @@ --> <gap extent="2"/> |
| 307 | * <gap> --> <gap/> |
| 308 | * x< ? > --> x<unsure/> or <unsure>x</unsure> (this can not be fully automated) |
| 309 | |
| 310 | === echo-de |
| 311 | |
| 312 | This module contains tags from the DESpecs that will be removed in the course of processing. We use the namespace "de" for the corresponding elements in the xml: |
| 313 | * <001> --> <de:unknown code="001"/> (bzw. wir haben eine Tabelle, was gemeint ist) |
| 314 | * <!> --> <de:wrong/> --> remove or <sic> |
| 315 | |
| 316 | === echo-figure |
| 317 | |
| 318 | * <fig> --> <figure>, eventuell mit <anchor/> |
| 319 | * <cap> --> <caption> |
| 320 | * <desc> --> <description> |
| 321 | * <var> --> <variables> |
| 322 | |
| 323 | === echo-handwritten |
| 324 | |
| 325 | In its simplest form, <handwritten> is just an empty tag. Nonetheless, within <s> it is replaced by <anchor> and moved to <div type="float"> to cater for scholarly additions, i.e. it is part of echo.float.class and not of echo.inline.class |
| 326 | |
| 327 | * <hd> --> <handwritten/>, eventuell mit <anchor/> |
| 328 | |
| 329 | === echo-float |
| 330 | |
| 331 | === echo-milestone |
| 332 | |
| 333 | ==== line breaks |
| 334 | |
| 335 | [Dieser Abschnitt ist sicher veraltet!] |
| 336 | |
| 337 | <lb/> can be in plaintext (<s>, <head>, some <note>, all members of echo.inline.class) and <p> |
| 338 | |
| 339 | in <p>: since a paragraph is split into <s>, most line breaks are actually in <s>. However: |
| 340 | * <lb/></s><s> and </s><s><lb/> shouldn't occur (--> </s><lb/><s> [and space before </s>?]) |
| 341 | * <lb/></s></p> shouldn't occur at all |
| 342 | |
| 343 | in <s> (and similarly for <head> and the members of echo.inline.class): |
| 344 | * line break --> <lb/>; no space before <lb/>; no line break after <lb/>; space after <lb/> if there is a hyphen before <lb/> (no automated space if the hyphen is missing) |
| 345 | |
| 346 | examples: |
| 347 | * <s>亦<lb/>能使人無疑。</s> |
| 348 | * <note>Plutar <lb/>chus in <lb/>commẽ <lb/>tario de <lb/>dæmo-<lb/>nio So-<lb/>cratis.</note> |
| 349 | |
| 350 | We use the normal hyphen U+002D instead of the soft hyphen U+00AD because the soft hyphen is not displayed in the xhtml. --> ? |
| 351 | |
| 352 | ==== column breaks |
| 353 | |
| 354 | * <col 1>...</col><col 2>...</col> --> ...<cb/>... |
| 355 | |
| 356 | ==== page breaks |
| 357 | |
| 358 | [Dieser Abschnitt ist sicher veraltet!] |
| 359 | |
| 360 | <pb/> can occur wherever <lb/> occurs (although it will be rare in <head>), and <div> |
| 361 | * <pb vii><rh>xyz</rh> --> <pb n="10" o="vii" o-norm="7" rhead="xyz" xlink:href="URI"/> |
| 362 | * <pb 一六七a> --> <pb n="..." o="一六七a" o-norm="167a" xlink:href="URI"/> |
| 363 | |
| 364 | === echo-attribute |
| 365 | |
| 366 | === echo-datatype |
| 367 | |
| 368 | === echo-mathematics |
| 369 | |
| 370 | * number <num>: |
| 371 | * "vii" --> <num value="7">vii</num> |
| 372 | * "½" --> <num value="0.5">½</num> |
| 373 | * variable <var>: |
| 374 | * "AB" --> <var type="line">AB</var> (type ist optional) |
| 375 | |
| 376 | Eine Funktion von <num> und <var> ist es, den Inhalt vor der morphologischen Analyse zu verstecken. |
| 377 | |
| 378 | Note: The scope of echo.num and echo.var is very limited. More complex mathematics is expressed with MathML --> echo-import-mathml |
| 379 | |
| 380 | === echo-chinese-text |
| 381 | |
| 382 | * <ti> --> <head type="ti"> |
| 383 | * indentations in Chinese text: |
| 384 | * <p ii> --> <p indent="2char"> oder nur "2"? |
| 385 | * <p xx> --> <p indent="-2char"> |
| 386 | * (indent is deliberately not defined as style="valid css" because it may be semantically meaningful) |
| 387 | * Linien: |
| 388 | * <sl> --> <emph style="sl"> |
| 389 | * <dl> --> <emph style="dl"> |
| 390 | * <wl> --> <emph style="wl"> |
| 391 | * <cl> --> <emph style="cl"> |
| 392 | |
| 393 | Small text: |
| 394 | * in <p>: <sm> --> <small> |
| 395 | * everywhere else: <emph style="sm"> (<h>, rhead, <ti>, <toc>, etc.) |
| 396 | * `\\` --> <smlb/> |
| 397 | |
| 398 | (plus some manual corrections where this simple distinction doesn't fit, e.g. <sm>chen</sm>) |
| 399 | |
| 400 | |