| 1 | {{{ |
| 2 | #!html |
| 3 | <?xml version="1.0" encoding="UTF-8"?> |
| 4 | <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" |
| 5 | "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> |
| 6 | <html xml:lang="en"> |
| 7 | <head> |
| 8 | <title>ECHO data entry specifications</title> |
| 9 | </head> |
| 10 | <body> |
| 11 | <h2>Characters to be typed directly</h2> |
| 12 | |
| 13 | <p>ASCII characters should be used with their normal values, |
| 14 | except as indicated below. Note that <i>tilde</i> (<b>by |
| 15 | itself</b>) should be entered directly as <tt>~</tt>.</p> |
| 16 | |
| 17 | <p>The following characters with diacritics are to be typed |
| 18 | directly:</p> |
| 19 | |
| 20 | <h3>Characters with acute accent</h3> |
| 21 | |
| 22 | <blockquote> |
| 23 | áéíóú |
| 24 | ÁÉÍÓÚ |
| 25 | </blockquote> |
| 26 | |
| 27 | <h3>Characters with grave accent</h3> |
| 28 | |
| 29 | <blockquote> |
| 30 | àèìòù |
| 31 | ÀÈÌÒÙ |
| 32 | </blockquote> |
| 33 | |
| 34 | <h3>Characters with circumflex accent</h3> |
| 35 | |
| 36 | <blockquote> |
| 37 | âêîôû |
| 38 | ÂÊÎÔÛ |
| 39 | </blockquote> |
| 40 | |
| 41 | <h3>Characters with umlaut/diaeresis</h3> |
| 42 | |
| 43 | <blockquote> |
| 44 | äëïöüÿ |
| 45 | ÄËÏÖÜŸ |
| 46 | </blockquote> |
| 47 | |
| 48 | <h3>Characters with tilde</h3> |
| 49 | |
| 50 | <blockquote> |
| 51 | ãõñ |
| 52 | ÃÕÑ |
| 53 | </blockquote> |
| 54 | |
| 55 | <h3>Characters with cedilla</h3> |
| 56 | |
| 57 | <blockquote> |
| 58 | ç Ç |
| 59 | </blockquote> |
| 60 | |
| 61 | <h3>Common ligatures</h3> |
| 62 | |
| 63 | <blockquote> |
| 64 | æ œ Æ Œ |
| 65 | </blockquote> |
| 66 | |
| 67 | <h2>Special conventions</h2> |
| 68 | |
| 69 | <p>Owing to the high frequency of “long s” <ʃ>, this |
| 70 | character should be typed as <tt>$</tt>. (We do not expect to |
| 71 | digitize works that contain both the currency symbol and the s |
| 72 | variant.)</p> |
| 73 | |
| 74 | <h2>XML Entity notation</h2> |
| 75 | |
| 76 | <p>Characters that cannot be conveniently be typed may be |
| 77 | indicated by means of XML entities. The entities specified |
| 78 | in <a |
| 79 | href="http://www.w3.org/2003/entities/iso8879doc/overview.html">ISO |
| 80 | 8879</a> (see |
| 81 | especially <b><a |
| 82 | href="http://www.w3.org/2003/entities/iso8879doc/isolat1.html">isolat1</a></b> |
| 83 | and <b><a |
| 84 | href="http://www.w3.org/2003/entities/iso8879doc/isolat2.html">isolat2</a></b>) |
| 85 | should be used. More generally, some characters may be entered |
| 86 | using conventions specified below:</p> |
| 87 | |
| 88 | <p>For characters with accents enumerated above, if the text input |
| 89 | method does not support the composition of the accent with a |
| 90 | certain character, entities may be used thus. For instance, the |
| 91 | operating system typically makes no provision for allowing the |
| 92 | entry of a modified <q> — yet such characters are |
| 93 | frequent in Latin materials. They may be typed as (e.g.):</p> |
| 94 | |
| 95 | <blockquote> |
| 96 | <tt>&qacute; &qgrave; &quml; &qtilde;</tt> |
| 97 | </blockquote> |
| 98 | |
| 99 | <p>The ampersand (<tt>&</tt>) must be entered as the |
| 100 | entity <tt>&amp;</tt>, to avoid confusion with its use in |
| 101 | entitites.</p> |
| 102 | </body> |
| 103 | </html> |
| 104 | }}} |