LegacySpecs: Zhejianglu-journal-SPECS.txt

File Zhejianglu-journal-SPECS.txt, 3.4 KB (added by hyman, 16 years ago)
Line 
1DATA ENTRY SPECIFICATIONS FOR ZHEJIANGLU IN JOURNAL EDITION
2
3GENERAL
4
51. ENCODING. The text shall be encoded as Unicode
6
72. PAGE BREAKS. Indicate the end of each page by typing <br>
8
93. COLUMN BREAKS. Use one input line for each column. A return should
10be typed after each column
11
124. TITLES. Titles are printed in large, bold characters and always
13begin "zhe2 jiang4 lu4". These titles should be tagged with <title> at
14the beginning and </title> at the end
15
165. HEADINGS. This text contains three levels of heading. All headings
17occur in a column by themselves. The first level heading always begins
18an ordinal number ("di4" + number). These headings should be tagged
19with a <h1> at the beginning and </h1> at the end. The second level
20heading is always the name of a dynasty. These headings should be
21tagged with a <h2> at the beginning and </h2> at the end. The third
22level heading is always the name of a craftsman. These headings should
23be tagged with a <h3> at the beginning and </h3> at the end
24
256. TABLE OF CONTENTS. Items in the table of contents are separated by
26vertical space. Make sure that one or more IDEOGRAPHIC SPACES (Unicode
27character U+3000) are typed between the items. Names of dynasties in
28the table of contents are in boldface. Dynasty names should be tagged
29with a <b> at the beginning and </b> at the end
30
317. LINES AND DOTS. Certain sequences of characters in the text are
32indicated with a line to the left or dot to the right of the
33characters. There are three types of lines: single straight line,
34double straight line, and wavy line. Sequences of characters indicated
35in this way should be tagged with a code at the beginning and end:
36<sl> and </sl> around a sequence with the single straight line, <dl>
37and </dl> around a sequence with the double straight line, and <wl>
38and </wl> around a sequence with the wavy line. Sequences of
39characters with a dot to the right should be tagged with <dot> at the
40beginning and </dot> at the end
41
428. SMALL CHARACTERS. Some parts of the text are written in small
43characters. Sequences of small characters should be tagged with a
44<small> at the beginning and </small> at the end of the sequence
45
469. ILLEGIBLE CHARACTERS. If a character is illegible because of bad
47printing, indicate it as <x> -- use one <x> for each illegible character
48
4910. UNKNOWN CHARACTERS. If a character is not recognized or can not be
50encoded in Unicode, indicate it by a numeric tag, so that the first
51unrecognized character is indicated as <01>, the second as <02> and so
52on; keep a list of these characters and reuse the same numeric code if
53the character reoccurs later in the text. Please provide a list
54showing the numeric codes used and a reproduction of the character for
55which they stand
56
5711. PAGE NUMBERS AND RUNNING HEADS. Running heads and page numbers
58occur on the outer edge of pages. Running heads on the left-hand page
59contain the book title, chapter, and dynasty. Running heads on the
60right-hand page contain the journal title, volume number, and issue
61number. Both types of running head should be tagged with <run> at the
62beginning and </run> at the end. Page numbers, which occur in the same
63column as the running head, should be typed with <pn> at the beginning
64and </pn> at the end
65
66WORK SAMPLE AND ESTIMATE
67
681. Please transcribe a sample of the first 8 pages according to the
69specifications given above
70
712. FORMAX already possesses the scans for this text
72
733. Please provide a cost estimate for transcription of the entire text