LegacySpecs: Zhejianglu1929-SPECS.txt

File Zhejianglu1929-SPECS.txt, 3.0 KB (added by hyman, 16 years ago)
Line 
1DATA ENTRY SPECIFICATIONS FOR ZHEJIANGLU OF 1929
2
3GENERAL
4
51. ENCODING. The text shall be encoded as Unicode
6
72. PAGE BREAKS. Indicate the end of each page by typing <br>
8
93. COLUMN BREAKS. Use one input line for each column. A return should
10be typed after each column
11
124. MAIN HEADINGS. Headings in this text always appear in a separate
13column and are indented vertically. Headings begin with the name of a
14craft and then a chapter number. When there is a heading, type <h> at
15the start of the heading and </h> at the end
16
175. SUBHEADINGS. Subheadings in this text always appear in a separate
18column and are indented vertically. Subheadings begin with "yi3
19shang4". Where there is a subheading, type <h2> at the start of the
20heading and </h2> at the end
21
226. LISTS OF NAMES. Lists of names of craftsmen appear in this
23text. These names are separated by vertical space. Make sure that one
24or more IDEOGRAPHIC SPACES (Unicode character U+3000) are typed
25between the names
26
277. SMALL CHARACTERS. Some parts of the text are written in small
28characters. Sequences of small characters should be tagged with a
29<small> at the beginning and </small> at the end of the sequence
30
318. ILLEGIBLE CHARACTERS. If a character is illegible because of bad
32printing, indicate it as <x> -- use one <x> for each illegible character
33
349. UNKNOWN CHARACTERS. If a character is not recognized or can not be
35encoded in Unicode, indicate it by a numeric tag, so that the first
36unrecognized character is indicated as <01>, the second as <02> and so
37on; keep a list of these characters and reuse the same numeric code if
38the character reoccurs later in the text. Please provide a list
39showing the numeric codes used and a reproduction of the character for
40which they stand
41
4210. CORRECTIONS IN TEXT. At certain places in the text, a scribe has
43corrected a series of characters. When a sequence of characters has
44been corrected (with an alternate text written at the right), the
45corrected sequence should be tagged with <s> at the beginning and </s>
46at the end. The characters written at the right should be tagged with
47<s2> at the beginning and </s2> at the end
48
4911. PAGE NUMBERS. On some scans, page numbers will be legible at the
50unbound edge of the manuscript page. Where these are legible, they
51should be typed and tagged with <pn> at the beginning and </pn> at the
52end
53
5412. SQUARES IN NAMES. In names, a square occurs (which resembles the
55character kou3) to indicate that the author does not know the proper
56character. This square should be entered as Unicode character U+25FB
57(WHITE MEDIUM SQUARE).
58
5913. CHARACTERS IN BRACKETS. Sequences of characters in brackets should
60be typed with Unicode character U+3014 (LEFT TORTOISE SHELL BRACKET)
61at the beginning and U+3015 (RIGHT TORTOISE SHELL BRACKET) at the end.
62
63WORK SAMPLE AND ESTIMATE
64
651. Please transcribe a sample of 5 scans according to the
66specifications given above
67
682. The sample scans will be provided as physical pages
69
703. Please provide a cost estimate for transcription of the entire book
71(30 scans)