DATA ENTRY SPECIFICATIONS FOR ZHEJIANGLU OF 1929
GENERAL
1. ENCODING. The text shall be encoded as Unicode
2. PAGE BREAKS. Indicate the end of each page by typing
3. COLUMN BREAKS. Use one input line for each column. A return should
be typed after each column
4. MAIN HEADINGS. Headings in this text always appear in a separate
column and are indented vertically. Headings begin with the name of a
craft and then a chapter number. When there is a heading, type at
the start of the heading and at the end
5. SUBHEADINGS. Subheadings in this text always appear in a separate
column and are indented vertically. Subheadings begin with "yi3
shang4". Where there is a subheading, type
at the start of the
heading and
at the end
6. LISTS OF NAMES. Lists of names of craftsmen appear in this
text. These names are separated by vertical space. Make sure that one
or more IDEOGRAPHIC SPACES (Unicode character U+3000) are typed
between the names
7. SMALL CHARACTERS. Some parts of the text are written in small
characters. Sequences of small characters should be tagged with a
at the beginning and at the end of the sequence
8. ILLEGIBLE CHARACTERS. If a character is illegible because of bad
printing, indicate it as -- use one for each illegible character
9. UNKNOWN CHARACTERS. If a character is not recognized or can not be
encoded in Unicode, indicate it by a numeric tag, so that the first
unrecognized character is indicated as <01>, the second as <02> and so
on; keep a list of these characters and reuse the same numeric code if
the character reoccurs later in the text. Please provide a list
showing the numeric codes used and a reproduction of the character for
which they stand
10. CORRECTIONS IN TEXT. At certain places in the text, a scribe has
corrected a series of characters. When a sequence of characters has
been corrected (with an alternate text written at the right), the
corrected sequence should be tagged with at the beginning and
at the end. The characters written at the right should be tagged with
at the beginning and at the end
11. PAGE NUMBERS. On some scans, page numbers will be legible at the
unbound edge of the manuscript page. Where these are legible, they
should be typed and tagged with at the beginning and at the
end
12. SQUARES IN NAMES. In names, a square occurs (which resembles the
character kou3) to indicate that the author does not know the proper
character. This square should be entered as Unicode character U+25FB
(WHITE MEDIUM SQUARE).
13. CHARACTERS IN BRACKETS. Sequences of characters in brackets should
be typed with Unicode character U+3014 (LEFT TORTOISE SHELL BRACKET)
at the beginning and U+3015 (RIGHT TORTOISE SHELL BRACKET) at the end.
WORK SAMPLE AND ESTIMATE
1. Please transcribe a sample of 5 scans according to the
specifications given above
2. The sample scans will be provided as physical pages
3. Please provide a cost estimate for transcription of the entire book
(30 scans)