DATA ENTRY SPECIFICATIONS FOR YUANSHIBISHU
GENERAL
1. ENCODING. The text shall be encoded as Unicode
2. PAGE BREAKS. Indicate the end of each Chinese page (there are four
pages per scan) by typing
3. COLUMN BREAKS. Use one input line for each column. A return should
be typed after each column
4. MAIN HEADINGS. Headings in this text are indicated by large
characters. When there is a sequence of large characters, type at
the start of the sequence and at the end
5. ENTRY HEADINGS. If a character is above the level of the normal
text, all characters up to the first space are considered an entry
heading; type
at the beginning of the sequence and
at the
end
6. TABLE OF CONTENTS. For the tables of contents (which comes after the
preface and at the beginning of each juan), make sure that one or more
IDEOGRAPHIC SPACES (Unicode character U+3000) are typed between the entries
7. ILLEGIBLE CHARACTERS. If a character is illegible because of bad
printing, indicate it as -- use one for each illegible character
8. UNKNOWN CHARACTERS. If a character is not recognized or can not be
encoded in Unicode, indicate it by a numeric tag, so that the first
unrecognized character is indicated as <01>, the second as <02> and so
on; keep a list of these characters and reuse the same numeric code if
the character reoccurs later in the text. Please provide a list
showing the numeric codes used and a reproduction of the character for
which they stand
9 (NEW). VOLUME AND PAGE NUMBERS. These occur in the center between pages, preceded by an abbreviation for the title of the book. Before the start of each page, please enter the complete information in the center preceded by and followed by . For the right-hand page, indicate "a" after the page number, and for the the left-hand page, indicate "b" after the page number.
WORK SAMPLE AND ESTIMATE
1. Please transcribe a sample of 5 scans according to the
specifications given above
2. The sample scans will be provided as individual numbered image
files
3. Please provide a cost estimate for transcription of the entire book
(251 scans)