DATA ENTRY SPECIFICATIONS FOR YUANSHIBISHU GENERAL 1. ENCODING. The text shall be encoded as Unicode 2. PAGE BREAKS. Indicate the end of each Chinese page (there are four pages per scan) by typing
3. COLUMN BREAKS. Use one input line for each column. A return should be typed after each column 4. MAIN HEADINGS. Headings in this text are indicated by large characters. When there is a sequence of large characters, type at the start of the sequence and at the end 5. ENTRY HEADINGS. If a character is above the level of the normal text, all characters up to the first space are considered an entry heading; type

at the beginning of the sequence and

at the end 6. TABLE OF CONTENTS. For the tables of contents (which comes after the preface and at the beginning of each juan), make sure that one or more IDEOGRAPHIC SPACES (Unicode character U+3000) are typed between the entries 7. ILLEGIBLE CHARACTERS. If a character is illegible because of bad printing, indicate it as -- use one for each illegible character 8. UNKNOWN CHARACTERS. If a character is not recognized or can not be encoded in Unicode, indicate it by a numeric tag, so that the first unrecognized character is indicated as <01>, the second as <02> and so on; keep a list of these characters and reuse the same numeric code if the character reoccurs later in the text. Please provide a list showing the numeric codes used and a reproduction of the character for which they stand 9 (NEW). VOLUME AND PAGE NUMBERS. These occur in the center between pages, preceded by an abbreviation for the title of the book. Before the start of each page, please enter the complete information in the center preceded by and followed by . For the right-hand page, indicate "a" after the page number, and for the the left-hand page, indicate "b" after the page number. WORK SAMPLE AND ESTIMATE 1. Please transcribe a sample of 5 scans according to the specifications given above 2. The sample scans will be provided as individual numbered image files 3. Please provide a cost estimate for transcription of the entire book (251 scans)