DATA ENTRY SPECIFICATIONS FOR ZHEJIANGLU IN JOURNAL EDITION
GENERAL
1. ENCODING. The text shall be encoded as Unicode
2. PAGE BREAKS. Indicate the end of each page by typing
3. COLUMN BREAKS. Use one input line for each column. A return should
be typed after each column
4. TITLES. Titles are printed in large, bold characters and always
begin "zhe2 jiang4 lu4". These titles should be tagged with
at
the beginning and at the end
5. HEADINGS. This text contains three levels of heading. All headings
occur in a column by themselves. The first level heading always begins
an ordinal number ("di4" + number). These headings should be tagged
with a at the beginning and
at the end. The second level
heading is always the name of a dynasty. These headings should be
tagged with a at the beginning and
at the end. The third
level heading is always the name of a craftsman. These headings should
be tagged with a at the beginning and
at the end
6. TABLE OF CONTENTS. Items in the table of contents are separated by
vertical space. Make sure that one or more IDEOGRAPHIC SPACES (Unicode
character U+3000) are typed between the items. Names of dynasties in
the table of contents are in boldface. Dynasty names should be tagged
with a at the beginning and at the end
7. LINES AND DOTS. Certain sequences of characters in the text are
indicated with a line to the left or dot to the right of the
characters. There are three types of lines: single straight line,
double straight line, and wavy line. Sequences of characters indicated
in this way should be tagged with a code at the beginning and end:
and around a sequence with the single straight line,
and
around a sequence with the double straight line, and
and around a sequence with the wavy line. Sequences of
characters with a dot to the right should be tagged with at the
beginning and at the end
8. SMALL CHARACTERS. Some parts of the text are written in small
characters. Sequences of small characters should be tagged with a
at the beginning and at the end of the sequence
9. ILLEGIBLE CHARACTERS. If a character is illegible because of bad
printing, indicate it as -- use one for each illegible character
10. UNKNOWN CHARACTERS. If a character is not recognized or can not be
encoded in Unicode, indicate it by a numeric tag, so that the first
unrecognized character is indicated as <01>, the second as <02> and so
on; keep a list of these characters and reuse the same numeric code if
the character reoccurs later in the text. Please provide a list
showing the numeric codes used and a reproduction of the character for
which they stand
11. PAGE NUMBERS AND RUNNING HEADS. Running heads and page numbers
occur on the outer edge of pages. Running heads on the left-hand page
contain the book title, chapter, and dynasty. Running heads on the
right-hand page contain the journal title, volume number, and issue
number. Both types of running head should be tagged with at the
beginning and at the end. Page numbers, which occur in the same
column as the running head, should be typed with at the beginning
and at the end
WORK SAMPLE AND ESTIMATE
1. Please transcribe a sample of the first 8 pages according to the
specifications given above
2. FORMAX already possesses the scans for this text
3. Please provide a cost estimate for transcription of the entire text