Changes between Version 4 and Version 5 of CWOB Junqi zaji


Ignore:
Timestamp:
Mar 23, 2009, 5:09:13 PM (15 years ago)
Author:
Wolfgang Schmidle
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • CWOB Junqi zaji

    v4 v5  
    2323== 2.  Questions From Formax ==
    2424
    25 Q:
     25Q1.  If the three books contain out-dented paragraphs, could you please give us a sample about <p x>?
     26
     27A: In these three books there are indeed no outdented paragraphs. (However, there are indented paragraphs, for example Jungqi zaji p.0009, line 3. In the Euclid text Jihe yuanben 幾何原本 there were outdented paragraphs.)
     28
     29
     30
     31About Junqi zaji
     32Q2. For this book, ics will be only used for the text in page 0001.jpg, i.e. <ti ics>軍器雜記</ti>.  Please confirm.
     33
     34A: Yes.
     35
     36
     37
     38Q3. Please see 0005.jpg, 0006.jpg there are some characters with circle, such as 一,二,三,甲,乙,丙etc.  Would you please advise that we should key them as (一),(二),(三),(甲),(乙),(丙), or use unknown characters instead of them?
     39
     40A: Please type it as (一), etc.
     41
     42
     43
     44Q4. Could you please confirm the markup below about figures in 0008.jpg?
     45{{{
     4601 <p>(二)純鋼造者<sm>如左第\\二圖</sm>其用在攻城砲得力而形式同前惟彈 殼甚薄內膛較大取其
     4702 多裝炸藥使至敵處崩炸極猛尤易催堅令敵驚怯以制勝則前砲亦可用之
     4803 <pb>
     4904 按開花彈論之乃係碰物開炸故凡初開砲時查敵距遠近必先用斯彈以試之
     5005 或攻城壘等用</p>
     5106 <fig>
     5207 <cap>第一啚</cap>
     5308 <desc>甲</desc>
     5409 <desc>乙</desc>
     5510 <desc>炸藥膛</desc>
     5611 <fig>
     5712 <cap>第二啚</cap>
     5813 <desc>炸藥膛</desc>
     5914 <h 2>(乙)子母彈</h>
     6015 <p>此種子彈 <sm>如左第\\二圖</sm>其彈殺係 [text omitted]</p>
     61}}}
     62
     6302: missing </p>
     64
     6504: missing <p i>
     66
     6707-10: According to our Specs, this is correct. However, we would appreciate if you could put all variables in a single <var> </var> tag, just as in the Euclid text Jihe yuanben. It would then look like this:
     68{{{
     69<fig>
     70<cap>第一啚</cap>
     71<desc>炸藥膛</desc>
     72<var>甲乙</var>
     73}}}
     74
     7511-13: okay
     76
     77
     78
     79Q5. About heading,
     80(1) Heading from TOC in main text will be keyed as <h 1>, please confirm.
     81
     82A: In the TOC of Junqi zaji, mark the first line as <h> and do not mark the other lines at all. It should look like this:
     83{{{
     84<toc>
     85<h> ... </h>
     86...
     87...
     88...
     89</toc>
     90}}}
     91
     92For the TOC of Taixi shiwu qiyuan please see the example in the Specs, page 8.
     93
     94
     95
     96(2) If the paragraphs beginning with circled 甲,circled乙,and circled 丙 etc. have the same indention with normal paragraphs we will mark them as <p> such as the kind of paragraphs in 0003.jpg, 0006.jpg, but if they further indent than the normal paragraphs, we will mark them as <h 2> such as the kind of paragraphs in 0007.jpg, 0008.j0009.jpg, but circled 甲,circled乙,and circled丙 in 0021.jpg will also be marked by <p>.  Is it right?  Please confirm markup below.
     97
     98Markup samples for <p>
     99{{{
     1000003.jpg
     101<p>(甲)小粒黑藥</p>
     102<p>(乙)大粒黑藥</p>
     103
     1040006.jpg
     105<p>(甲)開花彈</p>
     106<p>(乙)子母彈</p>
     107
     1080021.jpg
     109<p i>(甲)拉火</p>
     110<p i>(乙)擊火</p>
     111<p i>(丙)電火</p>
     112}}}
     113
     114Markup samples for <h 2>
     115{{{
     1160007.jpg
     117<h 2>(甲)開花彈</h>
     118
     1190008.jpg
     120<h 2>(乙)子母彈</h>
     121}}}
    26122
    27123A:
     124
     1250003, 0006: okay
     126
     1270021: okay
     128
     1290007, 0008: please use <p i> instead of <h 2>
     130
     131
     132
     133Q6.  Please see paragraphs beginning with circled 一,二, 三etc. in 0004.jpg, 0005.jpg, 0007.jpg, 0013.jpg, 0016.jpg, and 0032.jpg etc.  Could you please confirm they should be marked by <p>, or <list>?
     134
     135A: Some lines beginning with circled characters could indeed be interpreted as list items. However, since most of these lines are relatively long, we would like you to use <p> for all these lines (or <p i>, of course).
     136
     137
     138
     139Q7. Please see the attached Codes.pdf, column 2 is the source characters while column 3 is the corresponding characters that we want to key.
     140
     141(1) Could you please confirm if lines 1-8 and 10-12 are correct?
     142
     143(2) For Line 9, should we key this character as 隷, 隸, or unknown character i.e.<001>?
     144
     145A: How to proceed with character variants:
     146
     147As always, we would like you to provide us with plain text files in Unicode UTF-8 encoding. We wish the texts to be transcribed making use of the full character repertoire of Unicode 5.1. That means, if a variant is encoded as a separate Unicode character (with a unique Unicode codepoint), we wish the variant to be encoded in the transcribed text by the corresponding Unicode character.
     148
     149If Unicode 5.1 does not provide a distinct codepoint for a variant character, please assign an unknown character code and provide us with the standard variant in the list of unknown characters.
     150
     151In an e-mail about the Euclid text Jihe yuanben 幾何原本, you said that you cannot type the Unicode character U+2F88D in the CJK Compatibility Ideographs Supplement block (U+2F800 - U+2FA1F) and used <002> instead. The font Sun-ExtB should cover this Unicode block, but some applications may have problems with Unicode characters above U+FFFF. Please tell us if your problems persist.
     152
     153Taken together, we want you to do this:
     154
     1551. Please use Sun-ExtA and Sun-ExtB if possible.
     156
     1572a. If a character variant exists as a reference glyph with unique codepoint in Unicode 5.1, type it.
     158
     1592b. If the character variation does not exist in Unicode 5.1, assign an unknown character code and provide us with the standard variant in the list of unknown characters.
     160
     161
     162Regarding the characters in Codes.pdf:
     163
     1641. OK
     165
     1662. <001>   unknown characters list: (砲)
     167
     1683. OK (assuming that it is a slip of the pen)
     169
     1704. <002>   unknown characters list: (飾)
     171
     1725. <003>   unknown characters list: (絨)
     173
     1746. <004>   unknown characters list: (墺 U+58BA)
     175
     1767. <005>   unknown characters list: (曜)
     177
     1788. <006>   unknown characters list: (紀)
     179
     1809. 𨽻 (U+28F7B); it's a variant of 隸, but has a unique codepoint)
     181
     18210. 痲 (U+75F2), not 麻 (U+75F3)
     183
     18411. 神 (U+FA19), not 神 (U+795E)
     185
     18612. We cannot identify the character from the image. Please provide us either with a better image or with the name of the text (Junqi zaji?), the page number and the line number.
     187
    28188
    29189== 3.  Analysis of the Result ==