= Schema support = The MPDL document storing and querying system supports the document schemas [#archimedes Archimedes], [#echo Echo] and [#tei TEI(-Lite)]. For URI's [/wiki/schema/xpointer a subset of XPointer] will be supported in the near future. The element "pb" (page break) and "s" (sentence) are supported especially: a special fast method is used to retrieve a certain page (the fragment between two "pb" elements) in a document. And all fulltext queries within documents are executed against the element "s" and result in hits which each contains the position in the document (page number and sentence number). [=#archimedes ] == Archimedes == === Schema === The Archimedes schema is developed by the [http://archimedes.mpiwg-berlin.mpg.de Archimedes project] and can be found [https://it-dev.mpiwg-berlin.mpg.de/svn/mpdl-project-content/trunk/texts/eXist/archimedesOldCVSRepository/archimedes/xsd/archimedes.xsd here]. === Example === In the following simple example the metadata part consists of 4 elements („author“, „title“, „lang“, „date“) and the text part consists of 2 pages („pb“) with 2 paragraphs („p“) which contains 3 sentences („s“). {{{ Name, Prename Title en 1789

This is the first sentence of the first paragraph. This is the second sentence of the first paragraph. This is the third sentence of the first paragraph.

This is the first sentence of the second paragraph. This is the second sentence of the second paragraph. This is the third sentence of the second paragraph.

This is the first sentence of the first paragraph with line break. This is the second sentence of the first paragraph with line break. This is the third sentence of the first paragraph with line break.

This is the first sentence of the second paragraph. This is the second sentence of the second paragraph. This is the third sentence of the second paragraph.

}}} [=#echo ] == Echo == === Schema === The MPDL Echo schema is developed by the schema group of this project and can be found [https://it-dev.mpiwg-berlin.mpg.de/tracs/mpdl-project-content/wiki/echo-schema here]. === Elements === An Echo document (element „echo“ with namespace „echo“) consists of a metadata part (element „metadata“) which contains the Dublin Core metadata of the document and a fulltext part (element „text“) which contains the content of the document. Dublin Core metadata elements (namespace dcterms): * identifier * creator * title * date * rights * license * accessRights Fulltext elements (with attributes in paranthesis): * text elements: head (style), div (type, level, style), p (style), pb (file), lb, cb, gap (extent), s (style) * figure elements: figure, image (file), caption (style), description (style), variables (style), handwritten (xlink:href) * note elements: note (position, xlink:label) * quotation elements: q (style), quote (style), blockquote (style), set-off (style) * translation elements: foreign (lang, xml:lang), reg (orig, style) * mathematical elements: var (type, style), num (style), mml:* * geographical elements: place, event, time * person elements: person * xhtml elements: xhtml:* : e.g. table, ul * other elements: expan (style), emph (class, style), ref (target), anchor (type, xlink:label, xlink:href) === Example === In the following simple example the metadata part consists of 4 Dublin Core elements („creator“, „title“, „language“, „date“) and the text part consists of 2 pages („pb“) with 2 paragraphs („p“) which contains 3 sentences („s“). {{{ Name, Prename Title en 1789

This is the first sentence of the first paragraph. This is the second sentence of the first paragraph. This is the third sentence of the first paragraph.

This is the first sentence of the second paragraph. This is the second sentence of the second paragraph. This is the third sentence of the second paragraph.

This is the first sentence of the first paragraph with line break. This is the second sentence of the first paragraph with line break. This is the third sentence of the first paragraph with line break.

This is the first sentence of the second paragraph. This is the second sentence of the second paragraph. This is the third sentence of the second paragraph.

}}} [=#tei ] == TEI == === Schema === [http://www.tei-c.org/ TEI] provides [http://www.tei-c.org/Guidelines/Customization/ customizations for different purposes]. The MPDL project supports a subset of [http://www.tei-c.org/Guidelines/Customization/Lite/ TEI Lite] which is widely used and includes basic elements for simple documents. Also some useful elements (which are part of the [http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng TEI schema with all modules]) and [http://www.w3.org/Math/ MathML] and [http://de.wikipedia.org/wiki/Scalable_Vector_Graphics SVG] (which are part of the [http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_allPlus.rng TEI schema with all modules plus external additions]) are supported. A description of TEI pointers can be found [/wiki/schema/tei-pointers here]. A description of notes/annotations can be found [/wiki/schema/tei-notes here]. === Elements === TEI Lite (with attributes in paranthesis): * metadata elements: fileDesc, titleStmt, author, name, title, publicationStmt, date, idno, availability, sourceDesc, bibl, profileDesc, langUsage, language * text elements: div, gap (extent), head, lb, lg, l, p, pb, s, seg * link elements: ref (target), ptr (target), anchor (xml:id) * figure elements: figure * note elements: note (place) * quotation elements: cit, q, quote * translation elements: foreign (xml:lang), reg * terminology elements: mentioned, term * name elements: name (type="place"), (type="person"), name (type="organization") * time elements: date * mathematical elements: num * other elements: hi (rend), expan, emph, choice, orig, abbr, sic, corr TEI * text elements: pb (facs) * dictionary elements: entry, form, orth, sense, etym, def * figure elements: figure (facs) * name elements: placeName (type), persName * other elements: ex, am Additional * MathML elements: all elements of MathML 2.0 * SVG elements: all elements of SVG 1.1 === Example 1 === In the following simple example the metadata part ("teiHeader") consists of 5 elements („author“, „title“, „language“, „date“, "idno") and the text part consists of 2 pages („pb“) with 2 paragraphs („p“) which contains 3 sentences („s“). {{{ Title Name, Prename 1789 /experimental/yourDirectory

This text is available under Creative Commons license CC-BY

English

This is the first sentence of the first paragraph. And here is a link. This is the second sentence of the first paragraph. This is the third sentence of the first paragraph.

This is the first sentence of the second paragraph. This is the second sentence of the second paragraph. This is the third sentence of the second paragraph.

This is the first sentence of the first paragraph with line break. This is the second sentence of the first paragraph with line break. This is the third sentence of the first paragraph with line break.

This is the first sentence of the second paragraph. This is the second sentence of the second paragraph. This is the third sentence of the second paragraph.

}}} [=#teiMathML ] === Example 2: TEI document with MathML and SVG === In the following simple example the text part consists of a MathML formula and a SVG graphics. {{{ Title Name, Prename 1789 /experimental/yourDirectory

This text is available under Creative Commons license CC-BY

English
MathML example: a x 2 + b x + c SVG example:
}}}