Annotation of storage/meta/meta-format.tex, revision 1.23
1.1 casties 1: \documentclass[a4paper]{article}
2:
3: \usepackage[latin1]{inputenc}
4: \usepackage[T1]{fontenc}
5: \usepackage{ae}
6: %\usepackage{times}
7: %\usepackage{courier}
8:
9: % create in-text links black (with PDF)
1.6 casties 10: \usepackage[colorlinks=true,linkcolor=black]{hyperref}
1.1 casties 11: % Format URLs nicely (without PDF)
1.6 casties 12: %\usepackage{url}
1.1 casties 13:
14:
15: \title{A simple metadata format for resource bundles}
16:
1.4 casties 17: \author{Robert Casties, Dirk Wintergrün, Hans-Christoph Liess}
1.1 casties 18:
1.23 ! dwinter 19: \date{V1.3.7 of 27.8.2010}
1.1 casties 20:
21: \begin{document}
22:
23: \maketitle
24:
25: \tableofcontents
26:
27:
28: \section{File and directory names}
29: \label{sec:file-directory-names}
30:
31: File and directory names should not contain spaces. Allowed characters
32: in filenames are only the alphanumeric set a-z, A-Z, 0-9, hyphen
33: ``-'', underscore ``\_'' and dot ``.''.
34:
1.12 casties 35: Files and directories with names that contain illegal characters must
36: be transformed to allowed names. A proposition for a simple
37: transformation rule is
38:
39: \begin{itemize}
40: \item whitespace characters (e.g. blank, tab, cr, lf) are replaced by
41: hyphens ``-''
42:
43: \item other illegal characters are replaced by underscores ``\_''.
44: \end{itemize}
45:
46: This rule does not provide a reversible mapping to the original
47: illegal file name and it does not provide a collision-free mapping,
48: i.e. two different illegal file names might be mapped to the same
49: allowed file name. Additional precautions for these cases must be
50: taken.
1.1 casties 51:
1.4 casties 52:
53: \section{Metadata files}
54: \label{sec:metadata-files}
55:
56: The metadata information is stored in the XML format documented below
57: in special files in the resource directory. Two forms of metadata
58: files are possible:
59: \begin{itemize}
60: \item a file named \texttt{index.meta} in a directory.
61:
1.16 casties 62: \item a file with the same name as the data file it describes and an
1.4 casties 63: additional extension \texttt{.meta}. For example metadata for the
1.16 casties 64: file \texttt{p0001.tif} would be in a file \texttt{p0001.tif.meta}.
1.4 casties 65: \end{itemize}
66:
67: The resource directory must contain an \texttt{index.meta} file with
1.16 casties 68: information about the resource as a whole. Subdirectories can
69: contain additional \texttt{index.meta} files.
1.4 casties 70:
71: Additional information about single data files that are part of the
72: resource can either be put in \texttt{file} tags in the
73: \texttt{index.meta} file or in separate \emph{filename}\texttt{.meta}
74: files for each data file. Information from the directory level file is
1.16 casties 75: inherited at the file level when it is not overwritten.
1.4 casties 76:
77:
1.1 casties 78: \section{Resource format}
79: \label{sec:mpiwg-doc}
80:
81: In this description elements marked ``optional'' need not be supplied
82: by the provider of the resource and may be absent in all versions of
83: the metadata file. Elements marked ``required'' must be supplied by
84: the provider of the resource. Elements marked ``deduced'' can be
85: supplied by the provider of the resource but can also be provided by
1.4 casties 86: automatic scripts later in the process, these elements must be present
1.1 casties 87: in the final file.
88:
1.12 casties 89: File and directory paths in the metadata file use the conventional
90: Unix file separator slash ``/''.
91:
1.11 casties 92: The outer container element is \texttt{resource}. It has the following
93: \textbf{attributes}:
94:
95: \begin{description}
1.12 casties 96: \item[type] sub-type of resource (e.g. ``ECHO'', ``MPIWG'') --
97: optional.
1.11 casties 98:
1.16 casties 99: \item[version] version number of metadata format (currently 1.2) --
1.11 casties 100: required.
101: \end{description}
102:
103: \noindent The allowed \textbf{elements} inside \texttt{resource} are:
1.1 casties 104:
105: \begin{description}
1.14 casties 106: \item[description] An informal textual description of the resource --
107: optional\footnote{At least one description of the resource's content
108: is required. The description can be an informal
109: \texttt{description} element or a descriptive element (like
110: \texttt{bib}) in a \texttt{meta} container.}.
1.1 casties 111:
112: \item[name] The filename of the resource (name of the directory this
113: file is contained in) -- required.
114:
115: \item[creator] The name of the project or person that created the
116: resource -- optional.
1.4 casties 117:
118: \item[archive-creation-date] The time and date the archive collection
119: was created -- deduced.
1.1 casties 120:
1.4 casties 121: \item[archive-storage-date] The time and date the archive was written
122: to permanent storage -- deduced (must not be set by the user).
1.1 casties 123:
124: \item[archive-path] The full path to the resource directory inside the
1.5 casties 125: whole archive collection, including the resource directory -- deduced.
1.12 casties 126:
127: \item[archive-id] The ID for this document in the archive --
1.16 casties 128: optional.
1.1 casties 129:
130: \item[derived-from] Container for the description of the original
131: resource if this resource is a modified version of another resource
132: -- optional.
133:
134: \begin{description}
1.12 casties 135: \item[archive-id] The ID of the original resource
1.16 casties 136: -- required (or archive-path).
1.12 casties 137:
1.1 casties 138: \item[archive-path] The full path to the original resource
1.16 casties 139: -- required (or archive-id).
140:
141: \item[description] An informal textual description of the relation
142: of this resource to the original resource -- optional.
143: \end{description}
144:
145: \item[used-by] Container for the description of modified resources
146: if this resource is the source of another resource
147: -- optional.
148:
149: \begin{description}
150: \item[archive-id] The ID of the derived resource
151: -- required (or archive-path).
152:
153: \item[archive-path] The full path to the derived resource
154: -- required (or archive-id).
1.1 casties 155:
156: \item[description] An informal textual description of the relation
157: of this resource to the original resource -- optional.
158: \end{description}
159:
160: \item[linked-with] Container for the description of another
161: resource when this resource is a linked copy of another resource
162: -- optional.
163:
164: \begin{description}
1.12 casties 165: \item[archive-id] The ID of the linked resource
1.16 casties 166: -- required (or archive-path).
1.12 casties 167:
1.1 casties 168: \item[archive-path] The full path to the linked resource
1.16 casties 169: -- required (or archive-id).
1.1 casties 170:
171: \item[description] An informal textual description of the relation
172: of this resource to the linked resource -- optional.
173: \end{description}
174:
1.12 casties 175: \item[media-type] \label{tag-media-type} The main media type of this
176: resource -- required.\\ The main media type can be overridden by
177: \texttt{media-type}s in subdirectories. Possible types are
178: \begin{itemize}
179: \item \texttt{image}
180:
181: \item \texttt{text}
182:
183: \item \texttt{audio}
184:
185: \item \texttt{video}
186:
187: \item \texttt{data} for other type of data
188: \end{itemize}
1.1 casties 189:
190: \item[meta] Additional metadata information about the resource --
191: optional.\\ For a description of additional metadata see below.
192:
193: \item[dir] Container for the description of a subdirectory -- required
194: (when there are subdirectories).\\ \texttt{dir} tags should not be
195: nested. Directories at lower levels are identified by their
196: \texttt{path}.
197:
198: \begin{description}
199: \item[description] An informal textual description of the
200: subdirectory -- optional.
201:
202: \item[name] The name of the subdirectory -- required.
203:
1.12 casties 204: \item[original-name] A text string associated with the directory as
205: original name -- optional. (E.g. if the data in this directory
206: came from an external source and had a name that had to be changed
207: according to section~\ref{sec:file-directory-names} but it should
208: be possible to reference the original name.)
209:
1.1 casties 210: \item[path] The directory path of this subdirectory relative to the
1.5 casties 211: resource's root directory (excluding the directory itself) --
212: required (may be empty or omitted if the directory is a direct
213: child of the resource's root directory).
1.1 casties 214:
215: \item[meta] Additional metadata information about the directory --
216: optional.\\ For a description of additional metadata see below.
217: \end{description}
218:
219: \item[file] Container for the description of a file -- deduced.\\
220: \texttt{file} tags should not be nested in \texttt{dir} tags. Files
221: at lower directory levels are identified by their \texttt{path}.
222:
223: \begin{description}
224: \item[description] An informal textual description of the
225: file -- optional.
226:
227: \item[name] The name of the file -- required.
228:
1.12 casties 229: \item[original-name] A text string associated with the file as
1.16 casties 230: original name -- optional. (e.g. if this file came from an
1.12 casties 231: external source and had a name that had to be changed according to
1.16 casties 232: section~\ref{sec:file-directory-names} it is possible
233: to preserve the original name.)
1.12 casties 234:
1.1 casties 235: \item[path] The directory path of this file relative to the
1.5 casties 236: resource's root directory (excluding the file itself) -- required
237: (may be empty or omitted if the file is in the resource's root
238: directory).
1.7 casties 239:
240: \item[date] The file's modification or creation date\footnote{The
241: preferred time and date format is ``YYYY/MM/DD HH:MM:SS''},
242: whichever is more recent -- optional.
1.1 casties 243:
244: \item[modification-date] The file's modification date -- optional.
245:
246: \item[creation-date] The file's creation date -- optional.
1.7 casties 247:
1.1 casties 248: \item[size] The file size -- deduced.
249:
250: \item[mime-type] The file's mime-type -- optional.
251:
252: \item[md5cs] MD5 checksum of the file content -- optional.
253:
254: \item[meta] Additional metadata information about the file --
255: optional. For a description of additional metadata see below.
256: \end{description}
257:
258: \end{description}
259:
260:
261:
262: \section{Additional metadata}
263: \label{sec:additional-metadata}
264:
265: All elements with \texttt{meta} tags can contain an arbitrary number
1.12 casties 266: of the following additional metadata elements.
267:
1.16 casties 268: \subsection{Workflow state}
1.12 casties 269: \label{sec:workflow-state}
270:
271: All additional metadata elements can have a \texttt{workflow-state}
272: \textbf{attribute}. This attribute reflects the state of the
273: corresponding metadata element. The possible values for the
274: \texttt{workflow-state} attribute are
275: \begin{itemize}
276: \item \texttt{preliminary} this information is preliminary. It must
277: be checked in further workflow steps.
278:
279: \item \texttt{inwork}
280:
281: \item \texttt{final}
282: \end{itemize}
283:
284: workflow states other than \texttt{preliminary} are part of the
285: workflow handling of the respective projects.
286:
287: Metadata elements can appear multiple times with different
288: \texttt{workflow-state} attributes. This enables metadata versioning.
289:
290:
291:
292: \subsection{Content type}
293: \label{sec:content-type}
294:
295: \begin{description}
296: \item[content-type] \label{tag-content-type} The content type of this
297: resource -- required.\\
298: The content type enables the choice of tools to manipulate and
299: display the resource. There should be a common list of content
300: types. For digital documents (books, manuscripts) this would be
301: "scanned document", for other image data "scanned
302: images".\footnote{The criterion for documents is a ordered
303: succession of image files (pages) and equal image size and
304: resolution throughout the images of a resource.}
305: \end{description}
306:
307:
1.1 casties 308:
1.4 casties 309: \subsection{Language}
310: \label{sec:lang}
311:
312: The language of a resource (e.g. a text) can be specified with a
313: \texttt{lang} tag. Languages have to be described using the
314: international codes for the representation of names of languages
315: either in two-letter form (ISO 639-1) or in three-letter form (ISO
316: 639-2). The entire catalogue of languages is documented on the page
317:
318: \url{http://www.loc.gov/standards/iso639-2/englangn.html}
319:
1.1 casties 320:
321: \subsection{DRI}
322: \label{sec:dri}
323:
324: The \emph{digital resource identifier} for the resource is specified
1.4 casties 325: in a \texttt{dri} element. Digital resource identifiers are documented
1.1 casties 326: on the page
327:
328: \url{http://pythia.mpiwg-berlin.mpg.de/projects/standards/dri}.
329:
330:
1.4 casties 331:
332: \subsection{Collection context}
333: \label{sec:collection-context}
334:
1.15 casties 335: The context of a resource as part of a collection or part of a project
336: can be specified in the \texttt{context} element. The context element
337: can appear multiple times if the resource is part of multiple
338: collections or projects.
1.4 casties 339:
340: \begin{description}
1.5 casties 341: \item[context] information on collection or project context.
1.4 casties 342:
1.5 casties 343: \begin{description}
1.15 casties 344: \item[link] URL to additional context information -- optional.
1.5 casties 345:
1.15 casties 346: \item[name] Textual description of project or collection -- optional.
347:
348: \item[meta-datalink] description of external sources of canonical meta
349: information -- optional
350: \begin{description}
351: \item[db] \textbf{attribute} to identify different sets of meta data
352: links to the same resource -- optional
353:
354: \item[object] \textbf{attribute} to identify different objects or
355: parts of the same resource -- optional
356:
357: \item[label] textual label for the link -- optional
358:
359: \item[url] URL to present to the client -- optional
360:
361: \item[metadata-url] URL to an external server to be queried -- optional
362: \end{description}
363:
364: \item[meta-baselink] description of external server for canonical meta
365: information -- optional
366: \begin{description}
367: \item[db] \textbf{attribute} to identify different sets of meta data
368: links to the same resource -- optional
369:
370: \item[label] textual label for the link -- optional
371:
372: \item[url] URL to present to the client -- optional
373:
374: \item[metadata-url] URL to an external server to be queried --
375: required (the parameter \texttt{object=} with an object id has
376: to be appended to this URL)
377: \end{description}
1.5 casties 378: \end{description}
1.4 casties 379: \end{description}
1.5 casties 380:
1.4 casties 381:
382:
383:
1.1 casties 384: \subsection{Bibliographic information}
385: \label{sec:bibliographic-data}
386:
1.5 casties 387: Bibliographic information is presented in a \texttt{bib} container with
1.1 casties 388: a \texttt{type} parameter, giving the type of bibliographic resource.
1.4 casties 389: The \texttt{type} field can be repeated as a tag in the container.
390:
1.5 casties 391: The format is based on the ECHO scheme for bibliographic data (cf.
392: content workflow), the MPIWG ``Projektbibliografie'' and the format of
393: the commonly used program ``EndNote''.
394:
1.4 casties 395:
396: \subsubsection{Book}
397:
398: \begin{description}
399:
400: \item [bib type="book"] a published book.
401:
402: \begin{description}
403: \item [author] The author of the book.
404: \item [year] The year of publication.
405: \item [title] Title of the book.
406: \item [series-editor] Name of the series editor, if the book appears
407: in a series.
408: \item [series-title] Title of the serie, if the book appears in a
409: series.
410: \item [series-volume] Volume number, if the book appears in a
411: series.
412: \item [number-of-pages] Number of pages of the entire book.
413: \item [city] City where the book was published.
414: \item [publisher] Name of the publishing company
415: \item [edition] Edition of the book (e.g. third edition)
416: \item [number-of-volumes] Number of volumes, if the the book is
417: published in multiple volumes.
418: \item [translator] Name of the translator.
419: \item [isbn-issn]
1.18 casties 420: \item[call-number] Call number in holding library
421: \item[holding-library] Holding library
1.4 casties 422: \end{description}
423: \end{description}
424:
425: \subsubsection{In Book}
426:
427: \begin{description}
428: \item [bib type="inbook"] an article as part of a book.
429:
430: \begin{description}
431: \item [author] The author of the book.
432: \item [year] The year of publication.
433: \item [title] Title of the article.
434: \item [editor] Name of the book's editor.
435: \item [book-title] Title of the book.
436: \item [series-volume] Volume number, if the book appears in a
437: series.
438: \item [pages] Number of pages of the article.
439: \item [city] City where the book was published.
440: \item [publisher] Name of the publishing company
441: \item [edition] Edition of the book (e. g. third edition)
442: \item [series-author] Name of the series editor, if the book appears
443: in a series.
444: \item [series-title] Title of the series, if the book appears in a
445: series.
446: \item [number-of-volumes] Number of volumes, if the the book is
447: published in multiple volumes.
448: \item [translator] Name of the translator
449: \item [isbn-issn]
1.18 casties 450: \item[call-number] Call number in holding library
451: \item[holding-library] Holding library
1.4 casties 452: \end{description}
453: \end{description}
454:
455: \subsubsection{Proceedings}
456:
457: \begin{description}
458: \item [bib type="proceedings"] a conference proceedings publication.
459:
460: \begin{description}
461: \item [author] The author of the article.
462: \item [year] The year of publication.
463: \item [title] Title of the article.
464: \item [editor] Name of the book's editor.
465: \item [conference-name] Name of the conference the proceedings are
466: related to.
467: \item [volume] Volume number.
468: \item [pages] Number of pages of the article.
469: \item [date] Date of the conference the proceedings are related to.
470: \item [conference]-location City where the conference was held.
471: \item [publisher] Name of the publishing company
472: \item [edition] Edition of the book (e. g. third edition)
473: \item [series-editor] Name of the series editor, if the book appears
474: in a series.
475: \item [series-title] Title of the series, if the book appears in a
476: series.
477: \item [number-of-volumes] Number of volumes, if the the book is
478: published as multiple volumes.
479: \item [isbn-issn]
1.18 casties 480: \item[call-number] Call number in holding library
481: \item[holding-library] Holding library
1.4 casties 482: \end{description}
483: \end{description}
484:
485: \subsubsection{Edited Book}
486:
487: \begin{description}
488: \item[bib type="edited-book"] a book that is the edition of another
489: work.
490:
491: \begin{description}
492: \item [editor] Name of the editor of the book.
493: \item [year] The year of publication.
494: \item [title] Title of the book.
495: \item [series-editor] Name of the editor of the series the book is
496: part of.
497: \item [series-title] Title of the series, if the book is part of a
498: series.
499: \item [series-volume] Volume number, if the book appears in a series.
500: \item [number-of-pages] Number of pages of the article.
501: \item [city] City where the book was published.
502: \item [publisher] Name of the publishing company
503: \item [edition] Information about the edition (e.g. ``Repr. of the London ed. 1652'')
504: \item [number-of-volumes] Number of volumes, if the the book is
505: published as multiple volumes.
506: \item [isbn-issn]
1.18 casties 507: \item[call-number] Call number in holding library
508: \item[holding-library] Holding library
1.4 casties 509: \end{description}
510: \end{description}
511:
1.17 casties 512: \subsubsection{Journal Volume}
513:
514: \begin{description}
515: \item [bib type="journal-volume"] a volume of a scientific journal.
516: \begin{description}
517: \item [title] Name of the journal.
518: \item [editor] The editor of the journal.
519: \item [publisher] Name of the publishing company.
520: \item [city] City where the journal is published.
521: \item [year] The year of publication.
522: \item [volume] Volume number.
523: \item [numer-of-pages] Number of pages of the volume.
524: \item [isbn-issn]
1.18 casties 525: \item[call-number] Call number in holding library
526: \item[holding-library] Holding library
1.17 casties 527: \end{description}
528: \end{description}
529:
1.4 casties 530: \subsubsection{Journal Article}
531:
532: \begin{description}
533: \item [bib type="journal-article"] an article in a scientific journal.
534: \begin{description}
535: \item [author] The author of the article.
536: \item [year] The year of publication.
537: \item [title] Title of the article.
538: \item [journal] Name of the journal.
539: \item [volume] Volume number, if the journal appears in a series.
540: \item [issue] Number of the issue the article is part of.
541: \item [pages] Number of pages of the article.
542: \item [alternate-journal] Alternate Journal
543: \item [isbn-issn]
1.18 casties 544: \item[call-number] Call number in holding library
545: \item[holding-library] Holding library
1.4 casties 546: \end{description}
547: \end{description}
548:
549: \subsubsection{Magazine Article}
550:
551: \begin{description}
552: \item [bib type="magazine-article"] an article in a popular magazine.
553: \begin{description}
554: \item [author] The author of the book.
555: \item [year] The year of publication.
556: \item [title] Title of the article.
557: \item [magazine] Name of the magazine.
558: \item [volume] Volume number, if the book appears in a series.
559: \item [issue-number] Number of the issue the article is part of.
560: \item [pages Number] of pages of the article.
561: \item [date] Date when the article appeared.
1.18 casties 562: \item[call-number] Call number in holding library
563: \item[holding-library] Holding library
1.4 casties 564: \end{description}
565: \end{description}
566:
567: \subsubsection{Newspaper Article}
568:
569: \begin{description}
570: \item [bib type="newspaper-article"] an article in a newspaper.
571: \begin{description}
572: \item [author] The author of the article.
573: \item [year] The year of publication.
574: \item [title] Title of the article.
575: \item [Newspaper] Name of the newspaper the article appeared in.
576: \item [pages] Number of pages of the article.
577: \item [issue-date] Date of the issue the article is part of.
578: \item [city] City of the newspaper.
1.18 casties 579: \item[call-number] Call number in holding library
580: \item[holding-library] Holding library
1.4 casties 581: \end{description}
582: \end{description}
583:
584: \subsubsection{Thesis}
585:
586: \begin{description}
587: \item [bib type="thesis"] a master/doctorate/etc. thesis.
588: \begin{description}
589: \item [author] The author of the thesis.
590: \item [year] The year of publication.
591: \item [title] Title of the thesis.
592: \item [academic-department] Name of the academic department where
593: the thesis was handed in.
594: \item [number-of-pages] Number of pages of the thesis.
595: \item [city] City where the thesis was published.
596: \item [University] Name of the university where the thesis was
597: handed in.
598: \item [isbn-issn]
1.18 casties 599: \item[call-number] Call number in holding library
600: \item[holding-library] Holding library
1.4 casties 601: \end{description}
602: \end{description}
603:
604: \subsubsection{Report}
605:
606: \begin{description}
607: \item [bib type="report"] a scientific report.
608: \begin{description}
609: \item [author] The author of the report.
610: \item [year] The year of publication.
611: \item [title] Title of the report.
612: \item [pages] Number of pages of the report.
613: \item [date] Date when the report appeared.
614: \item [city] City where the book was published.
615: \item [institution] Institution where the report was produced.
616: \item [type] Type of report.
617: \item [report-number] Report number.
1.18 casties 618: \item[call-number] Call number in holding library
619: \item[holding-library] Holding library
1.4 casties 620: \end{description}
621: \end{description}
622:
1.5 casties 623: \subsubsection{Manuscript}
624:
625: \begin{description}
626: \item [bib type="manuscript"] a handwritten/typewritten manuscript.
627:
628: \begin{description}
629: \item [title] Title of the manuscript.
630: \item [author] The author of the text.
631: \item [location] Name of the library where the manuscript is
632: currently located.
633: \item [year] The year or century of publication.
634: \item [pages] Number of pages of the manuscript.
635: \item [signature] Signature of the manuscript.
636: \item [editorial-remarks] Remarks related to the online
637: publication of the manuscript. This could be notes about
638: annotations etc.
639: \item [description] This can be any kind of description.
640: \item [keywords] Keywords related to the manuscript.
1.18 casties 641: \item[call-number] Call number in holding library
642: \item[holding-library] Holding library
1.5 casties 643: \end{description}
644: \end{description}
645:
1.23 ! dwinter 646: \subsubsection{Extended Manuscript}
! 647:
! 648: \begin{description}
! 649: \item [bib type="extended-manuscript"] a handwritten/typewritten manuscript
! 650: with detailed information about the manuscripts appearance.
! 651:
! 652: \begin{description}
! 653: \item [title] Title of the manuscript.
! 654: \item [author] The author of the text.
! 655: \item[holding-library] Holding library
! 656: \item[call-number] Call number/Shelf mark in holding library
! 657: \item[location] Place/City/Country where the manuscript is
! 658: currently located.
! 659: \item[date calendar="type"] The date of publication with attribute which
! 660: calendar used. If no attribute used, CE is the default. Can also be
! 661: descriptive.
! 662: \item[year calendar="type"] approximate year or century
! 663: \item[number-of-folios] Number of folios/pages of the manuscript.
! 664: \item[signature] Signature(s) of the manuscript, under which a manuscript is
! 665: known.
! 666: \item[abstract] Interpretative abstract of the text's content
! 667: \item[incipit] Incipit
! 668: \item[explicit] Explicit
! 669: \item[contents] Formal description of the text structure (e.g. table of
! 670: contents)
! 671: \item[writing-surface] material of the writing surface (e.g. non-european
! 672: paper, palm leaf,\ldots)
! 673: \item[foliation] Text giving list or range of folios
! 674: \item[page-dimensions] height and width in cm
! 675: \item[written-area-dimensions] height and width in cm
! 676: \item[lines-per-page] number of lines and columns
! 677: \item[catchwords] Quire signatures and catchwords
! 678: \item[scripts] Description of the script
! 679: \item[copyist] Copyist
! 680: \item[collation-corrections] Notes on collation and corrections
! 681: \item[binding] Binding
! 682: \item[notes] Additional notes
! 683: \item[secondary-literature] Notes on secondary literature related to the
! 684: manuscript
! 685: \item [editorial-remarks] Remarks related to the online
! 686: publication of the manuscript.
! 687:
! 688: \item [keywords] Keywords related to the manuscript.
! 689:
! 690:
! 691: \end{description}
! 692: \end{description}
! 693:
! 694: \subsubsection{Codex}
! 695:
! 696: \begin{description}
! 697: \item [bib type="codex"] Codex
! 698:
! 699: \begin{description}
! 700:
! 701: \item[holding-library] Holding library
! 702: \item[call-number] Call number/Shelf mark in holding library
! 703: \item[location] Place/City/Country where the codex is
! 704: currently located.
! 705: \item[date calendar="type"] Date of the collation of the codex.
! 706: \item[year calendar="type"] approximate year or century
! 707: \item[number-of-folios] Number of folios/pages of the manuscript.
! 708: \item[signature] Signature(s) of the manuscript, under which a manuscript is
! 709: known.
! 710: \item[contents] Formal description of the text structure (e.g. table of
! 711: contents)
! 712: \item[dimensions] height + width in cm
! 713: \item[binding] Binding
! 714: \item[notes] Additional notes
! 715: \item[notes-on-ownership] Notes on ownership
! 716: \end{description}
! 717: \end{description}
! 718:
1.5 casties 719:
1.19 casties 720: \subsubsection{Correspondence}
721:
722: \begin{description}
723: \item [bib type="correspondence"] a piece of correspondence e.g. letter, telegram, in the following called ``letter''
724:
725: \begin{description}
726: \item[type] The type of correspondence, e.g. ``letter'', ``postcard'', ``telegram'', ``letter draft''
727: \item [author] The author/sender of the letter.
728: \item [recipient] The recipient of the letter.
729: \item [date] normalised date of the letter.
730: \item [date-range-end] end of range of uncertain dating -- optional.
731: \item [date-original] the date in its original form as noted on the letter -- optional.
732: \item [place] place where the letter was written/sent.
733: \item [title] Title of the letter -- optional.
734: \item[incipit] The opening phrase of the letter -- optional.
735: \item[excipit] The closing phrase of the letter -- optional.
736: \item [pages] Number of pages of the manuscript.
737: \item [signature] Canonical signature/call number of the manuscript.
738: \item [description] This can be any kind of description.
739: \item [keywords] Keywords related to the manuscript.
740: \item[call-number] Call number in the current holding library
741: \item[holding-library] current holding library
742: \end{description}
743: \end{description}
744:
745:
1.4 casties 746: \subsubsection{Generic}
747:
748: \begin{description}
749: \item [bib type="generic"] a generic bibliographic type. This type
750: should only be used in rare cases.
751: \begin{description}
752: \item [author]
753: \item [year]
754: \item [title]
755: \item [secondary-author]
756: \item [secondary-title]
757: \item [volume]
758: \item [number]
759: \item [pages]
760: \item [date]
761: \item [place-published]
762: \item [publisher]
763: \item [edition]
764: \item [tertiary author]
765: \item [tertiary-title]
766: \item [number-of-volumes]
767: \item [type-of-work]
768: \item [subsidiary author]
769: \item [alternate-title]
770: \item [isbn-issn]
771: \item [call-number]
772: \item [label]
773: \item [keywords]
774: \item [abstract]
775: \item [notes]
776: \item [url]
1.5 casties 777: \end{description}
1.4 casties 778: \end{description}
779:
780:
781: \subsection{Architectural drawings}
782: \label{sec:doc}
783:
784: Specific information for architectural drawings is presented in a
1.5 casties 785: \texttt{doc} container with an additional \texttt{type} attribute
786: giving the type of drawing. All elements inside the container can
787: appear multiple times.
1.4 casties 788:
789: \begin{description}
1.5 casties 790:
791: \item[doc type="Architectural Drawing"] architectural drawing.
792:
793: \begin{description}
794: \item [person] last name and first name of a person, separated by a
795: comma. A further common name for the person can be put infront,
796: separated by a semicolon.
797: \item [location] Name of a place in its common notation. This can be
798: a city or a institution.
799: \item [date] This can be a year (or several years, separated by
800: commas) or a period (1706-1714). Years are noted with four digits.
801: \item [object] Short description of an object or signatures.
802: \item [keywords] Keywords related to the object.
803: \end{description}
1.4 casties 804: \end{description}
1.1 casties 805:
806:
1.10 casties 807: \subsection{Document structure (table of contents)}
1.1 casties 808: \label{sec:toc}
809:
1.4 casties 810: Information on the structure of a document like the division into
811: parts and chapters in the way of a table of contents is presented in a
812: \texttt{toc} container.
813:
814: The scheme allows multiple logical pages on a single page image
815: as it is often the case with scanned books or manuscripts. The scheme
816: also allows for ``loose'' numbering schemes with roman, arabic or
817: other page numbers consecutively or mixed and changes in the numbering
818: within the document.
819:
820: The flexibility comes from the fact that no additional assumptions
821: about the mapping between logical pages and page images are made in
822: the format. All mapping information is specified by the user.
823:
824: The logical page numbering or naming that can be presented to the user
825: is specified in the \texttt{name} tags while the physical numbering of
826: the page images is specified in the \texttt{index} or \texttt{url}
827: tags.
1.1 casties 828:
1.4 casties 829: \begin{description}
1.5 casties 830: \item[toc] container for document structure
831:
1.4 casties 832: \begin{description}
1.5 casties 833: \item[page] describes a single logical page
834:
835: \begin{description}
836: \item[name] the ``name'' of the logical page. This can be any string
837: like a page number (arabic, roman, etc.) or a special designation
838: like ``Table 5''.
839:
840: \item[index] the \texttt{digilib} index number\footnote{The index
841: number for digilib is the index in the alphabetical order of the
842: scan file names.} of the scan image of the page.
843:
844: \item[url] alternatively to the \texttt{digilib} index number the
845: full URL of the scan image of the page can be used.
846: \end{description}
1.4 casties 847:
1.5 casties 848: \item[chapter] describes a section or chapter of the text.
849: \texttt{chapter} elements can be nested.
1.1 casties 850:
1.4 casties 851: \begin{description}
1.5 casties 852: \item[name] the title of the chapter or section.
853:
854: \item[start] the beginning of a page range (usually the first page
855: of the chapter). The \texttt{start} element has an optional
856: \texttt{increment} attribute to indicate the number of logical
857: pages on a scan image.\footnote{This information is only needed by
858: additional tools that try to generate lists of all page and
859: image numbers.}
860:
861: \begin{description}
862: \item[name] the ``name'' of the first page (see \texttt{page}).
863:
864: \item[index] the index of the first page (see \texttt{page}).
865:
866: \item[url] the URL of the first page (see \texttt{page}).
867: \end{description}
868:
869: \item[end] the end of a page range (usually the last page of the
870: chapter).
871:
872: \begin{description}
873: \item[name] the ``name'' of the last page (see \texttt{page}).
874:
875: \item[index] the index of the last page (see \texttt{page}).
876:
877: \item[url] the URL of the last page (see \texttt{page}).
878: \end{description}
879:
880: \item[page] alternative (and additional) to
881: \texttt{start}/\texttt{end} page ranges single \texttt{page}
882: elements can be used inside \texttt{chapter}.
1.4 casties 883: \end{description}
884: \end{description}
885: \end{description}
886:
887: %%\url{http://pythia.mpiwg-berlin.mpg.de/toolserver/TS_lise}
1.1 casties 888:
889:
1.12 casties 890: \subsection{Digital images}
1.1 casties 891: \label{sec:inform-scann-imag}
892:
893: Image files representing scanned images can have an \texttt{img}
894: container tag with information about the scan resolution and the size
895: of the original image. This information is used by the
896: \texttt{digilib} image viewing tool.
897:
898: Required is one of three possible sets of tags:
899:
900: \begin{description}
1.5 casties 901: \item[img] digital image information.
1.1 casties 902:
1.5 casties 903: \begin{description}
1.12 casties 904: \item[original-size-x] The width of the original
905: image -- required. \\
906: The unit of measure can be contained as parameter \texttt{unit},
907: the default is meter ``m''. The width to be considered is the
908: total width of the scanned area.
1.5 casties 909:
1.12 casties 910: \item[original-size-y] The height of the original image -- required.
1.5 casties 911:
1.12 casties 912: \item[original-pixel-x] The width of the hi-res scan in pixels -- deduced.
1.5 casties 913:
1.12 casties 914: \item[original-pixel-y] The height of the hi-res scan in pixels -- deduced.
1.5 casties 915: \end{description}
1.1 casties 916: \end{description}
917:
918: or
919:
920: \begin{description}
1.5 casties 921: \item[img] digital image information.
922:
923: \begin{description}
924: \item[original-dpi-x] The resolution of the hi-res scan in its width
1.12 casties 925: in pixels per inch -- required.
1.1 casties 926:
1.5 casties 927: \item[original-dpi-y] The resolution of the hi-res scan in its height
1.12 casties 928: in pixels per inch -- required.
929:
930: \item[original-pixel-x] The width of the hi-res scan in pixels -- deduced.
931:
932: \item[original-pixel-y] The height of the hi-res scan in pixels -- deduced.
1.5 casties 933: \end{description}
1.1 casties 934: \end{description}
935:
936: or
937:
938: \begin{description}
1.5 casties 939: \item[img] digital image information.
940:
941: \begin{description}
942: \item[original-dpi] The resolution of the hi-res scan in pixels per
1.12 casties 943: inch if the resolutions in width and height are the same -- required.
944:
945: \item[original-pixel-x] The width of the hi-res scan in pixels -- deduced.
946:
947: \item[original-pixel-y] The height of the hi-res scan in pixels -- deduced.
1.5 casties 948: \end{description}
1.1 casties 949: \end{description}
1.7 casties 950:
951:
1.10 casties 952:
1.12 casties 953: \subsection{Digital image acquisition}
1.10 casties 954: \label{sec:inform-about-image}
955:
956: A description of the technology used in the process of producing a
957: digital image.
958:
959: \begin{description}
960: \item[image-acquisition] description of the image production process
961: \begin{description}
1.12 casties 962: \item[device] acquisition device (e.g. ``flatbed scanner'')
1.10 casties 963:
1.12 casties 964: \item[image-type] type and color-depth of the image -- required (e.g. ``RGB 24
1.10 casties 965: bit'')
966:
967: \item[production-comment] additional textual information about the
968: production process
969: \end{description}
970: \end{description}
971:
972:
1.12 casties 973:
1.7 casties 974: \subsection{Full text with images}
975: \label{sec:full-text-with}
976:
1.12 casties 977: Full text in a XML format should be specified with a
978: \texttt{content-type}\footnote{see section~\ref{tag-content-type}
979: on page\pageref{tag-content-type}} ``fulltext''.
1.8 casties 980:
981: The relation between the full text and optional images of
982: whole pages or parts of pages must be specified in a
1.20 casties 983: \texttt{texttool} container.
1.8 casties 984:
985: \begin{description}
1.20 casties 986: \item[texttool] representation of full text with images
987:
1.8 casties 988: \begin{description}
1.22 casties 989: \item[text] the file name of the full text file (path
1.8 casties 990: inside document directory)
1.12 casties 991:
1.20 casties 992: \item[text-url-path] a characteristic part of the URL with which the
993: full text can be retrieved (the form and content of this element
994: is dependent on the specific text retrieval mechanism)
995:
996: \item[image] the directory name of the directory containig the
1.22 casties 997: page image files (path inside document directory)
998:
999: \item[figure] the directory name of the directory containig the
1000: in-page figure image files (path inside document directory)
1.8 casties 1001:
1.20 casties 1002: \item[xslt] the file name of an additional XSL transformation
1.8 casties 1003: file
1004:
1.20 casties 1005: \item[pagebreak] the name of the element that indicates page breaks
1006: (default ``pb'')
1.8 casties 1007: \end{description}
1008: \end{description}
1.7 casties 1009:
1.1 casties 1010:
1011:
1.12 casties 1012: \subsection{Copyright and access conditions}
1013: \label{sec:access-conditions}
1014:
1015: If the access to a resource is bound to conditions for technical or legal
1016: reasons then the conditions can be put in a \texttt{access-conditions}
1.16 casties 1017: container. Other usage conditions like copyright can also be
1.12 casties 1018: documented in this container.
1019:
1020: \begin{description}
1021: \item[access-conditions] legal and technical conditions for access to
1022: this resource
1023:
1024: \begin{description}
1025: \item[attribution] The name or institution this resource should be
1026: attributed to when it's publicly presented
1027:
1028: \begin{description}
1029: \item[name] a name (free text)
1030:
1031: \item[url] a URL (with an optional \texttt{label} attribute to show
1032: as text)
1.18 casties 1033:
1034: \item[description] more information (free text, e.g. holding
1035: library call number)
1.12 casties 1036: \end{description}
1037:
1.16 casties 1038: \item[copyright] the copyright holder and it's conditions
1.12 casties 1039: \begin{description}
1.16 casties 1040: \item[owner] the name of the copyright holder
1.12 casties 1041: \begin{description}
1042: \item[name] a name (free text)
1043:
1044: \item[url] a URL (with an optional \texttt{label} attribute to show
1045: as text)
1046: \end{description}
1047:
1048: \item[date] the date when the copyright was issued
1049:
1.16 casties 1050: \item[duration] the duration of the copyright term (if known)
1.12 casties 1051:
1052: \item[description] free-text field for special or additional
1053: conditions
1054: \end{description}
1.14 casties 1055:
1056:
1057: \item[publish-metadata] metadata about this resource can be made
1.16 casties 1058: freely available when this tag is present (otherwise metadata has
1059: the same access conditions as the rest of the resource). Access to
1060: the resource itself is regulated separately by the \texttt{access}
1061: element.
1.12 casties 1062:
1.16 casties 1063: \item[access] conditions of access to this resource. Different
1064: access types are specified by a \texttt{type} attribute:
1.12 casties 1065: \begin{description}
1.16 casties 1066: \item[type=group] access restricted to the members of this named
1067: group. The method to identify a user belonging to a named group
1068: is not specified in this document.
1069: \begin{description}
1070: \item[name] name of the group.
1071:
1072: \item[only-before] the access condition is only valid before the
1073: given date (format: ``YYYY/MM/DD'').
1074:
1075: \item[only-after] the access condition is only valid after the
1076: given date (format: ``YYYY/MM/DD'').
1077: \end{description}
1078:
1079: \item[type=institution] access restricted to the members of this
1080: institution. The method to identify a user to belong to the
1081: institution is not specified in this document.
1.12 casties 1082: \begin{description}
1.16 casties 1083: \item[name] name of the group.
1084:
1085: \item[only-before] the access condition is only valid before the
1086: given date (format: ``YYYY/MM/DD'').
1087:
1088: \item[only-after] the access condition is only valid after the
1089: given date (format: ``YYYY/MM/DD'').
1090: \end{description}
1091:
1092:
1093: \item[type=subnet] access restricted to all computers with an
1094: IP-address in this subnet.
1095: \begin{description}
1096: \item[range] subnet range defined in
1097: truncated-quad (e.g. ``141.14''), network-netmask
1098: (e.g. ``141.14.0.0/255.255.0.0''), or network-range
1099: (e.g. ``141.14.0.0/16'') notation.
1100:
1101: \item[only-before] the access condition is only valid before the
1102: given date (format: ``YYYY/MM/DD'').
1103:
1104: \item[only-after] the access condition is only valid after the
1105: given date (format: ``YYYY/MM/DD'').
1106: \end{description}
1107:
1.12 casties 1108:
1.16 casties 1109: \item[type=scientific] access to this resource should be restricted to
1110: scientific work
1111: \begin{description}
1112: \item[only-before] the access condition is only valid before the
1113: given date (format: ``YYYY/MM/DD'').
1114:
1115: \item[only-after] the access condition is only valid after the
1116: given date (format: ``YYYY/MM/DD'').
1.12 casties 1117: \end{description}
1.16 casties 1118:
1.12 casties 1119:
1.16 casties 1120: \item[type=free] access to this resource is not restricted
1121: \begin{description}
1122: \item[only-before] the access condition is only valid before the
1123: given date (format: ``YYYY/MM/DD'').
1.12 casties 1124:
1.16 casties 1125: \item[only-after] the access condition is only valid after the
1126: given date (format: ``YYYY/MM/DD'').
1127: \end{description}
1128:
1.12 casties 1129:
1.16 casties 1130: \item[type=special] if none of the above conditions seems appropriate,
1.12 casties 1131: a free-form text can be specified here.
1.16 casties 1132: \begin{description}
1133: \item[description] description of special access conditions.
1134:
1135: \item[only-before] the access condition is only valid before the
1136: given date (format: ``YYYY/MM/DD'').
1137:
1138: \item[only-after] the access condition is only valid after the
1139: given date (format: ``YYYY/MM/DD'').
1140: \end{description}
1141:
1.12 casties 1142: \end{description}
1143: \end{description}
1144: \end{description}
1145:
1146: \noindent
1.16 casties 1147: It should be noted that control over access to the resource has to be
1148: provided by additional technical measures. Access conditions in the
1149: metadata file only state that conditions \emph{should} be observed, it
1150: is not implied that they \emph{are} necessarily observed, as the
1151: enforcement of conditions depends on additional measures.
1.12 casties 1152:
1153:
1154:
1155: \subsection{Acquisition of raw-data}
1156: \label{sec:acqu-inform}
1157:
1158: Information about the acquisition source for raw data resources can be
1159: provided in an \texttt{acquisition} container.
1160:
1161: \begin{description}
1162: \item[acquisition] the acquisition source of this resource -- required
1163: for raw data.
1164: \begin{description}
1165: \item[provider] where this resource came from -- required
1166: \begin{description}
1167: \item[name] free-text name of the provider (institution or
1168: individual)
1169:
1170: \item[address] address of the provider
1171:
1172: \item[contact] contact person at the provider (i.e. name and email)
1173:
1174: \item[url] URL related to the provider
1.13 casties 1175:
1176: \item[provider-id] id of the provider (internally used) -- deduced
1.12 casties 1177: \end{description}
1178:
1179: \item[date] date of acquisition -- required
1180:
1181: \item[description] free-text description of the acquisition source or
1182: additional information
1183: \end{description}
1184: \end{description}
1185:
1186:
1187:
1188: \subsection{Documentary Films}
1189: \label{sec:documentary-films}
1190:
1191: Documentary films can be described using a \texttt{film-acquisition}
1192: container.
1193:
1194: \begin{description}
1195: \item[film-acquisition] description of a (documentary) film --
1196: required for documentary film
1197: \begin{description}
1198: \item[recording] specification of the recording process
1199: \begin{description}
1200: \item[author] the person or persons doing the recording
1201:
1202: \item[date] the date or time span when the film was recorded
1203:
1204: \item[location] the place where the film was recorded
1205:
1206: \item[device] recording device used (e.g. ``Sony CP-DV8 Camcorder'')
1207:
1208: \item[format] format of the recorded film -- required (e.g. ``DV
1209: 720x524 25fps interlaced'')
1210: \end{description}
1211:
1212: \item[description] free-form description of the recording and the
1213: content of the film
1214: \end{description}
1215: \end{description}
1216:
1217: (More information about the digitization step could be added in a
1218: \texttt{digitization} tag similar to the \texttt{recording} tag.)
1219:
1.1 casties 1220:
1221:
1222:
1.4 casties 1223: \section{Sample metadata files for ECHO resources}
1.1 casties 1224:
1.5 casties 1225: The following is a sample metadata index file for a directory containig a
1226: scanned document.
1227:
1228: \begin{small}
1.1 casties 1229: \begin{verbatim}
1.11 casties 1230: <resource type="ECHO" version="1.0">
1.5 casties 1231: <description>Fleck, 1980</description>
1232: <name>fleck.1980</name>
1233: <creator>University of Bern</creator>
1234: <archive-path>ubern/wiss-theorie</archive-path>
1235: <content-type>scanned images</content-type>
1236: <meta>
1237: <dri>echo23a45e2329x</dri>
1238: <lang>ger</lang>
1239: <bib type="book">
1240: <author>Fleck, Ludwik</author>
1241: <year>1980</year>
1242: <title>Entstehung und Entwicklung einer
1243: wissenschaftlichen Tatsache</title>
1244: <series-editor></series-editor>
1245: <series-title></series-title>
1246: <series-volume></series-volume>
1247: <number-of-pages></number-of-pages>
1248: <city>Frankfurt am Main</city>
1249: <publisher>Suhrkamp</publisher>
1250: <edition></edition>
1251: <number-of-volumes></number-of-volumes>
1252: <translator></translator>
1253: <isbn-issn></isbn-issn>
1254: <keywords>Wissenschaftstheorie, Fleck, Tatsache</keywords>
1255: <abstract></abstract>
1256: </bib>
1257: </meta>
1258: <dir>
1259: <description>Scanned images (300dpi)</description>
1260: <name>img</name>
1261: </dir>
1.4 casties 1262: </resource>
1263: \end{verbatim}
1.5 casties 1264: \end{small}
1.4 casties 1265:
1.5 casties 1266: The following is a sample metadata file for a single image of an
1267: architectural drawing.
1.4 casties 1268:
1.5 casties 1269: \begin{small}
1.4 casties 1270: \begin{verbatim}
1.11 casties 1271: <resource type="ECHO" version="1.0">
1.5 casties 1272: <creator>Bibliotheca Hertziana</creator>
1273: <content-type>scanned images</content-type>
1274: <file>
1275: <name>00000271-asl-160-r-full.tif</name>
1276: <meta>
1277: <img>
1278: <original-dpi>315</original-dpi>
1279: </img>
1280: <dri>echo45a67bc4367d</dri>
1281: <lang>ita</lang>
1282: <doc type="Architectural Drawing">
1283: <person>Ciolli, Giacomo</person>
1284: <person>Urban VIII; Barberini, Maffeo</person>
1285: <location>Accademia di San Luca</location>
1286: <location>Roma</location>
1287: <date>1706</date>
1288: <object>Concorso Clementino</object>
1289: <object>Fontana Pubblica</object>
1290: <object>Brunnen</object>
1291: <object>ASL 160</object>
1292: <keywords></keywords>
1293: </doc>
1294: <context>
1295: <url>http://colosseum.biblhertz.it:8080/Lineamenta/
1296: 1033478408.39/1035196181.35/1035196204.09/1035394121.83
1297: </url>
1298: </context>
1299: </meta>
1300: </file>
1.2 casties 1301: </resource>
1.1 casties 1302: \end{verbatim}
1.5 casties 1303: \end{small}
1.1 casties 1304:
1305: \end{document}
1306:
1307: %%% Local Variables:
1308: %%% mode: latex
1309: %%% TeX-master: t
1310: %%% End:
FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>