Changes between Version 3 and Version 4 of mpdl2.0-design


Ignore:
Timestamp:
Sep 6, 2011, 11:14:23 AM (13 years ago)
Author:
jwillenborg
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • mpdl2.0-design

    v3 v4  
    1 = MPDL 2.0 =
    2 
    3 The MPDL backend software release 2.0 is redesigned so that important functions (language technology, XML functions) are available as web applications independent from the eXist software - usable as HTTP servlets and fully implemented in Java.
     1= MPDL 2.0 (backend system) =
     2
     3The MPDL backend software is redesigned so that as much software as possible is separated from the eXist database system and could be used independently from eXist (MPDL 2.0). We offer two MPDL librares:
     4* language technology (available as a web archive file "mpiwg-mpdl-lt.war")
     5* XML technology (available as a web archive file "mpiwg-mpdl-xml.war")
     6
     7The other MPDL backend software which needs the functionality of eXist and Lucene (XML document storage and retrieval) is redesigned so that all important functions are available as HTTP servlets with a specified API.
    48
    59== Language technology ==
    610
    7 The language technology module consists of:
    8 * language technology data (XML data files, Java Berkely DB's)
     11The language technology web archive ""mpiwg-mpdl-lt.war"" consists of:
     12* data (data files, Java Berkely DB's)
    913  * morphology data (Perseus, CELEX, Lexique with languages: ara, eng, fre, ger, gre, ita, lat, nld, zho)
    1014  * dictionary data (dictionaries: autenrieth, baretti, bonitz, cooper, florio, lewis-short, lidell-scott-jones, salmone, webster)
    1115* Java source code
    1216* used Java libraries
    13 * web application configuration file (web.xml)
    14 
    15 It is available as the web archive file "mpiwg-mpdl-lt.war".
    16 
    17 Following servlets are available:
     17* web application configuration file ("web.xml")
     18
     19Following HTTP servlets are available:
    1820
    1921=== Morphology ===
     
    6668  * URL: /mpdl/getDictionaryEntries
    6769  * Request parameters:
    68     * forms or lemmas
     70    * forms or lemmas or range
    6971      * one form or lemma (string)
    7072      * list of forms or lemmas (XML)
     73      * range
     74        * entries beginning with, e.g. "a*"
     75        * entries from position x to y: e.g. "200-300"
    7176    * inputType (optional; default: "form")
    7277      * "form", "lemma"
    7378    * dictionary (optional; default: all dictionaries)
    74       * dictionary name (e.g. "webster")
     79      * dictionary name, e.g. "webster"
    7580    * language (optional, default: all languages)
    7681      * ISO 639-3 specifier
    7782    * outputType (optional)
    78       * full, compact
     83      * "full", "compact"
     84    * outputFormat (optional)
     85      * "xml", "html"
    7986    * normalization (optional; default: without normalization)
    8087      * "reg", "norm", "reg norm"
    8188  * Response output:
    8289    * dictionary entries
    83       * dictionary entries (XML)
     90      * dictionary entries (XML/HTML format)
     91        * if result is big then it is devided into result pages
     92        * with external links
    8493
    8594* !DictionaryEnrichServlet
     
    124133== XML technology ==
    125134
    126 The XML technology module consists of:
     135The XML technology web archive ""mpiwg-mpdl-xml.war"" consists of:
    127136* Java source code
    128137* used Java libraries
    129 * web application configuration file (web.xml)
    130 
    131 It is available as the web archive file "mpiwg-mpdl-xml.war".
    132 
    133 Following servlets are available:
     138* web application configuration file ("web.xml")
     139
     140Following HTTP servlets are available:
    134141
    135142=== XPath/XQuery ===
     
    189196    * XML fragment
    190197
     198== XML document storage and retrieval ==
     199
     200* !GetDocumentServlet
     201  * URL: /mpdl/getDoc
     202  * Request parameters:
     203    * docId
     204      * document identifier (e.g. "/echo/la/benedetti_1585.xml")
     205  * Response output:
     206    * document
     207
     208* !DocumentOperationServlet
     209  * URL: /mpdl/documentOperation
     210  * Request parameters:
     211    * operation
     212      * "create", "update", "delete"
     213    * srcUrl
     214      * source URL of XML document e.g. "http://yourServer/Benedetti_1585.xml"
     215    * destFileName (optional: default: file name in URL)
     216      * destination file name, e.g. "benedetti_1585.xml"
     217    * destLanguage (optional, default: xml:lang in document or "eng")
     218      * destination language (ISO 639-3 specifier), e.g. "lat"
     219  * Response output:
     220    * job id of scheduled operation
     221
     222* !QueryServlet
     223  * URL: /mpdl/query
     224  * Request parameters:
     225    * queryType (optional: default: "morphological normalized")
     226      * "exact", "morphological", "normalized"
     227    * query
     228      * attribute query (e.g. "author = 'Benedetti' and language = 'lat'")
     229      * fulltext query (e.g. "quantitas")
     230    * docbases (optional, default: all document bases)
     231      * document bases (e.g. "mpdl", "archimedes-project")
     232    * orderBy (optional)
     233      * order query result by fieldname: e.g. "author" or "score" (fulltext queries)
     234    * resultPageNumber (optional, default: 1)
     235      * query result hits: page number
     236    * resultPageSize (optional, default: 100)
     237      * query result hits: page size
     238  * Response output:
     239    * query result (XML format)
     240
     241* !QueryDocumentServlet
     242  * URL: /mpdl/queryDoc
     243  * Request parameters:
     244    * docId
     245      * document identifier (e.g. "/echo/la/benedetti_1585.xml")
     246    * query
     247      * fulltext query (e.g. "quantitas")
     248      * morphological fulltext query (e.g. "quantitas")
     249    * resultPageNumber (optional, default: 1)
     250      * query result hits: page number
     251    * resultPageSize (optional, default: 100)
     252      * query result hits: page size
     253  * Response output:
     254    * query result (XML format)