wiki:mpdl2.0-design

Version 1 (modified by jwillenborg, 13 years ago) (diff)

--

MPDL 2.0

The MPDL 1.0 software is tightly coupled with the XML system eXist. The next MPDL release 2.0 will be redesigned so that many functions (language technology, some basic XML functions) are usable as open services independently of the eXist software. The new functions will be designed in a layer architecture so that they could be used in different workflows and in a more standardized way (API and XML standard output format). All main functions are available as servlets and are fully implemented in Java.

Language technology

Word recognition

Input:

  • text (URL)
    • unstructured text
    • XML fragment/document
  • language (ISO 639-3 specifier)

Output

  • list of word tokens
    • words seperated by a blank
    • XML format

Morphology

Dictionary

XML functions

XPath/XQuery

get fragment