= Test of RLP = == RLP == * RLP: version 6.5.2 (platform dependant) * RLP-Lucene: version 6.0.0 (Java library: platform independant) == Document base == * 113 documents, sized each 1 KB - 18 MB * languages: latin, italian, english, german, french, dutch, greek, arabic, chinese == Hardware, operating system == * Mac Pro, Dual Core Intel Xeon 2,66 Ghz, 4GB RAM * MacOS 10.5.4 == Indexing == * done on [http://exist-db.org/ eXist with Lucene (eXist 1.3dev)] * needed 1,3 hours (83 minutes) * took most of the time full processor time (100%) * less RAM consumption (< 500 MB) == Result == * application: see [http://xserve07.mpiwg-berlin.mpg.de:30010/mpdl/query.xql MPDL prototype with RLP analyzer (access only within MPIWG network)] * do a morphological index lookup in a document, e.g. [http://xserve07.mpiwg-berlin.mpg.de:30010/mpdl/page-query-result.xql?document=/archimedes/la/delfi_fluxu_024_la_1559.xml&pn=1&mode=text&query-type=ftIndexMorph&query=a&query-result-pn=1 for "a" in Delfino, Federico. De fluxu et refluxu aquae maris. Venice, 1559] == Quality of indexing (random sample) == * base form reduction * latin: x of y not ok * english: * ... * overall: * in relation to Donatus/Snowball: * orthographic normalization: == Other observations == * double entries: same word forms seems to lead to different base forms (e.g. )