version 1.18, 2004/06/01 13:07:27
|
version 1.20, 2004/08/16 22:34:04
|
Line 185 No parameter--update all lemmatization i
|
Line 185 No parameter--update all lemmatization i
|
\paragraph{makefast.pl ARCHIMEDES} |
\paragraph{makefast.pl ARCHIMEDES} |
|
|
Updates the toc.cgi morphology indices |
Updates the toc.cgi morphology indices |
Parameters |
Parameters: |
No parameter--update all lemmatization indices |
No parameter--update all lemmatization indices |
[latin | ital | greek | en | nl | de]-- update this language |
[latin | ital | greek | en | nl | de]-- update this language |
|
|
\subsubsection{summary of differences btwn the archimedes toc.cgi |
The indices are produced from the corpus word index 'xml:raw:norm', |
implementation and the echo toc.cgi impelementation (toc.x.cgi)} |
which correlates raw forms to normalized forms, and |
|
'\$lang:inc_lemma', which correlates incidentia to lemmata. The basic |
|
rule is, if exists \$raw->\$norm->\$inc_lemma, then \$raw is included |
|
in the 'fast' index for that language. |
|
|
|
Currently stores the indices with the name xml:hit:\$lang, where |
|
\$lang is one of [ital,greek,latin,de,en,fr,nl] in the directory |
|
/usr/share/perlobjects/wordindex in Archim::Object::Depot format |
|
(Storable). Access to these indices is provided by |
|
Archim::Toc::Utils->get_hits_hash(\$lang) . |
|
|
|
The functionality of makefast.pl is duplicated by Archim::Toc::Index->make_fast_lemma(\$lang); |
|
|
|
|
|
\subsubsection{summary of differences btwn the archimedes toc.cgi implementation and the echo toc.cgi impelementation (toc.x.cgi)} |
|
|
\paragraph{missing in archimedes} |
\paragraph{missing in archimedes} |
\begin{enumerate} |
\begin{enumerate} |
Line 205 No parameter--update all lemmatization i
|
Line 219 No parameter--update all lemmatization i
|
\item remote text method may work differently |
\item remote text method may work differently |
|
|
|
|
|
|
\end{enumerate} |
\end{enumerate} |
\paragraph{differences} |
\paragraph{differences} |
\begin{enumerate} |
\begin{enumerate} |