texttool-architecture/soft-cgi.tex - diff

Return to soft-cgi.tex CVS log

Up to [Repository] / texttool-architecture

Diff for /texttool-architecture/soft-cgi.tex between versions 1.15 and 1.20

version 1.15, 2004/01/18 22:01:10	version 1.20, 2004/08/16 22:34:04
Line 23 submitted metadata file.	Line 23 submitted metadata file.

The input metadata file must have the following form	The input metadata file must have the following form

\paragraph	\begin{verbatim}

<resource>	<resource>
...	...
<meta>	<meta>
Line 40 The input metadata file must have the fo	Line 39 The input metadata file must have the fo
<pagebreak>pb</pagebreak><presentation>01-presentation/info.xml</presentation></texttool></meta>	<pagebreak>pb</pagebreak><presentation>01-presentation/info.xml</presentation></texttool></meta>

</meta>	</meta>
	\end{verbatim}

\paragraph{archimedes object registration}	\paragraph{archimedes object registration}

Line 48 The input metadata file must have the fo	Line 48 The input metadata file must have the fo

\paragraph{plan of this section }	\paragraph{plan of this section }

\begin{enumeration}	\begin{enumerate}
\item An overview of toc.cgi architecture	\item An overview of toc.cgi architecture
\item A walk-through of typical cgi queries for toc.cgi	\item A walk-through of typical cgi queries for toc.cgi
\item An index of cgi parameters and values with short descriptions of function	\item An index of cgi parameters and values with short descriptions of function
\end{enumeration}	\item The TOC Perl modules
	\end{enumerate}

\paragraph{Overview of toc.cgi architecture}	\paragraph{Overview of toc.cgi architecture}

Line 87 xml format at all.	Line 88 xml format at all.

\paragraph{cgi parameters -- standard queries}	\paragraph{cgi parameters -- standard queries}

\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=corpus }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=corpus }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=corpus }
\newline	\newline
\newline	\newline
get a listing of corpora	get a listing of corpora


\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpusmanifest }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpusmanifest }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpusmanifest }
\newline	\newline
\newline	\newline
get an xml listing of corpora	get an xml listing of corpora

	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi }
\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi }
\newline	\newline
\newline	\newline
get a listing of works in default corpus	get a listing of works in default corpus

\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?corpus=1 }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?corpus=1 }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?corpus=1 }
\newline	\newline
\newline	\newline
get a listing of works in corpus 1 [default corpus = 0]	get a listing of works in corpus 1 [default corpus = 0]

\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpuslist }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpuslist }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpuslist }
\newline	\newline
\newline	\newline
get an xml listing of works in default corpus	get an xml listing of works in default corpus

\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpuslist;corpus=1 }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpuslist;corpus=1 }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?step=xmlcorpuslist;corpus=1 }
\newline	\newline
\newline	\newline
get an xml listing of works in corpus 1	get an xml listing of works in corpus 1

\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=baifl_renav_006_la_1537;step=thumb }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=baifl_renav_006_la_1537;step=thumb }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=baifl_renav_006_la_1537;step=thumb }
\newline	\newline
\newline	\newline
get a work from default corpus with thumbnail navbar displayed left	get a work from default corpus with thumbnail navbar displayed left


\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=jorda_ponde_050_la_1533;step=thumb;ftype=thumbright }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=jorda_ponde_050_la_1533;step=thumb;ftype=thumbright }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=jorda_ponde_050_la_1533;step=thumb;ftype=thumbright }
\newline	\newline
\newline	\newline
get a work from default corpus with thumbnail navbar displayed right	get a work from default corpus with thumbnail navbar displayed right

\htmladdnormallink{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=jorda_ponde_050_la_1533;step=textonly;corpus=;page=22 }{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=jorda_ponde_050_la_1533;step=textonly;corpus=;page=22 }	\url{ http://archimedes.mpiwg-berlin.mpg.de/cgi-bin/toc/toc.cgi?dir=jorda_ponde_050_la_1533;step=textonly;corpus=;page=22 }
\newline	\newline
\newline	\newline
get a page of text from a work from default corpus	get a page of text from a work from default corpus


	\paragraph{TOC Perl Modules}
	\subparagraph{general}The documentation for the Toc Perl Modules is
	located in the modules themselves in POD format. The POD is the
	definitive documentation for the modules.

	The modules are available to archimedes staff from cvs on the archimedes server at
	141.14.236.86:/perseus/cvsroot in the module
	/perseus/cvsroot/mpitexts/perl/perllib. To get them, log on to the
	archimedes server and use the commandline command:
	\begin{verbatim}
	cvs -d /perseus/cvsroot co /perseus/cvsroot/mpitexts/perl/perllib
	\end{verbatim}

	or from a remote location

	\begin{verbatim}
	bash; export CVS_RSH=ssh; cvs -d :ext:myusername@141.14.236.86:/perseus/cvsroot co /perseus/cvsroot/mpitexts/perl/perllib
	\end{verbatim}

	\input{soft-search}
\subsubsection{Indexing}
\label{sec:indexing}

\paragraph{Status quo ECHO}
Currently indexing is not implemented on the ECHO server.

\paragraph{Plan ECHO}

\begin{enumeration}
\item construct remote (141.14.236.86) index for each file at
per-change or daily intervals
\item store indices locally in
archimedes/data/db/PROJECT_NAME/CORPUS_NAME/WORK
\item 2 progs on server 1. cgi: indexer 2. backend da_remote
\item 2 progs on client 1. cgi: sendindex 2. backend getindex
\item indexing transaction handled by two cgi scripts, one on the
server the other on the client [this is the 1st implementation bcs
its easiest and there are no port issues, but probably it'd be
better to have a separate port].
\item client cgi: getindex -- sends 1. list of files to index
2. uri to which xml notification of completion is to be sent. Upon
notification, activates backend prog that fetches and installs the
indices.
\item server cgi: indexer receives filelist and notification
addess. Activates backend that fetches files, indexes, places
completed indexes in a networked location, then sends xml
notification back to client.
\item single script provides backend access to indices
\item leave front-end issues like display, collection and navigation
to web-design programmers. Do only a sample for now.
\end{enumeration}

\subsubsection{Morphology}	\subsubsection{Morphology}
\label{sec:morphology}	\label{sec:morphology}
Line 198 No parameter--update all lemmatization i	Line 185 No parameter--update all lemmatization i
\paragraph{makefast.pl ARCHIMEDES}	\paragraph{makefast.pl ARCHIMEDES}

Updates the toc.cgi morphology indices	Updates the toc.cgi morphology indices
Parameters	Parameters:
No parameter--update all lemmatization indices	No parameter--update all lemmatization indices
[latin \| ital \| greek \| en \| nl \| de]-- update this language	[latin \| ital \| greek \| en \| nl \| de]-- update this language

\subsubsection{summary of differences btwn the archimedes toc.cgi	The indices are produced from the corpus word index 'xml:raw:norm',
implementation and the echo toc.cgi impelementation (toc.x.cgi)}	which correlates raw forms to normalized forms, and
	'\$lang:inc_lemma', which correlates incidentia to lemmata. The basic
	rule is, if exists \$raw->\$norm->\$inc_lemma, then \$raw is included
	in the 'fast' index for that language.

	Currently stores the indices with the name xml:hit:\$lang, where
	\$lang is one of [ital,greek,latin,de,en,fr,nl] in the directory
	/usr/share/perlobjects/wordindex in Archim::Object::Depot format
	(Storable). Access to these indices is provided by
	Archim::Toc::Utils->get_hits_hash(\$lang) .

	The functionality of makefast.pl is duplicated by Archim::Toc::Index->make_fast_lemma(\$lang);


	\subsubsection{summary of differences btwn the archimedes toc.cgi implementation and the echo toc.cgi impelementation (toc.x.cgi)}

\paragraph{missing in archimedes}	\paragraph{missing in archimedes}
\begin{enumeration}	\begin{enumerate}

\item html templates (coded but phased out of cvs branch)	\item html templates (coded but phased out of cvs branch)
\end{enumeration}	\end{enumerate}

\paragraph{missing in echo}	\paragraph{missing in echo}
\begin{enumeration}	\begin{enumerate}

\item word-coloring?	\item word-coloring?
\item remote text method may work differently	\item remote text method may work differently


	\end{enumerate}
\end{enumeration}
\paragraph{differences}	\paragraph{differences}
\begin{enumeration}	\begin{enumerate}
\item structure of info.xml	\item structure of info.xml
\item resource-discovery algorithm for info.xml	\item resource-discovery algorithm for info.xml
\end{enumeration}	\end{enumerate}

FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>

Removed from v.1.15
changed lines
	Added in v.1.20