| 3 | | This projects is aimed to develop tools creating fulltext indices for documents ocred with Tesseract and Octropus. |
| | 3 | This projects is aimed to develop tools creating fulltext indices for documents ocred with Tesseract and Octropus. |
| | 4 | This tools cover the following features: |
| | 5 | |
| | 6 | * Integrating [http://archimedes.fas.harvard.edu/docs/donatus-api/ Donatus Language Technologies] for creating and searching a Lucene Index |
| | 7 | * Indexing ocropus generated documents, so the hits can be displaxed on the original image using [http://digilib.berlios.de/digilb digilib]. |
| | 8 | |