3 | | This projects is aimed to develop tools creating fulltext indices for documents ocred with Tesseract and Octropus. |
| 3 | This projects is aimed to develop tools creating fulltext indices for documents ocred with Tesseract and Octropus. |
| 4 | This tools cover the following features: |
| 5 | |
| 6 | * Integrating [http://archimedes.fas.harvard.edu/docs/donatus-api/ Donatus Language Technologies] for creating and searching a Lucene Index |
| 7 | * Indexing ocropus generated documents, so the hits can be displaxed on the original image using [http://digilib.berlios.de/digilb digilib]. |
| 8 | |