= MPIWG-MPDL Software Development: Language technology workshop (2009, April, 16th at the [http://www.mpiwg-berlin.mpg.de MPIWG]) = == Lectures == * [http://www.ling.uni-potsdam.de/~stede/ Prof. Manfred Stede]: The MOTS document processing workbench * Peter Kolb: Text mining in a corpus of newspaper articles; Induction of morphological knowledge * Christian Chiacros: The PAULA interchange format and the ANNIS database * [http://www.mpiwg-berlin.mpg.de/en/staff/members/hyman Dr. Malcolm Hyman]: Donatus: an architecture for linguistic middleware * [http://www.mpiwg-berlin.mpg.de/en/staff/members/jwillenborg Dr. Josef Willenborg]: Integration of Donatus with an eXist-based repository * [http://www.mpiwg-berlin.mpg.de/en/staff/members/dwinter Dirk Wintergrün]: Ocropus + Donatus for large-scale digitized page image collections