org.apache.uima.java true org.apache.uima.example.opennlp.annotator.Tokenizer OpenNLPTokenizer Tokenize the text and create token annotations that span the tokens. The tokenization is performed using the OpenNLP MaxEnt tokenizer, which tokenizes according to the Penn Tree Bank tokenization standard. In general, tokens are separated by white space, but punctuation marks (e.g., ".", ",", "!", "?", etc.) and apostrophed endings (e.g., "'s", "'nt", etc.) are separate tokens. 1.0 The Apache Software Foundation ModelFile OpenNLP MaxEnt model file for the tokenizer. String false true ModelFile opennlp-tools-1.3.0/tokenize/EnglishTok.bin org.opennlp.Sentence org.opennlp.Token true true