org.apache.uima.java
true
org.apache.uima.example.opennlp.annotator.Tokenizer
OpenNLPTokenizer
Tokenize the text and create token annotations that span the tokens. The tokenization is performed using the OpenNLP MaxEnt tokenizer, which tokenizes according to the Penn Tree Bank tokenization standard. In general, tokens are separated by white space, but punctuation marks (e.g., ".", ",", "!", "?", etc.) and apostrophed endings (e.g., "'s", "'nt", etc.) are separate tokens.
1.0
The Apache Software Foundation
ModelFile
OpenNLP MaxEnt model file for the tokenizer.
String
false
true
ModelFile
opennlp-tools-1.3.0/tokenize/EnglishTok.bin
org.opennlp.Sentence
org.opennlp.Token
true
true