|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface RLPConstants
Constants used in RLP. The constants with names that begin with REXML_ contain strings used in constructing REXML documents. The constants with names that begin with LICENSE_ contain values used when discovering the available licenses
Field Summary | |
---|---|
static int |
ALTERNATIVE_LEMMAS
Integer String vector: alternative LEMMA results. |
static int |
ALTERNATIVE_NORM
Integer String vector: alternative NORMALIZED_TOKEN results. |
static int |
ALTERNATIVE_PARTS_OF_SPEECH
Integer and String vector: alternative PART_OF_SPEECH results. |
static int |
ALTERNATIVE_ROOTS
Integer and String vector: alternative ROOTS results. |
static int |
ALTERNATIVE_STEMS
Integer and String vector: alternative STEM results. |
static int |
ARBL_FEATURE_DEFINITE_ARTICLE
package-private tag for internal notices. |
static int |
ARBL_FEATURE_STRIPPABLE_PREFIX
package-private tag for internal notices. |
static int |
BASE_NOUN_PHRASE
Integer pair: index of first token and index + 1 of last token in the noun phrase. |
static int |
COMPOUND
Integer and string vector: Index to a token that represents a compound word and vector of components that make up the compound. |
static int |
DETECTED_ENCODING
Name of the character encoding. |
static int |
DETECTED_LANGUAGE
RLP-defined language code for the text. |
static int |
DETECTED_SCRIPT
Integer: ISO15924 code for the writing script. |
static int |
FLAGS
Not used. |
static int |
GAZETTEER_NAMES
Not used. |
static int |
HTML_DOC
For internal use. |
static int |
HTML_TABLES
For internal use. |
static int |
HTML_TAGS
For internal use. |
static int |
LANGUAGE_REGION
Six integers (5 currently used) defining a language region: Raw-text offset for start of region; Raw-text offset + 1 for end of region; Level: 0 (not nested), 1 (a nested sentence), 2 (a nested phrase or fragment); Type: 0 (entire document), 1 (a text region), 2 (a window scan), 3 (a script region, not used), 4 (unknown); Script (currently not used); Language identifier |
static int |
LEMMA
String: canonical form of a lexeme. |
static int |
LICENSE_FEATURE_BASE_LINGUISTICS
Numeric ID for the Base Linguistics license. |
static int |
LICENSE_FEATURE_NAMED_ENTITY
Numeric ID for the Named Entities license. |
static String |
LICENSE_NAMED_FEATURE_ANTF
String ID for the Arabic Name Translation Framework feature. |
static int |
LICENSE_NAMED_FEATURE_ANTF_FUNCTION_ENABLED
Numeric ID used to determine if the Arabic Name Translation Framework feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_ASKT
String ID doe the Arabic Search Keyword Translator feature. |
static int |
LICENSE_NAMED_FEATURE_ASKT_FUNCTION_ENABLED
Numeric ID used to determine if the Arabic Search Keyword Translator feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_GNM
String ID for the Global Name Matcher feature. |
static int |
LICENSE_NAMED_FEATURE_GNM_FUNCTION_ENABLED
Numeric ID used to determine if the Global Name Matcher feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_HTMLSTRIPPER
String ID for the HTML Stripper feature. |
static int |
LICENSE_NAMED_FEATURE_HTMLSTRIPPER_FUNCTION_ENABLED
Numeric ID used to determine if the HTML Stripper feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_NAMEMATCHING
String ID for the Name Matching feature. |
static int |
LICENSE_NAMED_FEATURE_NAMEMATCHING_FUNCTION_ENABLED
Numeric ID used to determine if the Name Matching feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_RAA
String ID for the Rosette Arabic Term Analyzer feature. |
static int |
LICENSE_NAMED_FEATURE_RAA_FUNCTION_ENABLED
Numeric ID used to determine if the Rosette Arabic Term Analyzer feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_RDIF
String ID for the Rosette Document Ingestion Framework feature. |
static int |
LICENSE_NAMED_FEATURE_RDIF_FUNCTION_ENABLED
Numeric ID used to determine if the Rosette Document Ingestion Framework feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_RLBL
String ID for the Rosette Language Boundary Locater feature. |
static int |
LICENSE_NAMED_FEATURE_RLBL_FUNCTION_ENABLED
Numeric ID used to determine if the Rosette Language Boundary Locater feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_RLI
String ID for the Rosette Language Identifier feature. |
static int |
LICENSE_NAMED_FEATURE_RLI_FUNCTION_ENABLED
Numeric ID used to determine if the Rosette Language Identifier feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_RNC
String ID for the Rosette Name Cleaner feature. |
static int |
LICENSE_NAMED_FEATURE_RNC_FUNCTION_ENABLED
Numeric ID used to determine if the Rosette Name Cleaner feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_RNT
String ID for the Rosette Name Translator feature. |
static int |
LICENSE_NAMED_FEATURE_RNT_FUNCTION_ENABLED
Numeric ID used to determine if the Rosette Name Translator feature is licensed. |
static String |
LICENSE_NAMED_FEATURE_XA
String ID for the Translation Assistant feature. |
static int |
LICENSE_NAMED_FEATURE_XA_FUNCTION_ENABLED
Numeric ID used to determine if the Translation Assistant feature is licensed. |
static int |
MANY_TO_ONE_NORMALIZED_TOKEN
For each token, provides the many-to-one normalized token index and many-to-one normalized tokens. |
static int |
MAP_OFFSETS
Integer: Maps a character in normalized text to its location prior to normalization. |
static int |
MIME_TYPE
String: MIME type of input. |
static int |
MORPHO_TAGS_VECTOR
package-private tag for internal notices. |
static int |
NAMED_ENTITY
Integer triple: Index of first token in the named entity; index + 1 of last token in the named entity; RLP-defined integer designating the type, optional subtype, and source (Named Entity Extractor, Regular Expressions Processor, or Gazetteer) of the entity. |
static int |
NORMALIZED_TOKEN
For each token, provides the token index and the alternative normalized tokens. |
static int |
PART_OF_SPEECH
String: Part of speech for the token. |
static int |
PATHNAME
For internal use. |
static int |
RAW_DATA
For internal use. |
static int |
RAW_TEXT
The input text. |
static int |
READING
Integer and string vector: Alternate readings (transcriptions) for a token. |
static String |
REXML_BASE_NP_ELEMENT_NAME
A single base noun phrase. |
static String |
REXML_BASE_NP_END_ELEMENT_NAME
The end index for a base noun phrase. |
static String |
REXML_BASE_NP_START_ELEMENT_NAME
The start index for a base noun phrase. |
static String |
REXML_BASE_NPS_ELEMENT_NAME
All the base noun phrases. |
static String |
REXML_COMPOUND_COMPONENT_ELEMENT_NAME
A single compound component. |
static String |
REXML_COMPOUND_COMPONENTS_ELEMENT_NAME
Compound components for a token. |
static String |
REXML_CONTENTS_ELEMENT_NAME
Contents element. |
static String |
REXML_DETECTED_ENCODING_ELEMENT_NAME
Detect encoding element. |
static String |
REXML_DOCUMENT_ELEMENT_NAME
Top element in a REXML document. |
static String |
REXML_END_ATTR_NAME
end= attribute. |
static String |
REXML_HEADER_ELEMENT_NAME
Header element. |
static String |
REXML_INDEX_ATTR_NAME
index= attribute for tokens. |
static String |
REXML_LANGUAGE_ELEMENT_NAME
Detected language element. |
static String |
REXML_MANY_TO_ONE_NORMALIZED_ELEMENT_NAME
Many-To-One Normalized token. |
static String |
REXML_NAMED_ENTITIES_ELEMENT_NAME
All the named entities. |
static String |
REXML_NAMED_ENTITY_ELEMENT_NAME
A single named entity. |
static String |
REXML_NAMED_ENTITY_END_ELEMENT_NAME
The end index for a named entity. |
static String |
REXML_NAMED_ENTITY_START_ELEMENT_NAME
The start token index for a named entity. |
static String |
REXML_NAMED_ENTITY_TYPE_ELEMENT_NAME
The type of the named entity as an integer. |
static String |
REXML_NAMED_ENTITY_TYPE_NAME_ELEMENT_NAME
The type of a named entity as a string. |
static String |
REXML_NAMESPACE_PREFIX
rexml:, the REXML namespace prefix. |
static String |
REXML_NAMESPACE_URI
Namespace URI for the rexml: namespace prefix. |
static String |
REXML_NORMALIZED_ELEMENT_NAME
Normalized token. |
static String |
REXML_POS_ELEMENT_NAME
Part Of Speech element. |
static String |
REXML_POSITION_ELEMENT_NAME
Position element for token positions. |
static String |
REXML_PUBLIC_ID
Public ID for REXML documents. |
static String |
REXML_READING_ELEMENT_NAME
A single reading. |
static String |
REXML_READINGS_ELEMENT_NAME
The readings for a token. |
static String |
REXML_ROOT_ELEMENT_NAME
A root |
static String |
REXML_SENTENCE_ELEMENT_NAME
A single sentence. |
static String |
REXML_SENTENCE_END_ELEMENT_NAME
sentence end. |
static String |
REXML_SENTENCE_START_ELEMENT_NAME
sentence start. |
static String |
REXML_SENTENCES_ELEMENT_NAME
Sentence boundaries. |
static String |
REXML_START_ATTR_NAME
start= attribute. |
static String |
REXML_STEM_ELEMENT_NAME
A stemmed form. |
static String |
REXML_SYSTEM_ID
System ID for REXML documents. |
static String |
REXML_TOKEN_ELEMENT_NAME
Token element. |
static String |
REXML_TOKENS_ELEMENT_NAME
Tokens element. |
static String |
REXML_WORD_ELEMENT_NAME
Word element. |
static int |
ROOTS
For Semitic languages, the root for the stem or normalized token. |
static int |
SCRIPT_REGION
Integer triple: Raw-text offset for start of region; Raw-text offset+ 1 for end of region; ISO15924 script identifier |
static int |
SENTENCE_BOUNDARY
Integer: Index of last token + 1 for the sentence. |
static int |
STEM
The dictionary form for the token. |
static int |
STOPWORD
Integer: Index for a token that is a stopword. |
static int |
TEXT_BOUNDARIES
Integer: Raw-text offset + 1 of a sentence-level text boundary. |
static int |
TOKEN
String: An atomic element from the input text, such as word, number, multiword expression, possessive affix, or punctuation. |
static int |
TOKEN_OFFSET
Integer pair: Raw-text start and end + 1 offsets for the token. |
static int |
TOKEN_PREFIX_STEM_LENGTHS
package-private tag for internal notices. |
static int |
TOKEN_SOURCE_ID
Integer: identifies the dictionary in which the token was found. |
static int |
TOKEN_SOURCE_NAME
Name of the dictionary identified by TOKEN_SOURCE_ID. |
static int |
TOKEN_VARIATIONS
Integer and string vector: Variant orthographic representations of a token (word). |
static int |
TRANSCRIBED_TEXT
Currently unused. |
Field Detail |
---|
static final int TOKEN
static final int PART_OF_SPEECH
static final int SENTENCE_BOUNDARY
static final int BASE_NOUN_PHRASE
static final int DETECTED_LANGUAGE
LanguageCode
,
Constant Field Valuesstatic final int DETECTED_ENCODING
static final int NAMED_ENTITY
static final int TOKEN_OFFSET
static final int STEM
static final int NORMALIZED_TOKEN
static final int COMPOUND
static final int READING
static final int SCRIPT_REGION
ISO15924
,
Constant Field Valuesstatic final int LANGUAGE_REGION
DETECTED_LANGUAGE
,
Constant Field Valuesstatic final int TEXT_BOUNDARIES
SENTENCE_BOUNDARY
,
Constant Field Valuesstatic final int RAW_TEXT
static final int STOPWORD
static final int TRANSCRIBED_TEXT
static final int TOKEN_VARIATIONS
static final int GAZETTEER_NAMES
static final int ROOTS
static final int FLAGS
static final int HTML_TAGS
static final int HTML_DOC
static final int RAW_DATA
static final int PATHNAME
static final int MIME_TYPE
static final int HTML_TABLES
static final int MAP_OFFSETS
static final int DETECTED_SCRIPT
ISO15924
,
Constant Field Valuesstatic final int TOKEN_SOURCE_ID
static final int TOKEN_SOURCE_NAME
static final int LEMMA
static final int ALTERNATIVE_NORM
NORMALIZED_TOKEN
,
Constant Field Valuesstatic final int ALTERNATIVE_LEMMAS
LEMMA
,
Constant Field Valuesstatic final int ALTERNATIVE_ROOTS
ROOTS
,
Constant Field Valuesstatic final int ALTERNATIVE_STEMS
STEM
,
Constant Field Valuesstatic final int ALTERNATIVE_PARTS_OF_SPEECH
PART_OF_SPEECH
,
Constant Field Valuesstatic final int MANY_TO_ONE_NORMALIZED_TOKEN
static final int TOKEN_PREFIX_STEM_LENGTHS
static final int MORPHO_TAGS_VECTOR
static final int ARBL_FEATURE_DEFINITE_ARTICLE
static final int ARBL_FEATURE_STRIPPABLE_PREFIX
static final String REXML_PUBLIC_ID
static final String REXML_SYSTEM_ID
static final String REXML_NAMESPACE_URI
static final String REXML_NAMESPACE_PREFIX
static final String REXML_DOCUMENT_ELEMENT_NAME
static final String REXML_HEADER_ELEMENT_NAME
static final String REXML_LANGUAGE_ELEMENT_NAME
static final String REXML_DETECTED_ENCODING_ELEMENT_NAME
static final String REXML_CONTENTS_ELEMENT_NAME
static final String REXML_TOKENS_ELEMENT_NAME
static final String REXML_TOKEN_ELEMENT_NAME
static final String REXML_WORD_ELEMENT_NAME
static final String REXML_INDEX_ATTR_NAME
static final String REXML_POSITION_ELEMENT_NAME
static final String REXML_START_ATTR_NAME
static final String REXML_END_ATTR_NAME
static final String REXML_POS_ELEMENT_NAME
static final String REXML_NORMALIZED_ELEMENT_NAME
static final String REXML_MANY_TO_ONE_NORMALIZED_ELEMENT_NAME
static final String REXML_SENTENCES_ELEMENT_NAME
static final String REXML_SENTENCE_ELEMENT_NAME
static final String REXML_SENTENCE_START_ELEMENT_NAME
static final String REXML_SENTENCE_END_ELEMENT_NAME
static final String REXML_NAMED_ENTITIES_ELEMENT_NAME
static final String REXML_NAMED_ENTITY_ELEMENT_NAME
static final String REXML_NAMED_ENTITY_START_ELEMENT_NAME
static final String REXML_NAMED_ENTITY_END_ELEMENT_NAME
static final String REXML_NAMED_ENTITY_TYPE_ELEMENT_NAME
static final String REXML_NAMED_ENTITY_TYPE_NAME_ELEMENT_NAME
static final String REXML_COMPOUND_COMPONENTS_ELEMENT_NAME
static final String REXML_COMPOUND_COMPONENT_ELEMENT_NAME
static final String REXML_STEM_ELEMENT_NAME
static final String REXML_READINGS_ELEMENT_NAME
static final String REXML_READING_ELEMENT_NAME
static final String REXML_BASE_NPS_ELEMENT_NAME
static final String REXML_BASE_NP_ELEMENT_NAME
static final String REXML_BASE_NP_START_ELEMENT_NAME
static final String REXML_BASE_NP_END_ELEMENT_NAME
static final String REXML_ROOT_ELEMENT_NAME
static final int LICENSE_FEATURE_BASE_LINGUISTICS
static final int LICENSE_FEATURE_NAMED_ENTITY
static final String LICENSE_NAMED_FEATURE_ANTF
static final int LICENSE_NAMED_FEATURE_ANTF_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_ASKT
static final int LICENSE_NAMED_FEATURE_ASKT_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_GNM
static final int LICENSE_NAMED_FEATURE_GNM_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_HTMLSTRIPPER
static final int LICENSE_NAMED_FEATURE_HTMLSTRIPPER_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_RLBL
static final int LICENSE_NAMED_FEATURE_RLBL_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_NAMEMATCHING
static final int LICENSE_NAMED_FEATURE_NAMEMATCHING_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_RAA
static final int LICENSE_NAMED_FEATURE_RAA_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_RDIF
static final int LICENSE_NAMED_FEATURE_RDIF_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_RLI
static final int LICENSE_NAMED_FEATURE_RLI_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_RNC
static final int LICENSE_NAMED_FEATURE_RNC_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_RNT
static final int LICENSE_NAMED_FEATURE_RNT_FUNCTION_ENABLED
static final String LICENSE_NAMED_FEATURE_XA
static final int LICENSE_NAMED_FEATURE_XA_FUNCTION_ENABLED
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |