C G I M N R S T V

C

close() - Method in class com.basistech.rlp.lucene.RLPTokenizer
For internal use.
com.basistech.rlp.extra - package com.basistech.rlp.extra
This package currently contains just one class that augments RLP.
com.basistech.rlp.lucene - package com.basistech.rlp.lucene
This package contains a Tokenizer, Analyzers, and other classes for the integration of RLP with Lucene and Solr.
com.basistech.rlp.solr - package com.basistech.rlp.solr
This package contains Solr factory classes for integration of RLP with Solr.
create(TokenStream) - Method in class com.basistech.rlp.solr.RLPPOSFilterFactory
Creates a POS (part-of-speech) filter.
create(Reader) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Creates a token stream using RLPTokenizer.
createAnalyzer(String) - Static method in class com.basistech.rlp.lucene.RLPAnalyzerDispatcher
Calls the default constructor of the Analyzer named RLPXxAnalyzer for the given language code xx.

G

getBTRootDirectory() - Static method in class com.basistech.rlp.extra.RLPEnvironmentHolder
A convenient method to return the product root directory, used to initialize the RLPEnvironment object.
getDefaultAllowedPOSTags() - Static method in class com.basistech.rlp.lucene.RLPEnAnalyzer
Gets the array of part-of-speech (POS) tags that is assumed when constructor without such argument is used.
getDefaultAllowedPOSTags() - Static method in class com.basistech.rlp.lucene.RLPJaAnalyzer
Gets the array of part-of-speech (POS) tags that is assumed when constructor without such argument is used.
getDefaultContextDefinition() - Static method in class com.basistech.rlp.lucene.RLPAnalyzer
Gets the RLP Context Definition string that is assumed when a constructor that doesn't take one is used.
getDefaultContextDefinition() - Static method in class com.basistech.rlp.lucene.RLPEnAnalyzer
Gets the default context definition, which only contains the BL1 LP.
getDefaultContextDefinition() - Static method in class com.basistech.rlp.lucene.RLPJaAnalyzer
Gets the context definition that is assumed when a constructor without such argument is used.
getDefaultPostTypes() - Static method in class com.basistech.rlp.lucene.RLPAnalyzer
Gets the set of post types that is assumed when a constructor that doesn't take a post types argument is used.
getDefaultPostTypes() - Static method in class com.basistech.rlp.lucene.RLPEnAnalyzer
Gets the set of post types that is assumed when a constructor without such argument is used.
getDefaultPostTypes() - Static method in class com.basistech.rlp.lucene.RLPJaAnalyzer
Gets the set of post types that is assumed when a constructor without such argument is used.
getDetectedLanguage() - Method in class com.basistech.rlp.lucene.RLPAnalyzer
Returns the language detected by RLI, if it is enabled.
getDetectedLanguage() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Return the detected language by RLI, if it is enabled.
getEnvironment() - Static method in class com.basistech.rlp.extra.RLPEnvironmentHolder
Returns the one and only RLPEnvironment object.
getLanguage() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Returns the language previously set by RLPTokenizer.setLanguage(LanguageCode).
getLanguage() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Retrieves the language code that has been set (LanguageCode.UNKNOWN if it hasn't been set).
getPostTypes() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Bulk getter of types used to generate tokens.
getRLPContextDef() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Retrieves the path to the RLP XML context definition file or string.

I

inform(ResourceLoader) - Method in class com.basistech.rlp.solr.RLPPOSFilterFactory
Specifies the list of allowed POS tags.
inform(ResourceLoader) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
An implementation of ResourceLoaderAware#inform(ResourceLoader) as required by the interface.
isPostCompoundComponents() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether component tokens are generated for each compound word (German, Dutch, Hungarian, Chinese, Japanese, and Korean).
isPostCompoundComponents() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether component tokens are generated for each compound word (German, Dutch, Hungarian, Chinese, Japanese, and Korean).
isPostLemma() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a token is generated for each lemma (Arabic only).
isPostLemma() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a token is generated for each lemma (Arabic only).
isPostM1NormalizedToken() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a token is generated for each many-to-one normalized word (Japanese, and other languages).
isPostM1NormalizedToken() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a token is generated for each many-to-one normalized word (Japanese, and other languages).
isPostNormalizedToken() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a token is generated for each normalized word (Arabic only).
isPostNormalizedToken() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a token is generated for each normalized word (Arabic only).
isPostPartOfSpeech() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a part-of-speech (POS) is stored in each Token's Payload field.
isPostPartOfSpeech() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a part-of-speech (POS) is stored in each Token's Payload field.
isPostReadings() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether reading Tokens are generated for each word that the language analyzer can predict (Chinese and Japanese).
isPostReadings() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether reading Tokens are generated for each word that the language analyzer can predict (Chinese and Japanese).
isPostRoot() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a token is generated for each root (Arabic only).
isPostRoot() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a token is generated for each root (Arabic only).
isPostStem() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a token is generated for each stem.
isPostStem() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a token is generated for each stem.
isPostWord() - Method in class com.basistech.rlp.lucene.RLPTokenizer
Determines whether a token is being generated for each word in its original form in the text
isPostWord() - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Determines whether a token is being generated for each word in its original form in the text

M

main(String[]) - Static method in class com.basistech.rlp.lucene.RLPEnAnalyzer
(Internal use only) Tokenizes an English sentence and displays the results.
main(String[]) - Static method in class com.basistech.rlp.lucene.RLPJaAnalyzer
(Internal use only) Tokenizes a Japanese sentence, and displays the results.

N

next(Token) - Method in class com.basistech.rlp.lucene.RLPPOSFilter
Iterates to the next token until a token whose part-of-speech tag is in the list, or the tokens are exhausted.
next(Token) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Returns the next token in the token stream.

R

reset(Reader) - Method in class com.basistech.rlp.lucene.RLPTokenizer
For internal use only.
reset() - Method in class com.basistech.rlp.lucene.RLPTokenizer
For internal use only.
RLPAnalyzer - Class in com.basistech.rlp.lucene
A generic (language-neutral) Analyzer that uses RLPTokenizer.
RLPAnalyzer(LanguageCode, String, EnumSet<RLPTokenizer.PostType>, String[]) - Constructor for class com.basistech.rlp.lucene.RLPAnalyzer
The most flexible constructor takes 4 arguments.
RLPAnalyzer(LanguageCode, String, EnumSet<RLPTokenizer.PostType>) - Constructor for class com.basistech.rlp.lucene.RLPAnalyzer
When this constructor is used, no POS filter will be used.
RLPAnalyzer(LanguageCode, String) - Constructor for class com.basistech.rlp.lucene.RLPAnalyzer
Same as the three-parameter constructor except that Tokens derived from all the available result types will be generated without POS tags in their payload.
RLPAnalyzer(LanguageCode) - Constructor for class com.basistech.rlp.lucene.RLPAnalyzer
Same as the two-parameter constructor except that an RLP Context that includes RCLU and the base linguistic LPs for all supported languages will be used.
RLPAnalyzer() - Constructor for class com.basistech.rlp.lucene.RLPAnalyzer
This is equivalent of RLPAnalyzer(LanguageCode#UNKNOWN).
RLPAnalyzerDispatcher - Class in com.basistech.rlp.lucene
Utility class to create an appropriate RLP based Analyzer for the given language.
RLPEnAnalyzer - Class in com.basistech.rlp.lucene
An Analyzer for English that uses RLP.
RLPEnAnalyzer(String, EnumSet<RLPTokenizer.PostType>, String[]) - Constructor for class com.basistech.rlp.lucene.RLPEnAnalyzer
This constructor does not use default values.
RLPEnAnalyzer(String, EnumSet<RLPTokenizer.PostType>) - Constructor for class com.basistech.rlp.lucene.RLPEnAnalyzer
This constructor uses the part-of-speech filter with the default part-of-speech tag set.
RLPEnAnalyzer(String) - Constructor for class com.basistech.rlp.lucene.RLPEnAnalyzer
This constructor uses default set of the post types, which are STEM (which is actually a lemma) POS (part-of-speech in Token's payload field)
RLPEnAnalyzer() - Constructor for class com.basistech.rlp.lucene.RLPEnAnalyzer
This default constructor uses a default RLP Context, which only includes BL1 LP, along with the default set of post types, and the default POS tags for English processing.
RLPEnvironmentHolder - Class in com.basistech.rlp.extra
Creates and manages the one and only instance of RLPEnvironment.
RLPJaAnalyzer - Class in com.basistech.rlp.lucene
An Analyzer for Japanese that uses RLP.
RLPJaAnalyzer(String, EnumSet<RLPTokenizer.PostType>, String[]) - Constructor for class com.basistech.rlp.lucene.RLPJaAnalyzer
This constructor does not use default values.
RLPJaAnalyzer(String, EnumSet<RLPTokenizer.PostType>) - Constructor for class com.basistech.rlp.lucene.RLPJaAnalyzer
This constructor uses the part-of-speech filter with the default part-of-speech tag set.
RLPJaAnalyzer(String) - Constructor for class com.basistech.rlp.lucene.RLPJaAnalyzer
This constructor uses default set of the post types, which are STEM (which is actually a lemma) COMP (compound word decomposition) POS (part-of-speech in Token's payload field) Note that the many-to-one normalizer (used to be called JON) output and readings are not used.
RLPJaAnalyzer() - Constructor for class com.basistech.rlp.lucene.RLPJaAnalyzer
This default constructor uses the default RLP Context which includes RCLU and JLA LPs.
RLPLuceneException - Exception in com.basistech.rlp.lucene
A RuntimeException from the RLP Lucene integration classes.
RLPLuceneException(String) - Constructor for exception com.basistech.rlp.lucene.RLPLuceneException
Constructs a new runtime exception with the specified detail message.
RLPLuceneException(Throwable) - Constructor for exception com.basistech.rlp.lucene.RLPLuceneException
Constructs a new runtime exception with the specified cause and a detail message (which typically contains the class and detail message of cause).
RLPLuceneException(String, Throwable) - Constructor for exception com.basistech.rlp.lucene.RLPLuceneException
Constructs a new runtime exception with the specified detail message and cause.
RLPPOSFilter - Class in com.basistech.rlp.lucene
This Filter removes tokens based on the POS (part-of-speech) tags that are stored in each token's payload by RLPTokenizer.
RLPPOSFilter(TokenStream, String[]) - Constructor for class com.basistech.rlp.lucene.RLPPOSFilter
Constructor that takes the allowed POS list as an array of Strings.
RLPPOSFilterFactory - Class in com.basistech.rlp.solr
Factory for the POS (part-of-speech) Filter that passes only tokens with predetermined POS tags.
RLPPOSFilterFactory() - Constructor for class com.basistech.rlp.solr.RLPPOSFilterFactory
 
RLPSolrException - Class in com.basistech.rlp.solr
A SolrException from RLP For Lucene factory classes.
RLPSolrException(SolrException.ErrorCode, String) - Constructor for class com.basistech.rlp.solr.RLPSolrException
Constructs a new runtime exception with the error code and specified detail message.
RLPSolrException(SolrException.ErrorCode, Throwable) - Constructor for class com.basistech.rlp.solr.RLPSolrException
Constructs a new runtime exception with the error code and the cause.
RLPTokenizer - Class in com.basistech.rlp.lucene
A Tokenizer that uses RLP Basis Linguistic language processors, such as BL1, CLA, JLA, KLA, and ARBL.
RLPTokenizer(Reader, LanguageCode, String, EnumSet<RLPTokenizer.PostType>) - Constructor for class com.basistech.rlp.lucene.RLPTokenizer
This is the only constructor.
RLPTokenizer.PostType - Enum in com.basistech.rlp.lucene
This enum is used to specify a set of result types that this Tokenizer will use, in the form of EnumSet<PostType>.
RLPTokenizerFactory - Class in com.basistech.rlp.solr
Solr tokenizer factory for RLPTokenizer.
RLPTokenizerFactory() - Constructor for class com.basistech.rlp.solr.RLPTokenizerFactory
 

S

setLanguage(LanguageCode) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Set the language of the text.
setLanguage(LanguageCode) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Designates the language of the text to be processed.
setPostCompoundComponents(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether component tokens are generated for each compound word.
setPostCompoundComponents(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether component tokens are generated for each compound word.
setPostLemma(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether a token is generated for each lemma (Arabic only).
setPostLemma(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether a token is generated for each lemma (Arabic only).
setPostM1NormalizedToken(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether a token is generated for each many-to-one normalized word (Japanese, and other languages).
setPostM1NormalizedToken(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether a token is generated for each many-to-one normalized word (Japanese, and other languages).
setPostNormalizedToken(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether a token is generated for each normalized word (Arabic only).
setPostNormalizedToken(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether a token is generated for each normalized word (Arabic only).
setPostPartOfSpeech(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether a part-of-speech (POS) is stored in each Token's Payload field.
setPostPartOfSpeech(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether a part-of-speech (POS) is stored in each Token's Payload field.
setPostReadings(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether reading Tokens are generated for each word that the language analyzer can predict (Chinese and Japanese).
setPostReadings(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether reading Tokens are generated for each word that the language analyzer can predict (Chinese and Japanese).
setPostRoot(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether a token is generated for each root (Arabic only).
setPostRoot(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether a token is generated for each root (Arabic only).
setPostStem(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether a token is generated for each stem.
setPostStem(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether a token is generated for each stem.
setPostTypes(EnumSet<RLPTokenizer.PostType>) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Bulk setter of token-generating features.
setPostWord(boolean) - Method in class com.basistech.rlp.lucene.RLPTokenizer
Specifies whether to generate a token for each of the original words in the text.
setPostWord(boolean) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Specifies whether to generate a token for each of the original words in the text.
setRLPContextDef(String) - Method in class com.basistech.rlp.solr.RLPTokenizerFactory
Sets the path to the RLP XML context definition file or string must start with "<").

T

TOKEN_TYPE_STRING_COMPOUND_COMPONENT - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#COMPOUND.
TOKEN_TYPE_STRING_LEMMA - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#LEMMA.
TOKEN_TYPE_STRING_MANY_TO_ONE_NORMALIZED_TOKEN - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#MANY_TO_ONE_NORMALIZED_TOKEN.
TOKEN_TYPE_STRING_NORMALIZED_TOKEN - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#NORMALIZED_TOKEN.
TOKEN_TYPE_STRING_READING - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#READING.
TOKEN_TYPE_STRING_ROOT - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#ROOTS.
TOKEN_TYPE_STRING_STEM - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#STEM.
TOKEN_TYPE_STRING_WORD - Static variable in class com.basistech.rlp.lucene.RLPTokenizer
The string that will be found in the type field of tokens that correspond to RLPConstants#TOKEN.
tokenStream(String, Reader) - Method in class com.basistech.rlp.lucene.RLPAnalyzer
An implementation of Analyzer#tokenStream(String, Reader).

V

valueOf(String) - Static method in enum com.basistech.rlp.lucene.RLPTokenizer.PostType
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.basistech.rlp.lucene.RLPTokenizer.PostType
Returns an array containing the constants of this enum type, in the order they're declared.

C G I M N R S T V

Copyright © 2007-2008 Basis Technology Corporation. All Rights Reserved.