Package opennlp.tools.util.featuregen
Class WordClusterFeatureGenerator
java.lang.Object
opennlp.tools.util.featuregen.WordClusterFeatureGenerator
- All Implemented Interfaces:
AdaptiveFeatureGenerator
An
AdaptiveFeatureGenerator implementation of a word cluster feature generator.
It is based on a pre-defined WordClusterDictionary.-
Constructor Summary
ConstructorsConstructorDescriptionWordClusterFeatureGenerator(WordClusterDictionary dict, String dictResourceKey, boolean lowerCaseDictionary) Instantiates aWordClusterFeatureGeneratorvia a specifiedWordClusterDictionary. -
Method Summary
Modifier and TypeMethodDescriptionvoidcreateFeatures(List<String> features, String[] tokens, int index, String[] previousOutcomes) Adds the appropriate features for the token at the specifiedindexwith the specified array ofpreviousOutcomesto the specified list of features.Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface opennlp.tools.util.featuregen.AdaptiveFeatureGenerator
clearAdaptiveData, updateAdaptiveData
-
Constructor Details
-
WordClusterFeatureGenerator
public WordClusterFeatureGenerator(WordClusterDictionary dict, String dictResourceKey, boolean lowerCaseDictionary) Instantiates aWordClusterFeatureGeneratorvia a specifiedWordClusterDictionary.- Parameters:
dict- The tokendictionaryto use.dictResourceKey- The prefix to use for detected features. Typically, the value for this prefix should be"dict".lowerCaseDictionary-trueif tokens will be lower-cased during dictionary lookup,falseotherwise.
-
-
Method Details
-
createFeatures
public void createFeatures(List<String> features, String[] tokens, int index, String[] previousOutcomes) Description copied from interface:AdaptiveFeatureGeneratorAdds the appropriate features for the token at the specifiedindexwith the specified array ofpreviousOutcomesto the specified list of features.- Specified by:
createFeaturesin interfaceAdaptiveFeatureGenerator- Parameters:
features- The list of features to be added to.tokens- The tokens of the sentence or other text unit being processed.index- The index of the token which is currently being processed.previousOutcomes- The outcomes for the tokens prior to the specified index.
-