Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.standard |
A fast grammar-based tokenizer constructed with JFlex.
|
Modifier and Type | Class and Description |
---|---|
class |
CachingTokenFilter
This class can be used if the Tokens of a TokenStream
are intended to be consumed more than once.
|
class |
ISOLatin1AccentFilter
A filter that replaces accented characters in the ISO Latin 1 character set
(ISO-8859-1) by their unaccented equivalent.
|
class |
LengthFilter
Removes words that are too long and too short from the stream.
|
class |
LowerCaseFilter
Normalizes token text to lower case.
|
class |
PorterStemFilter
Transforms the token stream as per the Porter stemming algorithm.
|
class |
StopFilter
Removes stop words from a token stream.
|
class |
TeeTokenFilter
Works in conjunction with the SinkTokenizer to provide the ability to set aside tokens
that have already been analyzed.
|
Modifier and Type | Class and Description |
---|---|
class |
StandardFilter
Normalizes tokens extracted with
StandardTokenizer . |
Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.