Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese and Korean.
|
org.apache.lucene.analysis.cn |
Analyzer for Chinese.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.query | |
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.shingle | |
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
A fast grammar-based tokenizer constructed with JFlex.
|
org.apache.lucene.analysis.th | |
org.apache.lucene.benchmark.byTask |
Benchmarking Lucene By Tasks.
|
org.apache.lucene.benchmark.standard | |
org.apache.lucene.benchmark.stats | |
org.apache.lucene.index |
Code to maintain and access indices.
|
org.apache.lucene.index.memory |
High-performance single-document main memory Apache Lucene fulltext search index.
|
org.apache.lucene.queryParser |
A simple query parser implemented with JavaCC.
|
org.apache.lucene.queryParser.analyzing | |
org.apache.lucene.queryParser.precedence | |
org.apache.lucene.search |
Code to search indices.
|
org.apache.lucene.search.highlight |
The highlight package contains classes to provide "keyword in context" features
typically used to highlight search terms in the text of results pages.
|
org.apache.lucene.search.similar |
Document similarity query generators.
|
org.apache.lucene.store.instantiated |
InstantiatedIndex, alternative RAM store for small corpora.
|
org.apache.lucene.swing.models |
Decorators for JTable TableModel and JList ListModel encapsulating Lucene indexing and searching functionality.
|
org.apache.lucene.wordnet |
This package uses synonyms defined by WordNet to build a
Lucene index storing them, which in turn can be used for query expansion.
|
org.apache.lucene.xmlparser | |
org.apache.lucene.xmlparser.builders |
Modifier and Type | Class and Description |
---|---|
class |
KeywordAnalyzer
"Tokenizes" the entire stream as a single token.
|
class |
PerFieldAnalyzerWrapper
This analyzer is used to facilitate scenarios where different
fields require different analysis techniques.
|
class |
SimpleAnalyzer
An Analyzer that filters LetterTokenizer with LowerCaseFilter.
|
class |
StopAnalyzer
Filters LetterTokenizer with LowerCaseFilter and StopFilter.
|
class |
WhitespaceAnalyzer
An Analyzer that uses WhitespaceTokenizer.
|
Modifier and Type | Method and Description |
---|---|
void |
PerFieldAnalyzerWrapper.addAnalyzer(java.lang.String fieldName,
Analyzer analyzer)
Defines an analyzer to use for the specified field.
|
Constructor and Description |
---|
PerFieldAnalyzerWrapper(Analyzer defaultAnalyzer)
Constructs with default analyzer.
|
Modifier and Type | Class and Description |
---|---|
class |
BrazilianAnalyzer
Analyzer for Brazilian language.
|
Modifier and Type | Class and Description |
---|---|
class |
CJKAnalyzer
Filters CJKTokenizer with StopFilter.
|
Modifier and Type | Class and Description |
---|---|
class |
ChineseAnalyzer
Title: ChineseAnalyzer
Description:
Subclass of org.apache.lucene.analysis.Analyzer
build from a ChineseTokenizer, filtered with ChineseFilter.
|
Modifier and Type | Class and Description |
---|---|
class |
CzechAnalyzer
Analyzer for Czech language.
|
Modifier and Type | Class and Description |
---|---|
class |
GermanAnalyzer
Analyzer for German language.
|
Modifier and Type | Class and Description |
---|---|
class |
GreekAnalyzer
Analyzer for the Greek language.
|
Modifier and Type | Class and Description |
---|---|
class |
FrenchAnalyzer
Analyzer for French language.
|
Modifier and Type | Class and Description |
---|---|
class |
DutchAnalyzer
Analyzer for Dutch language.
|
Modifier and Type | Class and Description |
---|---|
class |
QueryAutoStopWordAnalyzer |
Constructor and Description |
---|
QueryAutoStopWordAnalyzer(Analyzer delegate)
Initializes this analyzer with the Analyzer object that actual produces the tokens
|
Modifier and Type | Class and Description |
---|---|
class |
RussianAnalyzer
Analyzer for Russian language.
|
Modifier and Type | Class and Description |
---|---|
class |
ShingleAnalyzerWrapper
A ShingleAnalyzerWrapper wraps a ShingleFilter around another analyzer.
|
Modifier and Type | Field and Description |
---|---|
protected Analyzer |
ShingleAnalyzerWrapper.defaultAnalyzer |
Constructor and Description |
---|
ShingleAnalyzerWrapper(Analyzer defaultAnalyzer) |
ShingleAnalyzerWrapper(Analyzer defaultAnalyzer,
int maxShingleSize) |
Modifier and Type | Class and Description |
---|---|
class |
SnowballAnalyzer
|
Modifier and Type | Class and Description |
---|---|
class |
StandardAnalyzer
Filters
StandardTokenizer with StandardFilter , LowerCaseFilter and StopFilter , using a list of English stop words. |
Modifier and Type | Class and Description |
---|---|
class |
ThaiAnalyzer
Analyzer for Thai language.
|
Modifier and Type | Method and Description |
---|---|
Analyzer |
PerfRunData.getAnalyzer() |
Modifier and Type | Method and Description |
---|---|
void |
PerfRunData.setAnalyzer(Analyzer analyzer) |
Modifier and Type | Method and Description |
---|---|
static Query[] |
StandardBenchmarker.createQueries(java.util.List qs,
Analyzer a)
Deprecated.
Parse the strings containing Lucene queries.
|
Modifier and Type | Method and Description |
---|---|
Analyzer |
TestData.getAnalyzer() |
Modifier and Type | Method and Description |
---|---|
static TestData[] |
TestData.getAll(java.io.File[] sources,
Analyzer[] analyzers)
Prepare a list of benchmark data, using all possible combinations of
benchmark parameters.
|
static TestData[] |
TestData.getTestDataMinMaxMergeAndMaxBuffered(java.io.File[] sources,
Analyzer[] analyzers)
Similar to
TestData.getAll(java.io.File[], org.apache.lucene.analysis.Analyzer[]) but only uses
maxBufferedDocs of 10 and 100 and same for mergeFactor, thus reducing the number of permutations significantly. |
void |
TestData.setAnalyzer(Analyzer analyzer) |
Modifier and Type | Field and Description |
---|---|
protected Analyzer |
IndexModifier.analyzer
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
Analyzer |
IndexWriter.getAnalyzer()
Returns the analyzer used by this index.
|
Modifier and Type | Method and Description |
---|---|
void |
IndexModifier.addDocument(Document doc,
Analyzer docAnalyzer)
Deprecated.
Adds a document to this index, using the provided analyzer instead of the
one specific in the constructor.
|
void |
IndexWriter.addDocument(Document doc,
Analyzer analyzer)
Adds a document to this index, using the provided analyzer instead of the
value of
IndexWriter.getAnalyzer() . |
protected void |
IndexModifier.init(Directory directory,
Analyzer analyzer,
boolean create)
Deprecated.
Initialize an IndexWriter.
|
void |
IndexWriter.updateDocument(Term term,
Document doc,
Analyzer analyzer)
Updates a document by first deleting the document(s)
containing
term and then adding the new
document. |
Constructor and Description |
---|
IndexModifier(Directory directory,
Analyzer analyzer,
boolean create)
Deprecated.
Open an index with write access.
|
IndexModifier(java.io.File file,
Analyzer analyzer,
boolean create)
Deprecated.
Open an index with write access.
|
IndexModifier(java.lang.String dirName,
Analyzer analyzer,
boolean create)
Deprecated.
Open an index with write access.
|
IndexWriter(Directory d,
Analyzer a)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(Directory,Analyzer,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(Directory d,
Analyzer a,
boolean create)
Deprecated.
This constructor will be removed in the 3.0
release, and call
IndexWriter.commit() when needed.
Use IndexWriter.IndexWriter(Directory,Analyzer,boolean,MaxFieldLength) instead. |
IndexWriter(Directory d,
Analyzer a,
boolean create,
IndexDeletionPolicy deletionPolicy,
IndexWriter.MaxFieldLength mfl)
Expert: constructs an IndexWriter with a custom
IndexDeletionPolicy , for the index in d . |
IndexWriter(Directory d,
Analyzer a,
boolean create,
IndexWriter.MaxFieldLength mfl)
Constructs an IndexWriter for the index in
d . |
IndexWriter(Directory d,
Analyzer a,
IndexDeletionPolicy deletionPolicy,
IndexWriter.MaxFieldLength mfl)
Expert: constructs an IndexWriter with a custom
IndexDeletionPolicy , for the index in d ,
first creating it if it does not already exist. |
IndexWriter(Directory d,
Analyzer a,
IndexWriter.MaxFieldLength mfl)
Constructs an IndexWriter for the index in
d , first creating it if it does not
already exist. |
IndexWriter(Directory d,
boolean autoCommit,
Analyzer a)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(Directory,Analyzer,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(Directory d,
boolean autoCommit,
Analyzer a,
boolean create)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(Directory,Analyzer,boolean,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(Directory d,
boolean autoCommit,
Analyzer a,
boolean create,
IndexDeletionPolicy deletionPolicy)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(Directory,Analyzer,boolean,IndexDeletionPolicy,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(Directory d,
boolean autoCommit,
Analyzer a,
IndexDeletionPolicy deletionPolicy)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(Directory,Analyzer,IndexDeletionPolicy,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(java.io.File path,
Analyzer a)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(File,Analyzer,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(java.io.File path,
Analyzer a,
boolean create)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(File,Analyzer,boolean,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(java.io.File path,
Analyzer a,
boolean create,
IndexWriter.MaxFieldLength mfl)
Constructs an IndexWriter for the index in
path . |
IndexWriter(java.io.File path,
Analyzer a,
IndexWriter.MaxFieldLength mfl)
Constructs an IndexWriter for the index in
path , first creating it if it does not
already exist. |
IndexWriter(java.lang.String path,
Analyzer a)
Deprecated.
This constructor will be removed in the 3.0
release, and call
IndexWriter.commit() when needed.
Use IndexWriter.IndexWriter(String,Analyzer,MaxFieldLength) instead. |
IndexWriter(java.lang.String path,
Analyzer a,
boolean create)
Deprecated.
This constructor will be removed in the 3.0 release.
Use
IndexWriter.IndexWriter(String,Analyzer,boolean,MaxFieldLength)
instead, and call IndexWriter.commit() when needed. |
IndexWriter(java.lang.String path,
Analyzer a,
boolean create,
IndexWriter.MaxFieldLength mfl)
Constructs an IndexWriter for the index in
path . |
IndexWriter(java.lang.String path,
Analyzer a,
IndexWriter.MaxFieldLength mfl)
Constructs an IndexWriter for the index in
path , first creating it if it does not
already exist. |
Modifier and Type | Class and Description |
---|---|
class |
PatternAnalyzer
Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a
Reader , that can flexibly separate text into terms via a regular expression Pattern
(with behaviour identical to String.split(String) ),
and that combines the functionality of
LetterTokenizer ,
LowerCaseTokenizer ,
WhitespaceTokenizer ,
StopFilter into a single efficient
multi-purpose class. |
Modifier and Type | Method and Description |
---|---|
static Analyzer |
AnalyzerUtil.getLoggingAnalyzer(Analyzer child,
java.io.PrintStream log,
java.lang.String logName)
Returns a simple analyzer wrapper that logs all tokens produced by the
underlying child analyzer to the given log stream (typically System.err);
Otherwise behaves exactly like the child analyzer, delivering the very
same tokens; useful for debugging purposes on custom indexing and/or
querying.
|
static Analyzer |
AnalyzerUtil.getMaxTokenAnalyzer(Analyzer child,
int maxTokens)
Returns an analyzer wrapper that returns at most the first
maxTokens tokens from the underlying child analyzer,
ignoring all remaining tokens. |
static Analyzer |
AnalyzerUtil.getPorterStemmerAnalyzer(Analyzer child)
Returns an English stemming analyzer that stems tokens from the
underlying child analyzer according to the Porter stemming algorithm.
|
static Analyzer |
AnalyzerUtil.getSynonymAnalyzer(Analyzer child,
SynonymMap synonyms,
int maxSynonyms)
Returns an analyzer wrapper that wraps the underlying child analyzer's
token stream into a
SynonymTokenFilter . |
static Analyzer |
AnalyzerUtil.getTokenCachingAnalyzer(Analyzer child)
Returns an analyzer wrapper that caches all tokens generated by the underlying child analyzer's
token streams, and delivers those cached tokens on subsequent calls to
tokenStream(String fieldName, Reader reader)
if the fieldName has been seen before, altogether ignoring the Reader parameter on cache lookup. |
Modifier and Type | Method and Description |
---|---|
void |
MemoryIndex.addField(java.lang.String fieldName,
java.lang.String text,
Analyzer analyzer)
Convenience method; Tokenizes the given field text and adds the resulting
terms to the index; Equivalent to adding an indexed non-keyword Lucene
Field that is
tokenized ,
not stored ,
termVectorStored with positions (or
termVectorStored with positions and offsets ), |
static Analyzer |
AnalyzerUtil.getLoggingAnalyzer(Analyzer child,
java.io.PrintStream log,
java.lang.String logName)
Returns a simple analyzer wrapper that logs all tokens produced by the
underlying child analyzer to the given log stream (typically System.err);
Otherwise behaves exactly like the child analyzer, delivering the very
same tokens; useful for debugging purposes on custom indexing and/or
querying.
|
static Analyzer |
AnalyzerUtil.getMaxTokenAnalyzer(Analyzer child,
int maxTokens)
Returns an analyzer wrapper that returns at most the first
maxTokens tokens from the underlying child analyzer,
ignoring all remaining tokens. |
static java.lang.String[] |
AnalyzerUtil.getMostFrequentTerms(Analyzer analyzer,
java.lang.String text,
int limit)
Returns (frequency:term) pairs for the top N distinct terms (aka words),
sorted descending by frequency (and ascending by term, if tied).
|
static Analyzer |
AnalyzerUtil.getPorterStemmerAnalyzer(Analyzer child)
Returns an English stemming analyzer that stems tokens from the
underlying child analyzer according to the Porter stemming algorithm.
|
static Analyzer |
AnalyzerUtil.getSynonymAnalyzer(Analyzer child,
SynonymMap synonyms,
int maxSynonyms)
Returns an analyzer wrapper that wraps the underlying child analyzer's
token stream into a
SynonymTokenFilter . |
static Analyzer |
AnalyzerUtil.getTokenCachingAnalyzer(Analyzer child)
Returns an analyzer wrapper that caches all tokens generated by the underlying child analyzer's
token streams, and delivers those cached tokens on subsequent calls to
tokenStream(String fieldName, Reader reader)
if the fieldName has been seen before, altogether ignoring the Reader parameter on cache lookup. |
Modifier and Type | Method and Description |
---|---|
Analyzer |
QueryParser.getAnalyzer() |
Modifier and Type | Method and Description |
---|---|
static Query |
MultiFieldQueryParser.parse(java.lang.String[] queries,
java.lang.String[] fields,
Analyzer analyzer)
Parses a query which searches on the fields specified.
|
static Query |
MultiFieldQueryParser.parse(java.lang.String[] queries,
java.lang.String[] fields,
BooleanClause.Occur[] flags,
Analyzer analyzer)
Parses a query, searching on the fields specified.
|
static Query |
MultiFieldQueryParser.parse(java.lang.String query,
java.lang.String[] fields,
BooleanClause.Occur[] flags,
Analyzer analyzer)
Parses a query, searching on the fields specified.
|
Constructor and Description |
---|
MultiFieldQueryParser(java.lang.String[] fields,
Analyzer analyzer)
Creates a MultiFieldQueryParser.
|
MultiFieldQueryParser(java.lang.String[] fields,
Analyzer analyzer,
java.util.Map boosts)
Creates a MultiFieldQueryParser.
|
QueryParser(java.lang.String f,
Analyzer a)
Constructs a query parser.
|
Constructor and Description |
---|
AnalyzingQueryParser(java.lang.String field,
Analyzer analyzer)
Constructs a query parser.
|
Modifier and Type | Method and Description |
---|---|
Analyzer |
PrecedenceQueryParser.getAnalyzer() |
Constructor and Description |
---|
PrecedenceQueryParser(java.lang.String f,
Analyzer a)
Constructs a query parser.
|
Constructor and Description |
---|
FuzzyLikeThisQuery(int maxNumTerms,
Analyzer analyzer) |
QueryTermVector(java.lang.String queryString,
Analyzer analyzer) |
Modifier and Type | Method and Description |
---|---|
static TokenStream |
TokenSources.getAnyTokenStream(IndexReader reader,
int docId,
java.lang.String field,
Analyzer analyzer)
A convenience method that tries a number of approaches to getting a token stream.
|
static TokenStream |
TokenSources.getAnyTokenStream(IndexReader reader,
int docId,
java.lang.String field,
Document doc,
Analyzer analyzer)
A convenience method that tries to first get a TermPositionVector for the specified docId, then, falls back to
using the passed in
Document to retrieve the TokenStream. |
java.lang.String |
Highlighter.getBestFragment(Analyzer analyzer,
java.lang.String fieldName,
java.lang.String text)
Highlights chosen terms in a text, extracting the most relevant section.
|
java.lang.String[] |
Highlighter.getBestFragments(Analyzer analyzer,
java.lang.String text,
int maxNumFragments)
Deprecated.
This method incorrectly hardcodes the choice of fieldname. Use the
method of the same name that takes a fieldname.
|
java.lang.String[] |
Highlighter.getBestFragments(Analyzer analyzer,
java.lang.String fieldName,
java.lang.String text,
int maxNumFragments)
Highlights chosen terms in a text, extracting the most relevant sections.
|
static TokenStream |
TokenSources.getTokenStream(Document doc,
java.lang.String field,
Analyzer analyzer) |
static TokenStream |
TokenSources.getTokenStream(IndexReader reader,
int docId,
java.lang.String field,
Analyzer analyzer) |
static TokenStream |
TokenSources.getTokenStream(java.lang.String field,
java.lang.String contents,
Analyzer analyzer) |
Modifier and Type | Field and Description |
---|---|
static Analyzer |
MoreLikeThis.DEFAULT_ANALYZER
Default analyzer to parse source doc with.
|
Modifier and Type | Method and Description |
---|---|
Analyzer |
MoreLikeThisQuery.getAnalyzer() |
Analyzer |
MoreLikeThis.getAnalyzer()
Returns an analyzer that will be used to parse source doc with.
|
Modifier and Type | Method and Description |
---|---|
static Query |
SimilarityQueries.formSimilarQuery(java.lang.String body,
Analyzer a,
java.lang.String field,
java.util.Set stop)
Simple similarity query generators.
|
void |
MoreLikeThisQuery.setAnalyzer(Analyzer analyzer) |
void |
MoreLikeThis.setAnalyzer(Analyzer analyzer)
Sets the analyzer to use.
|
Constructor and Description |
---|
MoreLikeThisQuery(java.lang.String likeText,
java.lang.String[] moreLikeFields,
Analyzer analyzer) |
Modifier and Type | Method and Description |
---|---|
Analyzer |
InstantiatedIndexWriter.getAnalyzer() |
Modifier and Type | Method and Description |
---|---|
void |
InstantiatedIndexWriter.addDocument(Document doc,
Analyzer analyzer)
Adds a document to this index, using the provided analyzer instead of the
value of
InstantiatedIndexWriter.getAnalyzer() . |
protected void |
InstantiatedIndexWriter.addDocument(InstantiatedDocument document,
Analyzer analyzer)
Tokenizes a document and adds it to the buffer.
|
InstantiatedIndexWriter |
InstantiatedIndex.indexWriterFactory(Analyzer analyzer,
boolean create) |
void |
InstantiatedIndexWriter.updateDocument(Term term,
Document doc,
Analyzer analyzer) |
Constructor and Description |
---|
InstantiatedIndexWriter(InstantiatedIndex index,
Analyzer analyzer) |
InstantiatedIndexWriter(InstantiatedIndex index,
Analyzer analyzer,
boolean create) |
Modifier and Type | Method and Description |
---|---|
Analyzer |
ListSearcher.getAnalyzer() |
Analyzer |
TableSearcher.getAnalyzer() |
Modifier and Type | Method and Description |
---|---|
void |
ListSearcher.setAnalyzer(Analyzer analyzer) |
void |
TableSearcher.setAnalyzer(Analyzer analyzer) |
Modifier and Type | Method and Description |
---|---|
static Query |
SynExpand.expand(java.lang.String query,
Searcher syns,
Analyzer a,
java.lang.String field,
float boost)
Perform synonym expansion on a query.
|
static Query |
SynLookup.expand(java.lang.String query,
Searcher syns,
Analyzer a,
java.lang.String field,
float boost)
Perform synonym expansion on a query.
|
Modifier and Type | Field and Description |
---|---|
protected Analyzer |
CoreParser.analyzer |
Constructor and Description |
---|
CoreParser(Analyzer analyzer,
QueryParser parser)
Construct an XML parser that uses a single instance QueryParser for handling
UserQuery tags - all parse operations are synchronised on this parser
|
CoreParser(java.lang.String defaultField,
Analyzer analyzer)
Constructs an XML parser that creates a QueryParser for each UserQuery request.
|
CoreParser(java.lang.String defaultField,
Analyzer analyzer,
QueryParser parser) |
CorePlusExtensionsParser(Analyzer analyzer,
QueryParser parser)
Construct an XML parser that uses a single instance QueryParser for handling
UserQuery tags - all parse operations are synchronised on this parser
|
CorePlusExtensionsParser(java.lang.String defaultField,
Analyzer analyzer)
Constructs an XML parser that creates a QueryParser for each UserQuery request.
|
Modifier and Type | Method and Description |
---|---|
protected QueryParser |
UserInputQueryBuilder.createQueryParser(java.lang.String fieldName,
Analyzer analyzer)
Method to create a QueryParser - designed to be overridden
|
Constructor and Description |
---|
FuzzyLikeThisQueryBuilder(Analyzer analyzer) |
LikeThisQueryBuilder(Analyzer analyzer,
java.lang.String[] defaultFieldNames) |
SpanOrTermsBuilder(Analyzer analyzer) |
TermsFilterBuilder(Analyzer analyzer) |
TermsQueryBuilder(Analyzer analyzer) |
UserInputQueryBuilder(java.lang.String defaultField,
Analyzer analyzer) |
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.