public class EnwikiDocMaker extends LineDocMaker
LineDocMaker
Modifier and Type | Field and Description |
---|---|
protected boolean |
keepImages |
BODY_FIELD, BYTES_FIELD, config, DATE_FIELD, forever, ID_FIELD, indexVal, NAME_FIELD, storeVal, termVecVal, TITLE_FIELD
Constructor and Description |
---|
EnwikiDocMaker() |
Modifier and Type | Method and Description |
---|---|
Document |
makeDocument()
Create the next document.
|
void |
setConfig(Config config)
Set the properties
|
getNextDocData, makeDocument, numUniqueTexts, resetInputs
addBytes, addUniqueBytes, collectFiles, getByteCount, getCount, getHtmlParser, numUniqueBytes, printDocStatistics, resetUniqueBytes, setHTMLParser
public void setConfig(Config config)
DocMaker
setConfig
in interface DocMaker
setConfig
in class LineDocMaker
public Document makeDocument() throws Exception
DocMaker
makeDocument
in interface DocMaker
makeDocument
in class LineDocMaker
Exception
Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.