public class DirDocMaker extends BasicDocMaker
Modifier and Type | Class and Description |
---|---|
static class |
DirDocMaker.Iterator |
Modifier and Type | Field and Description |
---|---|
protected File |
dataDir |
protected ThreadLocal |
dateFormat |
protected DirDocMaker.Iterator |
inputFiles |
protected int |
iteration |
BODY_FIELD, BYTES_FIELD, config, DATE_FIELD, forever, ID_FIELD, indexVal, NAME_FIELD, storeVal, termVecVal, TITLE_FIELD
Constructor and Description |
---|
DirDocMaker() |
Modifier and Type | Method and Description |
---|---|
protected DateFormat |
getDateFormat() |
protected DocData |
getNextDocData()
Return the data of the next document.
|
int |
numUniqueTexts()
Return how many real unique texts are available, 0 if not applicable.
|
void |
resetInputs()
Reset inputs so that the test run would behave, input wise, as if it just started.
|
void |
setConfig(Config config)
Set the properties
|
addBytes, addUniqueBytes, collectFiles, getByteCount, getCount, getHtmlParser, makeDocument, makeDocument, numUniqueBytes, printDocStatistics, resetUniqueBytes, setHTMLParser
protected ThreadLocal dateFormat
protected File dataDir
protected int iteration
protected DirDocMaker.Iterator inputFiles
public void setConfig(Config config)
DocMaker
setConfig
in interface DocMaker
setConfig
in class BasicDocMaker
protected DateFormat getDateFormat()
protected DocData getNextDocData() throws Exception
BasicDocMaker
getNextDocData
in class BasicDocMaker
NoMoreDataException
- if data is exhausted (and 'forever' set to false).Exception
public void resetInputs()
DocMaker
resetInputs
in interface DocMaker
resetInputs
in class BasicDocMaker
public int numUniqueTexts()
DocMaker
Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.