lucene/contrib/benchmark/CHANGES.txt

Lucene Benchmark Contrib Change Log

The Benchmark contrib package contains code for benchmarking Lucene in a variety of ways.

$Id:$

7/14/2009
  LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works
  with Benchmark. Benchmark will now throw an exception if you specify sort fields without
  a type. The example sort algorithm is now typed.  (Mark Miller)

7/6/2009
  LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, 
  unless a different encoding is specified. Additionally, ContentSource now supports 
  a content.source.encoding parameter in the configuration file. 
  (Shai Erera via Mark Miller)

6/26/2009
  LUCENE-1716: Added the following support: 
  doc.tokenized.norms: specifies whether to store norms
  doc.body.tokenized.norms: special attribute for the body field
  doc.index.props: specifies whether DocMaker should index the properties set on
  DocData
  writer.info.stream: specifies the info stream to set on IndexWriter (supported
  values are: SystemOut, SystemErr and a file name). (Shai Erera via Mike McCandless)
  
6/23/09
  LUCENE-1714: WriteLineDocTask incorrectly  normalized text, by replacing only 
  occurrences of "\t" with a space. It now replaces "\r\n" in addition to that, 
  so that LineDocMaker won't fail. (Shai Erera via Michael McCandless)
  
6/17/09 
  LUCENE-1595: This issue breaks previous external algorithms. DocMaker has been 
  replaced with a concrete class which accepts a ContentSource for iterating over 
  a content source's documents. Most of the old DocMakers were changed to a 
  ContentSource implementation, and DocMaker is now a default document creation impl
  that provides an easy way for reusing fields. When [doc.maker] is not defined in 
  an algorithm, the new DocMaker is the default. If you have .alg files which 
  specify a DocMaker (like ReutersDocMaker), you should change the [doc.maker] line to: 
  [content.source=org.apache.lucene.benchmark.byTask.feeds.ReutersContentSource]
  
  i.e.
  doc.maker=org.apache.lucene.benchmark.byTask.feeds.ReutersDocMaker
  becomes
  content.source=org.apache.lucene.benchmark.byTask.feeds.ReutersContentSource
  
  doc.maker=org.apache.lucene.benchmark.byTask.feeds.SimpleDocMaker
  becomes
  content.source=org.apache.lucene.benchmark.byTask.feeds.SingleDocSource
 	
  Also, PerfTask now logs a message in tearDown() rather than each Task doing its
  own logging. A new setting called [log.step] is consulted to determine how often 
  to log. [doc.add.log.step] is no longer a valid setting. For easy migration of 
  current .alg files, rename [doc.add.log.step] to [log.step] and [doc.delete.log.step] 
  to [delete.log.step]. 
  
  Additionally, [doc.maker.forever] should be changed to [content.source.forever].
  (Shai Erera via Mark Miller)

6/12/09 
  LUCENE-1539: Added DeleteByPercentTask which enables deleting a
  percentage of documents and searching on them.  Changed CommitIndex
  to optionally accept a label (recorded as userData=<label> in the
  commit point).  Added FlushReaderTask, and modified OpenReaderTask
  to also optionally take a label referencing a commit point to open.
  Also changed default autoCommit (when IndexWriter is opened) to
  true. (Jason Rutherglen via Mike McCandless)

12/20/08
  LUCENE-1495: Allow task sequence to run for specfied number of seconds by adding ": 2.7s" (for example).

12/16/08
  LUCENE-1493: Stop using deprecated Hits API for searching; add new
  param search.num.hits to set top N docs to collect.

12/16/08
  LUCENE-1492: Added optional readOnly param (default true) to OpenReader task.

9/9/08
 LUCENE-1243: Added new sorting benchmark capabilities.  Also Reopen and commit tasks.  (Mark Miller via Grant Ingersoll)

5/10/08
  LUCENE-1090: remove relative paths assumptions from benchmark code.
  Only build.xml was modified: work-dir definition must remain so  
  benchmark tests can run from both trunk-home and benchmark-home.  
  
3/9/08
  LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of 
  first round were used in all rounds.  (E.g. term vectors.)
  (Mark Miller via Doron Cohen) 

1/30/08
  LUCENE-1156: Fixed redirect problem in EnwikiDocMaker.  Refactored ExtractWikipedia to use EnwikiDocMaker.  Added property to EnwikiDocMaker to allow
  for skipping image only documents.

1/24/2008
  LUCENE-1136: add ability to not count sub-task doLogic increment
  
1/23/2008
  LUCENE-1129: ReadTask properly uses the traversalSize value
  LUCENE-1128: Added support for benchmarking the highlighter

01/20/08
  LUCENE-1139: various fixes
  - add merge.scheduler, merge.policy config properties
  - refactor Open/CreateIndexTask to share setting config on IndexWriter
  - added doc.reuse.fields=true|false for LineDocMaker
  - OptimizeTask now takes int param to call optimize(int maxNumSegments)
  - CloseIndexTask now takes bool param to call close(false) (abort running merges)


01/03/08
  LUCENE-1116: quality package improvements:
  - add MRR computation; 
  - allow control of max #queries to run;
  - verify log & report are flushed.
  - add TREC query reader for the 1MQ track.  
      
12/31/07
  LUCENE-1102: EnwikiDocMaker now indexes the docid field, so results might not be comparable with results prior to this change, although
  it is doubted that this one small field makes much difference.
  
12/13/07
  LUCENE-1086: DocMakers setup for the "docs.dir" property
  fixed to properly handle absolute paths. (Shai Erera via Doron Cohen)
  
9/18/07
  LUCENE-941: infinite loop for alg: {[AddDoc(4000)]: 4} : *
  ResetInputsTask fixed to work also after exhaustion.
  All Reset Tasks now subclas ResetInputsTask.

8/9/07
  LUCENE-971: Change enwiki tasks to a doc maker (extending
  LineDocMaker) that directly processes the Wikipedia XML and produces
  documents.  Intermediate files (one per document) are no longer
  created.

8/1/07
  LUCENE-967: Add "ReadTokensTask" to allow for benchmarking just tokenization.

7/27/07
  LUCENE-836: Add support for search quality benchmarking, running 
  a set of queries against a searcher, and, optionally produce a submission
  report, and, if query judgements are available, compute quality measures:
  recall, precision_at_N, average_precision, MAP. TREC specific Judge (based 
  on TREC QRels) and TREC Topics reader are included in o.a.l.benchmark.quality.trec
  but any other format of queries and judgements can be implemented and used.
  
7/24/07
  LUCENE-947: Add support for creating and index "one document per
  line" from a large text file, which reduces per-document overhead of
  opening a single file for each document.

6/30/07
  LUCENE-848: Added support for Wikipedia benchmarking.

6/25/07
- LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat; logging for addDoc/deleteDoc tasks.
- LUCENE-945: tests fail to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
(Doron Cohen)

4/17/07
- LUCENE-863: Deprecated StandardBenchmarker in favour of byTask code.
  (Otis Gospodnetic)

4/13/07

Better error handling and javadocs around "exhaustive" doc making.

3/25/07

LUCENE-849: 
1. which HTML Parser is used is configurable with html.parser property.
2. External classes added to classpath with -Dbenchmark.ext.classpath=path.
3. '*' as repeating number now means "exhaust doc maker - no repetitions".

3/22/07

-Moved withRetrieve() call out of the loop in ReadTask
-Added SearchTravRetLoadFieldSelectorTask to help benchmark some of the FieldSelector capabilities
-Added options to store content bytes on the Reuters Doc (and others, but Reuters is the only one w/ it enabled)

3/21/07

Tests (for benchmarking code correctness) were added - LUCENE-840.
To be invoked by "ant test" from contrib/benchmark. (Doron Cohen)

3/19/07

1. Introduced an AbstractQueryMaker to hold common QueryMaker code. (GSI)
2. Added traversalSize parameter to SearchTravRetTask and SearchTravTask.  Changed SearchTravRetTask to extend SearchTravTask. (GSI)
3. Added FileBasedQueryMaker to run queries from a File or resource. (GSI)
4. Modified query-maker generation for read related tasks to make further read tasks addition simpler and safer. (DC)
5. Changed Taks' setParams() to throw UnsupportedOperationException if that task does not suppot command line param. (DC)
6. Improved javadoc to specify all properties command line params currently supported. (DC)
7. Refactored ReportTasks so that it is easy/possible now to create new report tasks. (DC)

01/09/07

1. Committed Doron Cohen's benchmarking contribution, which provides an easily expandable task based approach to benchmarking.  See the javadocs for information. (Doron Cohen via Grant Ingersoll)

2. Added this file.

3. 2/11/07: LUCENE-790 and 788:  Fixed Locale issue with date formatter. Fixed some minor issues with benchmarking by task.  Added a dependency
 on the Lucene demo to the build classpath.  (Doron Cohen, Grant Ingersoll)

4. 2/13/07: LUCENE-801: build.xml now builds Lucene core and Demo first and has classpath dependencies on the output of that build.  (Doron Cohen, Grant Ingersoll)
Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron! git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68 2007-01-12 23:08:23 -05:00			`Lucene Benchmark Contrib Change Log`

			`The Benchmark contrib package contains code for benchmarking Lucene in a variety of ways.`

			$Id:$
LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68 2009-06-26 13:26:54 -04:00
LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68 2009-07-14 18:52:58 -04:00			`7/14/2009`
			`LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works`
			`with Benchmark. Benchmark will now throw an exception if you specify sort fields without`
			`a type. The example sort algorithm is now typed. (Mark Miller)`

LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68 2009-07-06 11:56:39 -04:00			`7/6/2009`
			`LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files,`
			`unless a different encoding is specified. Additionally, ContentSource now supports`
			`a content.source.encoding parameter in the configuration file.`
			`(Shai Erera via Mark Miller)`

LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68 2009-06-26 13:26:54 -04:00			`6/26/2009`
			`LUCENE-1716: Added the following support:`
			`doc.tokenized.norms: specifies whether to store norms`
			`doc.body.tokenized.norms: special attribute for the body field`
			`doc.index.props: specifies whether DocMaker should index the properties set on`
			`DocData`
			`writer.info.stream: specifies the info stream to set on IndexWriter (supported`
			`values are: SystemOut, SystemErr and a file name). (Shai Erera via Mike McCandless)`

LUCENE-1714: fix WriteLineDocTask to also replace \r, \n (in addition to \t) with space so those chars don't create mal-formed lines git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787750 13f79535-47bb-0310-9956-ffa450edef68 2009-06-23 12:46:17 -04:00			`6/23/09`
			`LUCENE-1714: WriteLineDocTask incorrectly normalized text, by replacing only`
			`occurrences of "\t" with a space. It now replaces "\r\n" in addition to that,`
			`so that LineDocMaker won't fail. (Shai Erera via Michael McCandless)`

LUCENE-1595: Separate DocMaker into DocMaker and ContentSource. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786233 13f79535-47bb-0310-9956-ffa450edef68 2009-06-18 15:58:59 -04:00			`6/17/09`
			`LUCENE-1595: This issue breaks previous external algorithms. DocMaker has been`
			`replaced with a concrete class which accepts a ContentSource for iterating over`
			`a content source's documents. Most of the old DocMakers were changed to a`
			`ContentSource implementation, and DocMaker is now a default document creation impl`
			`that provides an easy way for reusing fields. When [doc.maker] is not defined in`
			`an algorithm, the new DocMaker is the default. If you have .alg files which`
			`specify a DocMaker (like ReutersDocMaker), you should change the [doc.maker] line to:`
			`[content.source=org.apache.lucene.benchmark.byTask.feeds.ReutersContentSource]`

			`i.e.`
			`doc.maker=org.apache.lucene.benchmark.byTask.feeds.ReutersDocMaker`
			`becomes`
			`content.source=org.apache.lucene.benchmark.byTask.feeds.ReutersContentSource`

			`doc.maker=org.apache.lucene.benchmark.byTask.feeds.SimpleDocMaker`
			`becomes`
			`content.source=org.apache.lucene.benchmark.byTask.feeds.SingleDocSource`

			`Also, PerfTask now logs a message in tearDown() rather than each Task doing its`
			`own logging. A new setting called [log.step] is consulted to determine how often`
			`to log. [doc.add.log.step] is no longer a valid setting. For easy migration of`
			`current .alg files, rename [doc.add.log.step] to [log.step] and [doc.delete.log.step]`
			`to [delete.log.step].`

			`Additionally, [doc.maker.forever] should be changed to [content.source.forever].`
			`(Shai Erera via Mark Miller)`

LUCENE-1539: add DeleteByPercent, FlushReader tasks, and ability to open reader on a labelled commit point git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784587 13f79535-47bb-0310-9956-ffa450edef68 2009-06-14 13:07:55 -04:00			`6/12/09`
			`LUCENE-1539: Added DeleteByPercentTask which enables deleting a`
			`percentage of documents and searching on them. Changed CommitIndex`
			`to optionally accept a label (recorded as userData=<label> in the`
			`commit point). Added FlushReaderTask, and modified OpenReaderTask`
			`to also optionally take a label referencing a commit point to open.`
			`Also changed default autoCommit (when IndexWriter is opened) to`
			`true. (Jason Rutherglen via Mike McCandless)`

LUCENE-1495: fix time-based test to reduce change of false failure git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728425 13f79535-47bb-0310-9956-ffa450edef68 2008-12-21 06:07:28 -05:00			`12/20/08`
			`LUCENE-1495: Allow task sequence to run for specfied number of seconds by adding ": 2.7s" (for example).`

LUCENE-1493: allow setting top number of hits to collect with search.num.hits git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727063 13f79535-47bb-0310-9956-ffa450edef68 2008-12-16 10:09:46 -05:00			`12/16/08`
			`LUCENE-1493: Stop using deprecated Hits API for searching; add new`
			`param search.num.hits to set top N docs to collect.`
LUCENE-1090: remove relative paths from benchmark's build.xml. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68 2008-06-10 07:58:00 -04:00
LUCENE-1492: add optional readOnly param to OpenReader task git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727029 13f79535-47bb-0310-9956-ffa450edef68 2008-12-16 06:44:01 -05:00			`12/16/08`
			`LUCENE-1492: Added optional readOnly param (default true) to OpenReader task.`

LUCENE-1243: Added new benchmark tasks git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68 2008-09-09 11:56:41 -04:00			`9/9/08`
			`LUCENE-1243: Added new sorting benchmark capabilities. Also Reopen and commit tasks. (Mark Miller via Grant Ingersoll)`

LUCENE-1090: remove relative paths from benchmark's build.xml. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68 2008-06-10 07:58:00 -04:00			`5/10/08`
			`LUCENE-1090: remove relative paths assumptions from benchmark code.`
			`Only build.xml was modified: work-dir definition must remain so`
			`benchmark tests can run from both trunk-home and benchmark-home.`

LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of first round were used in all rounds. (E.g. term vectors.) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635280 13f79535-47bb-0310-9956-ffa450edef68 2008-03-09 12:43:32 -04:00			`3/9/08`
			`LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of`
			`first round were used in all rounds. (E.g. term vectors.)`
			`(Mark Miller via Doron Cohen)`

LUCENE-1156: see CHANGES.txt git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68 2008-01-30 17:47:52 -05:00			`1/30/08`
			`LUCENE-1156: Fixed redirect problem in EnwikiDocMaker. Refactored ExtractWikipedia to use EnwikiDocMaker. Added property to EnwikiDocMaker to allow`
			`for skipping image only documents.`
Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron! git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68 2007-01-12 23:08:23 -05:00
LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68 2008-01-24 13:46:57 -05:00			`1/24/2008`
			`LUCENE-1136: add ability to not count sub-task doLogic increment`

LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68 2008-01-24 09:39:44 -05:00			`1/23/2008`
			`LUCENE-1129: ReadTask properly uses the traversalSize value`
			`LUCENE-1128: Added support for benchmarking the highlighter`

LUCENE-1139: various additions/fixes to contrib/benchmark git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68 2008-01-20 06:31:38 -05:00			`01/20/08`
			`LUCENE-1139: various fixes`
			`- add merge.scheduler, merge.policy config properties`
			`- refactor Open/CreateIndexTask to share setting config on IndexWriter`
			`- added doc.reuse.fields=true\|false for LineDocMaker`
			`- OptimizeTask now takes int param to call optimize(int maxNumSegments)`
			`- CloseIndexTask now takes bool param to call close(false) (abort running merges)`

LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68 2008-01-24 09:39:44 -05:00
LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68 2008-01-03 02:44:40 -05:00			`01/03/08`
			`LUCENE-1116: quality package improvements:`
			`- add MRR computation;`
			`- allow control of max #queries to run;`
			`- verify log & report are flushed.`
			`- add TREC query reader for the 1MQ track.`

LUCENE-1102: EnwikiDocMaker now adds a docid field git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68 2007-12-31 08:07:14 -05:00			`12/31/07`
			`LUCENE-1102: EnwikiDocMaker now indexes the docid field, so results might not be comparable with results prior to this change, although`
			`it is doubted that this one small field makes much difference.`

LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68 2007-12-13 03:58:52 -05:00			`12/13/07`
			`LUCENE-1086: DocMakers setup for the "docs.dir" property`
			`fixed to properly handle absolute paths. (Shai Erera via Doron Cohen)`

LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : * git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68 2007-09-18 05:05:06 -04:00			`9/18/07`
			`LUCENE-941: infinite loop for alg: {[AddDoc(4000)]: 4} : *`
LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68 2007-09-18 05:13:15 -04:00			`ResetInputsTask fixed to work also after exhaustion.`
			`All Reset Tasks now subclas ResetInputsTask.`
LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68 2007-08-18 08:24:21 -04:00
LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68 2007-08-09 04:57:26 -04:00			`8/9/07`
			`LUCENE-971: Change enwiki tasks to a doc maker (extending`
			`LineDocMaker) that directly processes the Wikipedia XML and produces`
			`documents. Intermediate files (one per document) are no longer`
			`created.`

LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68 2007-08-01 14:54:43 -04:00			`8/1/07`
			`LUCENE-967: Add "ReadTokensTask" to allow for benchmarking just tokenization.`

LUCENE-836: Add support for search quality benchmarking. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68 2007-07-27 16:24:52 -04:00			`7/27/07`
			`LUCENE-836: Add support for search quality benchmarking, running`
			`a set of queries against a searcher, and, optionally produce a submission`
			`report, and, if query judgements are available, compute quality measures:`
			`recall, precision_at_N, average_precision, MAP. TREC specific Judge (based`
			`on TREC QRels) and TREC Topics reader are included in o.a.l.benchmark.quality.trec`
			`but any other format of queries and judgements can be implemented and used.`

LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68 2007-07-25 04:54:58 -04:00			`7/24/07`
			`LUCENE-947: Add support for creating and index "one document per`
			`line" from a large text file, which reduces per-document overhead of`
			`opening a single file for each document.`

LUCENE-848. Add Wikipedia benchmarking support git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68 2007-06-30 22:19:10 -04:00			`6/30/07`
			`LUCENE-848: Added support for Wikipedia benchmarking.`

LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat; logging for addDoc/deleteDoc tasks; git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68 2007-06-26 14:27:21 -04:00			`6/25/07`
LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68 2007-06-27 02:49:38 -04:00			`- LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat; logging for addDoc/deleteDoc tasks.`
			`- LUCENE-945: tests fail to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.`
			`(Doron Cohen)`
LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat; logging for addDoc/deleteDoc tasks; git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68 2007-06-26 14:27:21 -04:00
- LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68 2007-04-17 18:11:09 -04:00			`4/17/07`
			`- LUCENE-863: Deprecated StandardBenchmarker in favour of byTask code.`
			`(Otis Gospodnetic)`

contrib/benchmark: better error handling and javadocs around "exhaustive" doc making. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68 2007-04-13 15:30:03 -04:00			`4/13/07`

			`Better error handling and javadocs around "exhaustive" doc making.`

LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*'; git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68 2007-03-26 12:46:33 -04:00			`3/25/07`

			`LUCENE-849:`
			`1. which HTML Parser is used is configurable with html.parser property.`
			`2. External classes added to classpath with -Dbenchmark.ext.classpath=path.`
			`3. '*' as repeating number now means "exhaust doc maker - no repetitions".`

LUCENE-837: Added optional bytes field to store on the Document. Enabled ReutersDocMaker w/ the ability to store byte data in a field. If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array. Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default) git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68 2007-03-22 23:48:12 -04:00			`3/22/07`

			`-Moved withRetrieve() call out of the loop in ReadTask`
			`-Added SearchTravRetLoadFieldSelectorTask to help benchmark some of the FieldSelector capabilities`
			`-Added options to store content bytes on the Reuters Doc (and others, but Reuters is the only one w/ it enabled)`

LUCENE-840: benchmarking code correctness tests were added. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68 2007-03-22 19:13:48 -04:00			`3/21/07`

			`Tests (for benchmarking code correctness) were added - LUCENE-840.`
			`To be invoked by "ant test" from contrib/benchmark. (Doron Cohen)`

LUCENE-837 applied git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68 2007-03-21 09:52:34 -04:00			`3/19/07`

			`1. Introduced an AbstractQueryMaker to hold common QueryMaker code. (GSI)`
			`2. Added traversalSize parameter to SearchTravRetTask and SearchTravTask. Changed SearchTravRetTask to extend SearchTravTask. (GSI)`
			`3. Added FileBasedQueryMaker to run queries from a File or resource. (GSI)`
			`4. Modified query-maker generation for read related tasks to make further read tasks addition simpler and safer. (DC)`
			`5. Changed Taks' setParams() to throw UnsupportedOperationException if that task does not suppot command line param. (DC)`
			`6. Improved javadoc to specify all properties command line params currently supported. (DC)`
			`7. Refactored ReportTasks so that it is easy/possible now to create new report tasks. (DC)`

Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron! git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68 2007-01-12 23:08:23 -05:00			`01/09/07`

			`1. Committed Doron Cohen's benchmarking contribution, which provides an easily expandable task based approach to benchmarking. See the javadocs for information. (Doron Cohen via Grant Ingersoll)`

Applied 788 and 790 from Doron Cohen. Ran both the micro-standard and the task runs and results look reasonable. Thanks, Doron git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506093 13f79535-47bb-0310-9956-ffa450edef68 2007-02-11 13:59:22 -05:00			`2. Added this file.`

			`3. 2/11/07: LUCENE-790 and 788: Fixed Locale issue with date formatter. Fixed some minor issues with benchmarking by task. Added a dependency`
LUCENE-801: build lucene core and demo first, change classpath to use the build classes instead of the jar git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@507260 13f79535-47bb-0310-9956-ffa450edef68 2007-02-13 17:17:24 -05:00			`on the Lucene demo to the build classpath. (Doron Cohen, Grant Ingersoll)`

- LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68 2007-04-17 18:11:09 -04:00			`4. 2/13/07: LUCENE-801: build.xml now builds Lucene core and Demo first and has classpath dependencies on the output of that build. (Doron Cohen, Grant Ingersoll)`