Michael McCandless
85fbeddf44
LUCENE-2079: more improvements to contrib/benchmark for testing NRT
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@882648 13f79535-47bb-0310-9956-ffa450edef68
2009-11-20 17:23:34 +00:00
Michael McCandless
443d0093e9
LUCENE-2050: enhance contrib/benchmark for running NRT tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@836154 13f79535-47bb-0310-9956-ffa450edef68
2009-11-14 11:14:18 +00:00
Robert Muir
731f418448
LUCENE-2059: allow TrecContentSource not to change the docname
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@835677 13f79535-47bb-0310-9956-ffa450edef68
2009-11-13 00:47:15 +00:00
Robert Muir
149a0c9b25
LUCENE-2058: specify trec_eval output file from commandline
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@835674 13f79535-47bb-0310-9956-ffa450edef68
2009-11-13 00:41:53 +00:00
Michael McCandless
9bec958bf0
LUCENE-2044: add delete.percent.rand.seed to control random seed for DeleteByPercentTask
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833861 13f79535-47bb-0310-9956-ffa450edef68
2009-11-08 11:54:35 +00:00
Michael McCandless
10e6ceac5d
LUCENE-2043: CommitIndexTask also commits pending changes in the open IndexReader
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833763 13f79535-47bb-0310-9956-ffa450edef68
2009-11-07 21:11:42 +00:00
Michael McCandless
0e2839b2b2
LUCENE-2042: add print.hits.field config
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833745 13f79535-47bb-0310-9956-ffa450edef68
2009-11-07 20:08:00 +00:00
Michael McCandless
73944292d4
LUCENE-2029: allow separate control over body stored/tokenized in DocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833605 13f79535-47bb-0310-9956-ffa450edef68
2009-11-06 23:44:10 +00:00
Michael McCandless
66d8e773b1
LUCENE-1994: fix thread safety of EnwikContentSource and DocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830488 13f79535-47bb-0310-9956-ffa450edef68
2009-10-28 10:06:23 +00:00
Michael McCandless
c471e8e83c
LUCENE-1773: fix highlighter test case; add contrib/memory as dependency to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801160 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 12:09:48 +00:00
Mark Robert Miller
34ab27401c
LUCENE-1770: Add EnwikiQueryMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801043 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 01:39:22 +00:00
Michael McCandless
d3fcafc53e
LUCENE-1773: add FastVectorHighlighter tasks to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800947 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 20:24:02 +00:00
Mark Robert Miller
b155258203
LUCENE-1778: Add support for log.step setting per task type. Perviously, if you included a log.step line in the .alg file, it had been applied to all tasks. Now, you can include a log.step.AddDoc, or log.step.DeleteDoc (for example) to control logging for just these tasks. If you want to ommit logging for any other task, include log.step=-1. The syntax is "log.step." together with the Task's 'short' name (i.e., without the 'Task' part).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800768 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:05:03 +00:00
Michael McCandless
094c674c4d
LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Mark Robert Miller
3adc61c3ac
LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Mark Robert Miller
add56f5e66
LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller
f780f77366
LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Michael McCandless
c7f865a4c7
LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68
2009-06-26 17:26:54 +00:00
Michael McCandless
5f72065d0f
LUCENE-1714: fix WriteLineDocTask to also replace \r, \n (in addition to \t) with space so those chars don't create mal-formed lines
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787750 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 16:46:17 +00:00
Mark Robert Miller
d7d455246f
LUCENE-1595: Separate DocMaker into DocMaker and ContentSource.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786233 13f79535-47bb-0310-9956-ffa450edef68
2009-06-18 19:58:59 +00:00
Michael McCandless
5b472b2d62
LUCENE-1539: add DeleteByPercent, FlushReader tasks, and ability to open reader on a labelled commit point
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784587 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 17:07:55 +00:00
Michael McCandless
51f9b759fa
LUCENE-1495: fix time-based test to reduce change of false failure
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728425 13f79535-47bb-0310-9956-ffa450edef68
2008-12-21 11:07:28 +00:00
Michael McCandless
74e097f8eb
LUCENE-1493: allow setting top number of hits to collect with search.num.hits
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727063 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 15:09:46 +00:00
Michael McCandless
09f661c48c
LUCENE-1492: add optional readOnly param to OpenReader task
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727029 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 11:44:01 +00:00
Grant Ingersoll
dadaebc290
LUCENE-1243: Added new benchmark tasks
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Doron Cohen
4d9e280788
LUCENE-1090: remove relative paths from benchmark's build.xml.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Doron Cohen
25f80c71c9
LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of
...
first round were used in all rounds. (E.g. term vectors.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635280 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 16:43:32 +00:00
Grant Ingersoll
3db9002efb
LUCENE-1156: see CHANGES.txt
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Doron Cohen
8ace4103a7
LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll
1183763dbe
LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Michael McCandless
354a3175d5
LUCENE-1139: various additions/fixes to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Doron Cohen
9e65cd554f
LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Grant Ingersoll
90a735441f
LUCENE-1102: EnwikiDocMaker now adds a docid field
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Doron Cohen
b7e167ac8d
LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen
40f0adb507
LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen
9e51c30349
LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Grant Ingersoll
c67fd79a83
LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Michael McCandless
d1422ebd6b
LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless
0fd867732e
LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen
98fa2d898d
LUCENE-836: Add support for search quality benchmarking.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless
02dd452026
LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll
bc7c586468
LUCENE-848. Add Wikipedia benchmarking support
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen
d9b6aa9c0b
LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen
e6c659269a
LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
...
logging for addDoc/deleteDoc tasks;
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Otis Gospodnetic
5b63c00106
- LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Doron Cohen
934a56e55f
contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Doron Cohen
031f50c4e7
LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Grant Ingersoll
f4fffb3491
LUCENE-837:
...
Added optional bytes field to store on the Document. Enabled ReutersDocMaker w/ the ability to store byte data in a field. If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.
Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen
34b560603a
LUCENE-840: benchmarking code correctness tests were added.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll
e09189a830
LUCENE-837 applied
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00