Commit Graph

53 Commits

Author SHA1 Message Date
Michael McCandless 85fbeddf44 LUCENE-2079: more improvements to contrib/benchmark for testing NRT
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@882648 13f79535-47bb-0310-9956-ffa450edef68
2009-11-20 17:23:34 +00:00
Michael McCandless 443d0093e9 LUCENE-2050: enhance contrib/benchmark for running NRT tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@836154 13f79535-47bb-0310-9956-ffa450edef68
2009-11-14 11:14:18 +00:00
Robert Muir 731f418448 LUCENE-2059: allow TrecContentSource not to change the docname
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@835677 13f79535-47bb-0310-9956-ffa450edef68
2009-11-13 00:47:15 +00:00
Robert Muir 149a0c9b25 LUCENE-2058: specify trec_eval output file from commandline
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@835674 13f79535-47bb-0310-9956-ffa450edef68
2009-11-13 00:41:53 +00:00
Michael McCandless 9bec958bf0 LUCENE-2044: add delete.percent.rand.seed to control random seed for DeleteByPercentTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833861 13f79535-47bb-0310-9956-ffa450edef68
2009-11-08 11:54:35 +00:00
Michael McCandless 10e6ceac5d LUCENE-2043: CommitIndexTask also commits pending changes in the open IndexReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833763 13f79535-47bb-0310-9956-ffa450edef68
2009-11-07 21:11:42 +00:00
Michael McCandless 0e2839b2b2 LUCENE-2042: add print.hits.field config
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833745 13f79535-47bb-0310-9956-ffa450edef68
2009-11-07 20:08:00 +00:00
Michael McCandless 73944292d4 LUCENE-2029: allow separate control over body stored/tokenized in DocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833605 13f79535-47bb-0310-9956-ffa450edef68
2009-11-06 23:44:10 +00:00
Michael McCandless 66d8e773b1 LUCENE-1994: fix thread safety of EnwikContentSource and DocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830488 13f79535-47bb-0310-9956-ffa450edef68
2009-10-28 10:06:23 +00:00
Michael McCandless c471e8e83c LUCENE-1773: fix highlighter test case; add contrib/memory as dependency to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801160 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 12:09:48 +00:00
Mark Robert Miller 34ab27401c LUCENE-1770: Add EnwikiQueryMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801043 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 01:39:22 +00:00
Michael McCandless d3fcafc53e LUCENE-1773: add FastVectorHighlighter tasks to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800947 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 20:24:02 +00:00
Mark Robert Miller b155258203 LUCENE-1778: Add support for log.step setting per task type. Perviously, if you included a log.step line in the .alg file, it had been applied to all tasks. Now, you can include a log.step.AddDoc, or log.step.DeleteDoc (for example) to control logging for just these tasks. If you want to ommit logging for any other task, include log.step=-1. The syntax is "log.step." together with the Task's 'short' name (i.e., without the 'Task' part).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800768 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:05:03 +00:00
Michael McCandless 094c674c4d LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller f780f77366 LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Michael McCandless c7f865a4c7 LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68
2009-06-26 17:26:54 +00:00
Michael McCandless 5f72065d0f LUCENE-1714: fix WriteLineDocTask to also replace \r, \n (in addition to \t) with space so those chars don't create mal-formed lines
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787750 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 16:46:17 +00:00
Mark Robert Miller d7d455246f LUCENE-1595: Separate DocMaker into DocMaker and ContentSource.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786233 13f79535-47bb-0310-9956-ffa450edef68
2009-06-18 19:58:59 +00:00
Michael McCandless 5b472b2d62 LUCENE-1539: add DeleteByPercent, FlushReader tasks, and ability to open reader on a labelled commit point
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784587 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 17:07:55 +00:00
Michael McCandless 51f9b759fa LUCENE-1495: fix time-based test to reduce change of false failure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728425 13f79535-47bb-0310-9956-ffa450edef68
2008-12-21 11:07:28 +00:00
Michael McCandless 74e097f8eb LUCENE-1493: allow setting top number of hits to collect with search.num.hits
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727063 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 15:09:46 +00:00
Michael McCandless 09f661c48c LUCENE-1492: add optional readOnly param to OpenReader task
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727029 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 11:44:01 +00:00
Grant Ingersoll dadaebc290 LUCENE-1243: Added new benchmark tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Doron Cohen 4d9e280788 LUCENE-1090: remove relative paths from benchmark's build.xml.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Doron Cohen 25f80c71c9 LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of
first round were used in all rounds.  (E.g. term vectors.)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635280 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 16:43:32 +00:00
Grant Ingersoll 3db9002efb LUCENE-1156: see CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Doron Cohen 8ace4103a7 LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll 1183763dbe LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Michael McCandless 354a3175d5 LUCENE-1139: various additions/fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Doron Cohen 9e65cd554f LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Grant Ingersoll 90a735441f LUCENE-1102: EnwikiDocMaker now adds a docid field
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Doron Cohen b7e167ac8d LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen 40f0adb507 LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen 9e51c30349 LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Otis Gospodnetic 5b63c00106 - LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Doron Cohen 934a56e55f contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Doron Cohen 031f50c4e7 LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Grant Ingersoll f4fffb3491 LUCENE-837:
Added optional bytes field to store on the Document.  Enabled ReutersDocMaker w/ the ability to store byte data in a field.  If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.

Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen 34b560603a LUCENE-840: benchmarking code correctness tests were added.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll e09189a830 LUCENE-837 applied
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00