Commit Graph

180 Commits

Author SHA1 Message Date
Michael McCandless afb58ca0d2 when reinit() is called, reset the start time (used only when printing # docs indexed)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617991 13f79535-47bb-0310-9956-ffa450edef68
2008-02-03 14:24:30 +00:00
Grant Ingersoll 3db9002efb LUCENE-1156: see CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Grant Ingersoll 61f2aa3078 allow TrecDocMaker to be more easily extended by making values protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616882 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 20:03:39 +00:00
Grant Ingersoll 4fc8654bc1 Made some member variables protected so they can accessed by extending classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616806 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 16:29:35 +00:00
Michael McCandless ee835ccf21 LUCENE-1084: change all IW ctors to require up-front specification of max field length; deprecated existing ones
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615168 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 10:21:15 +00:00
Doron Cohen 8ace4103a7 LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll 1183763dbe LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Michael McCandless dd0265469a LUCENE-1139: fix IndexWriter settings (compound, merge.factor, etc.) to work again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614022 13f79535-47bb-0310-9956-ffa450edef68
2008-01-21 21:42:43 +00:00
Michael McCandless 354a3175d5 LUCENE-1139: various additions/fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Michael McCandless ea951785eb LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00
Michael McCandless f0d5002066 LUCENE-1117: fix intermittent thread safety issue w/ EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@609080 13f79535-47bb-0310-9956-ffa450edef68
2008-01-05 01:51:53 +00:00
Michael McCandless f12862426a fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608523 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 15:20:41 +00:00
Doron Cohen 9e65cd554f LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Michael McCandless 263244312d LUCENE-1115: some small fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608306 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 01:48:18 +00:00
Grant Ingersoll 90a735441f LUCENE-1102: EnwikiDocMaker now adds a docid field
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Grant Ingersoll bd340a896d git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607291 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:26 +00:00
Grant Ingersoll 5c81934465 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607290 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:16 +00:00
Grant Ingersoll cb94c6aed4 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607288 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:07:33 +00:00
Grant Ingersoll 40d85a7781 Switch to using the EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607281 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 16:29:03 +00:00
Grant Ingersoll ca821526b0 removed bad chars at end of file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605966 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:33:16 +00:00
Grant Ingersoll 36b1206ad8 Restoring ExtractWikipedia, as it is still a handy class to have around. Splitting the documents is useful for debugging purposes when you know you want to look at a specific document instead of grepping through a really large file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605959 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:14:24 +00:00
Doron Cohen b7e167ac8d LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen 73f9e7ebc0 fix potential thread-safety issue in contrib/benchmark's TrecDocMaker.
(follow-up to http://svn.apache.org/viewvc?view=rev&revision=602475)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603846 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 07:26:58 +00:00
Michael McCandless 86ca6f86d7 fix intermittent thread-safety failure in contrib/benchmark unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602475 13f79535-47bb-0310-9956-ffa450edef68
2007-12-08 14:17:07 +00:00
Michael McCandless b0d2b1c90e LUCENE-1044: revert the doSync option to FSDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602165 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 17:42:33 +00:00
Michael McCandless 6be2c0765c LUCENE-1044: also re-default doSync back to false in contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596874 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 23:17:44 +00:00
Michael McCandless 439ba586fc LUCENE-1044: add doSync option to FSDirectory.getDirectory, defaulting to true, to sync() each file descriptor before close()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@593799 13f79535-47bb-0310-9956-ffa450edef68
2007-11-10 17:51:00 +00:00
Grant Ingersoll a614f0d99a Added some more algorithms for testing things out, implemented basic TREC query driver based on the sample in the javadocs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@585677 13f79535-47bb-0310-9956-ffa450edef68
2007-10-17 20:36:20 +00:00
Grant Ingersoll b7253a06b7 LUCENE-1027: Added better formatting of doubles, added wikipedia-flush-by-RAM for comparison
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583792 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 12:10:31 +00:00
Grant Ingersoll 9c9ebe5cf4 LUCENE-1027: Added support for doubles to Config, also added copies of standard and micro-standard algorithms that flush by RAM
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583771 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 11:05:40 +00:00
Michael McCandless a28eb4d978 LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Doron Cohen 40f0adb507 LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen 9e51c30349 LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch 9c2a036db3 - LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files. 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll d1f90c7825 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll 9192b16643 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll 477c4e0efe Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Michael McCandless d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Michael McCandless c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen ba90fb06c1 LUCENE-946: replace text from an online collection (used in
few test cases) with text that is surely 100% free.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Doron Cohen d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Michael Busch d955a970b6 LUCENE-622: Add ant target and pom.xml files for building maven artifacts of the Lucene core and the contrib modules.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@547860 13f79535-47bb-0310-9956-ffa450edef68
2007-06-16 04:45:13 +00:00
Chris M. Hostetter 373af70a20 LUCENE-930: Various contrib building improvements to ensure contrib dependencies are met, and test compilation errors fail the build.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@546226 13f79535-47bb-0310-9956-ffa450edef68
2007-06-11 19:29:41 +00:00
Michael Busch df0a188415 LUCENE-931: adding missing license headers to various files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545696 13f79535-47bb-0310-9956-ffa450edef68
2007-06-09 06:09:46 +00:00
Grant Ingersoll b5e09521b5 javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@538783 13f79535-47bb-0310-9956-ffa450edef68
2007-05-17 01:13:40 +00:00
Doron Cohen 8ed11daae0 javadoc warnings and errors solved (except for gdata).
build.xml "javadocs" target will now fail if there are 
javadoc warnings/errors under jdk1.4.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535448 13f79535-47bb-0310-9956-ffa450edef68
2007-05-05 05:36:49 +00:00
Otis Gospodnetic 5b63c00106 - LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Doron Cohen 33694c7d86 LUCENE-864: set svn:eol-style=native.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529511 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:11:04 +00:00
Doron Cohen 934a56e55f contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Otis Gospodnetic 5650a7bf92 - find . -name \*java -type f | xargs perl -pi -e 's/(\r\n|\n|\r)/\n/g' \
&& find . -name \*java -type f | xargs svn propset svn:eol-style native



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524969 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 23:37:14 +00:00
Otis Gospodnetic 3a1652a95b - Added the missing ASL 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524958 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:47:10 +00:00
Otis Gospodnetic af77bc03c1 - Fixed javadoc typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524952 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:56 +00:00
Otis Gospodnetic e158736d91 - Import cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524950 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:05 +00:00
Otis Gospodnetic ec0493167c - Don't use the deprecated FSDirectory.getDirectory(indexDir, boolean)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524949 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:26:44 +00:00
Otis Gospodnetic 023ffe2c29 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524914 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 20:27:34 +00:00
Doron Cohen 031f50c4e7 LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Doron Cohen 346b484473 Documentation updates following LUCENE-837.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521830 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 17:56:09 +00:00
Grant Ingersoll 251c2b328a LUCENE-837:
Fixed issues related to Doron's comments on Mar. 22, 07.

PUt in tests for ***FieldSelectorTask.  Moved the byte field handling from ReutersDocMaker to BasicDocMaker.

Added in tests for new Task

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521717 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 13:13:46 +00:00
Grant Ingersoll cfff81d1dc LUCENE-837:
fix copyright

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521698 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 12:26:08 +00:00
Grant Ingersoll f4fffb3491 LUCENE-837:
Added optional bytes field to store on the Document.  Enabled ReutersDocMaker w/ the ability to store byte data in a field.  If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.

Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen 34b560603a LUCENE-840: benchmarking code correctness tests were added.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll e09189a830 LUCENE-837 applied
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00
Doron Cohen 9a31a8828f correct names of dynamically loaded classes (analyzers, doc/query makers)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@519567 13f79535-47bb-0310-9956-ffa450edef68
2007-03-18 06:51:06 +00:00
Doron Cohen 977e729685 LUCENE-829: close done readers in contrib/benchmark.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518822 13f79535-47bb-0310-9956-ffa450edef68
2007-03-16 01:16:56 +00:00
Doron Cohen ab763bbdae contrib.benhmark.byTask javadocs corrections
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517566 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 06:54:03 +00:00
Grant Ingersoll ee12ced8e7 LUCENE-801: build lucene core and demo first, change classpath to use the build classes instead of the jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@507260 13f79535-47bb-0310-9956-ffa450edef68
2007-02-13 22:17:24 +00:00
Grant Ingersoll 13eaccfd56 Lucene 790
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506441 13f79535-47bb-0310-9956-ffa450edef68
2007-02-12 13:32:20 +00:00
Grant Ingersoll bb66099414 Applied 788 and 790 from Doron Cohen. Ran both the micro-standard and the task runs and results look reasonable.
Thanks, Doron

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506093 13f79535-47bb-0310-9956-ffa450edef68
2007-02-11 18:59:22 +00:00
Grant Ingersoll db238e787f Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron!
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68
2007-01-13 04:08:23 +00:00
Grant Ingersoll 8cffe45c2a replaced string concatenation with chained appends
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492383 13f79535-47bb-0310-9956-ffa450edef68
2007-01-04 02:17:58 +00:00
Grant Ingersoll 423c1b4c9c Commit benchmark code. Thanks to Doron Cohen for updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475222 13f79535-47bb-0310-9956-ffa450edef68
2006-11-15 13:09:33 +00:00
Grant Ingersoll 861a964ce4 initial checkin of build and dependencies
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@471623 13f79535-47bb-0310-9956-ffa450edef68
2006-11-06 03:05:23 +00:00