Michael McCandless
354a3175d5
LUCENE-1139: various additions/fixes to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Michael McCandless
ea951785eb
LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00
Michael McCandless
f0d5002066
LUCENE-1117: fix intermittent thread safety issue w/ EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@609080 13f79535-47bb-0310-9956-ffa450edef68
2008-01-05 01:51:53 +00:00
Michael McCandless
f12862426a
fix typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608523 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 15:20:41 +00:00
Doron Cohen
9e65cd554f
LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Michael McCandless
263244312d
LUCENE-1115: some small fixes to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608306 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 01:48:18 +00:00
Grant Ingersoll
90a735441f
LUCENE-1102: EnwikiDocMaker now adds a docid field
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Grant Ingersoll
bd340a896d
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607291 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 17:08:26 +00:00
Grant Ingersoll
5c81934465
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607290 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 17:08:16 +00:00
Grant Ingersoll
cb94c6aed4
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607288 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 17:07:33 +00:00
Grant Ingersoll
40d85a7781
Switch to using the EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607281 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 16:29:03 +00:00
Grant Ingersoll
ca821526b0
removed bad chars at end of file
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605966 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:33:16 +00:00
Grant Ingersoll
36b1206ad8
Restoring ExtractWikipedia, as it is still a handy class to have around. Splitting the documents is useful for debugging purposes when you know you want to look at a specific document instead of grepping through a really large file.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605959 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:14:24 +00:00
Doron Cohen
b7e167ac8d
LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen
73f9e7ebc0
fix potential thread-safety issue in contrib/benchmark's TrecDocMaker.
...
(follow-up to http://svn.apache.org/viewvc?view=rev&revision=602475 )
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603846 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 07:26:58 +00:00
Michael McCandless
86ca6f86d7
fix intermittent thread-safety failure in contrib/benchmark unit test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602475 13f79535-47bb-0310-9956-ffa450edef68
2007-12-08 14:17:07 +00:00
Michael McCandless
b0d2b1c90e
LUCENE-1044: revert the doSync option to FSDirectory
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602165 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 17:42:33 +00:00
Michael McCandless
6be2c0765c
LUCENE-1044: also re-default doSync back to false in contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596874 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 23:17:44 +00:00
Michael McCandless
439ba586fc
LUCENE-1044: add doSync option to FSDirectory.getDirectory, defaulting to true, to sync() each file descriptor before close()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@593799 13f79535-47bb-0310-9956-ffa450edef68
2007-11-10 17:51:00 +00:00
Grant Ingersoll
a614f0d99a
Added some more algorithms for testing things out, implemented basic TREC query driver based on the sample in the javadocs.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@585677 13f79535-47bb-0310-9956-ffa450edef68
2007-10-17 20:36:20 +00:00
Grant Ingersoll
b7253a06b7
LUCENE-1027: Added better formatting of doubles, added wikipedia-flush-by-RAM for comparison
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583792 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 12:10:31 +00:00
Grant Ingersoll
9c9ebe5cf4
LUCENE-1027: Added support for doubles to Config, also added copies of standard and micro-standard algorithms that flush by RAM
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583771 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 11:05:40 +00:00
Michael McCandless
a28eb4d978
LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Doron Cohen
40f0adb507
LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen
9e51c30349
LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch
9c2a036db3
- LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
...
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll
c67fd79a83
LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll
d1f90c7825
Deprecated all the old benchmarking stuff
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll
9192b16643
Deprecated all the old benchmarking stuff
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll
477c4e0efe
Deprecated all the old benchmarking stuff
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Michael McCandless
d42de32984
LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Michael McCandless
d1422ebd6b
LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless
0fd867732e
LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen
f3b9c9407a
for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Doron Cohen
98fa2d898d
LUCENE-836: Add support for search quality benchmarking.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless
02dd452026
LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Michael McCandless
c03f45a62d
LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll
bc7c586468
LUCENE-848. Add Wikipedia benchmarking support
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen
ba90fb06c1
LUCENE-946: replace text from an online collection (used in
...
few test cases) with text that is surely 100% free.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Doron Cohen
d9b6aa9c0b
LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen
e6c659269a
LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
...
logging for addDoc/deleteDoc tasks;
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Michael Busch
d955a970b6
LUCENE-622: Add ant target and pom.xml files for building maven artifacts of the Lucene core and the contrib modules.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@547860 13f79535-47bb-0310-9956-ffa450edef68
2007-06-16 04:45:13 +00:00
Chris M. Hostetter
373af70a20
LUCENE-930: Various contrib building improvements to ensure contrib dependencies are met, and test compilation errors fail the build.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@546226 13f79535-47bb-0310-9956-ffa450edef68
2007-06-11 19:29:41 +00:00
Michael Busch
df0a188415
LUCENE-931: adding missing license headers to various files
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545696 13f79535-47bb-0310-9956-ffa450edef68
2007-06-09 06:09:46 +00:00
Grant Ingersoll
b5e09521b5
javadoc fix
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@538783 13f79535-47bb-0310-9956-ffa450edef68
2007-05-17 01:13:40 +00:00
Doron Cohen
8ed11daae0
javadoc warnings and errors solved (except for gdata).
...
build.xml "javadocs" target will now fail if there are
javadoc warnings/errors under jdk1.4.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535448 13f79535-47bb-0310-9956-ffa450edef68
2007-05-05 05:36:49 +00:00
Otis Gospodnetic
5b63c00106
- LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Doron Cohen
33694c7d86
LUCENE-864: set svn:eol-style=native.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529511 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:11:04 +00:00
Doron Cohen
934a56e55f
contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Otis Gospodnetic
5650a7bf92
- find . -name \*java -type f | xargs perl -pi -e 's/(\r\n|\n|\r)/\n/g' \
...
&& find . -name \*java -type f | xargs svn propset svn:eol-style native
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524969 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 23:37:14 +00:00
Otis Gospodnetic
3a1652a95b
- Added the missing ASL 2.0
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524958 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:47:10 +00:00
Otis Gospodnetic
af77bc03c1
- Fixed javadoc typos.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524952 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:56 +00:00
Otis Gospodnetic
e158736d91
- Import cleanup
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524950 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:05 +00:00
Otis Gospodnetic
ec0493167c
- Don't use the deprecated FSDirectory.getDirectory(indexDir, boolean)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524949 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:26:44 +00:00
Otis Gospodnetic
023ffe2c29
- Cosmetics
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524914 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 20:27:34 +00:00
Doron Cohen
031f50c4e7
LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Doron Cohen
346b484473
Documentation updates following LUCENE-837.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521830 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 17:56:09 +00:00
Grant Ingersoll
251c2b328a
LUCENE-837:
...
Fixed issues related to Doron's comments on Mar. 22, 07.
PUt in tests for ***FieldSelectorTask. Moved the byte field handling from ReutersDocMaker to BasicDocMaker.
Added in tests for new Task
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521717 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 13:13:46 +00:00
Grant Ingersoll
cfff81d1dc
LUCENE-837:
...
fix copyright
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521698 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 12:26:08 +00:00
Grant Ingersoll
f4fffb3491
LUCENE-837:
...
Added optional bytes field to store on the Document. Enabled ReutersDocMaker w/ the ability to store byte data in a field. If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.
Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen
34b560603a
LUCENE-840: benchmarking code correctness tests were added.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll
e09189a830
LUCENE-837 applied
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00
Doron Cohen
9a31a8828f
correct names of dynamically loaded classes (analyzers, doc/query makers)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@519567 13f79535-47bb-0310-9956-ffa450edef68
2007-03-18 06:51:06 +00:00
Doron Cohen
977e729685
LUCENE-829: close done readers in contrib/benchmark.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518822 13f79535-47bb-0310-9956-ffa450edef68
2007-03-16 01:16:56 +00:00
Doron Cohen
ab763bbdae
contrib.benhmark.byTask javadocs corrections
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517566 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 06:54:03 +00:00
Grant Ingersoll
ee12ced8e7
LUCENE-801: build lucene core and demo first, change classpath to use the build classes instead of the jar
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@507260 13f79535-47bb-0310-9956-ffa450edef68
2007-02-13 22:17:24 +00:00
Grant Ingersoll
13eaccfd56
Lucene 790
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506441 13f79535-47bb-0310-9956-ffa450edef68
2007-02-12 13:32:20 +00:00
Grant Ingersoll
bb66099414
Applied 788 and 790 from Doron Cohen. Ran both the micro-standard and the task runs and results look reasonable.
...
Thanks, Doron
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506093 13f79535-47bb-0310-9956-ffa450edef68
2007-02-11 18:59:22 +00:00
Grant Ingersoll
db238e787f
Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron!
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68
2007-01-13 04:08:23 +00:00
Grant Ingersoll
8cffe45c2a
replaced string concatenation with chained appends
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492383 13f79535-47bb-0310-9956-ffa450edef68
2007-01-04 02:17:58 +00:00
Grant Ingersoll
423c1b4c9c
Commit benchmark code. Thanks to Doron Cohen for updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475222 13f79535-47bb-0310-9956-ffa450edef68
2006-11-15 13:09:33 +00:00
Grant Ingersoll
861a964ce4
initial checkin of build and dependencies
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@471623 13f79535-47bb-0310-9956-ffa450edef68
2006-11-06 03:05:23 +00:00