Mark Harwood
37a060c15c
Additional thread safety around filter creation - old code could create duplicate CachingWrapperFilter if thread1 gets cache miss and thread 2 has a cache miss before thread1 populates cache with new CachingWrapperFilter.
...
Synchronization cost around whole method is OK here because Filter object construction should be a lightweight call.
Note: CachingWrapperFilter currently has a similar bug in bits() method but adding "synchronized" around that whole method would not be a solution there because of the cost of evaluating filter.bits and the unnecessary blocking effect this would have on threads using different readers to the thread with the lock.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628921 13f79535-47bb-0310-9956-ffa450edef68
2008-02-18 23:39:12 +00:00
Mark Harwood
caed55ef5e
Added thread-safety around use of core's QueryParser.
...
Old XML parser constructors use a mode which will synchronize on use of the user-supplied QueryParser.
New constructors offer alternative option of passing "defaultField" String which is used to create a new single-use QueryParser for each parse operation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628568 13f79535-47bb-0310-9956-ffa450edef68
2008-02-17 22:32:58 +00:00
Michael McCandless
fc4bc1f5af
LUCENE-1084: fix more test cases to use user-defined MaxFieldLength ctor for IndexWriter
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619637 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 21:07:35 +00:00
Grant Ingersoll
00941055fd
benchmark autocommit
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619457 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 15:33:02 +00:00
Grant Ingersoll
0dc6c59ac1
LUCENE-494: Added QueryAutoStopWordAnalyzer in a new query subpackage
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619420 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 14:13:38 +00:00
Michael McCandless
7ea8bd35f4
LUCENE-1084: fix the remaining deprecated calls to IndexWriter ctors
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618764 13f79535-47bb-0310-9956-ffa450edef68
2008-02-05 20:07:29 +00:00
Erik Hatcher
140a867933
Correct javadoc
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618275 13f79535-47bb-0310-9956-ffa450edef68
2008-02-04 13:31:02 +00:00
Michael McCandless
afb58ca0d2
when reinit() is called, reset the start time (used only when printing # docs indexed)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617991 13f79535-47bb-0310-9956-ffa450edef68
2008-02-03 14:24:30 +00:00
Michael Busch
18b61286fa
LUCENE-584: Changed Filter API to return a DocIdSet instead of a java.util.BitSet. This allows using more efficient data structures for Filters and makes them more flexible.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617859 13f79535-47bb-0310-9956-ffa450edef68
2008-02-02 19:04:03 +00:00
Grant Ingersoll
3db9002efb
LUCENE-1156: see CHANGES.txt
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Grant Ingersoll
61f2aa3078
allow TrecDocMaker to be more easily extended by making values protected
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616882 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 20:03:39 +00:00
Grant Ingersoll
4fc8654bc1
Made some member variables protected so they can accessed by extending classes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616806 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 16:29:35 +00:00
Grant Ingersoll
b95e9a5154
LUCENE-1132: Updated documentation and deprecated bytes references in favor of character references
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616305 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 13:35:13 +00:00
Michael McCandless
38f8d4e837
LUCENE-1150: make StandardAnalyzer tokenizer constants public again (public access was accidentally removed with LUCENE-966)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616248 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 10:51:44 +00:00
Grant Ingersoll
7dd34ef888
LUCENE-1153: added lib/junit and build support
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615329 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 20:27:55 +00:00
Michael McCandless
ee835ccf21
LUCENE-1084: change all IW ctors to require up-front specification of max field length; deprecated existing ones
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615168 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 10:21:15 +00:00
Doron Cohen
8ace4103a7
LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll
7f09c9f00e
Added javadocs about UNTOKENIZED_TOKEN_FLAG conundrum
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614905 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:42:50 +00:00
Grant Ingersoll
305c47f500
LUCENE-1133: Adds ability to keep certain strings as single tokens
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614895 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:05:53 +00:00
Grant Ingersoll
1183763dbe
LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Grant Ingersoll
f75f490eb9
LUCENE-1127: added couple of convenience methods to TokenSources
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614884 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:36:46 +00:00
Michael McCandless
dd0265469a
LUCENE-1139: fix IndexWriter settings (compound, merge.factor, etc.) to work again
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614022 13f79535-47bb-0310-9956-ffa450edef68
2008-01-21 21:42:43 +00:00
Michael McCandless
354a3175d5
LUCENE-1139: various additions/fixes to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Grant Ingersoll
ed9fdba601
LUCENE-1050 and LUCENE-1138 fixes for lock problem
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612868 13f79535-47bb-0310-9956-ffa450edef68
2008-01-17 17:00:43 +00:00
Grant Ingersoll
0a54ca1920
LUCENE-1135. Added Javadoc marking the WikipediaTokenizer as experimental
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612244 13f79535-47bb-0310-9956-ffa450edef68
2008-01-15 22:01:21 +00:00
Michael Busch
e5da9db486
Rename README files to uppercase letters
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610961 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:38:38 +00:00
Michael Busch
49dd305f69
Rename README files to uppercase letters
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610960 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:37:27 +00:00
Michael McCandless
ea951785eb
LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00
Michael McCandless
f0d5002066
LUCENE-1117: fix intermittent thread safety issue w/ EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@609080 13f79535-47bb-0310-9956-ffa450edef68
2008-01-05 01:51:53 +00:00
Grant Ingersoll
79e09db401
LUCENE-1103: Internal links should increment as all tokens do, since the first token is valid too
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608989 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:36:23 +00:00
Grant Ingersoll
b18f6ae959
LUCENE-1103: The link is now incremented 1, but then the next token in the link is not incremented. This way, the link is not associated with the previous term. Instead it associated with the next term in the link, which would be the display tokens. If there are no display tokens, then it will take it's proper place in the token chain.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608978 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:15:22 +00:00
Grant Ingersoll
f715fc6031
LUCENE-1103
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608852 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 14:29:15 +00:00
Michael McCandless
f12862426a
fix typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608523 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 15:20:41 +00:00
Doron Cohen
9e65cd554f
LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Michael McCandless
263244312d
LUCENE-1115: some small fixes to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608306 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 01:48:18 +00:00
Grant Ingersoll
ed893f770c
LUCENE-1114: Updated example
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608126 13f79535-47bb-0310-9956-ffa450edef68
2008-01-02 15:30:40 +00:00
Michael Busch
75473edb02
LUCENE-746: Fix error message in AnalyzingQueryParser.getPrefixQuery.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607832 13f79535-47bb-0310-9956-ffa450edef68
2008-01-01 12:49:44 +00:00
Grant Ingersoll
90a735441f
LUCENE-1102: EnwikiDocMaker now adds a docid field
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Doron Cohen
ece8361ab5
LUCENE-749: ChainedFilter behavior fixed when logic of first filter is ANDNOT.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607606 13f79535-47bb-0310-9956-ffa450edef68
2007-12-30 22:47:59 +00:00
Grant Ingersoll
bd340a896d
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607291 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 17:08:26 +00:00
Grant Ingersoll
5c81934465
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607290 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 17:08:16 +00:00
Grant Ingersoll
cb94c6aed4
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607288 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 17:07:33 +00:00
Grant Ingersoll
40d85a7781
Switch to using the EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607281 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 16:29:03 +00:00
Grant Ingersoll
ca821526b0
removed bad chars at end of file
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605966 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:33:16 +00:00
Grant Ingersoll
36b1206ad8
Restoring ExtractWikipedia, as it is still a handy class to have around. Splitting the documents is useful for debugging purposes when you know you want to look at a specific document instead of grepping through a really large file.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605959 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:14:24 +00:00
Grant Ingersoll
55d0c3a2f8
LUCENE-1077: refactored to have a common PayloadHelper classes. Also added TokenOffsetPayloadTokenFilter, which encodes the Token offset into the payloads
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@604870 13f79535-47bb-0310-9956-ffa450edef68
2007-12-17 13:55:46 +00:00
Doron Cohen
b7e167ac8d
LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen
73f9e7ebc0
fix potential thread-safety issue in contrib/benchmark's TrecDocMaker.
...
(follow-up to http://svn.apache.org/viewvc?view=rev&revision=602475 )
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603846 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 07:26:58 +00:00
Michael McCandless
86ca6f86d7
fix intermittent thread-safety failure in contrib/benchmark unit test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602475 13f79535-47bb-0310-9956-ffa450edef68
2007-12-08 14:17:07 +00:00
Michael McCandless
b0d2b1c90e
LUCENE-1044: revert the doSync option to FSDirectory
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602165 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 17:42:33 +00:00
Grant Ingersoll
f9b2e971f2
LUCENE-1077 new sinks and payloads analysis packages
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602081 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 12:21:49 +00:00
Michael McCandless
6be2c0765c
LUCENE-1044: also re-default doSync back to false in contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596874 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 23:17:44 +00:00
Michael Busch
b04703fe8f
LUCENE-1055: Remove gdata from trunk.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596501 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 00:46:27 +00:00
Michael Busch
1abb04580f
Disable verbose standard output in MemoryIndexTest by default.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596000 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:19:17 +00:00
Mark Harwood
04ae927f38
Added toString implementation on BooleanFilter.java, provided by Jason Calabrese
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595996 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:08:06 +00:00
Michael Busch
bb37d2bcff
LUCENE-1051: Generate separate javadocs for core, demo and contrib classes, as well as an unified view.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595013 13f79535-47bb-0310-9956-ffa450edef68
2007-11-14 19:16:19 +00:00
Michael McCandless
439ba586fc
LUCENE-1044: add doSync option to FSDirectory.getDirectory, defaulting to true, to sync() each file descriptor before close()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@593799 13f79535-47bb-0310-9956-ffa450edef68
2007-11-10 17:51:00 +00:00
Daniel Naber
2f5507bfc9
fix returning unbalanced quotes in describeParams()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@586918 13f79535-47bb-0310-9956-ffa450edef68
2007-10-21 17:26:16 +00:00
Grant Ingersoll
a614f0d99a
Added some more algorithms for testing things out, implemented basic TREC query driver based on the sample in the javadocs.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@585677 13f79535-47bb-0310-9956-ffa450edef68
2007-10-17 20:36:20 +00:00
Grant Ingersoll
b7253a06b7
LUCENE-1027: Added better formatting of doubles, added wikipedia-flush-by-RAM for comparison
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583792 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 12:10:31 +00:00
Grant Ingersoll
9c9ebe5cf4
LUCENE-1027: Added support for doubles to Config, also added copies of standard and micro-standard algorithms that flush by RAM
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583771 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 11:05:40 +00:00
Mark Harwood
21a07ee41e
Provided DTDs for core and contrib XML query syntax. The "docs" directory contains detailed documentation generated by DTDdoc from the DTDs. The ant script used to generate these docs is also included but not hooked up to the main build process due to license issues with DTDdoc.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583307 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:45:27 +00:00
Mark Harwood
3872d3bfcc
Updated hashcode/equals to test all fields
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583305 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:40:45 +00:00
Chris M. Hostetter
243861715b
cleaning up a ton of javadoc warnings from gdata. most of these fixes related to either: clarifying packages for @link tags; changing @link or @see tags that pointed at classes/methods that didn't exist (by picking classes with very similar names that do exist); or removing incomplete stub javadocs (that added no information beyond the signature
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@582380 13f79535-47bb-0310-9956-ffa450edef68
2007-10-05 20:30:59 +00:00
Mark Harwood
62fa7b4b82
Added new DuplicateFilter functionality to filter documents sharing a field value (e.g. primary key/url)
...
Also includes Junit test and XML Query support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@581426 13f79535-47bb-0310-9956-ffa450edef68
2007-10-02 22:56:46 +00:00
Grant Ingersoll
dce47c6401
LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579766 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:15:26 +00:00
Grant Ingersoll
bcfad28d69
LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579765 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:12:59 +00:00
Michael McCandless
a28eb4d978
LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Michael McCandless
511406ecbe
remove temporary print for GData unit test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577892 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 19:41:42 +00:00
Chris M. Hostetter
3f517bff75
don't just write date to stdout, include date string in failure message
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577873 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:52:42 +00:00
Michael McCandless
fada31fa7f
adding temporary print to figure out why this gdata-server test is failing on build machine
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577861 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:38:23 +00:00
Doron Cohen
40f0adb507
LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen
9e51c30349
LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch
9c2a036db3
- LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
...
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll
c67fd79a83
LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll
d1f90c7825
Deprecated all the old benchmarking stuff
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll
9192b16643
Deprecated all the old benchmarking stuff
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll
477c4e0efe
Deprecated all the old benchmarking stuff
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Michael McCandless
d42de32984
LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll
82eb074afd
LUCENE-974: Removed Author tags from all existing code
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless
d1422ebd6b
LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless
2d954694dc
LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless
0fd867732e
LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen
f3b9c9407a
for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Doron Cohen
98fa2d898d
LUCENE-836: Add support for search quality benchmarking.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless
02dd452026
LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll
e97d5830ce
LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Michael McCandless
96ea45d193
LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Michael McCandless
cff5767e44
LUCENE-843: speed up IndexWriter performance
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Michael McCandless
c03f45a62d
LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll
bc7c586468
LUCENE-848. Add Wikipedia benchmarking support
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen
9ff9bf8142
fix javadoc unknown tag warning.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552111 13f79535-47bb-0310-9956-ffa450edef68
2007-06-30 07:04:27 +00:00
Doron Cohen
ba90fb06c1
LUCENE-946: replace text from an online collection (used in
...
few test cases) with text that is surely 100% free.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Otis Gospodnetic
71f2c1da8b
- LUCENE-906: Elision filter for French.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551744 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 00:36:09 +00:00
Doron Cohen
d9b6aa9c0b
LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen
e6c659269a
LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
...
logging for addDoc/deleteDoc tasks;
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Michael Busch
d955a970b6
LUCENE-622: Add ant target and pom.xml files for building maven artifacts of the Lucene core and the contrib modules.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@547860 13f79535-47bb-0310-9956-ffa450edef68
2007-06-16 04:45:13 +00:00
Chris M. Hostetter
373af70a20
LUCENE-930: Various contrib building improvements to ensure contrib dependencies are met, and test compilation errors fail the build.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@546226 13f79535-47bb-0310-9956-ffa450edef68
2007-06-11 19:29:41 +00:00
Michael Busch
df0a188415
LUCENE-931: adding missing license headers to various files
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545696 13f79535-47bb-0310-9956-ffa450edef68
2007-06-09 06:09:46 +00:00
Mark Harwood
1f1528e3ca
Exposed the MoreLikeThis "minDocFreq" property for use in MoreLikeThisQuery.java and in XML queries
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545333 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 22:13:37 +00:00