2588 Commits

Author SHA1 Message Date
Yonik Seeley
a94db219bd LUCENE-977: internal hashing improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@565368 13f79535-47bb-0310-9956-ffa450edef68
2007-08-13 14:59:03 +00:00
Michael McCandless
c25c1282ef LUCENE-976: add missing openInput(String, int) to MMapDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564785 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 22:20:35 +00:00
Michael McCandless
d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll
82eb074afd LUCENE-974: Removed Author tags from all existing code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless
d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless
2d954694dc LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless
32b981044f LUCENE-948: handle stale dir listing caches on NFS (so quickly switching writers between machines works)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@562546 13f79535-47bb-0310-9956-ffa450edef68
2007-08-03 18:41:22 +00:00
Michael McCandless
19ca21a1dd correcting my inability to count from 2 to 3
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@562282 13f79535-47bb-0310-9956-ffa450edef68
2007-08-02 23:01:46 +00:00
Michael McCandless
0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Michael Busch
60c6a847b5 LUCENE-832: ParallelReader fixed to not throw NPE if isCurrent(), isOptimized() or getVersion() is called.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561611 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 00:43:11 +00:00
Michael Busch
4b11c35b08 LUCENE-970: FilterIndexReader now implements isOptimized().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561162 13f79535-47bb-0310-9956-ffa450edef68
2007-07-30 22:29:56 +00:00
Doron Cohen
f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Michael McCandless
1bc5a68549 LUCENE-964: remove DocumentWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:50:19 +00:00
Doron Cohen
98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael Busch
c1496444b2 LUCENE-781: MultiReader fixed to not throw NPE if isCurrent(), isOptimized() or getVersion() is called.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560024 13f79535-47bb-0310-9956-ffa450edef68
2007-07-26 22:52:08 +00:00
Michael McCandless
02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll
2d16613438 LUCENE-868: Added a Javadoc comment
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559083 13f79535-47bb-0310-9956-ffa450edef68
2007-07-24 15:30:45 +00:00
Michael Busch
273d28f000 LUCENE-939: Check explicitly for boundary conditions in FieldInfos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558917 13f79535-47bb-0310-9956-ffa450edef68
2007-07-24 03:05:21 +00:00
Grant Ingersoll
e97d5830ce LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Daniel Naber
86432275f6 protect object against NPE in finally block, as an NPE here would hide the original exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558160 13f79535-47bb-0310-9956-ffa450edef68
2007-07-20 21:38:55 +00:00
Daniel Naber
58027a24bb improve setMaxFieldLength documentation; remove unused import statements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558132 13f79535-47bb-0310-9956-ffa450edef68
2007-07-20 20:53:58 +00:00
Doron Cohen
e2d3fe1787 LUCENE-957: RAMDirectory fixed for more than Integer.MAX_VALUE bytes.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557678 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 16:27:48 +00:00
Michael McCandless
a3298ce85c LUCENE-963: added caveats to javadocs for new Field setValue(...) methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557553 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 10:16:30 +00:00
Michael McCandless
66bacd0b13 LUCENE-963: add setters to Field to allow re-using Field instances during indexing (for better performance)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557445 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 01:33:59 +00:00
Grant Ingersoll
4293019522 LUCENE-960: Added SpanFilter mechanism that provides BitSet information and Span information in a filter. This can be used to get position information on where in a Document that is "on" in the filter matched.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557105 13f79535-47bb-0310-9956-ffa450edef68
2007-07-18 00:52:07 +00:00
Michael McCandless
aae68e0ba6 LUCENE-962: fixed case where unhandled exception in addDocument or updateDocument could fail to delete unreferenced files in the index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557082 13f79535-47bb-0310-9956-ffa450edef68
2007-07-17 23:22:56 +00:00
Yonik Seeley
52b5113a04 remove binary (was UTF-8 BOM) from start of CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556995 13f79535-47bb-0310-9956-ffa450edef68
2007-07-17 17:57:35 +00:00
Yonik Seeley
230ecaa169 remove synchronization in Document: LUCENE-959
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556693 13f79535-47bb-0310-9956-ffa450edef68
2007-07-16 18:46:58 +00:00
Michael McCandless
f55e4057d2 LUCENE-938: fixed certain cases where an unhandled exception could cause IndexWriter to lose buffered deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556010 13f79535-47bb-0310-9956-ffa450edef68
2007-07-13 14:23:47 +00:00
Michael McCandless
8355ab88b3 LUCENE-843: fixed the triggerMerger logic from LUCENE-887 that I accidentally lost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555793 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 22:15:20 +00:00
Michael Busch
5b944011ff LUCENE-944: Remove deprecated methods setUseScorer14() and getUseScorer14() from BooleanQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555717 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 18:41:40 +00:00
Michael Busch
7c612dcb2d LUCENE-951: Fixed NullPointerException in MultiLevelSkipListReader that was thrown after a call of TermPositions.seek().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555683 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 16:23:46 +00:00
Michael Busch
c8f84d6707 LUCENE-955: Fixed SegmentTermPositions to work correctly with the first term in the dictionary.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555678 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 16:14:54 +00:00
Michael McCandless
96ea45d193 LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Yonik Seeley
d7c0af3f30 replace div with shift since idiv takes ~40 cycles and compiler can't do strength reduction w/o knowing ops are non-negative
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553947 13f79535-47bb-0310-9956-ffa450edef68
2007-07-06 16:34:40 +00:00
Michael McCandless
4db9625e28 LUCENE-843: add missing 'synchronized' to allThreadsIdle() method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553854 13f79535-47bb-0310-9956-ffa450edef68
2007-07-06 11:57:18 +00:00
Michael McCandless
3e1a31748e LUCENE-843: SegmentInfo.clone() failed to copy some fields; in certain cases _x.nrm might not be created even when fieldInfos says it has norms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553298 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 19:03:21 +00:00
Michael McCandless
50787ab505 LUCENE-892: don't do extra buffer-to-buffer copies in CompoundFileReader.CSIndexInput
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553269 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 17:00:22 +00:00
Michael McCandless
39ade207b8 LUCENE-843: woops ... move the CHANGES.txt entries into the right section (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553268 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 16:59:18 +00:00
Michael McCandless
cff5767e44 LUCENE-843: speed up IndexWriter performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Chris M. Hostetter
4fe90e4086 hack to prevent ant from recursively stating all files in all directories looking for files to include in the META-INF
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552589 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 20:42:24 +00:00
Michael McCandless
c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll
bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen
9ff9bf8142 fix javadoc unknown tag warning.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552111 13f79535-47bb-0310-9956-ffa450edef68
2007-06-30 07:04:27 +00:00
Doron Cohen
ba90fb06c1 LUCENE-946: replace text from an online collection (used in
few test cases) with text that is surely 100% free.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Otis Gospodnetic
71f2c1da8b - LUCENE-906: Elision filter for French.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551744 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 00:36:09 +00:00
Doron Cohen
d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen
e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Chris M. Hostetter
3e39b0a9a0 LUCENE-936: example doesn't match text
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550680 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 04:35:29 +00:00
Chris M. Hostetter
290c339ef9 regening all docs using forrest 0.8 ... removing sitemap.xmap per forrest 0.8 upgrade instructures, all other changes are just the effects of skin changes in 0.8
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550678 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 04:29:29 +00:00