Commit Graph

2796 Commits

Author SHA1 Message Date
Michael McCandless 27c4c6a3b1 LUCENE-985: throw clearer exception when term is too long (> 16383 chars)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567338 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 23:15:14 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Michael McCandless c1a76d9dbe LUCENE-978: ensure all streams are closed when we hit exception in constructor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567253 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 09:42:21 +00:00
Michael McCandless 06d4849ca8 LUCENE-871: speedups for ISOLatin1AccentFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567085 13f79535-47bb-0310-9956-ffa450edef68
2007-08-17 16:58:36 +00:00
Grant Ingersoll d1f90c7825 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll 9192b16643 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll 477c4e0efe Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Grant Ingersoll f38b1cf2f8 LUCENE-975: New PositionBasedTermVectorMapper for getting term vector information on a position by position basis.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@565994 13f79535-47bb-0310-9956-ffa450edef68
2007-08-15 01:14:54 +00:00
Yonik Seeley a94db219bd LUCENE-977: internal hashing improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@565368 13f79535-47bb-0310-9956-ffa450edef68
2007-08-13 14:59:03 +00:00
Michael McCandless c25c1282ef LUCENE-976: add missing openInput(String, int) to MMapDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564785 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 22:20:35 +00:00
Michael McCandless d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll 82eb074afd LUCENE-974: Removed Author tags from all existing code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 2d954694dc LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless 32b981044f LUCENE-948: handle stale dir listing caches on NFS (so quickly switching writers between machines works)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@562546 13f79535-47bb-0310-9956-ffa450edef68
2007-08-03 18:41:22 +00:00
Michael McCandless 19ca21a1dd correcting my inability to count from 2 to 3
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@562282 13f79535-47bb-0310-9956-ffa450edef68
2007-08-02 23:01:46 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Michael Busch 60c6a847b5 LUCENE-832: ParallelReader fixed to not throw NPE if isCurrent(), isOptimized() or getVersion() is called.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561611 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 00:43:11 +00:00
Michael Busch 4b11c35b08 LUCENE-970: FilterIndexReader now implements isOptimized().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561162 13f79535-47bb-0310-9956-ffa450edef68
2007-07-30 22:29:56 +00:00
Doron Cohen f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Michael McCandless 1bc5a68549 LUCENE-964: remove DocumentWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:50:19 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael Busch c1496444b2 LUCENE-781: MultiReader fixed to not throw NPE if isCurrent(), isOptimized() or getVersion() is called.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560024 13f79535-47bb-0310-9956-ffa450edef68
2007-07-26 22:52:08 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll 2d16613438 LUCENE-868: Added a Javadoc comment
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559083 13f79535-47bb-0310-9956-ffa450edef68
2007-07-24 15:30:45 +00:00
Michael Busch 273d28f000 LUCENE-939: Check explicitly for boundary conditions in FieldInfos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558917 13f79535-47bb-0310-9956-ffa450edef68
2007-07-24 03:05:21 +00:00
Grant Ingersoll e97d5830ce LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Daniel Naber 86432275f6 protect object against NPE in finally block, as an NPE here would hide the original exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558160 13f79535-47bb-0310-9956-ffa450edef68
2007-07-20 21:38:55 +00:00
Daniel Naber 58027a24bb improve setMaxFieldLength documentation; remove unused import statements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558132 13f79535-47bb-0310-9956-ffa450edef68
2007-07-20 20:53:58 +00:00
Doron Cohen e2d3fe1787 LUCENE-957: RAMDirectory fixed for more than Integer.MAX_VALUE bytes.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557678 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 16:27:48 +00:00
Michael McCandless a3298ce85c LUCENE-963: added caveats to javadocs for new Field setValue(...) methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557553 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 10:16:30 +00:00
Michael McCandless 66bacd0b13 LUCENE-963: add setters to Field to allow re-using Field instances during indexing (for better performance)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557445 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 01:33:59 +00:00
Grant Ingersoll 4293019522 LUCENE-960: Added SpanFilter mechanism that provides BitSet information and Span information in a filter. This can be used to get position information on where in a Document that is "on" in the filter matched.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557105 13f79535-47bb-0310-9956-ffa450edef68
2007-07-18 00:52:07 +00:00
Michael McCandless aae68e0ba6 LUCENE-962: fixed case where unhandled exception in addDocument or updateDocument could fail to delete unreferenced files in the index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557082 13f79535-47bb-0310-9956-ffa450edef68
2007-07-17 23:22:56 +00:00
Yonik Seeley 52b5113a04 remove binary (was UTF-8 BOM) from start of CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556995 13f79535-47bb-0310-9956-ffa450edef68
2007-07-17 17:57:35 +00:00
Yonik Seeley 230ecaa169 remove synchronization in Document: LUCENE-959
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556693 13f79535-47bb-0310-9956-ffa450edef68
2007-07-16 18:46:58 +00:00
Michael McCandless f55e4057d2 LUCENE-938: fixed certain cases where an unhandled exception could cause IndexWriter to lose buffered deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556010 13f79535-47bb-0310-9956-ffa450edef68
2007-07-13 14:23:47 +00:00
Michael McCandless 8355ab88b3 LUCENE-843: fixed the triggerMerger logic from LUCENE-887 that I accidentally lost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555793 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 22:15:20 +00:00
Michael Busch 5b944011ff LUCENE-944: Remove deprecated methods setUseScorer14() and getUseScorer14() from BooleanQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555717 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 18:41:40 +00:00
Michael Busch 7c612dcb2d LUCENE-951: Fixed NullPointerException in MultiLevelSkipListReader that was thrown after a call of TermPositions.seek().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555683 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 16:23:46 +00:00
Michael Busch c8f84d6707 LUCENE-955: Fixed SegmentTermPositions to work correctly with the first term in the dictionary.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555678 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 16:14:54 +00:00
Michael McCandless 96ea45d193 LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Yonik Seeley d7c0af3f30 replace div with shift since idiv takes ~40 cycles and compiler can't do strength reduction w/o knowing ops are non-negative
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553947 13f79535-47bb-0310-9956-ffa450edef68
2007-07-06 16:34:40 +00:00
Michael McCandless 4db9625e28 LUCENE-843: add missing 'synchronized' to allThreadsIdle() method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553854 13f79535-47bb-0310-9956-ffa450edef68
2007-07-06 11:57:18 +00:00
Michael McCandless 3e1a31748e LUCENE-843: SegmentInfo.clone() failed to copy some fields; in certain cases _x.nrm might not be created even when fieldInfos says it has norms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553298 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 19:03:21 +00:00
Michael McCandless 50787ab505 LUCENE-892: don't do extra buffer-to-buffer copies in CompoundFileReader.CSIndexInput
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553269 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 17:00:22 +00:00
Michael McCandless 39ade207b8 LUCENE-843: woops ... move the CHANGES.txt entries into the right section (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553268 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 16:59:18 +00:00
Michael McCandless cff5767e44 LUCENE-843: speed up IndexWriter performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Chris M. Hostetter 4fe90e4086 hack to prevent ant from recursively stating all files in all directories looking for files to include in the META-INF
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552589 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 20:42:24 +00:00
Michael McCandless c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00