Commit Graph

2688 Commits

Author SHA1 Message Date
Yonik Seeley a94db219bd LUCENE-977: internal hashing improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@565368 13f79535-47bb-0310-9956-ffa450edef68
2007-08-13 14:59:03 +00:00
Michael McCandless c25c1282ef LUCENE-976: add missing openInput(String, int) to MMapDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564785 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 22:20:35 +00:00
Michael McCandless d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll 82eb074afd LUCENE-974: Removed Author tags from all existing code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 2d954694dc LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless 32b981044f LUCENE-948: handle stale dir listing caches on NFS (so quickly switching writers between machines works)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@562546 13f79535-47bb-0310-9956-ffa450edef68
2007-08-03 18:41:22 +00:00
Michael McCandless 19ca21a1dd correcting my inability to count from 2 to 3
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@562282 13f79535-47bb-0310-9956-ffa450edef68
2007-08-02 23:01:46 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Michael Busch 60c6a847b5 LUCENE-832: ParallelReader fixed to not throw NPE if isCurrent(), isOptimized() or getVersion() is called.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561611 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 00:43:11 +00:00
Michael Busch 4b11c35b08 LUCENE-970: FilterIndexReader now implements isOptimized().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561162 13f79535-47bb-0310-9956-ffa450edef68
2007-07-30 22:29:56 +00:00
Doron Cohen f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Michael McCandless 1bc5a68549 LUCENE-964: remove DocumentWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:50:19 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael Busch c1496444b2 LUCENE-781: MultiReader fixed to not throw NPE if isCurrent(), isOptimized() or getVersion() is called.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560024 13f79535-47bb-0310-9956-ffa450edef68
2007-07-26 22:52:08 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll 2d16613438 LUCENE-868: Added a Javadoc comment
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559083 13f79535-47bb-0310-9956-ffa450edef68
2007-07-24 15:30:45 +00:00
Michael Busch 273d28f000 LUCENE-939: Check explicitly for boundary conditions in FieldInfos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558917 13f79535-47bb-0310-9956-ffa450edef68
2007-07-24 03:05:21 +00:00
Grant Ingersoll e97d5830ce LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Daniel Naber 86432275f6 protect object against NPE in finally block, as an NPE here would hide the original exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558160 13f79535-47bb-0310-9956-ffa450edef68
2007-07-20 21:38:55 +00:00
Daniel Naber 58027a24bb improve setMaxFieldLength documentation; remove unused import statements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558132 13f79535-47bb-0310-9956-ffa450edef68
2007-07-20 20:53:58 +00:00
Doron Cohen e2d3fe1787 LUCENE-957: RAMDirectory fixed for more than Integer.MAX_VALUE bytes.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557678 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 16:27:48 +00:00
Michael McCandless a3298ce85c LUCENE-963: added caveats to javadocs for new Field setValue(...) methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557553 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 10:16:30 +00:00
Michael McCandless 66bacd0b13 LUCENE-963: add setters to Field to allow re-using Field instances during indexing (for better performance)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557445 13f79535-47bb-0310-9956-ffa450edef68
2007-07-19 01:33:59 +00:00
Grant Ingersoll 4293019522 LUCENE-960: Added SpanFilter mechanism that provides BitSet information and Span information in a filter. This can be used to get position information on where in a Document that is "on" in the filter matched.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557105 13f79535-47bb-0310-9956-ffa450edef68
2007-07-18 00:52:07 +00:00
Michael McCandless aae68e0ba6 LUCENE-962: fixed case where unhandled exception in addDocument or updateDocument could fail to delete unreferenced files in the index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@557082 13f79535-47bb-0310-9956-ffa450edef68
2007-07-17 23:22:56 +00:00
Yonik Seeley 52b5113a04 remove binary (was UTF-8 BOM) from start of CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556995 13f79535-47bb-0310-9956-ffa450edef68
2007-07-17 17:57:35 +00:00
Yonik Seeley 230ecaa169 remove synchronization in Document: LUCENE-959
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556693 13f79535-47bb-0310-9956-ffa450edef68
2007-07-16 18:46:58 +00:00
Michael McCandless f55e4057d2 LUCENE-938: fixed certain cases where an unhandled exception could cause IndexWriter to lose buffered deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@556010 13f79535-47bb-0310-9956-ffa450edef68
2007-07-13 14:23:47 +00:00
Michael McCandless 8355ab88b3 LUCENE-843: fixed the triggerMerger logic from LUCENE-887 that I accidentally lost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555793 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 22:15:20 +00:00
Michael Busch 5b944011ff LUCENE-944: Remove deprecated methods setUseScorer14() and getUseScorer14() from BooleanQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555717 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 18:41:40 +00:00
Michael Busch 7c612dcb2d LUCENE-951: Fixed NullPointerException in MultiLevelSkipListReader that was thrown after a call of TermPositions.seek().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555683 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 16:23:46 +00:00
Michael Busch c8f84d6707 LUCENE-955: Fixed SegmentTermPositions to work correctly with the first term in the dictionary.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@555678 13f79535-47bb-0310-9956-ffa450edef68
2007-07-12 16:14:54 +00:00
Michael McCandless 96ea45d193 LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Yonik Seeley d7c0af3f30 replace div with shift since idiv takes ~40 cycles and compiler can't do strength reduction w/o knowing ops are non-negative
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553947 13f79535-47bb-0310-9956-ffa450edef68
2007-07-06 16:34:40 +00:00
Michael McCandless 4db9625e28 LUCENE-843: add missing 'synchronized' to allThreadsIdle() method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553854 13f79535-47bb-0310-9956-ffa450edef68
2007-07-06 11:57:18 +00:00
Michael McCandless 3e1a31748e LUCENE-843: SegmentInfo.clone() failed to copy some fields; in certain cases _x.nrm might not be created even when fieldInfos says it has norms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553298 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 19:03:21 +00:00
Michael McCandless 50787ab505 LUCENE-892: don't do extra buffer-to-buffer copies in CompoundFileReader.CSIndexInput
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553269 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 17:00:22 +00:00
Michael McCandless 39ade207b8 LUCENE-843: woops ... move the CHANGES.txt entries into the right section (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553268 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 16:59:18 +00:00
Michael McCandless cff5767e44 LUCENE-843: speed up IndexWriter performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Chris M. Hostetter 4fe90e4086 hack to prevent ant from recursively stating all files in all directories looking for files to include in the META-INF
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552589 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 20:42:24 +00:00
Michael McCandless c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen 9ff9bf8142 fix javadoc unknown tag warning.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552111 13f79535-47bb-0310-9956-ffa450edef68
2007-06-30 07:04:27 +00:00
Doron Cohen ba90fb06c1 LUCENE-946: replace text from an online collection (used in
few test cases) with text that is surely 100% free.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Otis Gospodnetic 71f2c1da8b - LUCENE-906: Elision filter for French.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551744 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 00:36:09 +00:00
Doron Cohen d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Chris M. Hostetter 3e39b0a9a0 LUCENE-936: example doesn't match text
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550680 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 04:35:29 +00:00
Chris M. Hostetter 290c339ef9 regening all docs using forrest 0.8 ... removing sitemap.xmap per forrest 0.8 upgrade instructures, all other changes are just the effects of skin changes in 0.8
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550678 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 04:29:29 +00:00