Commit Graph

671 Commits

Author SHA1 Message Date
Michael McCandless 022d104c38 LUCENE-1591: add commons-compress JAR
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765580 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 11:28:46 +00:00
Michael McCandless 8c4fff6e21 LUCENE-1591: add bzip2 compression/decompress to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765543 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 09:46:30 +00:00
Uwe Schindler 87e10db563 Add a missing check for inverse ranges in splitRange() and corresponding tests for inverse ranges and 0-length-ranges.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764829 13f79535-47bb-0310-9956-ffa450edef68
2009-04-14 15:28:06 +00:00
Uwe Schindler 3ef76ae033 Small documentation updates, test fixed according to attributes documentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764654 13f79535-47bb-0310-9956-ffa450edef68
2009-04-14 00:36:02 +00:00
Michael McCandless ad79d914e4 LUCENE-1575: switch to new Collector API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764551 13f79535-47bb-0310-9956-ffa450edef68
2009-04-13 18:33:56 +00:00
Mark Robert Miller 413cabef1f Contrib queries package Query implementations do not override equals()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763857 13f79535-47bb-0310-9956-ffa450edef68
2009-04-10 02:21:04 +00:00
Mark Robert Miller a1b3fd7240 Add ConstantScore highlighting support to SpanScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763856 13f79535-47bb-0310-9956-ffa450edef68
2009-04-10 02:09:46 +00:00
Uwe Schindler ca6451958b Add extra check to TokenStream tests, only get attribute not create it
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763793 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 20:32:35 +00:00
Uwe Schindler b5c4f8e67d Add field name to term statistics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763778 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 19:31:42 +00:00
Michael McCandless 3c91517add LUCENE-1516: add near real-time search to IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763737 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 17:17:46 +00:00
Michael McCandless 6b4831fd1d LUCENE-1591: workaround bug in xerces so we can process Wikipedia's XML
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763416 13f79535-47bb-0310-9956-ffa450edef68
2009-04-08 21:50:33 +00:00
Uwe Schindler c7ac1fdec3 LUCENE-1582: Fix missing public accessor in ShiftAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763004 13f79535-47bb-0310-9956-ffa450edef68
2009-04-07 22:25:46 +00:00
Patrick O'Leary 42783c57e5 Commit LUCENE-1588 DistanceFieldComparatorSource
deprecate BoundaryBoxFilter

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@762801 13f79535-47bb-0310-9956-ffa450edef68
2009-04-07 14:54:50 +00:00
Uwe Schindler 13ae26b7fe LUCENE-1582: Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@762710 13f79535-47bb-0310-9956-ffa450edef68
2009-04-07 11:47:35 +00:00
Michael McCandless bddac65336 share single IndexSearcher instance in ReadTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@762307 13f79535-47bb-0310-9956-ffa450edef68
2009-04-06 12:20:42 +00:00
Michael McCandless c73712d1bb LUCENE-1576: fix BrazilianAnalyzer to downcase before filtering stop words
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@759307 13f79535-47bb-0310-9956-ffa450edef68
2009-03-27 19:04:25 +00:00
Michael McCandless 0f17904f1e remove slow download host; only download zip file once for all tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@759061 13f79535-47bb-0310-9956-ffa450edef68
2009-03-27 08:44:33 +00:00
Michael McCandless 198aa70b35 LUCENE-1500: fix compilation errors in contrib/benchmark by allowing doHighlight method to throw InvalidTokenOfsetsException
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@758575 13f79535-47bb-0310-9956-ffa450edef68
2009-03-26 10:03:34 +00:00
Mark Harwood 73a02ec6fe Fix for Lucene-1500 - new exception added to Highlighter API to handle TokenStreams with Tokens that exceed given text length
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@758460 13f79535-47bb-0310-9956-ffa450edef68
2009-03-25 23:05:14 +00:00
Mark Robert Miller 126f4b18d3 LUCENE-1563:Add example test case for surround query language
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@757762 13f79535-47bb-0310-9956-ffa450edef68
2009-03-24 12:28:13 +00:00
Patrick O'Leary 4f7ef62e1f Fixed NPE for minimal bounding box search
LUCENE-1568

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@756849 13f79535-47bb-0310-9956-ffa450edef68
2009-03-21 00:46:27 +00:00
Michael McCandless c443dcc662 LUCENE-1561: rename omitTf -> omitTermFreqAndPositions in Field*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@756663 13f79535-47bb-0310-9956-ffa450edef68
2009-03-20 17:26:01 +00:00
Michael McCandless 6bf0e6e09b LUCENE-1435: add contrib/collation (CollationKeyFilter), to convert tokens into indexable CollationKeys
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755914 13f79535-47bb-0310-9956-ffa450edef68
2009-03-19 10:51:55 +00:00
Michael McCandless e44e6b0603 LUCENE-1490: forgot CHANGES.txt update
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755746 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 21:42:17 +00:00
Michael McCandless 96863198a5 LUCENE-1490: fix latin1 conversion of HALFWIDTH_AND_FULLWIDTH_FORMS characters to only apply to the correct subset
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755666 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 17:28:53 +00:00
Michael McCandless 68f4d8b1a8 revert accidental changes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755652 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 17:06:46 +00:00
Michael McCandless cd5028b79a improve IndexWriter.rollback javadoc src/java/org/apache/lucene/index/IndexWriter.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755535 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 10:22:51 +00:00
Mark Robert Miller e256f679e3 typo in javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@749625 13f79535-47bb-0310-9956-ffa450edef68
2009-03-03 15:00:02 +00:00
Michael McCandless 6248e14515 LUCENE-1548: fix distance normalization in LevenshteinDistance to not produce negative distances
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@748534 13f79535-47bb-0310-9956-ffa450edef68
2009-02-27 14:07:12 +00:00
Yonik Seeley 6c176eb016 LUCENE-1398: Add ReverseStringFilter to contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@747915 13f79535-47bb-0310-9956-ffa450edef68
2009-02-25 20:44:05 +00:00
Chris M. Hostetter b97ee3a1a5 some javadoc cleanup, and deprecation note for older version of norm modification tool
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@747627 13f79535-47bb-0310-9956-ffa450edef68
2009-02-25 00:29:17 +00:00
Uwe Schindler 174e8237b9 Javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@746792 13f79535-47bb-0310-9956-ffa450edef68
2009-02-22 18:29:42 +00:00
Uwe Schindler c314ad000f LUCENE-1470: API change in RangeBuilder
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@746790 13f79535-47bb-0310-9956-ffa450edef68
2009-02-22 18:17:13 +00:00
Uwe Schindler 03db7c81c1 LUCENE-1470: Remove the recursion from splitRange and implement with loop
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@745533 13f79535-47bb-0310-9956-ffa450edef68
2009-02-18 15:08:02 +00:00
Uwe Schindler 8b79fa51a5 LUCENE-1470: New implementation using encoding of TrieUtils and TrieRangeFilter that now also supports 32bit and 64bit fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@744207 13f79535-47bb-0310-9956-ffa450edef68
2009-02-13 18:27:01 +00:00
Karl-Johan Wettin 6e692d38ec LUCENE-1531
Added support for BoostingTermQuery to XML query parser.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@742411 13f79535-47bb-0310-9956-ffa450edef68
2009-02-09 11:49:33 +00:00
Michael McCandless dbe0898d63 LUCENE-1535: use random seed for randomness, but print the seed on failure for reproducibility
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@741311 13f79535-47bb-0310-9956-ffa450edef68
2009-02-05 21:53:40 +00:00
Uwe Schindler 80b471b0cc one Math.max too much, sorry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@740444 13f79535-47bb-0310-9956-ffa450edef68
2009-02-03 21:07:11 +00:00
Uwe Schindler b2c04094ef Fix jUnit test problem with exclusive range if upper==lower (thanks to Michael McCandless):
[junit] junit.framework.AssertionFailedError: Returned count of  
range query must be equal to exclusive range length expected:<0> but  
was:<-1>
at org.apache.lucene.search.trie.TestTrieRangeQuery.testRangeSplit(TestTrieRangeQuery.java:203)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@740442 13f79535-47bb-0310-9956-ffa450edef68
2009-02-03 21:05:29 +00:00
Uwe Schindler e7edb24bc0 fix javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@739441 13f79535-47bb-0310-9956-ffa450edef68
2009-01-30 22:44:20 +00:00
Uwe Schindler cb55b1671b Add some extra check for validity of c'tor parameters in TrieRangeFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@739439 13f79535-47bb-0310-9956-ffa450edef68
2009-01-30 22:38:04 +00:00
Michael McCandless 20753f2e31 LUCENE-1507: add DocIdSet.EMPTY_DOCIDSET
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738895 13f79535-47bb-0310-9956-ffa450edef68
2009-01-29 15:39:11 +00:00
Michael McCandless 96e0aaf49e LUCENE-1483: switch to newly added MultiReaderHitCollector for all core collectors, that is aware of segment transitions during searching, to improve performance of searching and warming
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738219 13f79535-47bb-0310-9956-ffa450edef68
2009-01-27 20:15:21 +00:00
Uwe Schindler f1ab1aee64 Implement a shortcut, when range has min>max. In this case a static empty SortedVIntList is returned.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738182 13f79535-47bb-0310-9956-ffa450edef68
2009-01-27 18:12:34 +00:00
Uwe Schindler 153acfd064 LUCENE-1530: Support inclusive/exclusive for TrieRangeQuery/-Filter, remove default trie variant setters/getters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738109 13f79535-47bb-0310-9956-ffa450edef68
2009-01-27 15:15:24 +00:00
Michael McCandless c6f6f01643 LUCENE-1316: don't call synchronized IndexReader.isDeleted when scoring MatchAllDocsQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@737513 13f79535-47bb-0310-9956-ffa450edef68
2009-01-25 14:38:43 +00:00
Michael McCandless 4764105ff4 LUCENE-1527: open reader non-readOnly for doing deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@737175 13f79535-47bb-0310-9956-ffa450edef68
2009-01-23 20:33:51 +00:00
Uwe Schindler 7ef048e01c Optimize the test index as preparation for LUCENE-1483, because the statistics on visited terms in TrieRangeQuery/-Filter only work correctly for optimized indexes (as the count is reset on each query execution. So the new multisegment-search must have a one-segment index to only execute the query one time)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@737079 13f79535-47bb-0310-9956-ffa450edef68
2009-01-23 16:44:17 +00:00
Michael McCandless d545a46df5 LUCENE-1525: add missing maven dependencies for contrib/{benchmark,highlighter}; fix svn eolstyle to native
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@736340 13f79535-47bb-0310-9956-ffa450edef68
2009-01-21 16:59:39 +00:00
Uwe Schindler 88e19ab531 Trie encoded fields do not need norms (as they are only used for filtering with TrieRangeFilter or for sorting)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@734414 13f79535-47bb-0310-9956-ffa450edef68
2009-01-14 15:09:25 +00:00
Michael McCandless 03d6b2aa24 LUCENE-1479: if date is missing, don't skip the doc; just don't add 'docdate' field to the doc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733697 13f79535-47bb-0310-9956-ffa450edef68
2009-01-12 11:37:23 +00:00
Uwe Schindler f0300a643e LUCENE-1517: Change superclass of TrieRangeQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733519 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 21:15:24 +00:00
Uwe Schindler 4905d4404c Small optimization because an Object is not needed here
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733449 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 11:04:56 +00:00
Uwe Schindler c2b5e91ddd As suggested by Otis: Convert TABs to spaces, remove @author javadoc tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733443 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 10:25:14 +00:00
Karl-Johan Wettin d7376608b2 LUCENE-1514
ShingleMatrixFilter#next(Token) easily throws a StackOverflowException due to recursive invocation. (Karl Wettin)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733064 13f79535-47bb-0310-9956-ffa450edef68
2009-01-09 15:34:52 +00:00
Michael McCandless 8180700de8 LUCENE-1497: minor code cleanups to SimpleHTMLFormatter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732739 13f79535-47bb-0310-9956-ffa450edef68
2009-01-08 16:11:19 +00:00
Karl-Johan Wettin f991524da8 LUCENE-1510
InstantiatedIndexReader#norms methods throws NullPointerException on empty index.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732661 13f79535-47bb-0310-9956-ffa450edef68
2009-01-08 09:28:42 +00:00
Ryan McKinley 3f72bc813f remove unused imports and @author tag
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732410 13f79535-47bb-0310-9956-ffa450edef68
2009-01-07 17:47:26 +00:00
Ryan McKinley 977c9acc71 LUCENE-1512 -- adding GeoHash implementaion
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732400 13f79535-47bb-0310-9956-ffa450edef68
2009-01-07 17:10:58 +00:00
Ryan McKinley 43609e891d LUCENE-1504 -- SerialChainFilter should use DocSet API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732031 13f79535-47bb-0310-9956-ffa450edef68
2009-01-06 18:13:06 +00:00
Ryan McKinley 4978391e93 LUCENE-1508 -- using the tierPrefix constructors by default
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730662 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 03:21:23 +00:00
Ryan McKinley 0958d83354 LUCENE-1508 -- make the field prefix configurable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730658 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 02:52:30 +00:00
Ryan McKinley 1f585f0357 LUCENE-1387 -- expose getLat/getLng from DistanceQueryBuilder
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730653 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 00:51:26 +00:00
Michael Busch c9e7642bae Fix pom.xml.template of spatial search contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730552 13f79535-47bb-0310-9956-ffa450edef68
2009-01-01 15:02:48 +00:00
Otis Gospodnetic 72725a0b58 - Small documentation mods.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730207 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 18:20:43 +00:00
Ryan McKinley 0afd451f24 LUCENE-1503 -- refactor spatial Query/Filter classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730205 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 18:00:56 +00:00
Ryan McKinley 581c82ccbe LUCENE-1387 -- adding locallucene as new spatial contrib package.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730067 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 07:37:17 +00:00
Michael McCandless eb64661216 LUCENE-1495: fix TaskSequence.toString to show time-limited configuration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728751 13f79535-47bb-0310-9956-ffa450edef68
2008-12-22 19:37:58 +00:00
Michael McCandless 51f9b759fa LUCENE-1495: fix time-based test to reduce change of false failure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728425 13f79535-47bb-0310-9956-ffa450edef68
2008-12-21 11:07:28 +00:00
Michael McCandless 7abe0311bc LUCENE-1444: fix broken site links in javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728262 13f79535-47bb-0310-9956-ffa450edef68
2008-12-20 09:43:06 +00:00
Michael McCandless bdc42e5342 LUCENE-1495: allow TaskSequence to run for certain time
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728252 13f79535-47bb-0310-9956-ffa450edef68
2008-12-20 08:45:09 +00:00
Michael McCandless 74e097f8eb LUCENE-1493: allow setting top number of hits to collect with search.num.hits
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727063 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 15:09:46 +00:00
Michael McCandless 09f661c48c LUCENE-1492: add optional readOnly param to OpenReader task
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727029 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 11:44:01 +00:00
Michael McCandless 1f33d7c66a fix off-by-one bug in getting field sort type
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@726213 13f79535-47bb-0310-9956-ffa450edef68
2008-12-13 14:43:32 +00:00
Karl-Johan Wettin 812abd20dd LUCENE-1462
InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. 
Parts of InstantiatedIndex was not Serializable.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@726030 13f79535-47bb-0310-9956-ffa450edef68
2008-12-12 15:12:31 +00:00
Karl-Johan Wettin 219a20a945 LUCENE-1462
InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. 
Parts of InstantiatedIndex was not Serializable.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725837 13f79535-47bb-0310-9956-ffa450edef68
2008-12-11 22:08:45 +00:00
Grant Ingersoll 2225462178 LUCENE-1380: Add PositionFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725691 13f79535-47bb-0310-9956-ffa450edef68
2008-12-11 14:17:44 +00:00
Michael McCandless efc985cbf4 LUCENE-1478: allow SortField to use a custom numeric FieldCache parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@724484 13f79535-47bb-0310-9956-ffa450edef68
2008-12-08 21:07:45 +00:00
Grant Ingersoll 702ea32da7 make constructors public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@724059 13f79535-47bb-0310-9956-ffa450edef68
2008-12-07 00:39:35 +00:00
Michael McCandless 4a0a5664dd LUCENE-1468: switch Directory.list() to Directory.listAll(), which does no filtering of returned array
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723789 13f79535-47bb-0310-9956-ffa450edef68
2008-12-05 17:03:13 +00:00
Michael McCandless 5e3cd6711a LUCENE-1470: describe TrieRangeFilter/Query in README
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723701 13f79535-47bb-0310-9956-ffa450edef68
2008-12-05 10:16:50 +00:00
Michael McCandless 240897988d LUCENE-1470: switch TestTrieRangeQuery to LuceneTestCase as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723313 13f79535-47bb-0310-9956-ffa450edef68
2008-12-04 13:17:23 +00:00
Michael McCandless 05bff1833f LUCENE-1470: add proper dependency to contrib/build.xml so contrib tests can subclass LuceneTestCase; fixed small bug TestTrieUtils test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723287 13f79535-47bb-0310-9956-ffa450edef68
2008-12-04 11:06:59 +00:00
Michael McCandless 481f8080ab LUCENE-1470: add TrieRangeQuery, a much more efficient implementation of RangeQuery at the expense of added space consumed in the index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723031 13f79535-47bb-0310-9956-ffa450edef68
2008-12-03 19:38:31 +00:00
Michael McCandless 278dc0fc04 LUCENE-1296: add protected method CachingWrapperFilter.docIdSetToCache
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@722174 13f79535-47bb-0310-9956-ffa450edef68
2008-12-01 18:50:45 +00:00
Mark Robert Miller c2f7d554a3 LUCENE-1389: SimpleSpanFragmenter can create very short fragments
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@713569 13f79535-47bb-0310-9956-ffa450edef68
2008-11-13 00:01:43 +00:00
Yonik Seeley 8e8e8ddec4 set svn:eol-style to native on java files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712922 13f79535-47bb-0310-9956-ffa450edef68
2008-11-11 02:35:46 +00:00
Michael McCandless abe6ea0828 LUCENE-1424: enable MultiTermQuery to do constant scoring; refactor Range, Wildcard, Prefix to expose this
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712890 13f79535-47bb-0310-9956-ffa450edef68
2008-11-11 00:40:00 +00:00
Mark Harwood de7ab5b898 Added new web application demo for contrib's XmlQueryParser.
This change involves:
* Adding Tomcat's Servlet jar into the lib directory and appropriate entry in NOTICE.txt following the lead from Solr's packaging
* Adding new "demo" directory to XmlQueryParser src directory
* Changing XMLQueryParser's build file to create a demo War file
* Changing the main build to include the demo War file (and any other future contrib/*/war files) in the binary distributions

The packaged source distribution has NOT been changed currently to add a lib directory with the servlet.jar so building from a cut-down src distro as opposed to the full subversion /trunk directory will not currently build the war file (the xml query parser build file detects the absence of servlet.jar). Not sure if this is a problem currently.

TODO: 
Now that the servlet jar is available in Subversion I would recommend that the other existing WAR file,"luceneweb.war", is changed to move much of the java code which is currently embedded in JSP files into servlet .java files. This would ensure that the build system will check that the code in this application compiles cleanly with the latest Lucene APIs - otherwise any issue will only become apparent when a user tries to run a JSP.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712318 13f79535-47bb-0310-9956-ffa450edef68
2008-11-08 00:20:32 +00:00
Michael McCandless 24d76157a0 LUCENE-1420: let Similarity.computeNorm compute the norm; add option to discount overlap tokens when computing lengthNorm
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@710117 13f79535-47bb-0310-9956-ffa450edef68
2008-11-03 18:03:58 +00:00
Michael McCandless f2c988ae2b don't include contrib/memory class files in highlighter's jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@707834 13f79535-47bb-0310-9956-ffa450edef68
2008-10-25 10:19:30 +00:00
Grant Ingersoll 8dfe073760 LUCENE-1406. Added Arabic stemming and normalization. Also added new method to WordListLoader to allow for comments in word lists.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706342 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 17:19:29 +00:00
Michael McCandless da78e20bd0 LUCENE-1416: don't fail contrib/ant's unit test when file path contains non-URL safe characters like space
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706160 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 09:12:46 +00:00
Karl-Johan Wettin 456b10fdf9 LUCENE-1423
InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68
2008-10-18 16:29:53 +00:00
Daniel Naber 20ddb46fec minor javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@703717 13f79535-47bb-0310-9956-ffa450edef68
2008-10-11 19:45:16 +00:00
Michael McCandless 3f27b17a89 fix non-1.4-compatible throws clause
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@701827 13f79535-47bb-0310-9956-ffa450edef68
2008-10-05 16:40:59 +00:00
Mark Harwood 92d6a0ae7b Fix for a potential null-pointer-introducing bug which came about as part of the DocIdSet changes. TermsFilter no longer implemented bits(IndexReader) and the Filter base class' version of this was changed to return null.
When dropping 2.4 Lucene in as a direct replacement for 2.3.2 my client code was getting NullPointer errors - returning null was never part of the Filter.bits contract and so this could be a problem for others using this class. 

Fix is for TermsFilter to implement bits(IndexReader) - this can safely be removed in later versions because it is a deprecated method going forward.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700815 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 15:37:05 +00:00
Michael McCandless 98e1129a14 break off contrib/CHANGES.txt's 2.4.0 release section
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700743 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 11:22:58 +00:00
Andreas Vajda 88283a7027 - upgraded to bdb-je 3.3.69
- re-enabled testBytes() and testArrays() (per Aaron, Michael's instructions)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699961 13f79535-47bb-0310-9956-ffa450edef68
2008-09-29 02:21:51 +00:00
Andreas Vajda 96809b1238 upgraded db lib to version 4.7.25
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699882 13f79535-47bb-0310-9956-ffa450edef68
2008-09-28 19:07:39 +00:00
Mark Harwood ba6344a4d7 Fixed bug in FuzzyLikeThisQuery.java. Queries that contain a term with no fuzzy variants caused the query construction logic to exit loop early, producing no fuzzy variants for all subsequent terms in the query string.
Junit test added which recreates the problem conditions and added fix to FuzzyLikeThisQuery that solves the issue.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699512 13f79535-47bb-0310-9956-ffa450edef68
2008-09-26 21:44:55 +00:00
Michael McCandless 0e708a21fe LUCENE-1401: fix accidentally lost 'create=true' in CreateIndexTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698975 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 14:11:41 +00:00
Michael McCandless a40530e4a4 LUCENE-1401: remove new deprecated IndexWriter ctors; default autoCommit=false for new ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698932 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 12:04:38 +00:00
Michael McCandless b866828bab add graph to package.html for contrib/instantiated showing queries per sec vs number of docs indexed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@697126 13f79535-47bb-0310-9956-ffa450edef68
2008-09-19 15:15:36 +00:00
Otis Gospodnetic 0195fcd03d LUCENE-1378 - Removed the remaining 199 @author references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@695514 13f79535-47bb-0310-9956-ffa450edef68
2008-09-15 15:42:11 +00:00
Karl-Johan Wettin c6e5618674 Added a bunch of <p> in the Javadocs to make the rendered html view look better.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694396 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:26:46 +00:00
Karl-Johan Wettin 71f2d8199b LUCENE-1320
ShingleMatrixFilter JDK downgrade 1.5 -> 1.4 
Grant Ingersoll via Karl Wettin

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694393 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:23:18 +00:00
Michael McCandless d31fb4556c fix wordnet's Syns2Index to not fiddle with mergeFactor & maxBuffereDocs (the latter was hitting an exception)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694222 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 12:11:03 +00:00
Michael McCandless 4218996230 LUCENE-1366: rename Field.Index.* options
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694004 13f79535-47bb-0310-9956-ffa450edef68
2008-09-10 21:38:52 +00:00
Grant Ingersoll dadaebc290 LUCENE-1243: Added new benchmark tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Mark Robert Miller 5147f0563f LUCENE-1357: SpanScorer does not respect ConstantScoreRangeQuery setting
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693432 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 10:52:50 +00:00
Michael McCandless d5a40278bc LUCENE-1369: switch from Hashtable to HashMap and from Vector to List, when possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@692921 13f79535-47bb-0310-9956-ffa450edef68
2008-09-07 19:22:40 +00:00
Michael McCandless 120d01bb09 remove DOCTYPE declaration in testEncoding (it's causing intermittant false failures in nightly build when xerces has trouble loading the DTD from w3.org)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690914 13f79535-47bb-0310-9956-ffa450edef68
2008-09-01 09:07:16 +00:00
Karl-Johan Wettin bf238a5743 Javadocs fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690779 13f79535-47bb-0310-9956-ffa450edef68
2008-08-31 20:46:47 +00:00
Karl-Johan Wettin 82c70c018e LUCENE-1016 : TermVectorAccessor, transparent vector space access via stored vectors or by resolving the inverted index.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688745 13f79535-47bb-0310-9956-ffa450edef68
2008-08-25 15:02:20 +00:00
Karl-Johan Wettin 3034575f66 LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500.
Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter.

IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 22:02:47 +00:00
Karl-Johan Wettin 9fe7a35378 Contrib level CHANGES.txt. I forgot to add this some time ago.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 17:12:57 +00:00
Michael McCandless 003a853cc8 LUCENE-1333: don't use LuceneTestCase in contrib until we can fix the build dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687539 13f79535-47bb-0310-9956-ffa450edef68
2008-08-21 02:45:37 +00:00
Grant Ingersoll 09467ad18e LUCENE-1001: Make payloads accessible from SpanQuery's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687379 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 16:06:36 +00:00
Michael McCandless bb6b711718 LUCENE-1333: improvements to Token reuse API and full cutover to reuse API for all core and contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687357 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 14:38:07 +00:00
Mark Robert Miller 7675606908 LUCENE-1355: highlighter can incorrectly produce negative idf when index has deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687052 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 13:16:41 +00:00
Michael McCandless e31a9da835 LUCENE-1334: add Term(String fieldName) constructor that sets term text to empty string
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687014 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 10:40:39 +00:00
Doron Cohen 027307857e LUCENE-1353: Javacc target to generate precedence query parser.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@685114 13f79535-47bb-0310-9956-ffa450edef68
2008-08-12 10:49:59 +00:00
Michael McCandless 12e4e67403 add missing synchronized to PerfRunData.getQueryMaker()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@677555 13f79535-47bb-0310-9956-ffa450edef68
2008-07-17 10:50:24 +00:00
Mark Harwood 74c551fb88 Added option to allow UserQuery tag to define a different default fieldName.
Standard use case for this is where users are presented with a GUI form with multiple input boxes, each targetting a different field and allowing "lucene syntax". The XML query template behind such a form would have a <UserQuery> tag for each form field, each defined with the appropriate choice of default field name.

Added Junit test for changing default field name, updated DTD for XML query syntax and regenerated HTML documentation.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674975 13f79535-47bb-0310-9956-ffa450edef68
2008-07-08 21:50:06 +00:00
Karl-Johan Wettin ddc7c290d0 LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

Bug fix, did not support empty input token streams.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674367 13f79535-47bb-0310-9956-ffa450edef68
2008-07-07 00:08:41 +00:00
Karl-Johan Wettin bca43ea3ea LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673549 13f79535-47bb-0310-9956-ffa450edef68
2008-07-02 23:53:51 +00:00
Mark Robert Miller 943bf37e66 LUCENE-1321: Highlight fragment does not extend to maxDocCharsToAnalyze
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673220 13f79535-47bb-0310-9956-ffa450edef68
2008-07-01 21:44:33 +00:00
Karl-Johan Wettin 4f7ead8112 LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat the array offset right.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672568 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 18:52:12 +00:00
Karl-Johan Wettin b8fc54e72a LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames() and extended the test case to assert deleted documents behaves as they should (they did).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672556 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 17:23:35 +00:00
Erik Hatcher 3c80f97f7c MemoryIndex implements Serializable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@671579 13f79535-47bb-0310-9956-ffa450edef68
2008-06-25 15:18:29 +00:00
Grant Ingersoll 44dcbe184f add a getter for the StringDistance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669476 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 12:52:11 +00:00
Grant Ingersoll d6dcae49dd make public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669455 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 11:45:08 +00:00
Otis Gospodnetic 8b11f3b077 - Renamed TRStringDistance to LevensteinDistance (related to LUCENE-1297)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669086 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:09:11 +00:00
Otis Gospodnetic ce19aa28c0 LUCENE-1297 - Allow other string distance measures for the SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669085 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:01:57 +00:00
Mark Harwood 4adc3f7ba6 Fixed bug parsing boolean attributes. Boolean.getBoolean(s) was being used by mistake which reads a system property with name of s - instead need to parse s as string value of "true" or "false".
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@667294 13f79535-47bb-0310-9956-ffa450edef68
2008-06-12 23:38:39 +00:00
Doron Cohen 4d9e280788 LUCENE-1090: remove relative paths from benchmark's build.xml.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Grant Ingersoll 8c97e9b87b LUCENE-1299: Properly handle when IndexReader is not null and Field is in the spell checker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663649 13f79535-47bb-0310-9956-ffa450edef68
2008-06-05 16:12:08 +00:00
Grant Ingersoll f89cda6dde LUCENE-1298: Allow MLT to use custom similarity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663054 13f79535-47bb-0310-9956-ffa450edef68
2008-06-04 10:41:41 +00:00
Grant Ingersoll 1979d75605 LUCENE-1295: Made method public and added retrieveInterestingTerms variation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@662413 13f79535-47bb-0310-9956-ffa450edef68
2008-06-02 12:29:02 +00:00
Mark Robert Miller f32b5a5698 LUCENE-1285: WeightedSpanTermExtractor incorrectly treats the same terms occurring in different query types
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659965 13f79535-47bb-0310-9956-ffa450edef68
2008-05-25 11:38:55 +00:00
Michael Busch 3e7a15e317 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659742 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:45:12 +00:00
Michael Busch 1d6d186888 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659740 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:16:30 +00:00
Michael Busch 14f3663f56 Set eol-style to native for BooleanFilterTest.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659732 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 00:48:12 +00:00
Michael Busch 0ebfcc663e LUCENE-1187: ChainedFilter and BooleanFilter now work with new Filter API and DocIdSetIterator-based filters.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659635 13f79535-47bb-0310-9956-ffa450edef68
2008-05-23 19:25:05 +00:00
Otis Gospodnetic ce557ee4b3 LUCENE-852: Let the SpellChecker caller specify IndexWriter mergeFactor and RAM buffer size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659021 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:47:36 +00:00
Otis Gospodnetic b4f7c37bd1 LUCENE-1046: Removed dead code in SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659019 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:34:18 +00:00
Otis Gospodnetic f68c9544de LUCENE-1183: Optimized TRStringDistance class (in contrib/spell) that uses less memory than the previous version
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659016 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:24:55 +00:00
Otis Gospodnetic a379a67875 LUCENE-1152 Fix for calling indexDictionary after clearIndex call
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659013 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:21:17 +00:00
Otis Gospodnetic 90be0daa46 - Made PlainTextDictionary(Reader) public from protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657728 13f79535-47bb-0310-9956-ffa450edef68
2008-05-19 05:45:31 +00:00