Commit Graph

3625 Commits

Author SHA1 Message Date
Michael McCandless 312400842f LUCENE-1754: add EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798091 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:02:34 +00:00
Michael McCandless 3894101042 LUCENE-1754: update back-compat test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798089 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:57:41 +00:00
Michael McCandless 26a2c427d1 LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Uwe Schindler 05351a6f63 Remove left-over javadoc <font> tag.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797870 13f79535-47bb-0310-9956-ffa450edef68
2009-07-26 07:30:19 +00:00
Uwe Schindler 0f9d2ec5b8 Some additional changes to LUCENE-1644 / MTQ:
- Deprecate one ctor of MultiTermQuery(Term), as Term field is deprecated
- Add readResolve also for the default auto rewrite method (not really needed because equals/hashcode impl, but singleton status is preserved)
- Some changes in NumericRangeQueryTests (now they use always constant score boolean/filter rewrite in basic tests, the default in all other tests)
- Add some explanations for the rewriteMethod used in NumericRangeQery's ctor (term enum seeks and so on). I also made a switch statement of the valSize (like at other places in this class, this is for later 8 or 16 bit valSize extensions)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797741 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 10:54:30 +00:00
Michael McCandless 228888a882 LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Uwe Schindler 187ac2b4f3 LUCENE-1693: Add AttributeSource ctors to Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797727 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:23:07 +00:00
Michael Busch c311d5730c LUCENE-1448: Add TokenStream.end() to perform end-of-stream operations. Fixes offset problems when multiple fields with the same name are added to a document.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797715 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 04:11:33 +00:00
Michael Busch e4b34462ee LUCENE-1760: TokenStream API javadoc improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797714 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 03:41:39 +00:00
Michael McCandless be66120dff LUCENE-1644: enable different rewrite methods for MultiTermQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Michael Busch 31a5f0edcc LUCENE-1693: Various improvements to the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797665 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 21:45:48 +00:00
Otis Gospodnetic f758b4d259 - Typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Uwe Schindler 7b0b45796b LUCENE-1728: The javadocs-all target still used the old subdirs in contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797213 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 20:27:41 +00:00
Michael McCandless 843f196584 add smartcn to website docs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797181 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 18:32:12 +00:00
Simon Willnauer 999f6157c7 LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Michael McCandless 91e37bf1fb LUCENE-1609: un-deprecate IndexReader.getTermInfosIndexDivisor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796916 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 00:02:48 +00:00
Michael McCandless 374822897c remove print
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 16:44:28 +00:00
Michael Busch 869aabff8f Small fix for TestPayloads testcase.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796617 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 06:24:54 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll 63402f49c7 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless 849d072a41 LUCENE-1742: make SegmentReader, SegmentInfo, SegmentInfos public as 'expert' and 'subject-to-change-suddenly' APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795537 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 13:29:39 +00:00
Michael McCandless e4a7f98d71 LUCENE-1665: also deprecate short-hand ctors/setters in Sort that imply SortField.AUTO
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795179 13f79535-47bb-0310-9956-ffa450edef68
2009-07-17 18:23:30 +00:00
Michael McCandless eae3eea874 LUCENE-1566: do very large reads in chunks, to prevent hitting Sun JVM bug that throws invalid OOME
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794770 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 18:07:35 +00:00
Michael McCandless c79f54975e LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Michael McCandless d7579b7e1e LUCENE-1614: if a DISI doesn't implement advance, emulate it using skipTo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794399 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 20:21:53 +00:00
Michael McCandless 229a0a84a1 LUCENE-1744: fix BooleanScorer2.doc() to be correct when it's the top scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794233 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 11:15:58 +00:00
Michael McCandless 6361faa7a3 fix javadoc warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794228 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 10:58:18 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller ea7e4ad344 LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Mark Robert Miller 6bf4d35ce8 LUCENE-1583: SpanOrQuery skipTo() doesn't always move forwards as Spans documentation indicates it should.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794063 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 20:56:50 +00:00
Otis Gospodnetic b393e4d0af LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
- line, and those below, will be ignored--

M    CHANGES.txt
M    analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M    analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless 65494af827 LUCENE-1272: add MoreLikeThis.set/getBoost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless 23e2a395c8 LUCENE-1678: if a subclass of a core Analyzer overrides tokenStream but not reusableTokenStream, then the core analyzer's reusableTokenStream will now fallback to tokenStream to fix the previous back-compat break
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793966 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:32:18 +00:00
Uwe Schindler 70b7e0b41b LUCENE-1741: Make MMapDirectory.MAX_BBUF user configureable to support chunking the index files in smaller parts
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793826 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:30:31 +00:00
Uwe Schindler b2a4bc68ad LUCENE-1712: Set default precisionStep for NumericField and NumericRange*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793823 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:17:44 +00:00
Michael McCandless 91aedd6685 LUCENE-1740: add 'analyzer' command to Lucli, to change analyzer from the default StandardAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793526 13f79535-47bb-0310-9956-ffa450edef68
2009-07-13 10:06:01 +00:00
Michael McCandless a284608778 LUCENE-1609: remove synchronization from terms dict by loading index up front
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793104 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 20:48:36 +00:00
Michael McCandless 4a63ea0f6c LUCENE-1726: move unchanged readers to private CoreReaders class that's shared across cloned SegmentReaders
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793039 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 17:08:19 +00:00
Michael McCandless ce1231b37c LUCENE-1739: when computing FieldInfos.hasProx(), disregard un-indexed fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792918 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 12:31:17 +00:00
Michael McCandless 9cbe5f4ff4 LUCENE-1522: adding new Fast Vector Highlighter contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792542 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 13:06:51 +00:00
Michael McCandless e49af570d1 LUCENE-1727: make sure fields are stored in the exact order they were added to the document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792535 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 12:53:28 +00:00
Michael McCandless 1f1fa05a7e LUCENE-1717: properly account for RAM used by buffered deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792532 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 12:44:57 +00:00
Michael McCandless 343f292dad add missing images to forrest's source dirs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791951 13f79535-47bb-0310-9956-ffa450edef68
2009-07-07 19:37:15 +00:00
Michael McCandless 333e77a431 LUCENE-1704: allow specifying the Tidy configuration file when parsing HTML docs with contrib/ant
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791587 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:55:05 +00:00
Mark Harwood f7fa579971 Initial commit of LUCENE-1486 - a subclass of the default QueryParser that overrides the parsing of PhraseQueries to allow more complex syntax e.g. wildcards in phrase queries
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791579 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:32:54 +00:00
Mark Robert Miller 28d65ceee7 remove java 1.5 dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791576 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:18:19 +00:00
Uwe Schindler 10fbc9814a Fix some JavaDoc errors in Numeric*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791535 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 16:36:05 +00:00
Mark Robert Miller f780f77366 LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Uwe Schindler 705f099238 Convert and cleanup the test files to UTF-8. What is still broken is the incorrect usage of KOI8 and CP1251 encodings. Added svn:eol-style=native to all files again.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791483 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 13:50:17 +00:00
Uwe Schindler 6a3752feb8 LUCENE-1734: CharReader should delegate reset/mark/markSupported
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791415 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 08:55:52 +00:00