Commit Graph

3941 Commits

Author SHA1 Message Date
Mark Robert Miller f73a4f4324 LUCENE-1695: Update the Highlighter to use the new TokenStream API. This issue breaks backwards compatibility with some public classes. If you have implemented custom Fregmenters or Scorers, you will need to adjust them to work with the new TokenStream API. Rather than getting passed a Token at a time, you will be given a TokenStream to init your impl with - store the Attributes you are interested in locally and access them on each call to the method that used to pass a new Token. Look at the included updated impls for examples.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799455 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 22:00:47 +00:00
Mark Robert Miller 7ecaa8c990 wikipedia-flush-by-RAM.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799379 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:35:10 +00:00
Mark Robert Miller e505413fae wikipedia.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799377 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:34:28 +00:00
Uwe Schindler be85ca650a Add extra test for new TokenStream API backwards layer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799359 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 16:45:43 +00:00
Uwe Schindler 1073d29a53 LUCENE-1762: Slightly more readable code in Token/TermAttributeImpl, additional tests, deep cloning bug with Token.copyTo fixed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799025 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 19:14:07 +00:00
Michael McCandless 9f12a2be6b LUCENE-1766: document IndexReader/Searcher's thread safety as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799011 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 18:53:25 +00:00
Mark Robert Miller afb517e832 LUCENE-1752: Missing highlights when terms were repeated in separate, nested, boolean or disjunction queries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798976 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:47:05 +00:00
Michael McCandless 1286589fd5 fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798974 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:34:40 +00:00
Michael McCandless 57b50dbbfe LUCENE-1705: adding missing CHANGES entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798959 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 15:55:13 +00:00
Michael McCandless 8d5b662535 LUCENE-1766: spell out IndexWriter's thread safety in the javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798835 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 10:18:30 +00:00
Chris M. Hostetter e4585c507d removing empty file that was left in by mistake in r798086 (part of LUCENE-1754)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798772 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 03:41:21 +00:00
Chris M. Hostetter 5a21421a61 LUCENE-1764 minor tweaks to SampleComparable so that it causes more sane FieldCache usage in contrib/remote tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798726 13f79535-47bb-0310-9956-ffa450edef68
2009-07-28 23:14:13 +00:00
Uwe Schindler 72a17d83d3 LUCENE-1754: JavaDoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798110 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 12:03:51 +00:00
Michael McCandless 45cf5ebe33 LUCENE-1754: EMPTY_DOCIDSET subclasses DocIdSet directly
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798104 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:23:11 +00:00
Michael McCandless dbff1fc9b5 LUCENE-1754: just use EMPTY_DOCIDSET.iterator() instead of new EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:12:36 +00:00
Michael McCandless 094c674c4d LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Michael McCandless 312400842f LUCENE-1754: add EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798091 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:02:34 +00:00
Michael McCandless 3894101042 LUCENE-1754: update back-compat test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798089 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:57:41 +00:00
Michael McCandless 26a2c427d1 LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Uwe Schindler 05351a6f63 Remove left-over javadoc <font> tag.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797870 13f79535-47bb-0310-9956-ffa450edef68
2009-07-26 07:30:19 +00:00
Uwe Schindler 0f9d2ec5b8 Some additional changes to LUCENE-1644 / MTQ:
- Deprecate one ctor of MultiTermQuery(Term), as Term field is deprecated
- Add readResolve also for the default auto rewrite method (not really needed because equals/hashcode impl, but singleton status is preserved)
- Some changes in NumericRangeQueryTests (now they use always constant score boolean/filter rewrite in basic tests, the default in all other tests)
- Add some explanations for the rewriteMethod used in NumericRangeQery's ctor (term enum seeks and so on). I also made a switch statement of the valSize (like at other places in this class, this is for later 8 or 16 bit valSize extensions)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797741 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 10:54:30 +00:00
Michael McCandless 228888a882 LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Uwe Schindler 187ac2b4f3 LUCENE-1693: Add AttributeSource ctors to Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797727 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:23:07 +00:00
Michael Busch c311d5730c LUCENE-1448: Add TokenStream.end() to perform end-of-stream operations. Fixes offset problems when multiple fields with the same name are added to a document.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797715 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 04:11:33 +00:00
Michael Busch e4b34462ee LUCENE-1760: TokenStream API javadoc improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797714 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 03:41:39 +00:00
Michael McCandless be66120dff LUCENE-1644: enable different rewrite methods for MultiTermQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Michael Busch 31a5f0edcc LUCENE-1693: Various improvements to the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797665 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 21:45:48 +00:00
Otis Gospodnetic f758b4d259 - Typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Uwe Schindler 7b0b45796b LUCENE-1728: The javadocs-all target still used the old subdirs in contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797213 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 20:27:41 +00:00
Michael McCandless 843f196584 add smartcn to website docs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797181 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 18:32:12 +00:00
Simon Willnauer 999f6157c7 LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Michael McCandless 91e37bf1fb LUCENE-1609: un-deprecate IndexReader.getTermInfosIndexDivisor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796916 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 00:02:48 +00:00
Michael McCandless 374822897c remove print
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 16:44:28 +00:00
Michael Busch 869aabff8f Small fix for TestPayloads testcase.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796617 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 06:24:54 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll 63402f49c7 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless 849d072a41 LUCENE-1742: make SegmentReader, SegmentInfo, SegmentInfos public as 'expert' and 'subject-to-change-suddenly' APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795537 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 13:29:39 +00:00
Michael McCandless e4a7f98d71 LUCENE-1665: also deprecate short-hand ctors/setters in Sort that imply SortField.AUTO
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795179 13f79535-47bb-0310-9956-ffa450edef68
2009-07-17 18:23:30 +00:00
Michael McCandless eae3eea874 LUCENE-1566: do very large reads in chunks, to prevent hitting Sun JVM bug that throws invalid OOME
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794770 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 18:07:35 +00:00
Michael McCandless c79f54975e LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Michael McCandless d7579b7e1e LUCENE-1614: if a DISI doesn't implement advance, emulate it using skipTo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794399 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 20:21:53 +00:00
Michael McCandless 229a0a84a1 LUCENE-1744: fix BooleanScorer2.doc() to be correct when it's the top scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794233 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 11:15:58 +00:00
Michael McCandless 6361faa7a3 fix javadoc warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794228 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 10:58:18 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller ea7e4ad344 LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Mark Robert Miller 6bf4d35ce8 LUCENE-1583: SpanOrQuery skipTo() doesn't always move forwards as Spans documentation indicates it should.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794063 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 20:56:50 +00:00
Otis Gospodnetic b393e4d0af LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
- line, and those below, will be ignored--

M    CHANGES.txt
M    analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M    analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless 65494af827 LUCENE-1272: add MoreLikeThis.set/getBoost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless 23e2a395c8 LUCENE-1678: if a subclass of a core Analyzer overrides tokenStream but not reusableTokenStream, then the core analyzer's reusableTokenStream will now fallback to tokenStream to fix the previous back-compat break
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793966 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:32:18 +00:00
Uwe Schindler 70b7e0b41b LUCENE-1741: Make MMapDirectory.MAX_BBUF user configureable to support chunking the index files in smaller parts
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793826 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:30:31 +00:00