Mark Robert Miller
f73a4f4324
LUCENE-1695: Update the Highlighter to use the new TokenStream API. This issue breaks backwards compatibility with some public classes. If you have implemented custom Fregmenters or Scorers, you will need to adjust them to work with the new TokenStream API. Rather than getting passed a Token at a time, you will be given a TokenStream to init your impl with - store the Attributes you are interested in locally and access them on each call to the method that used to pass a new Token. Look at the included updated impls for examples.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799455 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 22:00:47 +00:00
Mark Robert Miller
7ecaa8c990
wikipedia-flush-by-RAM.alg should use content.source
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799379 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:35:10 +00:00
Mark Robert Miller
e505413fae
wikipedia.alg should use content.source
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799377 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:34:28 +00:00
Uwe Schindler
be85ca650a
Add extra test for new TokenStream API backwards layer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799359 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 16:45:43 +00:00
Uwe Schindler
1073d29a53
LUCENE-1762: Slightly more readable code in Token/TermAttributeImpl, additional tests, deep cloning bug with Token.copyTo fixed
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799025 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 19:14:07 +00:00
Michael McCandless
9f12a2be6b
LUCENE-1766: document IndexReader/Searcher's thread safety as well
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799011 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 18:53:25 +00:00
Mark Robert Miller
afb517e832
LUCENE-1752: Missing highlights when terms were repeated in separate, nested, boolean or disjunction queries.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798976 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:47:05 +00:00
Michael McCandless
1286589fd5
fix typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798974 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:34:40 +00:00
Michael McCandless
57b50dbbfe
LUCENE-1705: adding missing CHANGES entry
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798959 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 15:55:13 +00:00
Michael McCandless
8d5b662535
LUCENE-1766: spell out IndexWriter's thread safety in the javadocs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798835 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 10:18:30 +00:00
Chris M. Hostetter
e4585c507d
removing empty file that was left in by mistake in r798086 (part of LUCENE-1754)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798772 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 03:41:21 +00:00
Chris M. Hostetter
5a21421a61
LUCENE-1764 minor tweaks to SampleComparable so that it causes more sane FieldCache usage in contrib/remote tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798726 13f79535-47bb-0310-9956-ffa450edef68
2009-07-28 23:14:13 +00:00
Uwe Schindler
72a17d83d3
LUCENE-1754: JavaDoc updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798110 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 12:03:51 +00:00
Michael McCandless
45cf5ebe33
LUCENE-1754: EMPTY_DOCIDSET subclasses DocIdSet directly
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798104 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:23:11 +00:00
Michael McCandless
dbff1fc9b5
LUCENE-1754: just use EMPTY_DOCIDSET.iterator() instead of new EmptyDocIdSetIterator
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:12:36 +00:00
Michael McCandless
094c674c4d
LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Michael McCandless
312400842f
LUCENE-1754: add EmptyDocIdSetIterator
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798091 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:02:34 +00:00
Michael McCandless
3894101042
LUCENE-1754: update back-compat test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798089 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:57:41 +00:00
Michael McCandless
26a2c427d1
LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Uwe Schindler
05351a6f63
Remove left-over javadoc <font> tag.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797870 13f79535-47bb-0310-9956-ffa450edef68
2009-07-26 07:30:19 +00:00
Uwe Schindler
0f9d2ec5b8
Some additional changes to LUCENE-1644 / MTQ:
...
- Deprecate one ctor of MultiTermQuery(Term), as Term field is deprecated
- Add readResolve also for the default auto rewrite method (not really needed because equals/hashcode impl, but singleton status is preserved)
- Some changes in NumericRangeQueryTests (now they use always constant score boolean/filter rewrite in basic tests, the default in all other tests)
- Add some explanations for the rewriteMethod used in NumericRangeQery's ctor (term enum seeks and so on). I also made a switch statement of the valSize (like at other places in this class, this is for later 8 or 16 bit valSize extensions)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797741 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 10:54:30 +00:00
Michael McCandless
228888a882
LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Uwe Schindler
187ac2b4f3
LUCENE-1693: Add AttributeSource ctors to Tokenizer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797727 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:23:07 +00:00
Michael Busch
c311d5730c
LUCENE-1448: Add TokenStream.end() to perform end-of-stream operations. Fixes offset problems when multiple fields with the same name are added to a document.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797715 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 04:11:33 +00:00
Michael Busch
e4b34462ee
LUCENE-1760: TokenStream API javadoc improvements
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797714 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 03:41:39 +00:00
Michael McCandless
be66120dff
LUCENE-1644: enable different rewrite methods for MultiTermQuery
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Michael Busch
31a5f0edcc
LUCENE-1693: Various improvements to the new TokenStream API.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797665 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 21:45:48 +00:00
Otis Gospodnetic
f758b4d259
- Typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Uwe Schindler
7b0b45796b
LUCENE-1728: The javadocs-all target still used the old subdirs in contrib/analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797213 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 20:27:41 +00:00
Michael McCandless
843f196584
add smartcn to website docs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797181 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 18:32:12 +00:00
Simon Willnauer
999f6157c7
LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Michael McCandless
91e37bf1fb
LUCENE-1609: un-deprecate IndexReader.getTermInfosIndexDivisor
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796916 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 00:02:48 +00:00
Michael McCandless
374822897c
remove print
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 16:44:28 +00:00
Michael Busch
869aabff8f
Small fix for TestPayloads testcase.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796617 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 06:24:54 +00:00
Mark Robert Miller
3adc61c3ac
LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll
63402f49c7
Javadoc updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless
849d072a41
LUCENE-1742: make SegmentReader, SegmentInfo, SegmentInfos public as 'expert' and 'subject-to-change-suddenly' APIs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795537 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 13:29:39 +00:00
Michael McCandless
e4a7f98d71
LUCENE-1665: also deprecate short-hand ctors/setters in Sort that imply SortField.AUTO
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795179 13f79535-47bb-0310-9956-ffa450edef68
2009-07-17 18:23:30 +00:00
Michael McCandless
eae3eea874
LUCENE-1566: do very large reads in chunks, to prevent hitting Sun JVM bug that throws invalid OOME
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794770 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 18:07:35 +00:00
Michael McCandless
c79f54975e
LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Michael McCandless
d7579b7e1e
LUCENE-1614: if a DISI doesn't implement advance, emulate it using skipTo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794399 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 20:21:53 +00:00
Michael McCandless
229a0a84a1
LUCENE-1744: fix BooleanScorer2.doc() to be correct when it's the top scorer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794233 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 11:15:58 +00:00
Michael McCandless
6361faa7a3
fix javadoc warning
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794228 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 10:58:18 +00:00
Mark Robert Miller
add56f5e66
LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller
ea7e4ad344
LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Mark Robert Miller
6bf4d35ce8
LUCENE-1583: SpanOrQuery skipTo() doesn't always move forwards as Spans documentation indicates it should.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794063 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 20:56:50 +00:00
Otis Gospodnetic
b393e4d0af
LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
...
- line, and those below, will be ignored--
M CHANGES.txt
M analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless
65494af827
LUCENE-1272: add MoreLikeThis.set/getBoost
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless
23e2a395c8
LUCENE-1678: if a subclass of a core Analyzer overrides tokenStream but not reusableTokenStream, then the core analyzer's reusableTokenStream will now fallback to tokenStream to fix the previous back-compat break
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793966 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:32:18 +00:00
Uwe Schindler
70b7e0b41b
LUCENE-1741: Make MMapDirectory.MAX_BBUF user configureable to support chunking the index files in smaller parts
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793826 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:30:31 +00:00