Commit Graph

3640 Commits

Author SHA1 Message Date
Mark Robert Miller 7ecaa8c990 wikipedia-flush-by-RAM.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799379 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:35:10 +00:00
Mark Robert Miller e505413fae wikipedia.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799377 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:34:28 +00:00
Uwe Schindler be85ca650a Add extra test for new TokenStream API backwards layer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799359 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 16:45:43 +00:00
Uwe Schindler 1073d29a53 LUCENE-1762: Slightly more readable code in Token/TermAttributeImpl, additional tests, deep cloning bug with Token.copyTo fixed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799025 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 19:14:07 +00:00
Michael McCandless 9f12a2be6b LUCENE-1766: document IndexReader/Searcher's thread safety as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799011 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 18:53:25 +00:00
Mark Robert Miller afb517e832 LUCENE-1752: Missing highlights when terms were repeated in separate, nested, boolean or disjunction queries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798976 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:47:05 +00:00
Michael McCandless 1286589fd5 fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798974 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:34:40 +00:00
Michael McCandless 57b50dbbfe LUCENE-1705: adding missing CHANGES entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798959 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 15:55:13 +00:00
Michael McCandless 8d5b662535 LUCENE-1766: spell out IndexWriter's thread safety in the javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798835 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 10:18:30 +00:00
Chris M. Hostetter e4585c507d removing empty file that was left in by mistake in r798086 (part of LUCENE-1754)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798772 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 03:41:21 +00:00
Chris M. Hostetter 5a21421a61 LUCENE-1764 minor tweaks to SampleComparable so that it causes more sane FieldCache usage in contrib/remote tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798726 13f79535-47bb-0310-9956-ffa450edef68
2009-07-28 23:14:13 +00:00
Uwe Schindler 72a17d83d3 LUCENE-1754: JavaDoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798110 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 12:03:51 +00:00
Michael McCandless 45cf5ebe33 LUCENE-1754: EMPTY_DOCIDSET subclasses DocIdSet directly
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798104 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:23:11 +00:00
Michael McCandless dbff1fc9b5 LUCENE-1754: just use EMPTY_DOCIDSET.iterator() instead of new EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:12:36 +00:00
Michael McCandless 094c674c4d LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Michael McCandless 312400842f LUCENE-1754: add EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798091 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:02:34 +00:00
Michael McCandless 3894101042 LUCENE-1754: update back-compat test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798089 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:57:41 +00:00
Michael McCandless 26a2c427d1 LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Uwe Schindler 05351a6f63 Remove left-over javadoc <font> tag.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797870 13f79535-47bb-0310-9956-ffa450edef68
2009-07-26 07:30:19 +00:00
Uwe Schindler 0f9d2ec5b8 Some additional changes to LUCENE-1644 / MTQ:
- Deprecate one ctor of MultiTermQuery(Term), as Term field is deprecated
- Add readResolve also for the default auto rewrite method (not really needed because equals/hashcode impl, but singleton status is preserved)
- Some changes in NumericRangeQueryTests (now they use always constant score boolean/filter rewrite in basic tests, the default in all other tests)
- Add some explanations for the rewriteMethod used in NumericRangeQery's ctor (term enum seeks and so on). I also made a switch statement of the valSize (like at other places in this class, this is for later 8 or 16 bit valSize extensions)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797741 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 10:54:30 +00:00
Michael McCandless 228888a882 LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Uwe Schindler 187ac2b4f3 LUCENE-1693: Add AttributeSource ctors to Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797727 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:23:07 +00:00
Michael Busch c311d5730c LUCENE-1448: Add TokenStream.end() to perform end-of-stream operations. Fixes offset problems when multiple fields with the same name are added to a document.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797715 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 04:11:33 +00:00
Michael Busch e4b34462ee LUCENE-1760: TokenStream API javadoc improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797714 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 03:41:39 +00:00
Michael McCandless be66120dff LUCENE-1644: enable different rewrite methods for MultiTermQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Michael Busch 31a5f0edcc LUCENE-1693: Various improvements to the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797665 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 21:45:48 +00:00
Otis Gospodnetic f758b4d259 - Typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Uwe Schindler 7b0b45796b LUCENE-1728: The javadocs-all target still used the old subdirs in contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797213 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 20:27:41 +00:00
Michael McCandless 843f196584 add smartcn to website docs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797181 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 18:32:12 +00:00
Simon Willnauer 999f6157c7 LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Michael McCandless 91e37bf1fb LUCENE-1609: un-deprecate IndexReader.getTermInfosIndexDivisor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796916 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 00:02:48 +00:00
Michael McCandless 374822897c remove print
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 16:44:28 +00:00
Michael Busch 869aabff8f Small fix for TestPayloads testcase.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796617 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 06:24:54 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll 63402f49c7 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless 849d072a41 LUCENE-1742: make SegmentReader, SegmentInfo, SegmentInfos public as 'expert' and 'subject-to-change-suddenly' APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795537 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 13:29:39 +00:00
Michael McCandless e4a7f98d71 LUCENE-1665: also deprecate short-hand ctors/setters in Sort that imply SortField.AUTO
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795179 13f79535-47bb-0310-9956-ffa450edef68
2009-07-17 18:23:30 +00:00
Michael McCandless eae3eea874 LUCENE-1566: do very large reads in chunks, to prevent hitting Sun JVM bug that throws invalid OOME
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794770 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 18:07:35 +00:00
Michael McCandless c79f54975e LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Michael McCandless d7579b7e1e LUCENE-1614: if a DISI doesn't implement advance, emulate it using skipTo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794399 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 20:21:53 +00:00
Michael McCandless 229a0a84a1 LUCENE-1744: fix BooleanScorer2.doc() to be correct when it's the top scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794233 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 11:15:58 +00:00
Michael McCandless 6361faa7a3 fix javadoc warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794228 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 10:58:18 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller ea7e4ad344 LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Mark Robert Miller 6bf4d35ce8 LUCENE-1583: SpanOrQuery skipTo() doesn't always move forwards as Spans documentation indicates it should.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794063 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 20:56:50 +00:00
Otis Gospodnetic b393e4d0af LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
- line, and those below, will be ignored--

M    CHANGES.txt
M    analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M    analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless 65494af827 LUCENE-1272: add MoreLikeThis.set/getBoost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless 23e2a395c8 LUCENE-1678: if a subclass of a core Analyzer overrides tokenStream but not reusableTokenStream, then the core analyzer's reusableTokenStream will now fallback to tokenStream to fix the previous back-compat break
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793966 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:32:18 +00:00
Uwe Schindler 70b7e0b41b LUCENE-1741: Make MMapDirectory.MAX_BBUF user configureable to support chunking the index files in smaller parts
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793826 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:30:31 +00:00
Uwe Schindler b2a4bc68ad LUCENE-1712: Set default precisionStep for NumericField and NumericRange*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793823 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:17:44 +00:00