Simon Willnauer
999f6157c7
LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Michael McCandless
91e37bf1fb
LUCENE-1609: un-deprecate IndexReader.getTermInfosIndexDivisor
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796916 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 00:02:48 +00:00
Michael McCandless
374822897c
remove print
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 16:44:28 +00:00
Michael Busch
869aabff8f
Small fix for TestPayloads testcase.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796617 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 06:24:54 +00:00
Mark Robert Miller
3adc61c3ac
LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll
63402f49c7
Javadoc updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless
849d072a41
LUCENE-1742: make SegmentReader, SegmentInfo, SegmentInfos public as 'expert' and 'subject-to-change-suddenly' APIs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795537 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 13:29:39 +00:00
Michael McCandless
e4a7f98d71
LUCENE-1665: also deprecate short-hand ctors/setters in Sort that imply SortField.AUTO
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795179 13f79535-47bb-0310-9956-ffa450edef68
2009-07-17 18:23:30 +00:00
Michael McCandless
eae3eea874
LUCENE-1566: do very large reads in chunks, to prevent hitting Sun JVM bug that throws invalid OOME
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794770 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 18:07:35 +00:00
Michael McCandless
c79f54975e
LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Michael McCandless
d7579b7e1e
LUCENE-1614: if a DISI doesn't implement advance, emulate it using skipTo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794399 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 20:21:53 +00:00
Michael McCandless
229a0a84a1
LUCENE-1744: fix BooleanScorer2.doc() to be correct when it's the top scorer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794233 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 11:15:58 +00:00
Michael McCandless
6361faa7a3
fix javadoc warning
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794228 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 10:58:18 +00:00
Mark Robert Miller
add56f5e66
LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller
ea7e4ad344
LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Mark Robert Miller
6bf4d35ce8
LUCENE-1583: SpanOrQuery skipTo() doesn't always move forwards as Spans documentation indicates it should.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794063 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 20:56:50 +00:00
Otis Gospodnetic
b393e4d0af
LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
...
- line, and those below, will be ignored--
M CHANGES.txt
M analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless
65494af827
LUCENE-1272: add MoreLikeThis.set/getBoost
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless
23e2a395c8
LUCENE-1678: if a subclass of a core Analyzer overrides tokenStream but not reusableTokenStream, then the core analyzer's reusableTokenStream will now fallback to tokenStream to fix the previous back-compat break
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793966 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:32:18 +00:00
Uwe Schindler
70b7e0b41b
LUCENE-1741: Make MMapDirectory.MAX_BBUF user configureable to support chunking the index files in smaller parts
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793826 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:30:31 +00:00
Uwe Schindler
b2a4bc68ad
LUCENE-1712: Set default precisionStep for NumericField and NumericRange*
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793823 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:17:44 +00:00
Michael McCandless
91aedd6685
LUCENE-1740: add 'analyzer' command to Lucli, to change analyzer from the default StandardAnalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793526 13f79535-47bb-0310-9956-ffa450edef68
2009-07-13 10:06:01 +00:00
Michael McCandless
a284608778
LUCENE-1609: remove synchronization from terms dict by loading index up front
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793104 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 20:48:36 +00:00
Michael McCandless
4a63ea0f6c
LUCENE-1726: move unchanged readers to private CoreReaders class that's shared across cloned SegmentReaders
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793039 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 17:08:19 +00:00
Michael McCandless
ce1231b37c
LUCENE-1739: when computing FieldInfos.hasProx(), disregard un-indexed fields
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792918 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 12:31:17 +00:00
Michael McCandless
9cbe5f4ff4
LUCENE-1522: adding new Fast Vector Highlighter contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792542 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 13:06:51 +00:00
Michael McCandless
e49af570d1
LUCENE-1727: make sure fields are stored in the exact order they were added to the document
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792535 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 12:53:28 +00:00
Michael McCandless
1f1fa05a7e
LUCENE-1717: properly account for RAM used by buffered deletes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792532 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 12:44:57 +00:00
Michael McCandless
343f292dad
add missing images to forrest's source dirs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791951 13f79535-47bb-0310-9956-ffa450edef68
2009-07-07 19:37:15 +00:00
Michael McCandless
333e77a431
LUCENE-1704: allow specifying the Tidy configuration file when parsing HTML docs with contrib/ant
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791587 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:55:05 +00:00
Mark Harwood
f7fa579971
Initial commit of LUCENE-1486 - a subclass of the default QueryParser that overrides the parsing of PhraseQueries to allow more complex syntax e.g. wildcards in phrase queries
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791579 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:32:54 +00:00
Mark Robert Miller
28d65ceee7
remove java 1.5 dependency
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791576 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:18:19 +00:00
Uwe Schindler
10fbc9814a
Fix some JavaDoc errors in Numeric*
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791535 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 16:36:05 +00:00
Mark Robert Miller
f780f77366
LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Uwe Schindler
705f099238
Convert and cleanup the test files to UTF-8. What is still broken is the incorrect usage of KOI8 and CP1251 encodings. Added svn:eol-style=native to all files again.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791483 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 13:50:17 +00:00
Uwe Schindler
6a3752feb8
LUCENE-1734: CharReader should delegate reset/mark/markSupported
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791415 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 08:55:52 +00:00
Uwe Schindler
7acabfecf0
LUCENE-1713: Some minor fixes and additions to JavaDocs. ConstantScoreRangeQuery is fixed to this mode now.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791406 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 07:58:28 +00:00
Mark Robert Miller
9689ddf9e8
LUCENE-1599: missed SpanFirstQuery#clone and setting boost on clones
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791302 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 19:09:01 +00:00
Mark Robert Miller
405b71fd70
Add proper credit for LUCENE-1599 to changes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791299 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 18:44:39 +00:00
Mark Robert Miller
e04abc52e7
LUCENE-1599: Add clone support for SpanQuerys. SpanRegexQuery counts on this functionality and does not work correctly without it.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791280 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 17:16:16 +00:00
Uwe Schindler
21fba4ee7b
Fix equals() and hashCode() of FuzzyQuery (boost etc. missing, so use super methods)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791264 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 16:15:00 +00:00
Uwe Schindler
fc44f0dc3c
LUCENE-1713, LUCENE-1673: For completeness, add some getter methods to TermRangeFilter, to be similar like TermRangeQuery and NumericRangeFilter.
...
This also unifies the hashCode and equals methods in this family of queries. It also fixes a bug in hashCode and equals of NumericRangeQuery, missing the constantScoreRewrite property.
TermRangeTermEnum was changed to have a consistent ctor with TermRangeQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791263 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 16:01:52 +00:00
Mark Robert Miller
9789089343
reader should be closed after use
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791246 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 14:01:14 +00:00
Uwe Schindler
b0873e4476
LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 2)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791176 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:15:20 +00:00
Uwe Schindler
fed4bba63d
LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 1)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791175 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:14:12 +00:00
Uwe Schindler
3442865534
Set this file to the same eol-style like the other (no tests fail, but this file is of same type/encoding, see mail on java-dev/java-commit)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790800 13f79535-47bb-0310-9956-ffa450edef68
2009-07-03 06:27:28 +00:00
Uwe Schindler
250e619595
Reverted Mike's prop changes in r787424 (see mail on java-dev/java-commit)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790796 13f79535-47bb-0310-9956-ffa450edef68
2009-07-03 06:04:59 +00:00
Mark Robert Miller
14f2a82e28
fix header formatting for changes-to-html
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790628 13f79535-47bb-0310-9956-ffa450edef68
2009-07-02 15:38:07 +00:00
Mark Robert Miller
86c3b356be
missing Apache license header
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790581 13f79535-47bb-0310-9956-ffa450edef68
2009-07-02 13:25:08 +00:00
Simon Willnauer
410afb98bf
LUCENE-1719: Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter (Steven Row via Simon Willnauer)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790262 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 16:50:47 +00:00