Commit Graph

3611 Commits

Author SHA1 Message Date
Simon Willnauer 999f6157c7 LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Michael McCandless 91e37bf1fb LUCENE-1609: un-deprecate IndexReader.getTermInfosIndexDivisor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796916 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 00:02:48 +00:00
Michael McCandless 374822897c remove print
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 16:44:28 +00:00
Michael Busch 869aabff8f Small fix for TestPayloads testcase.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@796617 13f79535-47bb-0310-9956-ffa450edef68
2009-07-22 06:24:54 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll 63402f49c7 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless 849d072a41 LUCENE-1742: make SegmentReader, SegmentInfo, SegmentInfos public as 'expert' and 'subject-to-change-suddenly' APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795537 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 13:29:39 +00:00
Michael McCandless e4a7f98d71 LUCENE-1665: also deprecate short-hand ctors/setters in Sort that imply SortField.AUTO
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795179 13f79535-47bb-0310-9956-ffa450edef68
2009-07-17 18:23:30 +00:00
Michael McCandless eae3eea874 LUCENE-1566: do very large reads in chunks, to prevent hitting Sun JVM bug that throws invalid OOME
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794770 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 18:07:35 +00:00
Michael McCandless c79f54975e LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Michael McCandless d7579b7e1e LUCENE-1614: if a DISI doesn't implement advance, emulate it using skipTo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794399 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 20:21:53 +00:00
Michael McCandless 229a0a84a1 LUCENE-1744: fix BooleanScorer2.doc() to be correct when it's the top scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794233 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 11:15:58 +00:00
Michael McCandless 6361faa7a3 fix javadoc warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794228 13f79535-47bb-0310-9956-ffa450edef68
2009-07-15 10:58:18 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller ea7e4ad344 LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Mark Robert Miller 6bf4d35ce8 LUCENE-1583: SpanOrQuery skipTo() doesn't always move forwards as Spans documentation indicates it should.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794063 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 20:56:50 +00:00
Otis Gospodnetic b393e4d0af LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
- line, and those below, will be ignored--

M    CHANGES.txt
M    analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M    analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless 65494af827 LUCENE-1272: add MoreLikeThis.set/getBoost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless 23e2a395c8 LUCENE-1678: if a subclass of a core Analyzer overrides tokenStream but not reusableTokenStream, then the core analyzer's reusableTokenStream will now fallback to tokenStream to fix the previous back-compat break
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793966 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:32:18 +00:00
Uwe Schindler 70b7e0b41b LUCENE-1741: Make MMapDirectory.MAX_BBUF user configureable to support chunking the index files in smaller parts
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793826 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:30:31 +00:00
Uwe Schindler b2a4bc68ad LUCENE-1712: Set default precisionStep for NumericField and NumericRange*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793823 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 09:17:44 +00:00
Michael McCandless 91aedd6685 LUCENE-1740: add 'analyzer' command to Lucli, to change analyzer from the default StandardAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793526 13f79535-47bb-0310-9956-ffa450edef68
2009-07-13 10:06:01 +00:00
Michael McCandless a284608778 LUCENE-1609: remove synchronization from terms dict by loading index up front
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793104 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 20:48:36 +00:00
Michael McCandless 4a63ea0f6c LUCENE-1726: move unchanged readers to private CoreReaders class that's shared across cloned SegmentReaders
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793039 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 17:08:19 +00:00
Michael McCandless ce1231b37c LUCENE-1739: when computing FieldInfos.hasProx(), disregard un-indexed fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792918 13f79535-47bb-0310-9956-ffa450edef68
2009-07-10 12:31:17 +00:00
Michael McCandless 9cbe5f4ff4 LUCENE-1522: adding new Fast Vector Highlighter contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792542 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 13:06:51 +00:00
Michael McCandless e49af570d1 LUCENE-1727: make sure fields are stored in the exact order they were added to the document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792535 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 12:53:28 +00:00
Michael McCandless 1f1fa05a7e LUCENE-1717: properly account for RAM used by buffered deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792532 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 12:44:57 +00:00
Michael McCandless 343f292dad add missing images to forrest's source dirs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791951 13f79535-47bb-0310-9956-ffa450edef68
2009-07-07 19:37:15 +00:00
Michael McCandless 333e77a431 LUCENE-1704: allow specifying the Tidy configuration file when parsing HTML docs with contrib/ant
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791587 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:55:05 +00:00
Mark Harwood f7fa579971 Initial commit of LUCENE-1486 - a subclass of the default QueryParser that overrides the parsing of PhraseQueries to allow more complex syntax e.g. wildcards in phrase queries
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791579 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:32:54 +00:00
Mark Robert Miller 28d65ceee7 remove java 1.5 dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791576 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:18:19 +00:00
Uwe Schindler 10fbc9814a Fix some JavaDoc errors in Numeric*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791535 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 16:36:05 +00:00
Mark Robert Miller f780f77366 LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Uwe Schindler 705f099238 Convert and cleanup the test files to UTF-8. What is still broken is the incorrect usage of KOI8 and CP1251 encodings. Added svn:eol-style=native to all files again.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791483 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 13:50:17 +00:00
Uwe Schindler 6a3752feb8 LUCENE-1734: CharReader should delegate reset/mark/markSupported
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791415 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 08:55:52 +00:00
Uwe Schindler 7acabfecf0 LUCENE-1713: Some minor fixes and additions to JavaDocs. ConstantScoreRangeQuery is fixed to this mode now.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791406 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 07:58:28 +00:00
Mark Robert Miller 9689ddf9e8 LUCENE-1599: missed SpanFirstQuery#clone and setting boost on clones
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791302 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 19:09:01 +00:00
Mark Robert Miller 405b71fd70 Add proper credit for LUCENE-1599 to changes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791299 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 18:44:39 +00:00
Mark Robert Miller e04abc52e7 LUCENE-1599: Add clone support for SpanQuerys. SpanRegexQuery counts on this functionality and does not work correctly without it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791280 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 17:16:16 +00:00
Uwe Schindler 21fba4ee7b Fix equals() and hashCode() of FuzzyQuery (boost etc. missing, so use super methods)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791264 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 16:15:00 +00:00
Uwe Schindler fc44f0dc3c LUCENE-1713, LUCENE-1673: For completeness, add some getter methods to TermRangeFilter, to be similar like TermRangeQuery and NumericRangeFilter.
This also unifies the hashCode and equals methods in this family of queries. It also fixes a bug in hashCode and equals of NumericRangeQuery, missing the constantScoreRewrite property.
TermRangeTermEnum was changed to have a consistent ctor with TermRangeQuery.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791263 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 16:01:52 +00:00
Mark Robert Miller 9789089343 reader should be closed after use
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791246 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 14:01:14 +00:00
Uwe Schindler b0873e4476 LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 2)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791176 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:15:20 +00:00
Uwe Schindler fed4bba63d LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 1)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791175 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:14:12 +00:00
Uwe Schindler 3442865534 Set this file to the same eol-style like the other (no tests fail, but this file is of same type/encoding, see mail on java-dev/java-commit)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790800 13f79535-47bb-0310-9956-ffa450edef68
2009-07-03 06:27:28 +00:00
Uwe Schindler 250e619595 Reverted Mike's prop changes in r787424 (see mail on java-dev/java-commit)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790796 13f79535-47bb-0310-9956-ffa450edef68
2009-07-03 06:04:59 +00:00
Mark Robert Miller 14f2a82e28 fix header formatting for changes-to-html
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790628 13f79535-47bb-0310-9956-ffa450edef68
2009-07-02 15:38:07 +00:00
Mark Robert Miller 86c3b356be missing Apache license header
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790581 13f79535-47bb-0310-9956-ffa450edef68
2009-07-02 13:25:08 +00:00
Simon Willnauer 410afb98bf LUCENE-1719: Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter (Steven Row via Simon Willnauer)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790262 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 16:50:47 +00:00