Commit Graph

117 Commits

Author SHA1 Message Date
Uwe Schindler edbff4fb67 Cleanup changes in trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@909828 13f79535-47bb-0310-9956-ffa450edef68
2010-02-13 14:14:55 +00:00
Robert Muir a6b7c5552b LUCENE-2055: better snowball integration, deprecate buggy handcoded snowball impls, restructure lang support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@907125 13f79535-47bb-0310-9956-ffa450edef68
2010-02-05 23:05:46 +00:00
Robert Muir 23d403b6bb LUCENE-2234: Hindi Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@906468 13f79535-47bb-0310-9956-ffa450edef68
2010-02-04 12:41:56 +00:00
Robert Muir fdf4ea2448 LUCENE-2218: Improvements to ShingleFilter (performance, configurable sep. char and min shingle size)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@905043 13f79535-47bb-0310-9956-ffa450edef68
2010-01-31 14:04:01 +00:00
Koji Sekiguchi 65e1223ac4 LUCENE-2243: Add DisjunctionMaxQuery support for FastVectorHighlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@904776 13f79535-47bb-0310-9956-ffa450edef68
2010-01-30 13:13:13 +00:00
Robert Muir ba2b0851b8 LUCENE-2226: move contrib/snowball to contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@901505 13f79535-47bb-0310-9956-ffa450edef68
2010-01-21 02:45:09 +00:00
Robert Muir 78e45c92a7 LUCENE-2207: CJKTokenizer generates tokens with incorrect offsets
LUCENE-2219: Chinese, SmartChinese, Wikipedia tokenizers generate incorrect offsets, test end() in BaseTokenStreamTestCase


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@900196 13f79535-47bb-0310-9956-ffa450edef68
2010-01-17 19:25:57 +00:00
Michael McCandless edf6694156 LUCENE-1845: add CHANGES entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@900155 13f79535-47bb-0310-9956-ffa450edef68
2010-01-17 15:14:06 +00:00
Robert Muir 6ac5ebd0a1 LUCENE-2206: add snowball stopword lists
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@899955 13f79535-47bb-0310-9956-ffa450edef68
2010-01-16 14:07:35 +00:00
Uwe Schindler 2bbd00bd1e move changes.txt entry into contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@899682 13f79535-47bb-0310-9956-ffa450edef68
2010-01-15 16:14:24 +00:00
Robert Muir fc4cf94bdf LUCENE-2201: use char[] for snowball, don't create intermediate strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@898976 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 22:29:21 +00:00
Koji Sekiguchi 260d294111 LUCENE-2204: FastVectorHighlighter: some classes and members should be publicly accessible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@898323 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 13:53:32 +00:00
Simon Willnauer 673e368bf7 LUCENE-2199: ShingleFilter skipped over tri-gram shingles if outputUnigram was set to false
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@897672 13f79535-47bb-0310-9956-ffa450edef68
2010-01-10 18:06:19 +00:00
Robert Muir 5e8e5a0f05 LUCENE-2194: Improve the efficiency of Snowball
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@897449 13f79535-47bb-0310-9956-ffa450edef68
2010-01-09 13:34:11 +00:00
Mark Robert Miller 27f67473d5 TokenSources.getTokenStream() does not assign positionIncrement.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@896624 13f79535-47bb-0310-9956-ffa450edef68
2010-01-06 19:08:36 +00:00
Simon Willnauer c19d78dd4e LUCENE-2147: Improve Spatial Utility like classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@896240 13f79535-47bb-0310-9956-ffa450edef68
2010-01-05 22:03:48 +00:00
Robert Muir cdac1f7113 LUCENE-2084: remove Byte/CharBuffer wrapping for collation key generation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@895341 13f79535-47bb-0310-9956-ffa450edef68
2010-01-03 09:22:40 +00:00
Robert Muir 6e6acefb05 LUCENE-2124: fix javadocs from the move, thanks Steven
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@892743 13f79535-47bb-0310-9956-ffa450edef68
2009-12-21 09:50:58 +00:00
Robert Muir f616a47036 LUCENE-2165: SnowballAnalyzer was missing Set-based ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@891209 13f79535-47bb-0310-9956-ffa450edef68
2009-12-16 12:13:36 +00:00
Uwe Schindler dad7e60253 LUCENE-2157: DelimitedPayloadTokenFilter no longer copies the buffer over itsself, instead it sets the length to the offset of the delimiter. Also optimizes logic and IdentityEncoder to use NIO.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@890791 13f79535-47bb-0310-9956-ffa450edef68
2009-12-15 13:27:27 +00:00
Michael McCandless 121dbb58ba LUCENE-2144: fix InstantiatedIndex to handle termDocs(null)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@889431 13f79535-47bb-0310-9956-ffa450edef68
2009-12-10 21:38:07 +00:00
Simon Willnauer 6c0c318218 LUCENE-2100: Marked all contrib Analyzer subclasses as final. Analyzers should be only act as a composition of TokenStreams, users should compose their own analyzers instead of subclassing existing ones.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888799 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 13:32:32 +00:00
Simon Willnauer 43c475d296 LUCENE-2117: SnowballAnalyzer uses TurkishLowerCaseFilter instead of LowercaseFilter to correctly handle the unique Turkish casing behavior if used with Version > 3.0 and the TurkishStemmer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888787 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 12:47:37 +00:00
Robert Muir 550a4ef1af LUCENE-2124: move jdk collation to core, icu collation to icu contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888780 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 12:08:06 +00:00
Simon Willnauer 9ee4ce0fd5 LUCENE-2102: Add Turkish LowerCaseFilter which handles Turkish and Azeri unique casing behavior correctly.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887535 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:46:05 +00:00
Simon Willnauer 5556599fad LUCENE-2039: Added extensible query parser which enables arbitrary parser extensions based on field naming scheme
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887533 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:29:59 +00:00
Simon Willnauer ab447b6af0 LUCENE-2108: Enable safe concurrent spell-index modifications in Spellchecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887532 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:17:17 +00:00
Michael McCandless fa65d42e94 LUCENE-2115: cutover contrib tests to use generics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887524 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 09:43:54 +00:00
Michael McCandless fbfd147b23 LUCENE-2108: add SpellChecker.close()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@886911 13f79535-47bb-0310-9956-ffa450edef68
2009-12-03 20:35:19 +00:00
Robert Muir 892bc7f55a LUCENE-2062: Bulgarian Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@886190 13f79535-47bb-0310-9956-ffa450edef68
2009-12-02 16:08:56 +00:00
Robert Muir 2ef402eefa LUCENE-2067: Add a stemmer for Czech
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@885216 13f79535-47bb-0310-9956-ffa450edef68
2009-11-29 11:59:38 +00:00
Simon Willnauer e69141c51a LUCENE-2068: Fixed ReverseStringFilter for Unicode 4.0. Reverse Supplementary Characters correctly.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@883149 13f79535-47bb-0310-9956-ffa450edef68
2009-11-22 21:09:42 +00:00
Uwe Schindler ac2a7f5112 fix changes file in contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@881550 13f79535-47bb-0310-9956-ffa450edef68
2009-11-17 21:50:15 +00:00
Uwe Schindler 470c99dcee change version in trunk to 3.1-dev
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@880756 13f79535-47bb-0310-9956-ffa450edef68
2009-11-16 14:03:05 +00:00
Uwe Schindler 00f07ee460 LUCENE-2051: Contrib Analyzer Setters should be deprecated and replace with ctor arguments, thanks to Simon Willnauer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@880715 13f79535-47bb-0310-9956-ffa450edef68
2009-11-16 11:48:37 +00:00
Michael McCandless 53b807726a fold in 2.9.1 contrib/CHANGES entry to trunk's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833475 13f79535-47bb-0310-9956-ffa450edef68
2009-11-06 17:10:22 +00:00
Robert Muir 80e8bfbbc9 LUCENE-2031: Move patternanalyzer from memory contrib into analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@832889 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 22:37:01 +00:00
Robert Muir 1b38f9c24d LUCENE-2014: SmartChineseAnalyzer position increment bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830871 13f79535-47bb-0310-9956-ffa450edef68
2009-10-29 09:22:37 +00:00
Robert Muir 36b65637fc LUCENE-1904: Move wordnet synonym code from contrib/memory to contrib/wordnet
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830699 13f79535-47bb-0310-9956-ffa450edef68
2009-10-28 17:49:53 +00:00
Mark Robert Miller b14fd4bccc credit where credits due ;)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830000 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 22:21:07 +00:00
Uwe Schindler e4fdf4856e LUCENE-1929: Merge missing changes entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829999 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 22:17:51 +00:00
Michael McCandless c2bc750e48 LUCENE-2002: move changes entry back to 'API changes'
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829814 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 14:37:25 +00:00
Michael McCandless 2e4dc5b003 LUCENE-2002: move CHANGES entry under 'Changes in back compat policy' section, since it deprecates APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829774 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 12:44:02 +00:00
Michael McCandless aaddac8992 LUCENE-2002: add Version to QueryParser & contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829206 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 20:25:17 +00:00
Mark Robert Miller 0557d2ce5a LUCENE-2003 Changes entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829132 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 17:13:21 +00:00
Robert Muir d1fc6bece6 LUCENE-1359: FrenchAnalyzer tokenstream does not honor the contract of Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828298 13f79535-47bb-0310-9956-ffa450edef68
2009-10-22 04:03:12 +00:00
Robert Muir afc66e4e66 LUCENE-2001: Fix parsing bug in wordnet contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828091 13f79535-47bb-0310-9956-ffa450edef68
2009-10-21 16:32:03 +00:00
Michael McCandless 5ceb81834d LUCENE-1993: add maxDocFreq to MoreLikeThis
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@827042 13f79535-47bb-0310-9956-ffa450edef68
2009-10-20 11:59:53 +00:00
Robert Muir e053d80455 LUCENE-1966: ArabicAnalyzer stopwords cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@825110 13f79535-47bb-0310-9956-ffa450edef68
2009-10-14 12:24:18 +00:00
Andrzej Bialecki 7d0d4ecc44 LUCENE-1959 Add MultiPassIndexSplitter.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@824798 13f79535-47bb-0310-9956-ffa450edef68
2009-10-13 14:54:30 +00:00