Uwe Schindler
edbff4fb67
Cleanup changes in trunk
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@909828 13f79535-47bb-0310-9956-ffa450edef68
2010-02-13 14:14:55 +00:00
Robert Muir
a6b7c5552b
LUCENE-2055: better snowball integration, deprecate buggy handcoded snowball impls, restructure lang support
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@907125 13f79535-47bb-0310-9956-ffa450edef68
2010-02-05 23:05:46 +00:00
Robert Muir
23d403b6bb
LUCENE-2234: Hindi Analyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@906468 13f79535-47bb-0310-9956-ffa450edef68
2010-02-04 12:41:56 +00:00
Robert Muir
fdf4ea2448
LUCENE-2218: Improvements to ShingleFilter (performance, configurable sep. char and min shingle size)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@905043 13f79535-47bb-0310-9956-ffa450edef68
2010-01-31 14:04:01 +00:00
Koji Sekiguchi
65e1223ac4
LUCENE-2243: Add DisjunctionMaxQuery support for FastVectorHighlighter
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@904776 13f79535-47bb-0310-9956-ffa450edef68
2010-01-30 13:13:13 +00:00
Robert Muir
ba2b0851b8
LUCENE-2226: move contrib/snowball to contrib/analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@901505 13f79535-47bb-0310-9956-ffa450edef68
2010-01-21 02:45:09 +00:00
Robert Muir
78e45c92a7
LUCENE-2207: CJKTokenizer generates tokens with incorrect offsets
...
LUCENE-2219: Chinese, SmartChinese, Wikipedia tokenizers generate incorrect offsets, test end() in BaseTokenStreamTestCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@900196 13f79535-47bb-0310-9956-ffa450edef68
2010-01-17 19:25:57 +00:00
Michael McCandless
edf6694156
LUCENE-1845: add CHANGES entry
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@900155 13f79535-47bb-0310-9956-ffa450edef68
2010-01-17 15:14:06 +00:00
Robert Muir
6ac5ebd0a1
LUCENE-2206: add snowball stopword lists
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@899955 13f79535-47bb-0310-9956-ffa450edef68
2010-01-16 14:07:35 +00:00
Uwe Schindler
2bbd00bd1e
move changes.txt entry into contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@899682 13f79535-47bb-0310-9956-ffa450edef68
2010-01-15 16:14:24 +00:00
Robert Muir
fc4cf94bdf
LUCENE-2201: use char[] for snowball, don't create intermediate strings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@898976 13f79535-47bb-0310-9956-ffa450edef68
2010-01-13 22:29:21 +00:00
Koji Sekiguchi
260d294111
LUCENE-2204: FastVectorHighlighter: some classes and members should be publicly accessible
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@898323 13f79535-47bb-0310-9956-ffa450edef68
2010-01-12 13:53:32 +00:00
Simon Willnauer
673e368bf7
LUCENE-2199: ShingleFilter skipped over tri-gram shingles if outputUnigram was set to false
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@897672 13f79535-47bb-0310-9956-ffa450edef68
2010-01-10 18:06:19 +00:00
Robert Muir
5e8e5a0f05
LUCENE-2194: Improve the efficiency of Snowball
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@897449 13f79535-47bb-0310-9956-ffa450edef68
2010-01-09 13:34:11 +00:00
Mark Robert Miller
27f67473d5
TokenSources.getTokenStream() does not assign positionIncrement.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@896624 13f79535-47bb-0310-9956-ffa450edef68
2010-01-06 19:08:36 +00:00
Simon Willnauer
c19d78dd4e
LUCENE-2147: Improve Spatial Utility like classes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@896240 13f79535-47bb-0310-9956-ffa450edef68
2010-01-05 22:03:48 +00:00
Robert Muir
cdac1f7113
LUCENE-2084: remove Byte/CharBuffer wrapping for collation key generation
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@895341 13f79535-47bb-0310-9956-ffa450edef68
2010-01-03 09:22:40 +00:00
Robert Muir
6e6acefb05
LUCENE-2124: fix javadocs from the move, thanks Steven
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@892743 13f79535-47bb-0310-9956-ffa450edef68
2009-12-21 09:50:58 +00:00
Robert Muir
f616a47036
LUCENE-2165: SnowballAnalyzer was missing Set-based ctor
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@891209 13f79535-47bb-0310-9956-ffa450edef68
2009-12-16 12:13:36 +00:00
Uwe Schindler
dad7e60253
LUCENE-2157: DelimitedPayloadTokenFilter no longer copies the buffer over itsself, instead it sets the length to the offset of the delimiter. Also optimizes logic and IdentityEncoder to use NIO.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@890791 13f79535-47bb-0310-9956-ffa450edef68
2009-12-15 13:27:27 +00:00
Michael McCandless
121dbb58ba
LUCENE-2144: fix InstantiatedIndex to handle termDocs(null)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@889431 13f79535-47bb-0310-9956-ffa450edef68
2009-12-10 21:38:07 +00:00
Simon Willnauer
6c0c318218
LUCENE-2100: Marked all contrib Analyzer subclasses as final. Analyzers should be only act as a composition of TokenStreams, users should compose their own analyzers instead of subclassing existing ones.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888799 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 13:32:32 +00:00
Simon Willnauer
43c475d296
LUCENE-2117: SnowballAnalyzer uses TurkishLowerCaseFilter instead of LowercaseFilter to correctly handle the unique Turkish casing behavior if used with Version > 3.0 and the TurkishStemmer.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888787 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 12:47:37 +00:00
Robert Muir
550a4ef1af
LUCENE-2124: move jdk collation to core, icu collation to icu contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888780 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 12:08:06 +00:00
Simon Willnauer
9ee4ce0fd5
LUCENE-2102: Add Turkish LowerCaseFilter which handles Turkish and Azeri unique casing behavior correctly.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887535 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:46:05 +00:00
Simon Willnauer
5556599fad
LUCENE-2039: Added extensible query parser which enables arbitrary parser extensions based on field naming scheme
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887533 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:29:59 +00:00
Simon Willnauer
ab447b6af0
LUCENE-2108: Enable safe concurrent spell-index modifications in Spellchecker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887532 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:17:17 +00:00
Michael McCandless
fa65d42e94
LUCENE-2115: cutover contrib tests to use generics
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887524 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 09:43:54 +00:00
Michael McCandless
fbfd147b23
LUCENE-2108: add SpellChecker.close()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@886911 13f79535-47bb-0310-9956-ffa450edef68
2009-12-03 20:35:19 +00:00
Robert Muir
892bc7f55a
LUCENE-2062: Bulgarian Analyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@886190 13f79535-47bb-0310-9956-ffa450edef68
2009-12-02 16:08:56 +00:00
Robert Muir
2ef402eefa
LUCENE-2067: Add a stemmer for Czech
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@885216 13f79535-47bb-0310-9956-ffa450edef68
2009-11-29 11:59:38 +00:00
Simon Willnauer
e69141c51a
LUCENE-2068: Fixed ReverseStringFilter for Unicode 4.0. Reverse Supplementary Characters correctly.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@883149 13f79535-47bb-0310-9956-ffa450edef68
2009-11-22 21:09:42 +00:00
Uwe Schindler
ac2a7f5112
fix changes file in contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@881550 13f79535-47bb-0310-9956-ffa450edef68
2009-11-17 21:50:15 +00:00
Uwe Schindler
470c99dcee
change version in trunk to 3.1-dev
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@880756 13f79535-47bb-0310-9956-ffa450edef68
2009-11-16 14:03:05 +00:00
Uwe Schindler
00f07ee460
LUCENE-2051: Contrib Analyzer Setters should be deprecated and replace with ctor arguments, thanks to Simon Willnauer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@880715 13f79535-47bb-0310-9956-ffa450edef68
2009-11-16 11:48:37 +00:00
Michael McCandless
53b807726a
fold in 2.9.1 contrib/CHANGES entry to trunk's
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833475 13f79535-47bb-0310-9956-ffa450edef68
2009-11-06 17:10:22 +00:00
Robert Muir
80e8bfbbc9
LUCENE-2031: Move patternanalyzer from memory contrib into analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@832889 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 22:37:01 +00:00
Robert Muir
1b38f9c24d
LUCENE-2014: SmartChineseAnalyzer position increment bug
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830871 13f79535-47bb-0310-9956-ffa450edef68
2009-10-29 09:22:37 +00:00
Robert Muir
36b65637fc
LUCENE-1904: Move wordnet synonym code from contrib/memory to contrib/wordnet
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830699 13f79535-47bb-0310-9956-ffa450edef68
2009-10-28 17:49:53 +00:00
Mark Robert Miller
b14fd4bccc
credit where credits due ;)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830000 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 22:21:07 +00:00
Uwe Schindler
e4fdf4856e
LUCENE-1929: Merge missing changes entry
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829999 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 22:17:51 +00:00
Michael McCandless
c2bc750e48
LUCENE-2002: move changes entry back to 'API changes'
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829814 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 14:37:25 +00:00
Michael McCandless
2e4dc5b003
LUCENE-2002: move CHANGES entry under 'Changes in back compat policy' section, since it deprecates APIs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829774 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 12:44:02 +00:00
Michael McCandless
aaddac8992
LUCENE-2002: add Version to QueryParser & contrib analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829206 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 20:25:17 +00:00
Mark Robert Miller
0557d2ce5a
LUCENE-2003 Changes entry
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829132 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 17:13:21 +00:00
Robert Muir
d1fc6bece6
LUCENE-1359: FrenchAnalyzer tokenstream does not honor the contract of Analyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828298 13f79535-47bb-0310-9956-ffa450edef68
2009-10-22 04:03:12 +00:00
Robert Muir
afc66e4e66
LUCENE-2001: Fix parsing bug in wordnet contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828091 13f79535-47bb-0310-9956-ffa450edef68
2009-10-21 16:32:03 +00:00
Michael McCandless
5ceb81834d
LUCENE-1993: add maxDocFreq to MoreLikeThis
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@827042 13f79535-47bb-0310-9956-ffa450edef68
2009-10-20 11:59:53 +00:00
Robert Muir
e053d80455
LUCENE-1966: ArabicAnalyzer stopwords cleanup
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@825110 13f79535-47bb-0310-9956-ffa450edef68
2009-10-14 12:24:18 +00:00
Andrzej Bialecki
7d0d4ecc44
LUCENE-1959 Add MultiPassIndexSplitter.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@824798 13f79535-47bb-0310-9956-ffa450edef68
2009-10-13 14:54:30 +00:00