Commit Graph

105 Commits

Author SHA1 Message Date
Simon Willnauer 673e368bf7 LUCENE-2199: ShingleFilter skipped over tri-gram shingles if outputUnigram was set to false
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@897672 13f79535-47bb-0310-9956-ffa450edef68
2010-01-10 18:06:19 +00:00
Robert Muir 5e8e5a0f05 LUCENE-2194: Improve the efficiency of Snowball
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@897449 13f79535-47bb-0310-9956-ffa450edef68
2010-01-09 13:34:11 +00:00
Mark Robert Miller 27f67473d5 TokenSources.getTokenStream() does not assign positionIncrement.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@896624 13f79535-47bb-0310-9956-ffa450edef68
2010-01-06 19:08:36 +00:00
Simon Willnauer c19d78dd4e LUCENE-2147: Improve Spatial Utility like classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@896240 13f79535-47bb-0310-9956-ffa450edef68
2010-01-05 22:03:48 +00:00
Robert Muir cdac1f7113 LUCENE-2084: remove Byte/CharBuffer wrapping for collation key generation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@895341 13f79535-47bb-0310-9956-ffa450edef68
2010-01-03 09:22:40 +00:00
Robert Muir 6e6acefb05 LUCENE-2124: fix javadocs from the move, thanks Steven
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@892743 13f79535-47bb-0310-9956-ffa450edef68
2009-12-21 09:50:58 +00:00
Robert Muir f616a47036 LUCENE-2165: SnowballAnalyzer was missing Set-based ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@891209 13f79535-47bb-0310-9956-ffa450edef68
2009-12-16 12:13:36 +00:00
Uwe Schindler dad7e60253 LUCENE-2157: DelimitedPayloadTokenFilter no longer copies the buffer over itsself, instead it sets the length to the offset of the delimiter. Also optimizes logic and IdentityEncoder to use NIO.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@890791 13f79535-47bb-0310-9956-ffa450edef68
2009-12-15 13:27:27 +00:00
Michael McCandless 121dbb58ba LUCENE-2144: fix InstantiatedIndex to handle termDocs(null)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@889431 13f79535-47bb-0310-9956-ffa450edef68
2009-12-10 21:38:07 +00:00
Simon Willnauer 6c0c318218 LUCENE-2100: Marked all contrib Analyzer subclasses as final. Analyzers should be only act as a composition of TokenStreams, users should compose their own analyzers instead of subclassing existing ones.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888799 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 13:32:32 +00:00
Simon Willnauer 43c475d296 LUCENE-2117: SnowballAnalyzer uses TurkishLowerCaseFilter instead of LowercaseFilter to correctly handle the unique Turkish casing behavior if used with Version > 3.0 and the TurkishStemmer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888787 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 12:47:37 +00:00
Robert Muir 550a4ef1af LUCENE-2124: move jdk collation to core, icu collation to icu contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@888780 13f79535-47bb-0310-9956-ffa450edef68
2009-12-09 12:08:06 +00:00
Simon Willnauer 9ee4ce0fd5 LUCENE-2102: Add Turkish LowerCaseFilter which handles Turkish and Azeri unique casing behavior correctly.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887535 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:46:05 +00:00
Simon Willnauer 5556599fad LUCENE-2039: Added extensible query parser which enables arbitrary parser extensions based on field naming scheme
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887533 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:29:59 +00:00
Simon Willnauer ab447b6af0 LUCENE-2108: Enable safe concurrent spell-index modifications in Spellchecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887532 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 12:17:17 +00:00
Michael McCandless fa65d42e94 LUCENE-2115: cutover contrib tests to use generics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@887524 13f79535-47bb-0310-9956-ffa450edef68
2009-12-05 09:43:54 +00:00
Michael McCandless fbfd147b23 LUCENE-2108: add SpellChecker.close()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@886911 13f79535-47bb-0310-9956-ffa450edef68
2009-12-03 20:35:19 +00:00
Robert Muir 892bc7f55a LUCENE-2062: Bulgarian Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@886190 13f79535-47bb-0310-9956-ffa450edef68
2009-12-02 16:08:56 +00:00
Robert Muir 2ef402eefa LUCENE-2067: Add a stemmer for Czech
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@885216 13f79535-47bb-0310-9956-ffa450edef68
2009-11-29 11:59:38 +00:00
Simon Willnauer e69141c51a LUCENE-2068: Fixed ReverseStringFilter for Unicode 4.0. Reverse Supplementary Characters correctly.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@883149 13f79535-47bb-0310-9956-ffa450edef68
2009-11-22 21:09:42 +00:00
Uwe Schindler ac2a7f5112 fix changes file in contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@881550 13f79535-47bb-0310-9956-ffa450edef68
2009-11-17 21:50:15 +00:00
Uwe Schindler 470c99dcee change version in trunk to 3.1-dev
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@880756 13f79535-47bb-0310-9956-ffa450edef68
2009-11-16 14:03:05 +00:00
Uwe Schindler 00f07ee460 LUCENE-2051: Contrib Analyzer Setters should be deprecated and replace with ctor arguments, thanks to Simon Willnauer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@880715 13f79535-47bb-0310-9956-ffa450edef68
2009-11-16 11:48:37 +00:00
Michael McCandless 53b807726a fold in 2.9.1 contrib/CHANGES entry to trunk's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@833475 13f79535-47bb-0310-9956-ffa450edef68
2009-11-06 17:10:22 +00:00
Robert Muir 80e8bfbbc9 LUCENE-2031: Move patternanalyzer from memory contrib into analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@832889 13f79535-47bb-0310-9956-ffa450edef68
2009-11-04 22:37:01 +00:00
Robert Muir 1b38f9c24d LUCENE-2014: SmartChineseAnalyzer position increment bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830871 13f79535-47bb-0310-9956-ffa450edef68
2009-10-29 09:22:37 +00:00
Robert Muir 36b65637fc LUCENE-1904: Move wordnet synonym code from contrib/memory to contrib/wordnet
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830699 13f79535-47bb-0310-9956-ffa450edef68
2009-10-28 17:49:53 +00:00
Mark Robert Miller b14fd4bccc credit where credits due ;)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@830000 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 22:21:07 +00:00
Uwe Schindler e4fdf4856e LUCENE-1929: Merge missing changes entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829999 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 22:17:51 +00:00
Michael McCandless c2bc750e48 LUCENE-2002: move changes entry back to 'API changes'
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829814 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 14:37:25 +00:00
Michael McCandless 2e4dc5b003 LUCENE-2002: move CHANGES entry under 'Changes in back compat policy' section, since it deprecates APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829774 13f79535-47bb-0310-9956-ffa450edef68
2009-10-26 12:44:02 +00:00
Michael McCandless aaddac8992 LUCENE-2002: add Version to QueryParser & contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829206 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 20:25:17 +00:00
Mark Robert Miller 0557d2ce5a LUCENE-2003 Changes entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829132 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 17:13:21 +00:00
Robert Muir d1fc6bece6 LUCENE-1359: FrenchAnalyzer tokenstream does not honor the contract of Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828298 13f79535-47bb-0310-9956-ffa450edef68
2009-10-22 04:03:12 +00:00
Robert Muir afc66e4e66 LUCENE-2001: Fix parsing bug in wordnet contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828091 13f79535-47bb-0310-9956-ffa450edef68
2009-10-21 16:32:03 +00:00
Michael McCandless 5ceb81834d LUCENE-1993: add maxDocFreq to MoreLikeThis
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@827042 13f79535-47bb-0310-9956-ffa450edef68
2009-10-20 11:59:53 +00:00
Robert Muir e053d80455 LUCENE-1966: ArabicAnalyzer stopwords cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@825110 13f79535-47bb-0310-9956-ffa450edef68
2009-10-14 12:24:18 +00:00
Andrzej Bialecki 7d0d4ecc44 LUCENE-1959 Add MultiPassIndexSplitter.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@824798 13f79535-47bb-0310-9956-ffa450edef68
2009-10-13 14:54:30 +00:00
Robert Muir 956c8cda82 LUCENE-1963: Lowercase before stopfilter in ArabicAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823534 13f79535-47bb-0310-9956-ffa450edef68
2009-10-09 12:55:47 +00:00
Simon Willnauer b5b0ebb96c fixed spelling
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823296 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 19:49:19 +00:00
Simon Willnauer 8b7d25769b LUCENE-1965, LUCENE-1962: Added possible performance improvments for Persian-, Arabic- and SmartChineseAnalyzer to changes.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823294 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 19:48:28 +00:00
Koji Sekiguchi 90fc7e18c7 LUCENE-1953: FastVectorHighlighter: small fragCharSize can cause StringIndexOutOfBoundsException
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823170 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 13:34:43 +00:00
Michael McCandless 2fc8e01d9e LUCENE-1959: add IndexSplitter tool to pull segment files out of an index into another
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823153 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 12:50:19 +00:00
Karl-Johan Wettin b3f73db537 LUCENE-1939: IndexOutOfBoundsException at ShingleMatrixFilter's Iterator#hasNext method on exhausted streams.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821888 13f79535-47bb-0310-9956-ffa450edef68
2009-10-05 16:01:17 +00:00
Uwe Schindler b75e96f2f4 LUCENE-1257: Change some occurrences of StringBuffer in public/protected APIs of contrib/surround to StringBuilder
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821443 13f79535-47bb-0310-9956-ffa450edef68
2009-10-03 23:10:27 +00:00
Robert Muir 8da43c4bb8 LUCENE-1916: smartcn hhmm doc translation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821325 13f79535-47bb-0310-9956-ffa450edef68
2009-10-03 14:24:45 +00:00
Robert Muir dd9c1b0101 LUCENE-1936: Remove deprecated charset support from Greek and Russian analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@820756 13f79535-47bb-0310-9956-ffa450edef68
2009-10-01 19:20:09 +00:00
Michael McCandless fa3e8b870c LUCENE-1924: add BalancedSegmentMergePolicy
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@820627 13f79535-47bb-0310-9956-ffa450edef68
2009-10-01 12:23:03 +00:00
Michael McCandless da19413d09 LUCENE-1781: move CHANGES entry into unreleased section
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@818633 13f79535-47bb-0310-9956-ffa450edef68
2009-09-24 21:36:11 +00:00
Michael McCandless 06a9b8d290 LUCENE-1781: fix various issues with lat/lng bounding box computation for contrib/spatial
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@817456 13f79535-47bb-0310-9956-ffa450edef68
2009-09-22 00:08:21 +00:00