Commit Graph

153 Commits

Author SHA1 Message Date
Michael McCandless aaddac8992 LUCENE-2002: add Version to QueryParser & contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@829206 13f79535-47bb-0310-9956-ffa450edef68
2009-10-23 20:25:17 +00:00
Robert Muir d1fc6bece6 LUCENE-1359: FrenchAnalyzer tokenstream does not honor the contract of Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828298 13f79535-47bb-0310-9956-ffa450edef68
2009-10-22 04:03:12 +00:00
Uwe Schindler 04da5e73f2 LUCENE-1998: Parameter -> Java 5 enum transition
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@828156 13f79535-47bb-0310-9956-ffa450edef68
2009-10-21 19:30:06 +00:00
Uwe Schindler 1ae5f89cfb LUCENE-1987: Remove rest of analysis deprecations (StandardAnalyzer, StopAnalyzer)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@827979 13f79535-47bb-0310-9956-ffa450edef68
2009-10-21 12:12:11 +00:00
Robert Muir e053d80455 LUCENE-1966: ArabicAnalyzer stopwords cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@825110 13f79535-47bb-0310-9956-ffa450edef68
2009-10-14 12:24:18 +00:00
Uwe Schindler 4cded8042c LUCENE-1946, LUCENE-1753: Remove deprecated TokenStream API. What a pity, my wonderful backwards layer is gone! :-( Enforce decorator pattern by making the rest of TokenStreams final.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@824116 13f79535-47bb-0310-9956-ffa450edef68
2009-10-11 17:35:09 +00:00
Robert Muir 877c9ff521 For fa analyzer, add a test for custom stopwords
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823546 13f79535-47bb-0310-9956-ffa450edef68
2009-10-09 13:27:14 +00:00
Robert Muir 956c8cda82 LUCENE-1963: Lowercase before stopfilter in ArabicAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823534 13f79535-47bb-0310-9956-ffa450edef68
2009-10-09 12:55:47 +00:00
Michael McCandless f20e419aff LUCENE-1950: remove autoCommit=true from IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823321 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 20:57:32 +00:00
Simon Willnauer 05b7822170 LUCENE-1965: Lazy Atomic Loading Stopwords in SmartCN
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823285 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 19:21:36 +00:00
Simon Willnauer 286cb1f9d2 LUCENE-1962: Cleaned up Persian & Arabic Analyzer. Prevent default stopword list from being loaded more than once.
- replace if blocks with a single switch
- marking private members final where needed
- changed protected visibility to final in final class.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@823180 13f79535-47bb-0310-9956-ffa450edef68
2009-10-08 13:54:18 +00:00
Michael McCandless c11776d2c6 remove tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@822781 13f79535-47bb-0310-9956-ffa450edef68
2009-10-07 15:41:09 +00:00
Michael Busch d7d9241ef7 LUCENE-1856: Remove Hits.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@822587 13f79535-47bb-0310-9956-ffa450edef68
2009-10-07 05:08:22 +00:00
Karl-Johan Wettin b3f73db537 LUCENE-1939: IndexOutOfBoundsException at ShingleMatrixFilter's Iterator#hasNext method on exhausted streams.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821888 13f79535-47bb-0310-9956-ffa450edef68
2009-10-05 16:01:17 +00:00
Uwe Schindler 236baf9fcb LUCENE-1944: Cleanup contrib to not use deprecated APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821444 13f79535-47bb-0310-9956-ffa450edef68
2009-10-03 23:24:33 +00:00
Robert Muir 8da43c4bb8 LUCENE-1916: smartcn hhmm doc translation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821325 13f79535-47bb-0310-9956-ffa450edef68
2009-10-03 14:24:45 +00:00
Robert Muir 1f9088b038 LUCENE-1943: Improve performance of ChineseFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821322 13f79535-47bb-0310-9956-ffa450edef68
2009-10-03 13:54:12 +00:00
Karl-Johan Wettin 4f878bdc93 LUCENE-1257: Generified ShingleMatrixFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821311 13f79535-47bb-0310-9956-ffa450edef68
2009-10-03 13:17:11 +00:00
Uwe Schindler 835de0b44d LUCENE-1833: Change all new Number() ctors to Number.valueOf()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821186 13f79535-47bb-0310-9956-ffa450edef68
2009-10-02 22:16:44 +00:00
Uwe Schindler af0e97fd72 LUCENE-1257: Replace StringBuffer by StringBuilder where possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@821185 13f79535-47bb-0310-9956-ffa450edef68
2009-10-02 22:11:10 +00:00
Robert Muir dd9c1b0101 LUCENE-1936: Remove deprecated charset support from Greek and Russian analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@820756 13f79535-47bb-0310-9956-ffa450edef68
2009-10-01 19:20:09 +00:00
Uwe Schindler c1f5e753d7 LUCENE-1933: Provide a convenience AttributeFactory that creates a Token instance for all basic attributes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@820658 13f79535-47bb-0310-9956-ffa450edef68
2009-10-01 13:49:46 +00:00
Uwe Schindler ec90bc2202 LUCENE-1855: Change AttributeSource API to use generics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@820553 13f79535-47bb-0310-9956-ffa450edef68
2009-10-01 07:53:43 +00:00
Uwe Schindler 4666489857 LUCENE-1906: Fix backwards problems with CharStream and Tokenizers with custom reset(Reader) method.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@813671 13f79535-47bb-0310-9956-ffa450edef68
2009-09-11 06:12:13 +00:00
Uwe Schindler a8eb5c4b80 LUCENE-1903: Fix incorrect ShingleFilter behavior when outputUnigrams == false
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@812779 13f79535-47bb-0310-9956-ffa450edef68
2009-09-09 06:02:54 +00:00
Mark Robert Miller 53dcf2c320 cleanup unused reusableToken instances that were left behind
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@811984 13f79535-47bb-0310-9956-ffa450edef68
2009-09-07 03:29:38 +00:00
Chris M. Hostetter c56f4c224f LUCENE-1884: massive javadoc and comment cleanup -- primarily dealing with typos
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@811070 13f79535-47bb-0310-9956-ffa450edef68
2009-09-03 18:31:41 +00:00
Chris M. Hostetter 29e6be94c3 LUCENE-1882: improved package level docs for smartcn
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@810247 13f79535-47bb-0310-9956-ffa450edef68
2009-09-01 21:31:18 +00:00
Chris M. Hostetter e5cb7f668a LUCENE-1882: move SmartChineseAnalyzer to the 'correct' package ... this commit is based on a sequence of svn commands and a patch provided by Robert Muir in LUCENE-1862
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@810208 13f79535-47bb-0310-9956-ffa450edef68
2009-09-01 20:10:33 +00:00
Chris M. Hostetter 566aaf28e7 fix eol inconsistencies in some package.html files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@810191 13f79535-47bb-0310-9956-ffa450edef68
2009-09-01 19:30:59 +00:00
Robert Muir 5d05bbfd59 LUCENE-1817: Warn that file formats for smartcn might change in the future, and do not encourage custom dictionary use in the meantime
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808570 13f79535-47bb-0310-9956-ffa450edef68
2009-08-27 19:01:54 +00:00
Mark Robert Miller 26c5af3a33 LUCENE-1865 Add a ton of missing license headers throughout test/demo/contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808567 13f79535-47bb-0310-9956-ffa450edef68
2009-08-27 18:48:16 +00:00
Chris M. Hostetter 6c3d17a399 fix numerous javadoc errors, many of which only show up at the private level
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808459 13f79535-47bb-0310-9956-ffa450edef68
2009-08-27 15:34:23 +00:00
Chris M. Hostetter 2e87fb12f1 incorrect javadoc syntax
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808428 13f79535-47bb-0310-9956-ffa450edef68
2009-08-27 14:29:43 +00:00
Chris M. Hostetter 6b2eae0b5a javadoc is historicly very finicky about relative names in @link tags when the name doesn't resolve in the class hierarchy at the current accesse level (ie: even if a class is in the same package, an @link to it's short namewon't resolve if it's not actaully part of a signature for a method/field being documented)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808427 13f79535-47bb-0310-9956-ffa450edef68
2009-08-27 14:28:22 +00:00
Mark Robert Miller 56985e3136 add overview.html stubs for contrib to avoid error messages when building javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808233 13f79535-47bb-0310-9956-ffa450edef68
2009-08-27 00:23:09 +00:00
Mark Robert Miller 8cc45886d5 fix javadoc links in ngram contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808221 13f79535-47bb-0310-9956-ffa450edef68
2009-08-26 23:26:10 +00:00
Mark Robert Miller c593328eb0 convert the remaining @todo's to TODO:
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808211 13f79535-47bb-0310-9956-ffa450edef68
2009-08-26 22:39:40 +00:00
Simon Willnauer 67eea7215c LUCENE-1817: Marked analyzers/smartcn as experimental in order to warn users that this API might change in the future due to major refactorings.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@808170 13f79535-47bb-0310-9956-ffa450edef68
2009-08-26 20:21:50 +00:00
Uwe Schindler 367b35f0cb LUCENE-1843: Update contrib tests to conform to onlyUseNewAPI; refactored assertAnalyzesTo and others into the new BaseTokenStreamTestCase class; Rewrote TestMappingCharFilter to use the new assert functions, too; performance improvements of Token.copyTo(); new impl of SingleTokenTokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807190 13f79535-47bb-0310-9956-ffa450edef68
2009-08-24 12:44:13 +00:00
Uwe Schindler 5dd1810b0c LUCENE-1846: Fix more Locale problems
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807117 13f79535-47bb-0310-9956-ffa450edef68
2009-08-24 08:31:34 +00:00
Uwe Schindler 4745c8db05 LUCENE-1825: Another one :(
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806990 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 16:35:50 +00:00
Uwe Schindler c2f95d474b LUCENE-1825: Additional incorrect getAttribute usage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806986 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 16:17:08 +00:00
Robert Muir 6847c0e2bd LUCENE-1826: the new tokenizer constructors should not allow deprecated charsets
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806961 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 12:39:28 +00:00
Michael Busch 64ed5f39a5 LUCENE-1826: Add constructors that take AttributeSource and AttributeFactory to all Tokenizer implementations.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806942 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 08:34:22 +00:00
Robert Muir 8e4e7e17d0 LUCENE-1834: Remove unused code in SmartChineseAnalyzer hmm pkg
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806891 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 21:03:08 +00:00
Robert Muir 1ebbe2abd1 LUCENE-1793: Deprecate custom encoding support in Greek and Russian analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806886 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 20:36:06 +00:00
Robert Muir 1d9a96c2fc LUCENE-1813: Add option to ReverseStringFilter to mark reversed tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805769 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 12:07:15 +00:00
Robert Muir 58cd4a04d7 LUCENE-1794: Ensure analyzer options are applied immediately when using reusable token streams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805766 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 11:56:31 +00:00
Robert Muir 3887cf9419 LUCENE-1692: Additional tests and javadocs for contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805400 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 12:55:26 +00:00