Commit Graph

14 Commits

Author SHA1 Message Date
Steven Rowe 1b22e86417 LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055877 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 13:51:10 +00:00
Steven Rowe 2b9726ae81 LUCENE-2763: Swap URL+Email recognizing StandardTokenizer and UAX29Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043071 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 14:53:13 +00:00
Uwe Schindler 6f230c5e08 revert changes (will come in 3.x)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029347 13f79535-47bb-0310-9956-ffa450edef68
2010-10-31 14:03:50 +00:00
Uwe Schindler 819344aeab LUCENE-2732: Fix charset problems in XML loading in HyphenationCompoundWordTokenFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029345 13f79535-47bb-0310-9956-ffa450edef68
2010-10-31 13:56:46 +00:00
Steven Rowe 7f6dd505f1 LUCENE-2699: Update StandardTokenizer and UAX29Tokenizer to Unicode 6.0.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1022826 13f79535-47bb-0310-9956-ffa450edef68
2010-10-15 05:41:54 +00:00
Steven Rowe f9e4f551e2 LUCENE-1370: Added ShingleFilter option to output unigrams if no shingles can be generated.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1006187 13f79535-47bb-0310-9956-ffa450edef68
2010-10-09 16:55:23 +00:00
Steven Rowe 3c26a9167c LUCENE-2167: Implement StandardTokenizer with the UAX#29 Standard
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002032 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 06:16:16 +00:00
Robert Muir 8f71031ac8 LUCENE-2413: consolidate remaining solr tokenstreams into modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@957162 13f79535-47bb-0310-9956-ffa450edef68
2010-06-23 11:25:17 +00:00
Robert Muir a0c72afb31 LUCENE-2413: move more core analysis to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948225 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 22:28:32 +00:00
Robert Muir 71b59ca566 LUCENE-2413: consolidate remaining concrete core analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948195 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 20:16:44 +00:00
Robert Muir 5259d7d90b LUCENE-2413: move KeywordMarkerFilter to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946621 13f79535-47bb-0310-9956-ffa450edef68
2010-05-20 13:23:12 +00:00
Robert Muir 5ccb3ae286 LUCENE-2413: fold contrib/icu into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946590 13f79535-47bb-0310-9956-ffa450edef68
2010-05-20 10:46:00 +00:00
Robert Muir 1e1296e6f8 sync all changes to reflect reality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941710 13f79535-47bb-0310-9956-ffa450edef68
2010-05-06 13:08:59 +00:00
Robert Muir bef21b3e18 LUCENE-2444: boilerplate stuff for the analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941369 13f79535-47bb-0310-9956-ffa450edef68
2010-05-05 16:27:58 +00:00