Robert Muir
|
4455345c6e
|
LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
|
2011-05-03 00:29:47 +00:00 |
Robert Muir
|
308e0bd4a9
|
LUCENE-2514, LUCENE-2551: collation uses byte[] keys, deprecate old unscalable locale sort/range, termrangequery/filter work on bytes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075210 13f79535-47bb-0310-9956-ffa450edef68
|
2011-02-28 05:15:50 +00:00 |
Koji Sekiguchi
|
6f31407109
|
SOLR-1057: Add PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067131 13f79535-47bb-0310-9956-ffa450edef68
|
2011-02-04 10:19:52 +00:00 |
Steven Rowe
|
1b22e86417
|
LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055877 13f79535-47bb-0310-9956-ffa450edef68
|
2011-01-06 13:51:10 +00:00 |
Steven Rowe
|
2b9726ae81
|
LUCENE-2763: Swap URL+Email recognizing StandardTokenizer and UAX29Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043071 13f79535-47bb-0310-9956-ffa450edef68
|
2010-12-07 14:53:13 +00:00 |
Uwe Schindler
|
6f230c5e08
|
revert changes (will come in 3.x)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029347 13f79535-47bb-0310-9956-ffa450edef68
|
2010-10-31 14:03:50 +00:00 |
Uwe Schindler
|
819344aeab
|
LUCENE-2732: Fix charset problems in XML loading in HyphenationCompoundWordTokenFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029345 13f79535-47bb-0310-9956-ffa450edef68
|
2010-10-31 13:56:46 +00:00 |
Steven Rowe
|
7f6dd505f1
|
LUCENE-2699: Update StandardTokenizer and UAX29Tokenizer to Unicode 6.0.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1022826 13f79535-47bb-0310-9956-ffa450edef68
|
2010-10-15 05:41:54 +00:00 |
Steven Rowe
|
f9e4f551e2
|
LUCENE-1370: Added ShingleFilter option to output unigrams if no shingles can be generated.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1006187 13f79535-47bb-0310-9956-ffa450edef68
|
2010-10-09 16:55:23 +00:00 |
Steven Rowe
|
3c26a9167c
|
LUCENE-2167: Implement StandardTokenizer with the UAX#29 Standard
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002032 13f79535-47bb-0310-9956-ffa450edef68
|
2010-09-28 06:16:16 +00:00 |
Robert Muir
|
8f71031ac8
|
LUCENE-2413: consolidate remaining solr tokenstreams into modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@957162 13f79535-47bb-0310-9956-ffa450edef68
|
2010-06-23 11:25:17 +00:00 |
Robert Muir
|
a0c72afb31
|
LUCENE-2413: move more core analysis to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948225 13f79535-47bb-0310-9956-ffa450edef68
|
2010-05-25 22:28:32 +00:00 |
Robert Muir
|
71b59ca566
|
LUCENE-2413: consolidate remaining concrete core analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948195 13f79535-47bb-0310-9956-ffa450edef68
|
2010-05-25 20:16:44 +00:00 |
Robert Muir
|
5259d7d90b
|
LUCENE-2413: move KeywordMarkerFilter to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946621 13f79535-47bb-0310-9956-ffa450edef68
|
2010-05-20 13:23:12 +00:00 |
Robert Muir
|
5ccb3ae286
|
LUCENE-2413: fold contrib/icu into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946590 13f79535-47bb-0310-9956-ffa450edef68
|
2010-05-20 10:46:00 +00:00 |
Robert Muir
|
1e1296e6f8
|
sync all changes to reflect reality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941710 13f79535-47bb-0310-9956-ffa450edef68
|
2010-05-06 13:08:59 +00:00 |
Robert Muir
|
bef21b3e18
|
LUCENE-2444: boilerplate stuff for the analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941369 13f79535-47bb-0310-9956-ffa450edef68
|
2010-05-05 16:27:58 +00:00 |