Commit Graph

12 Commits

Author SHA1 Message Date
Steven Rowe f3a363708f LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234452 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:20:46 +00:00
Robert Muir a5d2d78cec LUCENE-3378: nuke another useless custom test-classpath
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158857 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 18:21:41 +00:00
Robert Muir ef56f5d551 LUCENE-3361: port url+email tokenizer to standardtokenizerinterface, fix combining marks bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154936 13f79535-47bb-0310-9956-ffa450edef68
2011-08-08 11:57:59 +00:00
Robert Muir 2dda5bc35f LUCENE-3358: StandardTokenizer wrongly discarded combining marks attached to Han/Hiragana
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154005 13f79535-47bb-0310-9956-ffa450edef68
2011-08-04 20:49:47 +00:00
Robert Muir c8101d708c LUCENE-3160: lucene source build doesn't work correctly by itself
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129445 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 00:08:14 +00:00
Shai Erera ece1524805 LUCENE-2609: Generate jar containing test classes (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067738 13f79535-47bb-0310-9956-ffa450edef68
2011-02-06 19:48:54 +00:00
Steven Rowe 11146b8c3c changed generate-maven-artifacts target to place all maven artifacts in one place: modules/dist/maven/; added modules/dist/ to list of dirs to remove with the 'clean' target; added modules/dist/ to svn:ignore list on modules/
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1062308 13f79535-47bb-0310-9956-ffa450edef68
2011-01-23 01:42:19 +00:00
Steven Rowe 1b22e86417 LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055877 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 13:51:10 +00:00
Steven Rowe 2b9726ae81 LUCENE-2763: Swap URL+Email recognizing StandardTokenizer and UAX29Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043071 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 14:53:13 +00:00
Steven Rowe 3c26a9167c LUCENE-2167: Implement StandardTokenizer with the UAX#29 Standard
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002032 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 06:16:16 +00:00
Robert Muir 71b59ca566 LUCENE-2413: consolidate remaining concrete core analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948195 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 20:16:44 +00:00
Robert Muir f6e9cc9f32 LUCENE-2444: move contrib/analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941308 13f79535-47bb-0310-9956-ffa450edef68
2010-05-05 14:26:59 +00:00