Commit Graph

121 Commits

Author SHA1 Message Date
Uwe Schindler 460fa90564 LUCENE-2374: Added Attribute reflection API: It's now possible to inspect the contents of AttributeImpl and AttributeSource using a well-defined API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1061039 13f79535-47bb-0310-9956-ffa450edef68
2011-01-19 22:41:16 +00:00
Yonik Seeley b2cad88aad SOLR-2316: fail early if synonym file not provided
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060846 13f79535-47bb-0310-9956-ffa450edef68
2011-01-19 16:11:42 +00:00
Shai Erera 2a0484bd40 LUCENE-2295: remove maxFieldLength (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060340 13f79535-47bb-0310-9956-ffa450edef68
2011-01-18 12:01:40 +00:00
Robert Muir 4249ef9644 LUCENE-2847: remove obselete warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1059719 13f79535-47bb-0310-9956-ffa450edef68
2011-01-17 01:43:37 +00:00
Steven Rowe 8d7d57abdc LUCENE-2847: Added ASL2 license to supplementary macros generator, and to the generated file, and set svn:eol-style to native for both of them.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1056014 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 19:15:21 +00:00
Robert Muir fbfb07d904 LUCENE-2842: avoid java6-only String.isEmpty in rule parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055906 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 15:07:12 +00:00
Robert Muir 66d3f38d52 LUCENE-2842: missing eol-style
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055893 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 14:33:35 +00:00
Robert Muir 61872be09d LUCENE-2842: add Galician analyzer, Portuguese RSLP
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055892 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 14:30:37 +00:00
Steven Rowe 1b22e86417 LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055877 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 13:51:10 +00:00
Michael McCandless 87274d00ac LUCENE-2837: collapse Searcher/Searchable into IndexSearcher; remove contrib/remote, MultiSearcher; absorb ParallelMultiSearcher into IndexSearcher as optional ExecutorService to ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055416 13f79535-47bb-0310-9956-ffa450edef68
2011-01-05 11:16:40 +00:00
Robert Muir 8696f549d4 LUCENE-2020: Remove unused imports
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1052926 13f79535-47bb-0310-9956-ffa450edef68
2010-12-26 19:16:42 +00:00
Robert Muir 620b2a0619 LUCENE-2747: Deprecate/remove language-specific tokenizers in favor of StandardTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043114 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 16:19:17 +00:00
Steven Rowe 2b9726ae81 LUCENE-2763: Swap URL+Email recognizing StandardTokenizer and UAX29Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043071 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 14:53:13 +00:00
Robert Muir f87ca310ec LUCENE-2797: Upgrade icu to 4.6
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1042185 13f79535-47bb-0310-9956-ffa450edef68
2010-12-04 14:08:03 +00:00
Robert Muir a58c26978f LUCENE-2781: drop deprecations from trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1040463 13f79535-47bb-0310-9956-ffa450edef68
2010-11-30 11:22:39 +00:00
Robert Muir ff47493dbd fix bug where StandardFilter isn't respected
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1040390 13f79535-47bb-0310-9956-ffa450edef68
2010-11-30 02:44:47 +00:00
Robert Muir de3d057abc SOLR-2237: Added StempelPolishStemFilterFactory to contrib/analysis-extras
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1035996 13f79535-47bb-0310-9956-ffa450edef68
2010-11-17 12:26:15 +00:00
Uwe Schindler 6f230c5e08 revert changes (will come in 3.x)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029347 13f79535-47bb-0310-9956-ffa450edef68
2010-10-31 14:03:50 +00:00
Uwe Schindler 819344aeab LUCENE-2732: Fix charset problems in XML loading in HyphenationCompoundWordTokenFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029345 13f79535-47bb-0310-9956-ffa450edef68
2010-10-31 13:56:46 +00:00
Uwe Schindler 987f32849b LUCENE-2708: when a test Assume fails, display information, improved one
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1023312 13f79535-47bb-0310-9956-ffa450edef68
2010-10-16 15:43:11 +00:00
Steven Rowe 7f6dd505f1 LUCENE-2699: Update StandardTokenizer and UAX29Tokenizer to Unicode 6.0.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1022826 13f79535-47bb-0310-9956-ffa450edef68
2010-10-15 05:41:54 +00:00
Steven Rowe f9e4f551e2 LUCENE-1370: Added ShingleFilter option to output unigrams if no shingles can be generated.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1006187 13f79535-47bb-0310-9956-ffa450edef68
2010-10-09 16:55:23 +00:00
Steven Rowe 42d5b585ce Ignore this test under IntelliJ, which can't use Ant's test file patterns (Test*.java,*Test.java) to ignore this test, and thinks it's a failure since no test methods can be found.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1004853 13f79535-47bb-0310-9956-ffa450edef68
2010-10-05 23:23:55 +00:00
Robert Muir 7c020e317a LUCENE-2683: upgrade icu libraries to 4.4.2
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1004335 13f79535-47bb-0310-9956-ffa450edef68
2010-10-04 17:53:41 +00:00
Robert Muir 98621382be clear up 1.5-only javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1004215 13f79535-47bb-0310-9956-ffa450edef68
2010-10-04 12:03:51 +00:00
Robert Muir afad8123d2 clear up more warnings in modules/contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003990 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 16:27:34 +00:00
Robert Muir 0789e5f4e7 LUCENE-2681: fix generics violations in contrib/modules
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003978 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 15:41:57 +00:00
Robert Muir 85a27b8b38 clear up javadocs warnings/errors (forgot to svn add these overview.htmls)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003965 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 13:30:29 +00:00
Robert Muir e05117884a clear up javadocs warnings/errors
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003962 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 13:22:51 +00:00
Robert Muir c8b7a21b4b clear up more compiler warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003906 13f79535-47bb-0310-9956-ffa450edef68
2010-10-02 22:20:26 +00:00
Robert Muir fd11477ece clean up some fallthru/deprecation warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003873 13f79535-47bb-0310-9956-ffa450edef68
2010-10-02 19:58:35 +00:00
Robert Muir f5031a6b27 LUCENE-2167: cut over these analyzers also
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002221 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 15:33:22 +00:00
Steven Rowe 3c26a9167c LUCENE-2167: Implement StandardTokenizer with the UAX#29 Standard
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002032 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 06:16:16 +00:00
Robert Muir cce20cd820 LUCENE-2070: document how LengthFilter counts characters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1000675 13f79535-47bb-0310-9956-ffa450edef68
2010-09-24 00:42:05 +00:00
Robert Muir c84bd2f1ec LUCENE-2653: ThaiAnalyzer assumes things about your jre
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@998684 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 15:40:06 +00:00
Robert Muir 774eaeada0 LUCENE-2630: fix intl test bugs that rely on cldr version
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@997180 13f79535-47bb-0310-9956-ffa450edef68
2010-09-15 03:30:35 +00:00
Robert Muir feabadea20 LUCENE-2642: merge LuceneTestCase and LuceneTestCaseJ4
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@996611 13f79535-47bb-0310-9956-ffa450edef68
2010-09-13 17:37:20 +00:00
Robert Muir d38ec19a28 LUCENE-2639: remove random juggling in tests, add -Dtests.seed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@995772 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 12:34:18 +00:00
Robert Muir 912a6152a8 LUCENE-2629: fix analysis/icu's gennorm2 task
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@991053 13f79535-47bb-0310-9956-ffa450edef68
2010-08-31 01:33:02 +00:00
Robert Muir 13fd70521a LUCENE-2624: add armenian, basque, catalan analyzers from snowball
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@990459 13f79535-47bb-0310-9956-ffa450edef68
2010-08-28 22:42:25 +00:00
Robert Muir 33cc5a041e SOLR-2059: Add types attribute to WordDelimiterFilterFactory
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@990451 13f79535-47bb-0310-9956-ffa450edef68
2010-08-28 21:25:44 +00:00
Robert Muir 48dde8359f LUCENE-2098: speed up BaseCharFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@990161 13f79535-47bb-0310-9956-ffa450edef68
2010-08-27 14:33:22 +00:00
Robert Muir 07df8d5210 LUCENE-2598: factor the behavior of MockRAMDirectory into MockDirectoryWrapper, add experimental -Dtests.directory= to allow running the tests under different directory impls [but the default is still RAMDirectory]
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@988206 13f79535-47bb-0310-9956-ffa450edef68
2010-08-23 17:00:43 +00:00
Ryan McKinley c31c4b63d1 even more pom fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@987378 13f79535-47bb-0310-9956-ffa450edef68
2010-08-20 04:11:50 +00:00
Ryan McKinley 3be9fedd84 getting 'generate-maven-artifacts' to work with analysis module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@987286 13f79535-47bb-0310-9956-ffa450edef68
2010-08-19 19:58:36 +00:00
Robert Muir 1473b59c0e SOLR-1860: expose these analyzers stoplists as .txt like the others
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@986612 13f79535-47bb-0310-9956-ffa450edef68
2010-08-18 09:59:00 +00:00
Robert Muir faed4b4cd0 LUCENE-2598: add newDirectory and track that resources are closed correctly by tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@984968 13f79535-47bb-0310-9956-ffa450edef68
2010-08-12 20:56:23 +00:00
Robert Muir 61954ca249 SOLR-2002: change tests from TestCase to LuceneTestCase for better coverage
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@983530 13f79535-47bb-0310-9956-ffa450edef68
2010-08-09 06:11:16 +00:00
Shai Erera bed729c561 LUCENE-2570: Some improvements to _TestUtil and its usage
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@979646 13f79535-47bb-0310-9956-ffa450edef68
2010-07-27 11:31:25 +00:00
Robert Muir fcc9a4a3c3 LUCENE-2503: add forgotten javadoc/citation (sorry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@964054 13f79535-47bb-0310-9956-ffa450edef68
2010-07-14 14:06:06 +00:00
Robert Muir 3241eb9291 LUCENE-2503: add light stemmers for european languages
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@964019 13f79535-47bb-0310-9956-ffa450edef68
2010-07-14 12:10:34 +00:00
Robert Muir 8f71031ac8 LUCENE-2413: consolidate remaining solr tokenstreams into modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@957162 13f79535-47bb-0310-9956-ffa450edef68
2010-06-23 11:25:17 +00:00
Michael McCandless c91bddb26b LUCENE-2380: hard cutover of all preflex APIs to flex
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@955257 13f79535-47bb-0310-9956-ffa450edef68
2010-06-16 15:17:32 +00:00
Robert Muir 5a661500c1 LUCENE-2413: directory and package fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@955203 13f79535-47bb-0310-9956-ffa450edef68
2010-06-16 11:33:29 +00:00
Robert Muir 6e51a53189 LUCENE-2372: remove unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@950042 13f79535-47bb-0310-9956-ffa450edef68
2010-06-01 12:42:30 +00:00
Robert Muir ad0e495911 LUCENE-2372: switch over remaining uses of TermAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@950008 13f79535-47bb-0310-9956-ffa450edef68
2010-06-01 10:35:13 +00:00
Uwe Schindler 98b252ed7f LUCENE-2295: Added a LimitTokenCountAnalyzer / LimitTokenCountFilter to wrap any other Analyzer and provide the same functionality as MaxFieldLength provided on IndexWriter. This patch also fixes a bug in the offset calculation in CharTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@949445 13f79535-47bb-0310-9956-ffa450edef68
2010-05-29 23:14:18 +00:00
Uwe Schindler 9e61dd591f Generics Policeman ticket
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948234 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 22:44:36 +00:00
Robert Muir a0c72afb31 LUCENE-2413: move more core analysis to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948225 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 22:28:32 +00:00
Robert Muir 71b59ca566 LUCENE-2413: consolidate remaining concrete core analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948195 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 20:16:44 +00:00
Robert Muir 5259d7d90b LUCENE-2413: move KeywordMarkerFilter to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946621 13f79535-47bb-0310-9956-ffa450edef68
2010-05-20 13:23:12 +00:00
Robert Muir 5ccb3ae286 LUCENE-2413: fold contrib/icu into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946590 13f79535-47bb-0310-9956-ffa450edef68
2010-05-20 10:46:00 +00:00
Robert Muir fe5f1aabcb LUCENE-1287: Allow usage of HyphenationCompoundWordTokenFilter without a dictionary
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946139 13f79535-47bb-0310-9956-ffa450edef68
2010-05-19 11:58:37 +00:00
Uwe Schindler cd45643b96 LUCENE-2384: Remove hack, as JFlex trunk now has the zzBuffer bug fixed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@945130 13f79535-47bb-0310-9956-ffa450edef68
2010-05-17 13:13:10 +00:00
Robert Muir acbf053b7c LUCENE-2463: Improve Greek analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@945090 13f79535-47bb-0310-9956-ffa450edef68
2010-05-17 11:28:04 +00:00
Robert Muir 26b9faddb2 LUCENE-2413: consolidate SynonymFilter into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@942827 13f79535-47bb-0310-9956-ffa450edef68
2010-05-10 17:37:45 +00:00
Robert Muir 399d373089 fix the compile target... this never worked for contrib/analyzers before either
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@942262 13f79535-47bb-0310-9956-ffa450edef68
2010-05-07 22:51:45 +00:00
Robert Muir 1b020be130 LUCENE-2437: Indonesian Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@942235 13f79535-47bb-0310-9956-ffa450edef68
2010-05-07 21:21:12 +00:00
Robert Muir 1e1296e6f8 sync all changes to reflect reality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941710 13f79535-47bb-0310-9956-ffa450edef68
2010-05-06 13:08:59 +00:00
Robert Muir bef21b3e18 LUCENE-2444: boilerplate stuff for the analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941369 13f79535-47bb-0310-9956-ffa450edef68
2010-05-05 16:27:58 +00:00
Robert Muir f6e9cc9f32 LUCENE-2444: move contrib/analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941308 13f79535-47bb-0310-9956-ffa450edef68
2010-05-05 14:26:59 +00:00