Commit Graph

275 Commits

Author SHA1 Message Date
Christopher John Male 3597bc4bf4 LUCENE-3396: Converted simple Analyzers which got lost in merging
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1169654 13f79535-47bb-0310-9956-ffa450edef68
2011-09-12 09:00:42 +00:00
Christopher John Male 4c5606ee29 LUCENE-3396: Converted most Analyzers over to using ReusableAnalyzerBase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1169607 13f79535-47bb-0310-9956-ffa450edef68
2011-09-12 05:50:26 +00:00
Robert Muir a027a35583 nocommit -> TODO
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1169474 13f79535-47bb-0310-9956-ffa450edef68
2011-09-11 16:39:59 +00:00
Christopher John Male e3172b9239 LUCENE-3414: Added Hunspell for Lucene
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1167467 13f79535-47bb-0310-9956-ffa450edef68
2011-09-10 06:00:39 +00:00
Robert Muir 128aaf8387 LUCENE-3410: move changes to 3.5 and nuke deprecated code in trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1166770 13f79535-47bb-0310-9956-ffa450edef68
2011-09-08 15:56:01 +00:00
Robert Muir b265d499f2 LUCENE-3417: DictionaryCompoundWordFilter did not properly add tokens from the end compound word
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1166728 13f79535-47bb-0310-9956-ffa450edef68
2011-09-08 14:59:15 +00:00
Christopher John Male 4b44bd7d83 LUCENE-3410: Deprecated multi-int constructors in WordDelimiterFilter. Now uses int bitfield
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1165995 13f79535-47bb-0310-9956-ffa450edef68
2011-09-07 04:43:10 +00:00
Michael McCandless 4dad0ba89f LUCENE-2308: cutover to FieldType
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1162347 13f79535-47bb-0310-9956-ffa450edef68
2011-08-27 13:27:01 +00:00
Christopher John Male 0f2d7ad556 LUCENE-3397: Cleaned up remaining test TSs and PatterAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1161986 13f79535-47bb-0310-9956-ffa450edef68
2011-08-26 04:16:19 +00:00
Christopher John Male 1057d24e7f LUCENE-3400: Removed DutchAnalyzer.setStemDictionary
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1161484 13f79535-47bb-0310-9956-ffa450edef68
2011-08-25 10:32:21 +00:00
Christopher John Male 0ef9c3c25f LUCENE-3376: Moved ReusableAnalyzerBase to core
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1160117 13f79535-47bb-0310-9956-ffa450edef68
2011-08-22 06:01:31 +00:00
Robert Muir a5d2d78cec LUCENE-3378: nuke another useless custom test-classpath
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158857 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 18:21:41 +00:00
Robert Muir 99ac972281 LUCENE-3378: move collationtestbase to tests-framework
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158819 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 16:43:13 +00:00
Robert Muir 8a0578dfe2 LUCENE-3378: move VocabularyAssert to test-framework
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158730 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 14:19:15 +00:00
Robert Muir 7eab19aff7 LUCENE-3375: fix synonyms bug where keepOrig=false would discard unmatched inputs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158342 13f79535-47bb-0310-9956-ffa450edef68
2011-08-16 16:01:05 +00:00
Robert Muir f7237cb165 LUCENE-3361: remove api deprecations in trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154943 13f79535-47bb-0310-9956-ffa450edef68
2011-08-08 12:17:33 +00:00
Robert Muir ef56f5d551 LUCENE-3361: port url+email tokenizer to standardtokenizerinterface, fix combining marks bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154936 13f79535-47bb-0310-9956-ffa450edef68
2011-08-08 11:57:59 +00:00
Robert Muir 2dda5bc35f LUCENE-3358: StandardTokenizer wrongly discarded combining marks attached to Han/Hiragana
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154005 13f79535-47bb-0310-9956-ffa450edef68
2011-08-04 20:49:47 +00:00
Uwe Schindler 014dee7cf5 revert accidental commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150488 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:22:07 +00:00
Uwe Schindler 9c73f9d03b LUCENE-3336: Speed up javadocs-all builds by minimizing compile costs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150486 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:19:42 +00:00
Robert Muir 3626220146 use a different character for test, one that is still enclosing mark in 6.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150091 13f79535-47bb-0310-9956-ffa450edef68
2011-07-23 12:18:48 +00:00
Michael McCandless fbf9f4ccad LUCENE-3289: add options to FST Builder to tradeoff RAM/CPU used during build vs how small the resulting FST is
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145292 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:53:13 +00:00
Robert Muir 015ecfa0a0 LUCENE-3233: improve ram/perf of SynonymFilter, add wordnet parsing, nuke contrib/wordnet
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145158 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 12:58:52 +00:00
Christopher John Male 5f30bedccc LUCENE-3283: Moved core QueryParsers to queryparser module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145016 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 03:37:00 +00:00
Christopher John Male f16f395a30 LUCENE-3284: Decoupled remaining module/contrib tests from QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144566 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 01:11:18 +00:00
Simon Willnauer 6c5621f16c fixed dead store variable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144269 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 11:26:03 +00:00
Michael McCandless b55eeb510d LUCENE-3246: invert getDelDocs to getLiveDocs as pre-cursor for LUCENE-1536
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1143415 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 13:54:38 +00:00
Christopher John Male f9ed2c19cd LUCENE-3219: Moved SortField types to Enum
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1138276 13f79535-47bb-0310-9956-ffa450edef68
2011-06-22 01:48:45 +00:00
Robert Muir eca56e0564 LUCENE-152: minor optimization to avoid some char[]/String creation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1134328 13f79535-47bb-0310-9956-ffa450edef68
2011-06-10 14:00:32 +00:00
Simon Willnauer 2007a4b4e0 Remove @Version tags from JavaDoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1133805 13f79535-47bb-0310-9956-ffa450edef68
2011-06-09 11:43:35 +00:00
Robert Muir b7277878e8 LUCENE-152: add KStem
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130527 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 12:58:22 +00:00
Robert Muir 7bf34183d0 LUCENE-3158: put licensing information in the LICENSE.txt file where it belongs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129450 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 01:22:03 +00:00
Robert Muir c8101d708c LUCENE-3160: lucene source build doesn't work correctly by itself
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129445 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 00:08:14 +00:00
Simon Willnauer 68a840c2b7 SOLR-2530: Remove Noggit CharArr from FieldType
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1127326 13f79535-47bb-0310-9956-ffa450edef68
2011-05-24 22:44:36 +00:00
Robert Muir f1a1844fb9 LUCENE-3113: fix analyzer bugs found by MockTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104519 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 20:16:40 +00:00
Robert Muir a871b29ed6 LUCENE-3086: add ElisionFilter to ItalianAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1102120 13f79535-47bb-0310-9956-ffa450edef68
2011-05-11 22:43:54 +00:00
Ryan McKinley 96878534a0 LUCENE-3071: Add ReversePathHierarchyTokenizer and enable skip on PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1099999 13f79535-47bb-0310-9956-ffa450edef68
2011-05-05 23:30:05 +00:00
Robert Muir 4455345c6e LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir a75e5282c7 collation tests: try to find less jre bugs and just test thread safety
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir 593d7a54ea LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir 7db98455e7 LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir c0c695053c LUCENE-2560: remove copy/paste unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir 68061ef921 LUCENE-2560: add basic stress tests for analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir c3f6331639 LUCENE-3016: add analyzer for Latvian
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir 7d07d206b5 LUCENE-3020: better payload testing with mockanalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Robert Muir e67bf6b089 LUCENE-2944: fix BytesRef reuse bugs, TermToBytesRefAttribute owns the bytes like other attributes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083784 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 13:52:15 +00:00
Robert Muir 52fbd34849 clear java 1.5-only javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078639 13f79535-47bb-0310-9956-ffa450edef68
2011-03-07 00:55:32 +00:00
Robert Muir d51068ffd6 LUCENE-2894: apply formatting to more code samples
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076237 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 14:59:02 +00:00
Robert Muir 6600f5acdf LUCENE-2943: tone down test even more
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076223 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 13:56:15 +00:00
Robert Muir 7e5d696d7d LUCENE-2943: tone down test with multiplier a bit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075999 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 19:53:05 +00:00
Robert Muir 2509d35c11 LUCENE-2943: fix thread-safety issues with ICU collation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075850 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 15:47:14 +00:00
Robert Muir 308e0bd4a9 LUCENE-2514, LUCENE-2551: collation uses byte[] keys, deprecate old unscalable locale sort/range, termrangequery/filter work on bytes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075210 13f79535-47bb-0310-9956-ffa450edef68
2011-02-28 05:15:50 +00:00
Uwe Schindler 5691bea096 LUCENE-2920: Removed ShingleMatrixFilter as it is unmaintained and does not work with custom Attributes or custom payload encoders
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1070821 13f79535-47bb-0310-9956-ffa450edef68
2011-02-15 09:24:06 +00:00
Robert Muir 6386f77138 LUCENE-2911: synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer; add CJK types
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1068979 13f79535-47bb-0310-9956-ffa450edef68
2011-02-09 17:07:46 +00:00
Robert Muir 70a9910b38 LUCENE-2908: clean up serialization in the codebase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1068526 13f79535-47bb-0310-9956-ffa450edef68
2011-02-08 19:05:28 +00:00
Shai Erera ece1524805 LUCENE-2609: Generate jar containing test classes (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067738 13f79535-47bb-0310-9956-ffa450edef68
2011-02-06 19:48:54 +00:00
Koji Sekiguchi 6f31407109 SOLR-1057: Add PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067131 13f79535-47bb-0310-9956-ffa450edef68
2011-02-04 10:19:52 +00:00
Robert Muir dde8fc7020 LUCENE-2751: add LuceneTestCase.newSearcher. use this to get an indexsearcher that randomly uses threads, etc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1066691 13f79535-47bb-0310-9956-ffa450edef68
2011-02-02 23:27:25 +00:00
Robert Muir 5ccf063a5d LUCENE-2901: fix consistency of KeywordMarkerFilter, it should only set, not unset the attribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065621 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 14:06:45 +00:00
Robert Muir 107c06324b fix more javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065474 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 02:59:40 +00:00
Uwe Schindler e7088279f7 LUCENE-1253: LengthFilter (and Solr's KeepWordTokenFilter) now require up front specification of enablePositionIncrement. Together with StopFilter they have a common base class (FilteringTokenFilter) that handles the position increments automatically. Implementors only need to override an accept() method that filters tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065343 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 18:30:34 +00:00
Robert Muir d1a5ca1460 add missing @Override and @Deprecated annotations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065304 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 15:10:15 +00:00
Robert Muir 5629a2b96b add missing license headers where there are none, but the JIRA box was checked
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065265 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 13:28:41 +00:00
Yonik Seeley 51dc4159e6 SOLR-1283: fix numRead counter that caused mark invalid exceptions
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1063920 13f79535-47bb-0310-9956-ffa450edef68
2011-01-26 23:40:08 +00:00
Steven Rowe 11146b8c3c changed generate-maven-artifacts target to place all maven artifacts in one place: modules/dist/maven/; added modules/dist/ to list of dirs to remove with the 'clean' target; added modules/dist/ to svn:ignore list on modules/
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1062308 13f79535-47bb-0310-9956-ffa450edef68
2011-01-23 01:42:19 +00:00
Steven Rowe 74360c80f5 LUCENE-2657: Replace Maven POM templates with full POMs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1061613 13f79535-47bb-0310-9956-ffa450edef68
2011-01-21 03:44:13 +00:00
Yonik Seeley b2cad88aad SOLR-2316: fail early if synonym file not provided
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060846 13f79535-47bb-0310-9956-ffa450edef68
2011-01-19 16:11:42 +00:00
Shai Erera 2a0484bd40 LUCENE-2295: remove maxFieldLength (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060340 13f79535-47bb-0310-9956-ffa450edef68
2011-01-18 12:01:40 +00:00
Robert Muir 4249ef9644 LUCENE-2847: remove obselete warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1059719 13f79535-47bb-0310-9956-ffa450edef68
2011-01-17 01:43:37 +00:00
Steven Rowe 8d7d57abdc LUCENE-2847: Added ASL2 license to supplementary macros generator, and to the generated file, and set svn:eol-style to native for both of them.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1056014 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 19:15:21 +00:00
Robert Muir fbfb07d904 LUCENE-2842: avoid java6-only String.isEmpty in rule parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055906 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 15:07:12 +00:00
Robert Muir 66d3f38d52 LUCENE-2842: missing eol-style
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055893 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 14:33:35 +00:00
Robert Muir 61872be09d LUCENE-2842: add Galician analyzer, Portuguese RSLP
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055892 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 14:30:37 +00:00
Steven Rowe 1b22e86417 LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055877 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 13:51:10 +00:00
Michael McCandless 87274d00ac LUCENE-2837: collapse Searcher/Searchable into IndexSearcher; remove contrib/remote, MultiSearcher; absorb ParallelMultiSearcher into IndexSearcher as optional ExecutorService to ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055416 13f79535-47bb-0310-9956-ffa450edef68
2011-01-05 11:16:40 +00:00
Robert Muir 8696f549d4 LUCENE-2020: Remove unused imports
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1052926 13f79535-47bb-0310-9956-ffa450edef68
2010-12-26 19:16:42 +00:00
Robert Muir 620b2a0619 LUCENE-2747: Deprecate/remove language-specific tokenizers in favor of StandardTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043114 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 16:19:17 +00:00
Steven Rowe 2b9726ae81 LUCENE-2763: Swap URL+Email recognizing StandardTokenizer and UAX29Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043071 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 14:53:13 +00:00
Robert Muir f87ca310ec LUCENE-2797: Upgrade icu to 4.6
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1042185 13f79535-47bb-0310-9956-ffa450edef68
2010-12-04 14:08:03 +00:00
Robert Muir a58c26978f LUCENE-2781: drop deprecations from trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1040463 13f79535-47bb-0310-9956-ffa450edef68
2010-11-30 11:22:39 +00:00
Robert Muir ff47493dbd fix bug where StandardFilter isn't respected
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1040390 13f79535-47bb-0310-9956-ffa450edef68
2010-11-30 02:44:47 +00:00
Uwe Schindler 819344aeab LUCENE-2732: Fix charset problems in XML loading in HyphenationCompoundWordTokenFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1029345 13f79535-47bb-0310-9956-ffa450edef68
2010-10-31 13:56:46 +00:00
Uwe Schindler 987f32849b LUCENE-2708: when a test Assume fails, display information, improved one
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1023312 13f79535-47bb-0310-9956-ffa450edef68
2010-10-16 15:43:11 +00:00
Steven Rowe 7f6dd505f1 LUCENE-2699: Update StandardTokenizer and UAX29Tokenizer to Unicode 6.0.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1022826 13f79535-47bb-0310-9956-ffa450edef68
2010-10-15 05:41:54 +00:00
Steven Rowe f9e4f551e2 LUCENE-1370: Added ShingleFilter option to output unigrams if no shingles can be generated.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1006187 13f79535-47bb-0310-9956-ffa450edef68
2010-10-09 16:55:23 +00:00
Steven Rowe 42d5b585ce Ignore this test under IntelliJ, which can't use Ant's test file patterns (Test*.java,*Test.java) to ignore this test, and thinks it's a failure since no test methods can be found.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1004853 13f79535-47bb-0310-9956-ffa450edef68
2010-10-05 23:23:55 +00:00
Robert Muir afad8123d2 clear up more warnings in modules/contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003990 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 16:27:34 +00:00
Robert Muir 0789e5f4e7 LUCENE-2681: fix generics violations in contrib/modules
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003978 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 15:41:57 +00:00
Robert Muir e05117884a clear up javadocs warnings/errors
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003962 13f79535-47bb-0310-9956-ffa450edef68
2010-10-03 13:22:51 +00:00
Robert Muir c8b7a21b4b clear up more compiler warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003906 13f79535-47bb-0310-9956-ffa450edef68
2010-10-02 22:20:26 +00:00
Robert Muir fd11477ece clean up some fallthru/deprecation warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1003873 13f79535-47bb-0310-9956-ffa450edef68
2010-10-02 19:58:35 +00:00
Robert Muir f5031a6b27 LUCENE-2167: cut over these analyzers also
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002221 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 15:33:22 +00:00
Steven Rowe 3c26a9167c LUCENE-2167: Implement StandardTokenizer with the UAX#29 Standard
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1002032 13f79535-47bb-0310-9956-ffa450edef68
2010-09-28 06:16:16 +00:00
Robert Muir cce20cd820 LUCENE-2070: document how LengthFilter counts characters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1000675 13f79535-47bb-0310-9956-ffa450edef68
2010-09-24 00:42:05 +00:00
Robert Muir c84bd2f1ec LUCENE-2653: ThaiAnalyzer assumes things about your jre
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@998684 13f79535-47bb-0310-9956-ffa450edef68
2010-09-19 15:40:06 +00:00
Robert Muir 774eaeada0 LUCENE-2630: fix intl test bugs that rely on cldr version
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@997180 13f79535-47bb-0310-9956-ffa450edef68
2010-09-15 03:30:35 +00:00
Robert Muir feabadea20 LUCENE-2642: merge LuceneTestCase and LuceneTestCaseJ4
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@996611 13f79535-47bb-0310-9956-ffa450edef68
2010-09-13 17:37:20 +00:00
Robert Muir d38ec19a28 LUCENE-2639: remove random juggling in tests, add -Dtests.seed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@995772 13f79535-47bb-0310-9956-ffa450edef68
2010-09-10 12:34:18 +00:00
Robert Muir 13fd70521a LUCENE-2624: add armenian, basque, catalan analyzers from snowball
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@990459 13f79535-47bb-0310-9956-ffa450edef68
2010-08-28 22:42:25 +00:00
Robert Muir 33cc5a041e SOLR-2059: Add types attribute to WordDelimiterFilterFactory
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@990451 13f79535-47bb-0310-9956-ffa450edef68
2010-08-28 21:25:44 +00:00
Robert Muir 48dde8359f LUCENE-2098: speed up BaseCharFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@990161 13f79535-47bb-0310-9956-ffa450edef68
2010-08-27 14:33:22 +00:00
Robert Muir 07df8d5210 LUCENE-2598: factor the behavior of MockRAMDirectory into MockDirectoryWrapper, add experimental -Dtests.directory= to allow running the tests under different directory impls [but the default is still RAMDirectory]
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@988206 13f79535-47bb-0310-9956-ffa450edef68
2010-08-23 17:00:43 +00:00
Ryan McKinley 3be9fedd84 getting 'generate-maven-artifacts' to work with analysis module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@987286 13f79535-47bb-0310-9956-ffa450edef68
2010-08-19 19:58:36 +00:00
Robert Muir 1473b59c0e SOLR-1860: expose these analyzers stoplists as .txt like the others
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@986612 13f79535-47bb-0310-9956-ffa450edef68
2010-08-18 09:59:00 +00:00
Robert Muir faed4b4cd0 LUCENE-2598: add newDirectory and track that resources are closed correctly by tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@984968 13f79535-47bb-0310-9956-ffa450edef68
2010-08-12 20:56:23 +00:00
Robert Muir 61954ca249 SOLR-2002: change tests from TestCase to LuceneTestCase for better coverage
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@983530 13f79535-47bb-0310-9956-ffa450edef68
2010-08-09 06:11:16 +00:00
Shai Erera bed729c561 LUCENE-2570: Some improvements to _TestUtil and its usage
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@979646 13f79535-47bb-0310-9956-ffa450edef68
2010-07-27 11:31:25 +00:00
Robert Muir fcc9a4a3c3 LUCENE-2503: add forgotten javadoc/citation (sorry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@964054 13f79535-47bb-0310-9956-ffa450edef68
2010-07-14 14:06:06 +00:00
Robert Muir 3241eb9291 LUCENE-2503: add light stemmers for european languages
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@964019 13f79535-47bb-0310-9956-ffa450edef68
2010-07-14 12:10:34 +00:00
Robert Muir 8f71031ac8 LUCENE-2413: consolidate remaining solr tokenstreams into modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@957162 13f79535-47bb-0310-9956-ffa450edef68
2010-06-23 11:25:17 +00:00
Michael McCandless c91bddb26b LUCENE-2380: hard cutover of all preflex APIs to flex
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@955257 13f79535-47bb-0310-9956-ffa450edef68
2010-06-16 15:17:32 +00:00
Robert Muir 5a661500c1 LUCENE-2413: directory and package fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@955203 13f79535-47bb-0310-9956-ffa450edef68
2010-06-16 11:33:29 +00:00
Robert Muir 6e51a53189 LUCENE-2372: remove unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@950042 13f79535-47bb-0310-9956-ffa450edef68
2010-06-01 12:42:30 +00:00
Robert Muir ad0e495911 LUCENE-2372: switch over remaining uses of TermAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@950008 13f79535-47bb-0310-9956-ffa450edef68
2010-06-01 10:35:13 +00:00
Uwe Schindler 98b252ed7f LUCENE-2295: Added a LimitTokenCountAnalyzer / LimitTokenCountFilter to wrap any other Analyzer and provide the same functionality as MaxFieldLength provided on IndexWriter. This patch also fixes a bug in the offset calculation in CharTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@949445 13f79535-47bb-0310-9956-ffa450edef68
2010-05-29 23:14:18 +00:00
Uwe Schindler 9e61dd591f Generics Policeman ticket
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948234 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 22:44:36 +00:00
Robert Muir a0c72afb31 LUCENE-2413: move more core analysis to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948225 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 22:28:32 +00:00
Robert Muir 71b59ca566 LUCENE-2413: consolidate remaining concrete core analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@948195 13f79535-47bb-0310-9956-ffa450edef68
2010-05-25 20:16:44 +00:00
Robert Muir 5259d7d90b LUCENE-2413: move KeywordMarkerFilter to analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946621 13f79535-47bb-0310-9956-ffa450edef68
2010-05-20 13:23:12 +00:00
Robert Muir fe5f1aabcb LUCENE-1287: Allow usage of HyphenationCompoundWordTokenFilter without a dictionary
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@946139 13f79535-47bb-0310-9956-ffa450edef68
2010-05-19 11:58:37 +00:00
Uwe Schindler cd45643b96 LUCENE-2384: Remove hack, as JFlex trunk now has the zzBuffer bug fixed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@945130 13f79535-47bb-0310-9956-ffa450edef68
2010-05-17 13:13:10 +00:00
Robert Muir acbf053b7c LUCENE-2463: Improve Greek analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@945090 13f79535-47bb-0310-9956-ffa450edef68
2010-05-17 11:28:04 +00:00
Robert Muir 26b9faddb2 LUCENE-2413: consolidate SynonymFilter into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@942827 13f79535-47bb-0310-9956-ffa450edef68
2010-05-10 17:37:45 +00:00
Robert Muir 1b020be130 LUCENE-2437: Indonesian Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@942235 13f79535-47bb-0310-9956-ffa450edef68
2010-05-07 21:21:12 +00:00
Robert Muir f6e9cc9f32 LUCENE-2444: move contrib/analyzers to modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@941308 13f79535-47bb-0310-9956-ffa450edef68
2010-05-05 14:26:59 +00:00