Commit Graph

109 Commits

Author SHA1 Message Date
Robert Muir 4455345c6e LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir a75e5282c7 collation tests: try to find less jre bugs and just test thread safety
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir 44ba0859db LUCENE-2560: stress tests for icu integration
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096339 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 16:07:16 +00:00
Robert Muir 593d7a54ea LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir 7db98455e7 LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir c0c695053c LUCENE-2560: remove copy/paste unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir 68061ef921 LUCENE-2560: add basic stress tests for analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir c3f6331639 LUCENE-3016: add analyzer for Latvian
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir ecd795c585 LUCENE-3026: SmartChineseAnalyzer's WordTokenFilter threw NullPointerException on sentences longer than 32,767 characters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092328 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 15:15:31 +00:00
Robert Muir 7d07d206b5 LUCENE-3020: better payload testing with mockanalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Steven Rowe c613d642a0 LUCENE-3006: specialized definition of javadoc.classpath is not required for building ICU analysis module's javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088063 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 16:47:24 +00:00
Steven Rowe 14eb02ffa4 LUCENE-3006: die javadoc warnings die (modules/ edition)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087830 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 17:43:24 +00:00
Robert Muir e67bf6b089 LUCENE-2944: fix BytesRef reuse bugs, TermToBytesRefAttribute owns the bytes like other attributes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083784 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 13:52:15 +00:00
Grant Ingersoll 746d0ef5a0 LUCENE-2952: add notices
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082516 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 15:00:51 +00:00
Grant Ingersoll 9352885d1b LUCENE-2952: normalize license files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1081792 13f79535-47bb-0310-9956-ffa450edef68
2011-03-15 14:07:16 +00:00
Steven Rowe 7180bb3cb9 LUCENE-2957: generate-maven-artifacts target should include all non-Mavenized Lucene & Solr dependencies
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080443 13f79535-47bb-0310-9956-ffa450edef68
2011-03-11 04:32:14 +00:00
Robert Muir 52fbd34849 clear java 1.5-only javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078639 13f79535-47bb-0310-9956-ffa450edef68
2011-03-07 00:55:32 +00:00
Robert Muir 48dbe35e69 correct minor problems with dates and copyright owners in NOTICE.txts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078529 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 18:22:48 +00:00
Robert Muir 28ea4b7561 add xyz-LICENSE.txt for all third party jars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078512 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 16:50:22 +00:00
Robert Muir d51068ffd6 LUCENE-2894: apply formatting to more code samples
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076237 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 14:59:02 +00:00
Robert Muir 6600f5acdf LUCENE-2943: tone down test even more
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076223 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 13:56:15 +00:00
Robert Muir 7e5d696d7d LUCENE-2943: tone down test with multiplier a bit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075999 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 19:53:05 +00:00
Robert Muir 2509d35c11 LUCENE-2943: fix thread-safety issues with ICU collation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075850 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 15:47:14 +00:00
Robert Muir 308e0bd4a9 LUCENE-2514, LUCENE-2551: collation uses byte[] keys, deprecate old unscalable locale sort/range, termrangequery/filter work on bytes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075210 13f79535-47bb-0310-9956-ffa450edef68
2011-02-28 05:15:50 +00:00
Uwe Schindler 5691bea096 LUCENE-2920: Removed ShingleMatrixFilter as it is unmaintained and does not work with custom Attributes or custom payload encoders
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1070821 13f79535-47bb-0310-9956-ffa450edef68
2011-02-15 09:24:06 +00:00
Robert Muir 6386f77138 LUCENE-2911: synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer; add CJK types
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1068979 13f79535-47bb-0310-9956-ffa450edef68
2011-02-09 17:07:46 +00:00
Robert Muir 70a9910b38 LUCENE-2908: clean up serialization in the codebase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1068526 13f79535-47bb-0310-9956-ffa450edef68
2011-02-08 19:05:28 +00:00
Shai Erera ece1524805 LUCENE-2609: Generate jar containing test classes (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067738 13f79535-47bb-0310-9956-ffa450edef68
2011-02-06 19:48:54 +00:00
Koji Sekiguchi 6f31407109 SOLR-1057: Add PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067131 13f79535-47bb-0310-9956-ffa450edef68
2011-02-04 10:19:52 +00:00
Robert Muir dde8fc7020 LUCENE-2751: add LuceneTestCase.newSearcher. use this to get an indexsearcher that randomly uses threads, etc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1066691 13f79535-47bb-0310-9956-ffa450edef68
2011-02-02 23:27:25 +00:00
Robert Muir 5ccf063a5d LUCENE-2901: fix consistency of KeywordMarkerFilter, it should only set, not unset the attribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065621 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 14:06:45 +00:00
Robert Muir 107c06324b fix more javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065474 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 02:59:40 +00:00
Uwe Schindler e7088279f7 LUCENE-1253: LengthFilter (and Solr's KeepWordTokenFilter) now require up front specification of enablePositionIncrement. Together with StopFilter they have a common base class (FilteringTokenFilter) that handles the position increments automatically. Implementors only need to override an accept() method that filters tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065343 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 18:30:34 +00:00
Robert Muir d1a5ca1460 add missing @Override and @Deprecated annotations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065304 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 15:10:15 +00:00
Robert Muir 5629a2b96b add missing license headers where there are none, but the JIRA box was checked
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065265 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 13:28:41 +00:00
Yonik Seeley 51dc4159e6 SOLR-1283: fix numRead counter that caused mark invalid exceptions
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1063920 13f79535-47bb-0310-9956-ffa450edef68
2011-01-26 23:40:08 +00:00
Steven Rowe 11146b8c3c changed generate-maven-artifacts target to place all maven artifacts in one place: modules/dist/maven/; added modules/dist/ to list of dirs to remove with the 'clean' target; added modules/dist/ to svn:ignore list on modules/
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1062308 13f79535-47bb-0310-9956-ffa450edef68
2011-01-23 01:42:19 +00:00
Steven Rowe 74360c80f5 LUCENE-2657: Replace Maven POM templates with full POMs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1061613 13f79535-47bb-0310-9956-ffa450edef68
2011-01-21 03:44:13 +00:00
Uwe Schindler 460fa90564 LUCENE-2374: Added Attribute reflection API: It's now possible to inspect the contents of AttributeImpl and AttributeSource using a well-defined API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1061039 13f79535-47bb-0310-9956-ffa450edef68
2011-01-19 22:41:16 +00:00
Yonik Seeley b2cad88aad SOLR-2316: fail early if synonym file not provided
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060846 13f79535-47bb-0310-9956-ffa450edef68
2011-01-19 16:11:42 +00:00
Shai Erera 2a0484bd40 LUCENE-2295: remove maxFieldLength (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060340 13f79535-47bb-0310-9956-ffa450edef68
2011-01-18 12:01:40 +00:00
Robert Muir 4249ef9644 LUCENE-2847: remove obselete warning
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1059719 13f79535-47bb-0310-9956-ffa450edef68
2011-01-17 01:43:37 +00:00
Steven Rowe 8d7d57abdc LUCENE-2847: Added ASL2 license to supplementary macros generator, and to the generated file, and set svn:eol-style to native for both of them.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1056014 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 19:15:21 +00:00
Robert Muir fbfb07d904 LUCENE-2842: avoid java6-only String.isEmpty in rule parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055906 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 15:07:12 +00:00
Robert Muir 66d3f38d52 LUCENE-2842: missing eol-style
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055893 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 14:33:35 +00:00
Robert Muir 61872be09d LUCENE-2842: add Galician analyzer, Portuguese RSLP
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055892 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 14:30:37 +00:00
Steven Rowe 1b22e86417 LUCENE-2847: Support all of unicode, including supplementary code points above the basic multilingual plane, in StandardTokenizer and UAX29URLEmailTokenizer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055877 13f79535-47bb-0310-9956-ffa450edef68
2011-01-06 13:51:10 +00:00
Michael McCandless 87274d00ac LUCENE-2837: collapse Searcher/Searchable into IndexSearcher; remove contrib/remote, MultiSearcher; absorb ParallelMultiSearcher into IndexSearcher as optional ExecutorService to ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1055416 13f79535-47bb-0310-9956-ffa450edef68
2011-01-05 11:16:40 +00:00
Robert Muir 8696f549d4 LUCENE-2020: Remove unused imports
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1052926 13f79535-47bb-0310-9956-ffa450edef68
2010-12-26 19:16:42 +00:00
Robert Muir 620b2a0619 LUCENE-2747: Deprecate/remove language-specific tokenizers in favor of StandardTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1043114 13f79535-47bb-0310-9956-ffa450edef68
2010-12-07 16:19:17 +00:00