Commit Graph

108 Commits

Author SHA1 Message Date
Robert Muir 2dda5bc35f LUCENE-3358: StandardTokenizer wrongly discarded combining marks attached to Han/Hiragana
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154005 13f79535-47bb-0310-9956-ffa450edef68
2011-08-04 20:49:47 +00:00
Uwe Schindler 014dee7cf5 revert accidental commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150488 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:22:07 +00:00
Uwe Schindler 9c73f9d03b LUCENE-3336: Speed up javadocs-all builds by minimizing compile costs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150486 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:19:42 +00:00
Robert Muir 3626220146 use a different character for test, one that is still enclosing mark in 6.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150091 13f79535-47bb-0310-9956-ffa450edef68
2011-07-23 12:18:48 +00:00
Michael McCandless fbf9f4ccad LUCENE-3289: add options to FST Builder to tradeoff RAM/CPU used during build vs how small the resulting FST is
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145292 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:53:13 +00:00
Robert Muir 015ecfa0a0 LUCENE-3233: improve ram/perf of SynonymFilter, add wordnet parsing, nuke contrib/wordnet
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145158 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 12:58:52 +00:00
Christopher John Male 5f30bedccc LUCENE-3283: Moved core QueryParsers to queryparser module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145016 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 03:37:00 +00:00
Christopher John Male f16f395a30 LUCENE-3284: Decoupled remaining module/contrib tests from QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144566 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 01:11:18 +00:00
Simon Willnauer 6c5621f16c fixed dead store variable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144269 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 11:26:03 +00:00
Michael McCandless b55eeb510d LUCENE-3246: invert getDelDocs to getLiveDocs as pre-cursor for LUCENE-1536
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1143415 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 13:54:38 +00:00
Christopher John Male f9ed2c19cd LUCENE-3219: Moved SortField types to Enum
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1138276 13f79535-47bb-0310-9956-ffa450edef68
2011-06-22 01:48:45 +00:00
Robert Muir eca56e0564 LUCENE-152: minor optimization to avoid some char[]/String creation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1134328 13f79535-47bb-0310-9956-ffa450edef68
2011-06-10 14:00:32 +00:00
Simon Willnauer 2007a4b4e0 Remove @Version tags from JavaDoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1133805 13f79535-47bb-0310-9956-ffa450edef68
2011-06-09 11:43:35 +00:00
Robert Muir b7277878e8 LUCENE-152: add KStem
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130527 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 12:58:22 +00:00
Robert Muir 7bf34183d0 LUCENE-3158: put licensing information in the LICENSE.txt file where it belongs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129450 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 01:22:03 +00:00
Robert Muir c8101d708c LUCENE-3160: lucene source build doesn't work correctly by itself
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129445 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 00:08:14 +00:00
Simon Willnauer 68a840c2b7 SOLR-2530: Remove Noggit CharArr from FieldType
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1127326 13f79535-47bb-0310-9956-ffa450edef68
2011-05-24 22:44:36 +00:00
Robert Muir f1a1844fb9 LUCENE-3113: fix analyzer bugs found by MockTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104519 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 20:16:40 +00:00
Robert Muir a871b29ed6 LUCENE-3086: add ElisionFilter to ItalianAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1102120 13f79535-47bb-0310-9956-ffa450edef68
2011-05-11 22:43:54 +00:00
Ryan McKinley 96878534a0 LUCENE-3071: Add ReversePathHierarchyTokenizer and enable skip on PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1099999 13f79535-47bb-0310-9956-ffa450edef68
2011-05-05 23:30:05 +00:00
Robert Muir 4455345c6e LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir a75e5282c7 collation tests: try to find less jre bugs and just test thread safety
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir 593d7a54ea LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir 7db98455e7 LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir c0c695053c LUCENE-2560: remove copy/paste unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir 68061ef921 LUCENE-2560: add basic stress tests for analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir c3f6331639 LUCENE-3016: add analyzer for Latvian
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir 7d07d206b5 LUCENE-3020: better payload testing with mockanalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Robert Muir e67bf6b089 LUCENE-2944: fix BytesRef reuse bugs, TermToBytesRefAttribute owns the bytes like other attributes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083784 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 13:52:15 +00:00
Robert Muir 52fbd34849 clear java 1.5-only javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078639 13f79535-47bb-0310-9956-ffa450edef68
2011-03-07 00:55:32 +00:00
Robert Muir d51068ffd6 LUCENE-2894: apply formatting to more code samples
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076237 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 14:59:02 +00:00
Robert Muir 6600f5acdf LUCENE-2943: tone down test even more
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076223 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 13:56:15 +00:00
Robert Muir 7e5d696d7d LUCENE-2943: tone down test with multiplier a bit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075999 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 19:53:05 +00:00
Robert Muir 2509d35c11 LUCENE-2943: fix thread-safety issues with ICU collation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075850 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 15:47:14 +00:00
Robert Muir 308e0bd4a9 LUCENE-2514, LUCENE-2551: collation uses byte[] keys, deprecate old unscalable locale sort/range, termrangequery/filter work on bytes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075210 13f79535-47bb-0310-9956-ffa450edef68
2011-02-28 05:15:50 +00:00
Uwe Schindler 5691bea096 LUCENE-2920: Removed ShingleMatrixFilter as it is unmaintained and does not work with custom Attributes or custom payload encoders
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1070821 13f79535-47bb-0310-9956-ffa450edef68
2011-02-15 09:24:06 +00:00
Robert Muir 6386f77138 LUCENE-2911: synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer; add CJK types
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1068979 13f79535-47bb-0310-9956-ffa450edef68
2011-02-09 17:07:46 +00:00
Robert Muir 70a9910b38 LUCENE-2908: clean up serialization in the codebase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1068526 13f79535-47bb-0310-9956-ffa450edef68
2011-02-08 19:05:28 +00:00
Shai Erera ece1524805 LUCENE-2609: Generate jar containing test classes (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067738 13f79535-47bb-0310-9956-ffa450edef68
2011-02-06 19:48:54 +00:00
Koji Sekiguchi 6f31407109 SOLR-1057: Add PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1067131 13f79535-47bb-0310-9956-ffa450edef68
2011-02-04 10:19:52 +00:00
Robert Muir dde8fc7020 LUCENE-2751: add LuceneTestCase.newSearcher. use this to get an indexsearcher that randomly uses threads, etc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1066691 13f79535-47bb-0310-9956-ffa450edef68
2011-02-02 23:27:25 +00:00
Robert Muir 5ccf063a5d LUCENE-2901: fix consistency of KeywordMarkerFilter, it should only set, not unset the attribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065621 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 14:06:45 +00:00
Robert Muir 107c06324b fix more javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065474 13f79535-47bb-0310-9956-ffa450edef68
2011-01-31 02:59:40 +00:00
Uwe Schindler e7088279f7 LUCENE-1253: LengthFilter (and Solr's KeepWordTokenFilter) now require up front specification of enablePositionIncrement. Together with StopFilter they have a common base class (FilteringTokenFilter) that handles the position increments automatically. Implementors only need to override an accept() method that filters tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065343 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 18:30:34 +00:00
Robert Muir d1a5ca1460 add missing @Override and @Deprecated annotations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065304 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 15:10:15 +00:00
Robert Muir 5629a2b96b add missing license headers where there are none, but the JIRA box was checked
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1065265 13f79535-47bb-0310-9956-ffa450edef68
2011-01-30 13:28:41 +00:00
Yonik Seeley 51dc4159e6 SOLR-1283: fix numRead counter that caused mark invalid exceptions
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1063920 13f79535-47bb-0310-9956-ffa450edef68
2011-01-26 23:40:08 +00:00
Steven Rowe 11146b8c3c changed generate-maven-artifacts target to place all maven artifacts in one place: modules/dist/maven/; added modules/dist/ to list of dirs to remove with the 'clean' target; added modules/dist/ to svn:ignore list on modules/
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1062308 13f79535-47bb-0310-9956-ffa450edef68
2011-01-23 01:42:19 +00:00
Steven Rowe 74360c80f5 LUCENE-2657: Replace Maven POM templates with full POMs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1061613 13f79535-47bb-0310-9956-ffa450edef68
2011-01-21 03:44:13 +00:00
Yonik Seeley b2cad88aad SOLR-2316: fail early if synonym file not provided
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1060846 13f79535-47bb-0310-9956-ffa450edef68
2011-01-19 16:11:42 +00:00