Commit Graph

152 Commits

Author SHA1 Message Date
Robert Muir 4455345c6e LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir a75e5282c7 collation tests: try to find less jre bugs and just test thread safety
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir 1f67321074 missing svn:eol-style
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1097216 13f79535-47bb-0310-9956-ffa450edef68
2011-04-27 19:40:18 +00:00
Robert Muir 44ba0859db LUCENE-2560: stress tests for icu integration
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096339 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 16:07:16 +00:00
Robert Muir 593d7a54ea LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir 7db98455e7 LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir c0c695053c LUCENE-2560: remove copy/paste unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir 68061ef921 LUCENE-2560: add basic stress tests for analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir c3f6331639 LUCENE-3016: add analyzer for Latvian
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir ecd795c585 LUCENE-3026: SmartChineseAnalyzer's WordTokenFilter threw NullPointerException on sentences longer than 32,767 characters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092328 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 15:15:31 +00:00
Robert Muir 52b54262dc LUCENE-3020: don't reflect mockanalyzer, it has no no-arg ctor anymore
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091159 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 18:15:50 +00:00
Robert Muir 7d07d206b5 LUCENE-3020: better payload testing with mockanalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Steven Rowe c613d642a0 LUCENE-3006: specialized definition of javadoc.classpath is not required for building ICU analysis module's javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088063 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 16:47:24 +00:00
Michael McCandless f10d92398b LUCENE-1076: new TieredMergePolicy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088051 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 15:47:12 +00:00
Steven Rowe 14eb02ffa4 LUCENE-3006: die javadoc warnings die (modules/ edition)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087830 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 17:43:24 +00:00
Robert Muir d940c24c03 fix benchmark collation test to match reality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087548 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 01:58:35 +00:00
Robert Muir 74a065a57f fix collation benchmark to use byte terms
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087529 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 00:47:16 +00:00
Steven Rowe 7402c50058 fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087475 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:58 +00:00
Steven Rowe 085d30ecf3 changes entries for recent commits
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087474 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:21 +00:00
Steven Rowe 1caaea77b2 ReadTokensTask now converts tokens to their indexed forms (char[]->byte[]), just as the indexer does. This allows measurement of the conversion process, which is important for analysis components that customize it, e.g. (ICU)CollationKeyFilter.
NB: as a result, benchmarks that incorporate this task will no longer be directly comparable between 3.X and 4.0

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087471 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:44:20 +00:00
Steven Rowe 9cefe60a4b Removed special case for looking up KeywordAnalyzer, which is *not* alone among analyzers occupying package o.a.l.analysis.core.
Instead, now attempting to instantiate no-package analyzers as core analyzers, then falling back to the previous default package ("org.apache.lucene.analysis.") if that fails.  Also, made the same changes in NewShingleAnalyzerTask.

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087468 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:34:46 +00:00
Steven Rowe 3bbfa450e4 Updated to the new method for obtaining a top-level deleted docs bitset. Also checking the bitset for null, when there are no deleted docs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087435 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 21:03:18 +00:00
Steven Rowe 56c2994f66 Added a special case for looking up KeywordAnalyzer, which alone among analyzers occupies package o.a.l.analysis.core.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087431 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 20:16:10 +00:00
Doron Cohen 8d0c1b62af LUCENE-2977: WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084929 13f79535-47bb-0310-9956-ffa450edef68
2011-03-24 12:22:13 +00:00
Doron Cohen c6f3dd5cc7 LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix - fix CHANGES entry.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084549 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:47:18 +00:00
Doron Cohen d123b8a224 LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084544 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:38:54 +00:00
Grant Ingersoll ed20a24d22 LUCENE-2952: restore src/tools and move validation there
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084274 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:57 +00:00
Doron Cohen 97909a908e fix mis-spelled assert comment (again)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084273 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:00 +00:00
Doron Cohen 1029aedcfd fix mis-spelled assert comment.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084247 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 16:46:53 +00:00
Doron Cohen bb8e6ae846 LUCENE-2978: Upgrade benchmark's commons-compress from 1.0 to 1.1.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084210 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 15:08:29 +00:00
Doron Cohen a9fda446c3 LUCENE-2958: WriteLineDocTask improvements - flexible line fields definition - port/merge from 3x.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083816 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 14:59:42 +00:00
Robert Muir e67bf6b089 LUCENE-2944: fix BytesRef reuse bugs, TermToBytesRefAttribute owns the bytes like other attributes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083784 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 13:52:15 +00:00
Doron Cohen e45d28a8d3 LUCENE-2964: Allow benchmark tasks from alternative packages - merge/port from 3x.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083731 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 11:23:37 +00:00
Doron Cohen 6d47d7377d LUCENE-2963: Easier way to run benchmark, by calling Benmchmark.exec(alg-file) - port from 3x.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083557 13f79535-47bb-0310-9956-ffa450edef68
2011-03-20 20:12:39 +00:00
Grant Ingersoll 8bee953057 LUCENE-2952: drop dev-tools dependency, move to test framework, split out checking to each area: lucene, modules, solr
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083010 13f79535-47bb-0310-9956-ffa450edef68
2011-03-18 18:40:02 +00:00
Grant Ingersoll f36c32405d LUCENE-2952: hook in dependency checking for license, notice
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082535 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 15:34:21 +00:00
Grant Ingersoll 746d0ef5a0 LUCENE-2952: add notices
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082516 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 15:00:51 +00:00
Grant Ingersoll 372fa574f9 remove unneeded license, notice
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082512 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 14:55:34 +00:00
Grant Ingersoll 9352885d1b LUCENE-2952: normalize license files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1081792 13f79535-47bb-0310-9956-ffa450edef68
2011-03-15 14:07:16 +00:00
Steven Rowe 7180bb3cb9 LUCENE-2957: generate-maven-artifacts target should include all non-Mavenized Lucene & Solr dependencies
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080443 13f79535-47bb-0310-9956-ffa450edef68
2011-03-11 04:32:14 +00:00
Steven Rowe 77371e0433 Obsolete - replaced by apache-extras luceneutil
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080335 13f79535-47bb-0310-9956-ffa450edef68
2011-03-10 20:23:02 +00:00
Steven Rowe 3fcf6d6525 LUCENE-2961: Remove benchmark/lib/xml-apis-2.9.0.jar - JVM 1.5+ contains these JAXP 1.3 interface classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080258 13f79535-47bb-0310-9956-ffa450edef68
2011-03-10 15:57:52 +00:00
Uwe Schindler bdaa02c3c0 LUCENE-2953: PriorityQueue's internal heap was made private final
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1079707 13f79535-47bb-0310-9956-ffa450edef68
2011-03-09 09:18:56 +00:00
Robert Muir 52fbd34849 clear java 1.5-only javadocs warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078639 13f79535-47bb-0310-9956-ffa450edef68
2011-03-07 00:55:32 +00:00
Robert Muir b2fcee9822 add missing LICENSE/NOTICE to benchmarks module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078565 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 20:48:10 +00:00
Robert Muir 48dbe35e69 correct minor problems with dates and copyright owners in NOTICE.txts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078529 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 18:22:48 +00:00
Robert Muir 28ea4b7561 add xyz-LICENSE.txt for all third party jars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078512 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 16:50:22 +00:00
Robert Muir d51068ffd6 LUCENE-2894: apply formatting to more code samples
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076237 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 14:59:02 +00:00
Robert Muir 6600f5acdf LUCENE-2943: tone down test even more
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076223 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 13:56:15 +00:00
Robert Muir 7e5d696d7d LUCENE-2943: tone down test with multiplier a bit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075999 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 19:53:05 +00:00