Robert Muir
4455345c6e
LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir
a75e5282c7
collation tests: try to find less jre bugs and just test thread safety
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir
1f67321074
missing svn:eol-style
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1097216 13f79535-47bb-0310-9956-ffa450edef68
2011-04-27 19:40:18 +00:00
Robert Muir
44ba0859db
LUCENE-2560: stress tests for icu integration
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096339 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 16:07:16 +00:00
Robert Muir
593d7a54ea
LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir
7db98455e7
LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir
c0c695053c
LUCENE-2560: remove copy/paste unused import
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir
68061ef921
LUCENE-2560: add basic stress tests for analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir
c3f6331639
LUCENE-3016: add analyzer for Latvian
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir
ecd795c585
LUCENE-3026: SmartChineseAnalyzer's WordTokenFilter threw NullPointerException on sentences longer than 32,767 characters
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092328 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 15:15:31 +00:00
Robert Muir
52b54262dc
LUCENE-3020: don't reflect mockanalyzer, it has no no-arg ctor anymore
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091159 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 18:15:50 +00:00
Robert Muir
7d07d206b5
LUCENE-3020: better payload testing with mockanalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Steven Rowe
c613d642a0
LUCENE-3006: specialized definition of javadoc.classpath is not required for building ICU analysis module's javadocs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088063 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 16:47:24 +00:00
Michael McCandless
f10d92398b
LUCENE-1076: new TieredMergePolicy
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088051 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 15:47:12 +00:00
Steven Rowe
14eb02ffa4
LUCENE-3006: die javadoc warnings die (modules/ edition)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087830 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 17:43:24 +00:00
Robert Muir
d940c24c03
fix benchmark collation test to match reality
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087548 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 01:58:35 +00:00
Robert Muir
74a065a57f
fix collation benchmark to use byte terms
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087529 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 00:47:16 +00:00
Steven Rowe
7402c50058
fix typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087475 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:58 +00:00
Steven Rowe
085d30ecf3
changes entries for recent commits
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087474 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:21 +00:00
Steven Rowe
1caaea77b2
ReadTokensTask now converts tokens to their indexed forms (char[]->byte[]), just as the indexer does. This allows measurement of the conversion process, which is important for analysis components that customize it, e.g. (ICU)CollationKeyFilter.
...
NB: as a result, benchmarks that incorporate this task will no longer be directly comparable between 3.X and 4.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087471 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:44:20 +00:00
Steven Rowe
9cefe60a4b
Removed special case for looking up KeywordAnalyzer, which is *not* alone among analyzers occupying package o.a.l.analysis.core.
...
Instead, now attempting to instantiate no-package analyzers as core analyzers, then falling back to the previous default package ("org.apache.lucene.analysis.") if that fails. Also, made the same changes in NewShingleAnalyzerTask.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087468 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:34:46 +00:00
Steven Rowe
3bbfa450e4
Updated to the new method for obtaining a top-level deleted docs bitset. Also checking the bitset for null, when there are no deleted docs.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087435 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 21:03:18 +00:00
Steven Rowe
56c2994f66
Added a special case for looking up KeywordAnalyzer, which alone among analyzers occupies package o.a.l.analysis.core.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087431 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 20:16:10 +00:00
Doron Cohen
8d0c1b62af
LUCENE-2977: WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084929 13f79535-47bb-0310-9956-ffa450edef68
2011-03-24 12:22:13 +00:00
Doron Cohen
c6f3dd5cc7
LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix - fix CHANGES entry.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084549 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:47:18 +00:00
Doron Cohen
d123b8a224
LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084544 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:38:54 +00:00
Grant Ingersoll
ed20a24d22
LUCENE-2952: restore src/tools and move validation there
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084274 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:57 +00:00
Doron Cohen
97909a908e
fix mis-spelled assert comment (again)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084273 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:00 +00:00
Doron Cohen
1029aedcfd
fix mis-spelled assert comment.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084247 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 16:46:53 +00:00
Doron Cohen
bb8e6ae846
LUCENE-2978: Upgrade benchmark's commons-compress from 1.0 to 1.1.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084210 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 15:08:29 +00:00
Doron Cohen
a9fda446c3
LUCENE-2958: WriteLineDocTask improvements - flexible line fields definition - port/merge from 3x.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083816 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 14:59:42 +00:00
Robert Muir
e67bf6b089
LUCENE-2944: fix BytesRef reuse bugs, TermToBytesRefAttribute owns the bytes like other attributes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083784 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 13:52:15 +00:00
Doron Cohen
e45d28a8d3
LUCENE-2964: Allow benchmark tasks from alternative packages - merge/port from 3x.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083731 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 11:23:37 +00:00
Doron Cohen
6d47d7377d
LUCENE-2963: Easier way to run benchmark, by calling Benmchmark.exec(alg-file) - port from 3x.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083557 13f79535-47bb-0310-9956-ffa450edef68
2011-03-20 20:12:39 +00:00
Grant Ingersoll
8bee953057
LUCENE-2952: drop dev-tools dependency, move to test framework, split out checking to each area: lucene, modules, solr
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083010 13f79535-47bb-0310-9956-ffa450edef68
2011-03-18 18:40:02 +00:00
Grant Ingersoll
f36c32405d
LUCENE-2952: hook in dependency checking for license, notice
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082535 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 15:34:21 +00:00
Grant Ingersoll
746d0ef5a0
LUCENE-2952: add notices
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082516 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 15:00:51 +00:00
Grant Ingersoll
372fa574f9
remove unneeded license, notice
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1082512 13f79535-47bb-0310-9956-ffa450edef68
2011-03-17 14:55:34 +00:00
Grant Ingersoll
9352885d1b
LUCENE-2952: normalize license files
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1081792 13f79535-47bb-0310-9956-ffa450edef68
2011-03-15 14:07:16 +00:00
Steven Rowe
7180bb3cb9
LUCENE-2957: generate-maven-artifacts target should include all non-Mavenized Lucene & Solr dependencies
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080443 13f79535-47bb-0310-9956-ffa450edef68
2011-03-11 04:32:14 +00:00
Steven Rowe
77371e0433
Obsolete - replaced by apache-extras luceneutil
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080335 13f79535-47bb-0310-9956-ffa450edef68
2011-03-10 20:23:02 +00:00
Steven Rowe
3fcf6d6525
LUCENE-2961: Remove benchmark/lib/xml-apis-2.9.0.jar - JVM 1.5+ contains these JAXP 1.3 interface classes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1080258 13f79535-47bb-0310-9956-ffa450edef68
2011-03-10 15:57:52 +00:00
Uwe Schindler
bdaa02c3c0
LUCENE-2953: PriorityQueue's internal heap was made private final
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1079707 13f79535-47bb-0310-9956-ffa450edef68
2011-03-09 09:18:56 +00:00
Robert Muir
52fbd34849
clear java 1.5-only javadocs warnings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078639 13f79535-47bb-0310-9956-ffa450edef68
2011-03-07 00:55:32 +00:00
Robert Muir
b2fcee9822
add missing LICENSE/NOTICE to benchmarks module
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078565 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 20:48:10 +00:00
Robert Muir
48dbe35e69
correct minor problems with dates and copyright owners in NOTICE.txts
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078529 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 18:22:48 +00:00
Robert Muir
28ea4b7561
add xyz-LICENSE.txt for all third party jars
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1078512 13f79535-47bb-0310-9956-ffa450edef68
2011-03-06 16:50:22 +00:00
Robert Muir
d51068ffd6
LUCENE-2894: apply formatting to more code samples
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076237 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 14:59:02 +00:00
Robert Muir
6600f5acdf
LUCENE-2943: tone down test even more
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1076223 13f79535-47bb-0310-9956-ffa450edef68
2011-03-02 13:56:15 +00:00
Robert Muir
7e5d696d7d
LUCENE-2943: tone down test with multiplier a bit
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1075999 13f79535-47bb-0310-9956-ffa450edef68
2011-03-01 19:53:05 +00:00