Robert Muir
f5048293b5
LUCENE-2995: factor out a shared spellchecking module
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126642 13f79535-47bb-0310-9956-ffa450edef68
2011-05-23 18:33:53 +00:00
Koji Sekiguchi
49fe358760
use prettify in grouping package javadoc
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126275 13f79535-47bb-0310-9956-ffa450edef68
2011-05-22 23:56:43 +00:00
Shai Erera
fa5da66404
LUCENE-3102: add no-wrap ability to CachingCollector
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1124379 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 18:51:39 +00:00
Michael McCandless
92b00742ee
fix javadoc warnings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1124298 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 15:17:51 +00:00
Shai Erera
fa23344e40
LUCENE-3102: add factory method to CachingCollector
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104683 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 04:00:48 +00:00
Robert Muir
f1a1844fb9
LUCENE-3113: fix analyzer bugs found by MockTokenizer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104519 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 20:16:40 +00:00
Michael McCandless
1c464e6dcc
LUCENE-3098: add AllGroupsCollector
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104421 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 17:20:54 +00:00
Shai Erera
f5fdea8dda
LUCENE-3102: first cut - some refactoring, bug fixes, add test, move to core (trunk)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103872 13f79535-47bb-0310-9956-ffa450edef68
2011-05-16 20:29:10 +00:00
Shai Erera
680bb70e3a
use assertArrayEquals instead of the deprecated version
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103601 13f79535-47bb-0310-9956-ffa450edef68
2011-05-16 03:17:55 +00:00
Steven Rowe
acb3c8be3a
LUCENE-1421: Specialize the dist-maven target for the new grouping module, so that generate-maven-artifacts can succeed
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103227 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 21:24:50 +00:00
Michael McCandless
c7db217230
LUCENE-1421: moves CHANGES entry up to contrib's 3.x CHANGES
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103155 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 16:44:04 +00:00
Michael McCandless
a8993ca817
LUCENE-1421: fixup javadocs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103150 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 16:17:58 +00:00
Robert Muir
4c85f799eb
LUCENE-1421: make class public
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103102 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 13:26:26 +00:00
Robert Muir
2b3e066518
LUCENE-1421: add workaround to unbreak the build
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103048 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 11:25:56 +00:00
Michael McCandless
9083cedb92
LUCENE-1421: factor out shared grouping module
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103024 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 10:48:29 +00:00
Robert Muir
a871b29ed6
LUCENE-3086: add ElisionFilter to ItalianAnalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1102120 13f79535-47bb-0310-9956-ffa450edef68
2011-05-11 22:43:54 +00:00
Ryan McKinley
96878534a0
LUCENE-3071: Add ReversePathHierarchyTokenizer and enable skip on PathHierarchyTokenizer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1099999 13f79535-47bb-0310-9956-ffa450edef68
2011-05-05 23:30:05 +00:00
Robert Muir
4455345c6e
LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir
a75e5282c7
collation tests: try to find less jre bugs and just test thread safety
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir
1f67321074
missing svn:eol-style
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1097216 13f79535-47bb-0310-9956-ffa450edef68
2011-04-27 19:40:18 +00:00
Robert Muir
44ba0859db
LUCENE-2560: stress tests for icu integration
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096339 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 16:07:16 +00:00
Robert Muir
593d7a54ea
LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir
7db98455e7
LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir
c0c695053c
LUCENE-2560: remove copy/paste unused import
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir
68061ef921
LUCENE-2560: add basic stress tests for analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir
c3f6331639
LUCENE-3016: add analyzer for Latvian
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir
ecd795c585
LUCENE-3026: SmartChineseAnalyzer's WordTokenFilter threw NullPointerException on sentences longer than 32,767 characters
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092328 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 15:15:31 +00:00
Robert Muir
52b54262dc
LUCENE-3020: don't reflect mockanalyzer, it has no no-arg ctor anymore
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091159 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 18:15:50 +00:00
Robert Muir
7d07d206b5
LUCENE-3020: better payload testing with mockanalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Steven Rowe
c613d642a0
LUCENE-3006: specialized definition of javadoc.classpath is not required for building ICU analysis module's javadocs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088063 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 16:47:24 +00:00
Michael McCandless
f10d92398b
LUCENE-1076: new TieredMergePolicy
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088051 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 15:47:12 +00:00
Steven Rowe
14eb02ffa4
LUCENE-3006: die javadoc warnings die (modules/ edition)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087830 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 17:43:24 +00:00
Robert Muir
d940c24c03
fix benchmark collation test to match reality
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087548 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 01:58:35 +00:00
Robert Muir
74a065a57f
fix collation benchmark to use byte terms
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087529 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 00:47:16 +00:00
Steven Rowe
7402c50058
fix typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087475 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:58 +00:00
Steven Rowe
085d30ecf3
changes entries for recent commits
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087474 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:21 +00:00
Steven Rowe
1caaea77b2
ReadTokensTask now converts tokens to their indexed forms (char[]->byte[]), just as the indexer does. This allows measurement of the conversion process, which is important for analysis components that customize it, e.g. (ICU)CollationKeyFilter.
...
NB: as a result, benchmarks that incorporate this task will no longer be directly comparable between 3.X and 4.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087471 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:44:20 +00:00
Steven Rowe
9cefe60a4b
Removed special case for looking up KeywordAnalyzer, which is *not* alone among analyzers occupying package o.a.l.analysis.core.
...
Instead, now attempting to instantiate no-package analyzers as core analyzers, then falling back to the previous default package ("org.apache.lucene.analysis.") if that fails. Also, made the same changes in NewShingleAnalyzerTask.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087468 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:34:46 +00:00
Steven Rowe
3bbfa450e4
Updated to the new method for obtaining a top-level deleted docs bitset. Also checking the bitset for null, when there are no deleted docs.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087435 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 21:03:18 +00:00
Steven Rowe
56c2994f66
Added a special case for looking up KeywordAnalyzer, which alone among analyzers occupies package o.a.l.analysis.core.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087431 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 20:16:10 +00:00
Doron Cohen
8d0c1b62af
LUCENE-2977: WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084929 13f79535-47bb-0310-9956-ffa450edef68
2011-03-24 12:22:13 +00:00
Doron Cohen
c6f3dd5cc7
LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix - fix CHANGES entry.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084549 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:47:18 +00:00
Doron Cohen
d123b8a224
LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084544 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:38:54 +00:00
Grant Ingersoll
ed20a24d22
LUCENE-2952: restore src/tools and move validation there
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084274 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:57 +00:00
Doron Cohen
97909a908e
fix mis-spelled assert comment (again)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084273 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:00 +00:00
Doron Cohen
1029aedcfd
fix mis-spelled assert comment.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084247 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 16:46:53 +00:00
Doron Cohen
bb8e6ae846
LUCENE-2978: Upgrade benchmark's commons-compress from 1.0 to 1.1.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084210 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 15:08:29 +00:00
Doron Cohen
a9fda446c3
LUCENE-2958: WriteLineDocTask improvements - flexible line fields definition - port/merge from 3x.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083816 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 14:59:42 +00:00
Robert Muir
e67bf6b089
LUCENE-2944: fix BytesRef reuse bugs, TermToBytesRefAttribute owns the bytes like other attributes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083784 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 13:52:15 +00:00
Doron Cohen
e45d28a8d3
LUCENE-2964: Allow benchmark tasks from alternative packages - merge/port from 3x.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1083731 13f79535-47bb-0310-9956-ffa450edef68
2011-03-21 11:23:37 +00:00