Uwe Schindler
|
7154c5466d
|
LUCENE-3919: Fix generics and additional checks
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310883 13f79535-47bb-0310-9956-ffa450edef68
|
2012-04-07 22:00:28 +00:00 |
Robert Muir
|
ed485b29ec
|
add basic charfilter support to TestRandomChains
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310805 13f79535-47bb-0310-9956-ffa450edef68
|
2012-04-07 17:37:16 +00:00 |
Robert Muir
|
fbc8429905
|
LUCENE-3919: more thorough testing of analysis chains
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310789 13f79535-47bb-0310-9956-ffa450edef68
|
2012-04-07 15:48:02 +00:00 |
Robert Muir
|
2fe2e82584
|
LUCENE-1866: better RAT reporting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309248 13f79535-47bb-0310-9956-ffa450edef68
|
2012-04-04 05:03:53 +00:00 |
Robert Muir
|
8f0d7cc135
|
LUCENE-3930: nuke jars from source tree and use ivy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307563 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-30 18:04:43 +00:00 |
Ryan McKinley
|
49f43806a8
|
LUCENE-2000: remove redundant casts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307012 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-29 17:34:34 +00:00 |
Michael McCandless
|
e49b69d459
|
tests: get JRE bug workaround working for this test again
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1306931 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-29 15:43:03 +00:00 |
Ryan McKinley
|
05fe168961
|
LUCENE-2000: clone() now returns covariant types where possible.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1306626 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-28 22:22:25 +00:00 |
Robert Muir
|
bca62a44d3
|
LUCENE-3929: add a test demonstrating this works
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1305870 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-27 15:16:42 +00:00 |
Robert Muir
|
ae0f44fcb9
|
remaining eol-style fixes to trunk, native except .sh (LF)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1305492 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-26 18:57:08 +00:00 |
Robert Muir
|
35705cc396
|
LUCENE-3919: fix czechstemmer aioobe on the empty term
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1305177 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-25 23:40:44 +00:00 |
Steven Rowe
|
fb33754168
|
LUCENE-3881: Added UAX29URLEmailAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304975 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-25 01:20:55 +00:00 |
Steven Rowe
|
ada9780484
|
LUCENE-3913: Fix HTMLStripCharFilter invalid final offset for input containing </br>
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304912 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-24 20:54:31 +00:00 |
Robert Muir
|
f597b9a1cc
|
LUCENE-3883: Irish Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304836 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-24 15:59:04 +00:00 |
Michael McCandless
|
7291d38535
|
LUCENE-3905: sometimes run real-ish content (from LineFileDocs) through the analyzers too; fix end() offset bugs in the ngram tokenizers/filters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304525 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-23 17:39:13 +00:00 |
Robert Muir
|
a6fd306dfb
|
add missing license headers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303738 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-22 11:33:45 +00:00 |
Michael McCandless
|
1a191f4edc
|
LUCENE-3898: reset() was missing some state
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303441 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-21 15:22:28 +00:00 |
Michael McCandless
|
595744089a
|
LUCENE-3896: CharacterUtils.fill must call Reader.read again if it only got a single high surrogate char on the first read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303374 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-21 12:53:27 +00:00 |
Robert Muir
|
f75d40dad5
|
LUCENE-3894: try toning down for this tokenizer (it builds lots of tokens from the input treated as a path)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303276 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-21 04:30:11 +00:00 |
Robert Muir
|
1156de050f
|
LUCENE-3894: add large docs tests for more tokenizers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303273 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-21 03:59:14 +00:00 |
Robert Muir
|
3d73a3014e
|
LUCENE-3896: beef up TestDuelingAnalyzers for larger documents
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303253 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-21 01:52:22 +00:00 |
Michael McCandless
|
c20242721f
|
LUCENE-3894: some tokenizers weren't reading all input chars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303193 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-20 23:02:37 +00:00 |
Robert Muir
|
b7a7e5a625
|
LUCENE-3889: remove unnecessary/unused base class
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303026 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-20 17:28:26 +00:00 |
Jan Høydahl
|
5648222e86
|
SOLR-2764: Fix testcase for minimal stemmer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302872 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-20 13:12:39 +00:00 |
Jan Høydahl
|
54d48eb98b
|
SOLR-2764: Create a NorwegianLightStemmer and NorwegianMinimalStemmer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302833 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-20 10:57:50 +00:00 |
Robert Muir
|
790323780f
|
basic javadocs improvements, mostly simple descriptions where the class had nothing before
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302752 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-20 02:09:25 +00:00 |
Robert Muir
|
4a2b1d974a
|
javadocs: add missing package.htmls
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302713 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-19 23:20:25 +00:00 |
Steven Rowe
|
c4f72f61ac
|
LUCENE-3880: UAX29URLEmailTokenizer now recognizes emails when the mailto: scheme is prepended.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302265 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-19 03:13:52 +00:00 |
Robert Muir
|
3d2d144f92
|
LUCENE-3848: don't produce tokenstreams that start with posinc=0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1301478 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-16 13:06:30 +00:00 |
Uwe Schindler
|
3d8b22ffd0
|
LUCENE-3850: Fix rawtypes warnings for Java 7 compiler (#2)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1297162 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-05 18:48:04 +00:00 |
Uwe Schindler
|
989530e17e
|
LUCENE-3850: Fix rawtypes warnings for Java 7 compiler
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1297048 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-05 13:34:40 +00:00 |
Christian Moen
|
430365f7cc
|
Kuromoji now produces both compound words and the segmentation of those words in search mode (LUCENE-3767)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296805 13f79535-47bb-0310-9956-ffa450edef68
|
2012-03-04 13:34:13 +00:00 |
Dawid Weiss
|
8c2e3cef8f
|
LUCENE-3820: limiting the amount of input for pattern matching to go past exponential time patterns, even if they happen. A nice catch from Mike too -- un-ignore testNastyPattern and look at processing time go wild with each additional input character...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294797 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-28 19:26:05 +00:00 |
Dawid Weiss
|
f3cc65733b
|
Sysout of the randomized pattern.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294518 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-28 08:15:38 +00:00 |
Dawid Weiss
|
4d401ca87d
|
Test thread's name reflects the current seed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294514 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-28 08:04:42 +00:00 |
Dawid Weiss
|
493bd8b42f
|
LUCENE-3820: optimistic limit on running time for the randomized pattern test. This doesn't eliminate the possibility of hitting an exponential time pattern, but I re-run a few times and it seems to be pretty stbale.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294322 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-27 20:50:24 +00:00 |
Dawid Weiss
|
7be5533989
|
LUCENE-3820: Wrong trailing index calculation in PatternReplaceCharFilter.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294141 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-27 13:13:10 +00:00 |
Robert Muir
|
6a07201844
|
don't fail test due to jre bugs in String.toLowerCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243415 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-13 04:50:12 +00:00 |
Robert Muir
|
590741dcfe
|
LUCENE-3766: Remove Tokenizer's default ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242890 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-10 19:12:35 +00:00 |
Robert Muir
|
8a50cefc6b
|
LUCENE-3748: EnglishPossessiveFilter did not work with a proper right quotation mark
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242740 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-10 11:01:11 +00:00 |
Robert Muir
|
72ae3171be
|
LUCENE-3765: Trappy behavior with StopFilter/ignoreCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242497 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-09 19:59:50 +00:00 |
Robert Muir
|
c0319d5928
|
SOLR-3056: document expectations in these files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241960 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-08 16:27:47 +00:00 |
Robert Muir
|
dac1b58277
|
SOLR-3097, SOLR-3105: add fieldtypes for different languages to the example
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241878 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-08 12:07:52 +00:00 |
Tommaso Teofili
|
6d3bb736f3
|
[LUCENE-3744] - applied patch for whiteList usage in TypeTokenFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1240034 13f79535-47bb-0310-9956-ffa450edef68
|
2012-02-03 09:13:17 +00:00 |
Michael McCandless
|
8e40ea5bf8
|
LUCENE-3742: fix token offset for hangs-off-end output in SynonymFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1238851 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-31 23:01:55 +00:00 |
Uwe Schindler
|
10ba9abeb2
|
Reverse merged revision(s) from lucene/dev/trunk up to 1237502
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237505 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-29 23:19:05 +00:00 |
Michael McCandless
|
d1165b1972
|
LUCENE-3725: add optional packing to FSTs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1237500 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-29 22:48:45 +00:00 |
Robert Muir
|
d7fe56ddae
|
LUCENE-2858: fix analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237312 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-29 15:16:04 +00:00 |
Steven Rowe
|
97d62cc383
|
Fix offset array assertion off-by-one
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1236912 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-27 22:43:48 +00:00 |
Robert Muir
|
6edfe4f157
|
LUCENE-3717: add tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235199 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-24 10:40:46 +00:00 |
Robert Muir
|
35a73d5f55
|
LUCENE-3717: fix broken offsets in ngramtokenizers, and check return value of Reader.read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235187 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-24 09:50:21 +00:00 |
Robert Muir
|
7fafdd3576
|
LUCENE-3717: add checkRandomData to more analyzers and fix more offsets bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234850 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-23 15:19:58 +00:00 |
Steven Rowe
|
059410d424
|
LUCENE-3690: fix handling of unpaired numeric character entity UTF-16 surrogates to output U+FFFD REPLACEMENT CHARACTER; and add handling of properly paired numeric character entity UTF-16 surrogates, to output the corresponding pair of code units.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234687 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-23 07:36:38 +00:00 |
Robert Muir
|
c754c1c9c8
|
LUCENE-3717: add better offsets testing to BaseTokenStreamTestCase, fix offsets bugs in ThaiWordFilter and ICUTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234652 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-23 00:08:52 +00:00 |
Robert Muir
|
a7cfee6b07
|
SOLR-2891: fix CompoundWordTokenFilter to not create invalid offsets when the length of the text was changed by a previous filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234546 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-22 16:41:06 +00:00 |
Steven Rowe
|
f3a363708f
|
LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234452 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-22 05:20:46 +00:00 |
Uwe Schindler
|
af9b4d816f
|
LUCENE-3671: Add TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234396 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-21 19:02:44 +00:00 |
Michael McCandless
|
87bc4521c2
|
LUCENE-3695: move some confusing FST sugar out
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231795 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-15 23:25:38 +00:00 |
Michael McCandless
|
11f33ee521
|
LUCENE-3684: add offsets to postings APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231794 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-15 23:17:45 +00:00 |
Yonik Seeley
|
b2a0040e98
|
tests: silliness
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231526 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 16:59:36 +00:00 |
Michael McCandless
|
5ca66287ea
|
woops
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231513 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 15:20:58 +00:00 |
Michael McCandless
|
d584f6361d
|
LUCENE-3685: add ToChildBlockJoinQuery, to join from parent to child
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231512 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 15:17:04 +00:00 |
Robert Muir
|
8b8c2b4dee
|
LUCENE-3690: since this filter handles all kinds of bad partial and wierd input, this should be fine to enable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231272 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-13 19:46:30 +00:00 |
Robert Muir
|
cd372bdc83
|
LUCENE-3305: add Kuromoji Japanese morphological analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230748 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-12 20:10:48 +00:00 |
Simon Willnauer
|
3b8458f6de
|
use TEST_VERSION_CURRENT instead of 4_0 in test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229523 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-10 12:46:38 +00:00 |
Simon Willnauer
|
f19317d318
|
SOLR-3020: Add KeywordAttribute support to HunspellStemFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229519 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-10 12:33:29 +00:00 |
Michael McCandless
|
cdb2ee8a7b
|
LUCENE-3679: replace IR.getFieldNames with IR.getFieldInfos
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229401 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-09 22:29:40 +00:00 |
Michael McCandless
|
defd51a11b
|
fix syn test bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1228704 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-07 19:28:07 +00:00 |
Michael McCandless
|
ed9f0fd5ef
|
LUCENE-3668: if there's only 1 output for a synonym rule then set start/endOffset to match the full span of the input tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1228650 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-07 16:26:15 +00:00 |
Steven Rowe
|
76d1662cb7
|
- Added license declaration
- Removed unused 'length' param to combine()
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225615 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-29 18:53:10 +00:00 |
Robert Muir
|
b2970db4bc
|
LUCENE-2906: filter to process output of Standard/ICUTokenizer and create overlapping bigrams for CJK
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225433 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-29 05:04:49 +00:00 |
Robert Muir
|
a55f511a77
|
LUCENE-3650: move o.a.l.index.codecs to o.a.l.codecs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1215245 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-16 19:03:12 +00:00 |
Robert Muir
|
18febd69e4
|
LUCENE-2208: improve charfilter offset testing
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1215038 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-16 04:37:47 +00:00 |
Steven Rowe
|
60929a5adb
|
LUCENE-3645: Remove unnecessary array wrapping when calling varargs methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1214413 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-14 19:15:47 +00:00 |
Robert Muir
|
7dc025bdce
|
LUCENE-3642: fix invalid offsets from CharTokenizer, [Edge]NGramFilters, SmartChinese, add sanity check to BaseTokenStreamTestCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1213329 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-12 17:28:09 +00:00 |
Robert Muir
|
3899e18ca3
|
LUCENE-3640: Remove IndexSearcher.close
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1213117 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-12 00:21:40 +00:00 |
Uwe Schindler
|
905a0f211c
|
LUCENE-3606: Make IndexReader really read-only
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1212292 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-09 09:13:39 +00:00 |
Robert Muir
|
9b15b1d3b0
|
consolidate assumes in ThaiAnalyzer test so we don't miss it for individual tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1212141 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-08 21:47:12 +00:00 |
Robert Muir
|
3843ac5b8b
|
LUCENE-3606: fix more tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3606@1210308 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-05 01:59:11 +00:00 |
Michael McCandless
|
961b820e53
|
LUCENE-2929: specify up front if you need freqs from DocsEnum
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1210176 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-04 18:50:58 +00:00 |
Chris M. Hostetter
|
3ed5106920
|
SOLR-2819: Improved speed of parsing hex entities in HTMLStripCharFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1208032 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-29 19:15:54 +00:00 |
Robert Muir
|
7f766cf603
|
LUCENE-3590: nuke BytesRef.utf8ToChars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206174 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-25 13:55:41 +00:00 |
Robert Muir
|
3b6da22aa7
|
LUCENE-3590: clearly mark bogus deep-copying apis in BytesRef
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206143 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-25 12:50:13 +00:00 |
Robert Muir
|
873f199924
|
LUCENE-2621: move TermVectors,FieldInfos,SegmentInfos to codec
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1202842 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-16 19:09:35 +00:00 |
Robert Muir
|
598920d7bd
|
LUCENE-3571: nuke IndexSearcher(Directory)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1202657 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-16 12:19:41 +00:00 |
Simon Willnauer
|
ee293e7e7d
|
fix javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1200111 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-10 03:32:33 +00:00 |
Simon Willnauer
|
dc6b4b6533
|
LUCENE-2564: Cut over WordListLoader to CharArrayMap/Set and use CharSetDecoder to detect encoding problems early
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1200080 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-10 01:21:25 +00:00 |
Robert Muir
|
fa6500fa6c
|
LUCENE-3490: restructure codec hierarchy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1197603 13f79535-47bb-0310-9956-ffa450edef68
|
2011-11-04 15:43:35 +00:00 |
Uwe Schindler
|
a91efbedd1
|
LUCENE-3530: Remove deprecated methods in CompoundTokenFilters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1188613 13f79535-47bb-0310-9956-ffa450edef68
|
2011-10-25 11:31:16 +00:00 |
Uwe Schindler
|
ec186e7280
|
LUCENE-3508: Decompounders based on CompoundWordTokenFilterBase can now be used with custom attributes. All those attributes are preserved and set on all added decompounded tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1188597 13f79535-47bb-0310-9956-ffa450edef68
|
2011-10-25 10:44:36 +00:00 |
Robert Muir
|
f21ac2f58c
|
LUCENE-3301: add workaround for jre breakiterator bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1187900 13f79535-47bb-0310-9956-ffa450edef68
|
2011-10-23 14:55:25 +00:00 |
Robert Muir
|
9ba4ce2ed5
|
javadocs fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1182505 13f79535-47bb-0310-9956-ffa450edef68
|
2011-10-12 18:20:41 +00:00 |
Jan Høydahl
|
22dcd39d9e
|
SOLR-2792: Allow case insensitive Hunspell stemming
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1179459 13f79535-47bb-0310-9956-ffa450edef68
|
2011-10-05 22:08:55 +00:00 |
Michael McCandless
|
ec2b654231
|
LUCENE-3477: add explicit breaks in jflex sources so we don't hit compiler warnings; fix a couple other warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1177723 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-30 16:23:24 +00:00 |
Christopher John Male
|
8d28270460
|
LUCENE-3470: Changed Field constructor signatures order to value, fieldtype
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1176773 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-28 08:07:16 +00:00 |
Christopher John Male
|
67c13bd2fe
|
LUCENE-3455: Renamed Analyzer.reusableTokenStream to Analyzer.tokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1176728 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-28 05:26:54 +00:00 |
Christopher John Male
|
0bed3142bb
|
LUCENE-3455: Test Analysis consumers now use reusableTokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1175670 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-26 04:58:48 +00:00 |
Christopher John Male
|
4ff0b2f82c
|
LUCENE-3396: Collapsing Analyzer and ReusableAnalyzerBase together, mandating use of TokenStreamComponents
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1175297 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-25 05:10:25 +00:00 |
Christopher John Male
|
318911200d
|
LUCENE-3434: Removed state changing setters in ShingleAnalyzerWrapper and PerFieldAnalyzerWrapper
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1170942 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-15 03:21:17 +00:00 |
Christopher John Male
|
94028fe11a
|
LUCENE-3431: Removed deprecated addStopwords methods in QueryAutoStopWordAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1170424 13f79535-47bb-0310-9956-ffa450edef68
|
2011-09-14 03:33:50 +00:00 |