Robert Muir
6563a58a2a
LUCENE-3969: add new random test for MappingCharFilter (sometimes fails, due to same final offset bug)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311765 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 14:49:36 +00:00
Robert Muir
f97ac2d0cb
LUCENE-3969: add failing test case for MappingCharFilter wrong final offset
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311761 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 14:38:39 +00:00
Robert Muir
8966429dab
LUCENE-3969: disable these for now so we can work on the other issues
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311748 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 14:19:09 +00:00
Robert Muir
d4b5405533
LUCENE-3969: check offsets even if posIncrAtt doesnt yet exist (and vice versa), and check that offsets are non-negative
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311734 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 14:09:15 +00:00
Uwe Schindler
3706fbc5b0
Fix ShingleFilter reuse, some minor changes to testcase for speed and consistency
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311724 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 13:50:03 +00:00
Michael McCandless
a764c0d021
LUCENE-3969: add whitespace to analyzer description
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311667 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 10:28:24 +00:00
Michael McCandless
9e98ec186c
LUCENE-3969: check that startOffset <= endOffset; comment out sops
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311406 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 20:04:55 +00:00
Michael McCandless
3e098abaed
LUCENE-3969: ValidatingTokenFilter shouldn't create new atts
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311405 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 20:00:50 +00:00
Michael McCandless
11a65763d0
LUCENE-3969: remove nocommit
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311400 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 19:45:16 +00:00
Michael McCandless
ad5c89b1b1
LUCENE-3969: validate after each analysis stage; tenatively add posLen to ShingleFilter
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311373 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 19:05:47 +00:00
Uwe Schindler
f6f8e38cfa
LUCENE-3969: Simplify the crazy Reader wrapper
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311358 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:53:27 +00:00
Robert Muir
f41576a306
LUCENE-3969: don't get caught by tokenizers that consume in ctor and throw IAE or UOE ever again
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311351 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:32:39 +00:00
Robert Muir
2a01acc0e8
LUCENE-3969: don't use scary attsource ctor yet, and always print the analyzer for now
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311339 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:21:46 +00:00
Uwe Schindler
79baa1f682
LUCENE-3969: Remove unneeded wildcards
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311331 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:08:19 +00:00
Uwe Schindler
eae8e8159d
LUCENE-3969: Remove useless success variable
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311322 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:56:35 +00:00
Uwe Schindler
bd8bdb08b3
LUCENE-3969: Remove code duplication
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311320 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:52:14 +00:00
Michael McCandless
4456273922
LUCENE-3969: fix PatternTokenizer to not consume chars from the input Reader if it throws IAE
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311318 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:47:56 +00:00
Michael McCandless
d76a03214c
LUCENE-3969: add missing IAE to WikipediaTokenizer ctor
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311294 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:00:41 +00:00
Uwe Schindler
102ece7710
LUCENE-3969: More cleanups
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311282 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 15:32:08 +00:00
Uwe Schindler
214ab39f68
LUCENE-3969: Minor cleanups and code consistency
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311278 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 15:15:11 +00:00
Robert Muir
ac393486e0
LUCENE-3969: don't allow negative subword params, Hyphenation relies upon this to filter out what appear to be bogus hyphenation points
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311257 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 14:31:25 +00:00
Robert Muir
24f8a9e627
LUCENE-3969: disable PositionFilter for now
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311241 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 14:16:35 +00:00
Robert Muir
f63af6afe5
LUCENE-3969: don't be this evil yet for type char
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311235 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:44:18 +00:00
Robert Muir
6311f71de6
LUCENE-3969: commit current state
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311220 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:25:28 +00:00
Robert Muir
d55447b25e
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311219 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:24:23 +00:00
Robert Muir
4702d72570
fix eclipse classpath
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311203 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 11:45:42 +00:00
Jan Høydahl
ff5206e64f
SOLR-3254: Upgrade Solr to Tika 1.1
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311198 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 10:48:03 +00:00
Jan Høydahl
aed333d367
SOLR-1979: Updated README and CHANGES in trunk
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311116 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 23:05:52 +00:00
Robert Muir
27dbcaefdc
revert bogus fix (assault against a police officer)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311113 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 22:10:08 +00:00
Robert Muir
00c2246e44
fix generification bug
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311110 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 21:56:03 +00:00
Michael McCandless
c63f95911a
LUCENE-3942: syn filter sets posLen when possible
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311100 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 20:55:32 +00:00
Michael McCandless
b5103519f0
add TODOs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311097 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 20:44:58 +00:00
Michael McCandless
78b4be5dc6
LUCENE-3940: fix Kuromoji to not produce invalid token graph due to UNK with punctuation being decompounded
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311072 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 19:17:17 +00:00
Michael McCandless
879e825083
LUCENE-3966: smoke tester can test from local (file://) urls, handle url-shortened base URLs, and build/stage/test unsigned artifacts
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311067 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 18:22:14 +00:00
Steven Rowe
6350ed1b8a
LUCENE-3963: Enable smokeTestRelease.py usage under Windows+Cygwin
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311055 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 17:15:54 +00:00
Michael McCandless
c567961eca
LUCENE-3968: factor out LookaheadTokenFilter from MockGraphTokenFilter
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311043 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 16:56:54 +00:00
Michael McCandless
72cc92c903
LUCENE-3109: revert
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310998 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 13:38:01 +00:00
Michael McCandless
c9031ca725
LUCENE-3109: rename Fields/Producer/Consumer to InvertedFields*
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310969 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 10:09:19 +00:00
Steven Rowe
13b75650e9
LUCENE-3964: fix svn:eol-style
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310950 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 04:10:49 +00:00
Steven Rowe
0c09bfcba4
LUCENE-3964: Enable Maven artifact staging
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310949 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 04:07:42 +00:00
Michael McCandless
755ebafa49
LUCENE-3873: add MockGraphTokenFilter, inserting random graph tokens
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310910 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 23:06:12 +00:00
Uwe Schindler
62890c8089
LUCENE-3919: Remove useless loop
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310898 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:33:13 +00:00
Uwe Schindler
bdaa79206d
LUCENE-3919: Die, context class loader, die. Also don't initialize (run static ctors) unrelated classes!
...
@UweSays: "If you get the context classloader from a thread, in most cases you are doing something wrong because you don't understand how Java classloading works."
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310893 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:27:57 +00:00
Uwe Schindler
7154c5466d
LUCENE-3919: Fix generics and additional checks
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310883 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:00:28 +00:00
Robert Muir
61ee8ef3ec
SOLR-3340: don't use number format of the default locale
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310812 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 18:12:28 +00:00
Robert Muir
4b638a2903
LUCENE-3919: add convenience ctors to MockTokenizer,MockCharFilter, fix MockCharFilter to chain its correctOffset
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310811 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 18:07:56 +00:00
Robert Muir
ed485b29ec
add basic charfilter support to TestRandomChains
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310805 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 17:37:16 +00:00
Stanisław Osiński
55454dd616
SOLR-3338: ignore exceptions from getConfigDir(), it's used only for clearer logs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310800 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 16:50:49 +00:00
Robert Muir
44e4290b4c
LUCENE-3958: javadocs corrections for IndexWriter
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310799 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 16:47:39 +00:00
Robert Muir
fbc8429905
LUCENE-3919: more thorough testing of analysis chains
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310789 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 15:48:02 +00:00