724 Commits

Author SHA1 Message Date
Robert Muir
f97ac2d0cb LUCENE-3969: add failing test case for MappingCharFilter wrong final offset
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311761 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 14:38:39 +00:00
Robert Muir
8966429dab LUCENE-3969: disable these for now so we can work on the other issues
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311748 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 14:19:09 +00:00
Uwe Schindler
3706fbc5b0 Fix ShingleFilter reuse, some minor changes to testcase for speed and consistency
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311724 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 13:50:03 +00:00
Michael McCandless
a764c0d021 LUCENE-3969: add whitespace to analyzer description
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311667 13f79535-47bb-0310-9956-ffa450edef68
2012-04-10 10:28:24 +00:00
Michael McCandless
3e098abaed LUCENE-3969: ValidatingTokenFilter shouldn't create new atts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311405 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 20:00:50 +00:00
Michael McCandless
11a65763d0 LUCENE-3969: remove nocommit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311400 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 19:45:16 +00:00
Michael McCandless
ad5c89b1b1 LUCENE-3969: validate after each analysis stage; tenatively add posLen to ShingleFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311373 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 19:05:47 +00:00
Uwe Schindler
f6f8e38cfa LUCENE-3969: Simplify the crazy Reader wrapper
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311358 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:53:27 +00:00
Robert Muir
f41576a306 LUCENE-3969: don't get caught by tokenizers that consume in ctor and throw IAE or UOE ever again
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311351 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:32:39 +00:00
Robert Muir
2a01acc0e8 LUCENE-3969: don't use scary attsource ctor yet, and always print the analyzer for now
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311339 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:21:46 +00:00
Uwe Schindler
79baa1f682 LUCENE-3969: Remove unneeded wildcards
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311331 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 17:08:19 +00:00
Uwe Schindler
eae8e8159d LUCENE-3969: Remove useless success variable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311322 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:56:35 +00:00
Uwe Schindler
bd8bdb08b3 LUCENE-3969: Remove code duplication
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311320 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:52:14 +00:00
Michael McCandless
4456273922 LUCENE-3969: fix PatternTokenizer to not consume chars from the input Reader if it throws IAE
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311318 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:47:56 +00:00
Michael McCandless
d76a03214c LUCENE-3969: add missing IAE to WikipediaTokenizer ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311294 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 16:00:41 +00:00
Uwe Schindler
102ece7710 LUCENE-3969: More cleanups
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311282 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 15:32:08 +00:00
Uwe Schindler
214ab39f68 LUCENE-3969: Minor cleanups and code consistency
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311278 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 15:15:11 +00:00
Robert Muir
ac393486e0 LUCENE-3969: don't allow negative subword params, Hyphenation relies upon this to filter out what appear to be bogus hyphenation points
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311257 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 14:31:25 +00:00
Robert Muir
24f8a9e627 LUCENE-3969: disable PositionFilter for now
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311241 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 14:16:35 +00:00
Robert Muir
f63af6afe5 LUCENE-3969: don't be this evil yet for type char
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311235 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:44:18 +00:00
Robert Muir
6311f71de6 LUCENE-3969: commit current state
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311220 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:25:28 +00:00
Robert Muir
27dbcaefdc revert bogus fix (assault against a police officer)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311113 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 22:10:08 +00:00
Robert Muir
00c2246e44 fix generification bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311110 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 21:56:03 +00:00
Michael McCandless
c63f95911a LUCENE-3942: syn filter sets posLen when possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311100 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 20:55:32 +00:00
Michael McCandless
78b4be5dc6 LUCENE-3940: fix Kuromoji to not produce invalid token graph due to UNK with punctuation being decompounded
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311072 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 19:17:17 +00:00
Michael McCandless
72cc92c903 LUCENE-3109: revert
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310998 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 13:38:01 +00:00
Michael McCandless
c9031ca725 LUCENE-3109: rename Fields/Producer/Consumer to InvertedFields*
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310969 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 10:09:19 +00:00
Michael McCandless
755ebafa49 LUCENE-3873: add MockGraphTokenFilter, inserting random graph tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310910 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 23:06:12 +00:00
Uwe Schindler
62890c8089 LUCENE-3919: Remove useless loop
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310898 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:33:13 +00:00
Uwe Schindler
bdaa79206d LUCENE-3919: Die, context class loader, die. Also don't initialize (run static ctors) unrelated classes!
@UweSays: "If you get the context classloader from a thread, in most cases you are doing something wrong because you don't understand how Java classloading works."

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310893 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:27:57 +00:00
Uwe Schindler
7154c5466d LUCENE-3919: Fix generics and additional checks
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310883 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:00:28 +00:00
Robert Muir
ed485b29ec add basic charfilter support to TestRandomChains
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310805 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 17:37:16 +00:00
Robert Muir
fbc8429905 LUCENE-3919: more thorough testing of analysis chains
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310789 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 15:48:02 +00:00
Steven Rowe
bb59b83358 LUCENE-3959: remove all @author tags from Lucene/Solr sources
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310622 13f79535-47bb-0310-9956-ffa450edef68
2012-04-06 22:29:35 +00:00
Robert Muir
abf9e7d2b4 fix busted top-level modules/ validate task
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309581 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 20:19:34 +00:00
Chris M. Hostetter
bb7bc2ff44 LUCENE-3945: use sha1 checksums to verify jars pulled from ivy match expectations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309503 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 17:53:32 +00:00
Steven Rowe
0a47c9d4d9 nuke obsolete comment
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309393 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 14:04:50 +00:00
Robert Muir
6c7c89c3f9 LUCENE-1866: add exclusion for bocchan test file
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309255 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 05:36:52 +00:00
Robert Muir
2fe2e82584 LUCENE-1866: better RAT reporting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309248 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 05:03:53 +00:00
Robert Muir
e5448e2e20 LUCENE-3947: fix rat-sources task to work with tools/ directories
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309207 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 01:51:56 +00:00
Steven Rowe
c9393b9683 LUCENE-3944: Make the 'generate-maven-artifacts' target use filtered POMs placed under lucene/build/poms/, rather than in each module's base directory. The 'clean' target now removes them.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309141 13f79535-47bb-0310-9956-ffa450edef68
2012-04-03 20:52:02 +00:00
Robert Muir
6b16efdc22 LUCENE-3930: kuromoji steals icu's jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1308423 13f79535-47bb-0310-9956-ffa450edef68
2012-04-02 16:31:59 +00:00
Steven Rowe
0945a6521f IntelliJ configuration: enable IntelliJ .alg file copying to benchmark/build/classes/test/conf/ via a new Ant task 'copy-alg-files-for-testing', which is now also called by the benchmark module's compile-test target. (Standard IntelliJ configuration mechanisms don't support copying to a sub-directory of the test output directory.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1308188 13f79535-47bb-0310-9956-ffa450edef68
2012-04-01 19:46:49 +00:00
Robert Muir
8f0d7cc135 LUCENE-3930: nuke jars from source tree and use ivy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307563 13f79535-47bb-0310-9956-ffa450edef68
2012-03-30 18:04:43 +00:00
Steven Rowe
91fb08086a LUCENE-3937: Removed remaining references to the patched xercesImpl jar; added benchmark/CHANGES.txt entry.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307165 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 23:27:45 +00:00
Uwe Schindler
4d31bb3a7c LUCENE-3937: Workaround a XERCES-J bug in benchmark module.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307141 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 22:04:15 +00:00
Ryan McKinley
13745b230d remove compiler warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307095 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 20:25:02 +00:00
Martijn van Groningen
c19e594ea2 LUCENE-3936: Rename StringIndexDocValues to DocTermsIndexDocValues
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307044 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 18:50:08 +00:00
Ryan McKinley
49f43806a8 LUCENE-2000: remove redundant casts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307012 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 17:34:34 +00:00
Michael McCandless
e49b69d459 tests: get JRE bug workaround working for this test again
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1306931 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 15:43:03 +00:00