Commit Graph

237 Commits

Author SHA1 Message Date
Uwe Schindler 102ece7710 LUCENE-3969: More cleanups
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311282 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 15:32:08 +00:00
Uwe Schindler 214ab39f68 LUCENE-3969: Minor cleanups and code consistency
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311278 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 15:15:11 +00:00
Robert Muir ac393486e0 LUCENE-3969: don't allow negative subword params, Hyphenation relies upon this to filter out what appear to be bogus hyphenation points
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311257 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 14:31:25 +00:00
Robert Muir 24f8a9e627 LUCENE-3969: disable PositionFilter for now
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311241 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 14:16:35 +00:00
Robert Muir f63af6afe5 LUCENE-3969: don't be this evil yet for type char
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311235 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:44:18 +00:00
Robert Muir 6311f71de6 LUCENE-3969: commit current state
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3969@1311220 13f79535-47bb-0310-9956-ffa450edef68
2012-04-09 13:25:28 +00:00
Robert Muir 27dbcaefdc revert bogus fix (assault against a police officer)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311113 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 22:10:08 +00:00
Robert Muir 00c2246e44 fix generification bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311110 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 21:56:03 +00:00
Michael McCandless c63f95911a LUCENE-3942: syn filter sets posLen when possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1311100 13f79535-47bb-0310-9956-ffa450edef68
2012-04-08 20:55:32 +00:00
Michael McCandless 755ebafa49 LUCENE-3873: add MockGraphTokenFilter, inserting random graph tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310910 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 23:06:12 +00:00
Uwe Schindler 62890c8089 LUCENE-3919: Remove useless loop
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310898 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:33:13 +00:00
Uwe Schindler bdaa79206d LUCENE-3919: Die, context class loader, die. Also don't initialize (run static ctors) unrelated classes!
@UweSays: "If you get the context classloader from a thread, in most cases you are doing something wrong because you don't understand how Java classloading works."

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310893 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:27:57 +00:00
Uwe Schindler 7154c5466d LUCENE-3919: Fix generics and additional checks
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310883 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 22:00:28 +00:00
Robert Muir ed485b29ec add basic charfilter support to TestRandomChains
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310805 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 17:37:16 +00:00
Robert Muir fbc8429905 LUCENE-3919: more thorough testing of analysis chains
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1310789 13f79535-47bb-0310-9956-ffa450edef68
2012-04-07 15:48:02 +00:00
Robert Muir 2fe2e82584 LUCENE-1866: better RAT reporting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1309248 13f79535-47bb-0310-9956-ffa450edef68
2012-04-04 05:03:53 +00:00
Robert Muir 8f0d7cc135 LUCENE-3930: nuke jars from source tree and use ivy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307563 13f79535-47bb-0310-9956-ffa450edef68
2012-03-30 18:04:43 +00:00
Ryan McKinley 49f43806a8 LUCENE-2000: remove redundant casts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1307012 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 17:34:34 +00:00
Michael McCandless e49b69d459 tests: get JRE bug workaround working for this test again
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1306931 13f79535-47bb-0310-9956-ffa450edef68
2012-03-29 15:43:03 +00:00
Ryan McKinley 05fe168961 LUCENE-2000: clone() now returns covariant types where possible.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1306626 13f79535-47bb-0310-9956-ffa450edef68
2012-03-28 22:22:25 +00:00
Robert Muir bca62a44d3 LUCENE-3929: add a test demonstrating this works
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1305870 13f79535-47bb-0310-9956-ffa450edef68
2012-03-27 15:16:42 +00:00
Robert Muir ae0f44fcb9 remaining eol-style fixes to trunk, native except .sh (LF)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1305492 13f79535-47bb-0310-9956-ffa450edef68
2012-03-26 18:57:08 +00:00
Robert Muir 35705cc396 LUCENE-3919: fix czechstemmer aioobe on the empty term
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1305177 13f79535-47bb-0310-9956-ffa450edef68
2012-03-25 23:40:44 +00:00
Steven Rowe fb33754168 LUCENE-3881: Added UAX29URLEmailAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304975 13f79535-47bb-0310-9956-ffa450edef68
2012-03-25 01:20:55 +00:00
Steven Rowe ada9780484 LUCENE-3913: Fix HTMLStripCharFilter invalid final offset for input containing </br>
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304912 13f79535-47bb-0310-9956-ffa450edef68
2012-03-24 20:54:31 +00:00
Robert Muir f597b9a1cc LUCENE-3883: Irish Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304836 13f79535-47bb-0310-9956-ffa450edef68
2012-03-24 15:59:04 +00:00
Michael McCandless 7291d38535 LUCENE-3905: sometimes run real-ish content (from LineFileDocs) through the analyzers too; fix end() offset bugs in the ngram tokenizers/filters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1304525 13f79535-47bb-0310-9956-ffa450edef68
2012-03-23 17:39:13 +00:00
Robert Muir a6fd306dfb add missing license headers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303738 13f79535-47bb-0310-9956-ffa450edef68
2012-03-22 11:33:45 +00:00
Michael McCandless 1a191f4edc LUCENE-3898: reset() was missing some state
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303441 13f79535-47bb-0310-9956-ffa450edef68
2012-03-21 15:22:28 +00:00
Michael McCandless 595744089a LUCENE-3896: CharacterUtils.fill must call Reader.read again if it only got a single high surrogate char on the first read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303374 13f79535-47bb-0310-9956-ffa450edef68
2012-03-21 12:53:27 +00:00
Robert Muir f75d40dad5 LUCENE-3894: try toning down for this tokenizer (it builds lots of tokens from the input treated as a path)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303276 13f79535-47bb-0310-9956-ffa450edef68
2012-03-21 04:30:11 +00:00
Robert Muir 1156de050f LUCENE-3894: add large docs tests for more tokenizers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303273 13f79535-47bb-0310-9956-ffa450edef68
2012-03-21 03:59:14 +00:00
Robert Muir 3d73a3014e LUCENE-3896: beef up TestDuelingAnalyzers for larger documents
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303253 13f79535-47bb-0310-9956-ffa450edef68
2012-03-21 01:52:22 +00:00
Michael McCandless c20242721f LUCENE-3894: some tokenizers weren't reading all input chars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303193 13f79535-47bb-0310-9956-ffa450edef68
2012-03-20 23:02:37 +00:00
Robert Muir b7a7e5a625 LUCENE-3889: remove unnecessary/unused base class
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1303026 13f79535-47bb-0310-9956-ffa450edef68
2012-03-20 17:28:26 +00:00
Jan Høydahl 5648222e86 SOLR-2764: Fix testcase for minimal stemmer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302872 13f79535-47bb-0310-9956-ffa450edef68
2012-03-20 13:12:39 +00:00
Jan Høydahl 54d48eb98b SOLR-2764: Create a NorwegianLightStemmer and NorwegianMinimalStemmer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302833 13f79535-47bb-0310-9956-ffa450edef68
2012-03-20 10:57:50 +00:00
Robert Muir 790323780f basic javadocs improvements, mostly simple descriptions where the class had nothing before
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302752 13f79535-47bb-0310-9956-ffa450edef68
2012-03-20 02:09:25 +00:00
Robert Muir 4a2b1d974a javadocs: add missing package.htmls
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302713 13f79535-47bb-0310-9956-ffa450edef68
2012-03-19 23:20:25 +00:00
Steven Rowe c4f72f61ac LUCENE-3880: UAX29URLEmailTokenizer now recognizes emails when the mailto: scheme is prepended.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1302265 13f79535-47bb-0310-9956-ffa450edef68
2012-03-19 03:13:52 +00:00
Robert Muir 3d2d144f92 LUCENE-3848: don't produce tokenstreams that start with posinc=0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1301478 13f79535-47bb-0310-9956-ffa450edef68
2012-03-16 13:06:30 +00:00
Uwe Schindler 3d8b22ffd0 LUCENE-3850: Fix rawtypes warnings for Java 7 compiler (#2)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1297162 13f79535-47bb-0310-9956-ffa450edef68
2012-03-05 18:48:04 +00:00
Uwe Schindler 989530e17e LUCENE-3850: Fix rawtypes warnings for Java 7 compiler
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1297048 13f79535-47bb-0310-9956-ffa450edef68
2012-03-05 13:34:40 +00:00
Christian Moen 430365f7cc Kuromoji now produces both compound words and the segmentation of those words in search mode (LUCENE-3767)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296805 13f79535-47bb-0310-9956-ffa450edef68
2012-03-04 13:34:13 +00:00
Dawid Weiss 8c2e3cef8f LUCENE-3820: limiting the amount of input for pattern matching to go past exponential time patterns, even if they happen. A nice catch from Mike too -- un-ignore testNastyPattern and look at processing time go wild with each additional input character...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294797 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 19:26:05 +00:00
Dawid Weiss f3cc65733b Sysout of the randomized pattern.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294518 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 08:15:38 +00:00
Dawid Weiss 4d401ca87d Test thread's name reflects the current seed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294514 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 08:04:42 +00:00
Dawid Weiss 493bd8b42f LUCENE-3820: optimistic limit on running time for the randomized pattern test. This doesn't eliminate the possibility of hitting an exponential time pattern, but I re-run a few times and it seems to be pretty stbale.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294322 13f79535-47bb-0310-9956-ffa450edef68
2012-02-27 20:50:24 +00:00
Dawid Weiss 7be5533989 LUCENE-3820: Wrong trailing index calculation in PatternReplaceCharFilter.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294141 13f79535-47bb-0310-9956-ffa450edef68
2012-02-27 13:13:10 +00:00
Robert Muir 6a07201844 don't fail test due to jre bugs in String.toLowerCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243415 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 04:50:12 +00:00