Simon Willnauer
f303bcd465
LUCENE-3807: Cleanup Suggest / Lookup API
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296268 13f79535-47bb-0310-9956-ffa450edef68
2012-03-02 15:59:55 +00:00
Robert Muir
a273239db6
LUCENE-3801: generify FST shortestPaths to any output type
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296237 13f79535-47bb-0310-9956-ffa450edef68
2012-03-02 14:59:44 +00:00
Robert Muir
adebb1592a
LUCENE-3801: our suggesters are 1000x faster than we thought they were
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296147 13f79535-47bb-0310-9956-ffa450edef68
2012-03-02 11:22:14 +00:00
Robert Muir
978ce35e40
fix suggester benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1295426 13f79535-47bb-0310-9956-ffa450edef68
2012-03-01 06:33:30 +00:00
Chris M. Hostetter
29a7c260fe
LUCENE-2604: add '/' to the list of chars in the various escape functions
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294965 13f79535-47bb-0310-9956-ffa450edef68
2012-02-29 03:58:41 +00:00
Michael McCandless
9ab8f9b83c
LUCENE-3824: don't do pointless by-value cmp in TermOrdVal/DocValuesComparator.setBottom
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294856 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 22:20:18 +00:00
Dawid Weiss
8c2e3cef8f
LUCENE-3820: limiting the amount of input for pattern matching to go past exponential time patterns, even if they happen. A nice catch from Mike too -- un-ignore testNastyPattern and look at processing time go wild with each additional input character...
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294797 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 19:26:05 +00:00
Dawid Weiss
f3cc65733b
Sysout of the randomized pattern.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294518 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 08:15:38 +00:00
Dawid Weiss
4d401ca87d
Test thread's name reflects the current seed.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294514 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 08:04:42 +00:00
Dawid Weiss
493bd8b42f
LUCENE-3820: optimistic limit on running time for the randomized pattern test. This doesn't eliminate the possibility of hitting an exponential time pattern, but I re-run a few times and it seems to be pretty stbale.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294322 13f79535-47bb-0310-9956-ffa450edef68
2012-02-27 20:50:24 +00:00
Dawid Weiss
7be5533989
LUCENE-3820: Wrong trailing index calculation in PatternReplaceCharFilter.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294141 13f79535-47bb-0310-9956-ffa450edef68
2012-02-27 13:13:10 +00:00
Tommaso Teofili
482c0610fd
[LUCENE-3731] - refactored analyzeText method to initializeIterator and made it abstract inside BaseUIMATokenizer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1293614 13f79535-47bb-0310-9956-ffa450edef68
2012-02-25 14:14:00 +00:00
Simon Willnauer
f29eda768d
LUCENE-3807: Clean up Suggest API
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1293148 13f79535-47bb-0310-9956-ffa450edef68
2012-02-24 09:49:39 +00:00
Tommaso Teofili
930816cc5b
LUCENE-3731 - AEProviderFactory getAEProvider logic cleaned
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1292585 13f79535-47bb-0310-9956-ffa450edef68
2012-02-22 23:39:51 +00:00
Robert Muir
a7a3d5497f
enable tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1292231 13f79535-47bb-0310-9956-ffa450edef68
2012-02-22 10:57:24 +00:00
Simon Willnauer
620622a70e
LUCENE-3807: compare string with string
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1292228 13f79535-47bb-0310-9956-ffa450edef68
2012-02-22 10:39:39 +00:00
Robert Muir
9527a9e397
LUCENE-3813: improve testEmpty to also check that the iterator is exhausted
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291883 13f79535-47bb-0310-9956-ffa450edef68
2012-02-21 15:58:12 +00:00
Robert Muir
9911c1de35
LUCENE-3807: fix missing null check in HighFrequencyDictionary
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291826 13f79535-47bb-0310-9956-ffa450edef68
2012-02-21 14:56:42 +00:00
Robert Muir
51388a2f9c
LUCENE-3811: remove unused benchmark dependencies
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291728 13f79535-47bb-0310-9956-ffa450edef68
2012-02-21 12:11:04 +00:00
Simon Willnauer
70501dd845
LUCENE-3807: consume all terms from the enum
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291506 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 22:53:55 +00:00
Robert Muir
9d210b0c37
add test for suggester iterators
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291502 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 22:51:34 +00:00
Simon Willnauer
1860439f15
LUCENE-3807: clean up TermFreqIterator API
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291418 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 19:35:59 +00:00
Robert Muir
f1345de257
don't uppercase int to İNT
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291343 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 16:17:17 +00:00
Robert Muir
a519b630ee
LUCENE-3714: add weighted FST suggester impl
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291020 13f79535-47bb-0310-9956-ffa450edef68
2012-02-19 16:23:05 +00:00
Michael McCandless
854c9ac452
LUCENE-3777: separate out Int/Long/Float/DoubleField to reduce traps
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1245583 13f79535-47bb-0310-9956-ffa450edef68
2012-02-17 14:46:35 +00:00
Shai Erera
242092c929
LUCENE-3794: DirectoryTaxonomyWriter can lose the INDEX_CREATE_TIME property, causing DirTaxoReader.refresh() to falsely succeed (or fail)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244964 13f79535-47bb-0310-9956-ffa450edef68
2012-02-16 12:54:56 +00:00
Robert Muir
e51795be39
LUCENE-3731: remove unnecessary code
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244714 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 20:53:53 +00:00
Robert Muir
c97e3edbb9
LUCENE-3731: performance improvements and thread safety fixes to UIMA tokenizers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244688 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 20:29:20 +00:00
Steven Rowe
d47f01c350
LUCENE-3754: Store generated archive manifests in per-module output directories - each artifact gets its own manifest file
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244536 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 15:30:53 +00:00
Robert Muir
a5a0fd421e
LUCENE-3768: fix typos in .alg files and test that all .alg files can be parsed
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244509 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 14:46:05 +00:00
Tommaso Teofili
c454ae6a66
[LUCENE-3731] - creating and using simple wst and pos tagger implementations for analyzers' random string testing
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244474 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 13:17:57 +00:00
Shai Erera
6c34d407cd
fix DocMaker file handle leak: now with the actual fix :)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244380 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 07:02:31 +00:00
Shai Erera
505850c8f2
fix DocMaker file handle leak
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244379 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 07:01:41 +00:00
Ryan McKinley
cea3acb111
LUCENE-3731: fix javadoc warnings, add uima to eclipse project
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244350 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 04:41:32 +00:00
Ryan McKinley
8d9bfe9245
LUCENE-3731: adding missing overview.html
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244340 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 04:01:57 +00:00
Tommaso Teofili
d66d97790b
[LUCENE-3731] - Creating the analysis-uima module for UIMA based tokenizers/analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244236 13f79535-47bb-0310-9956-ffa450edef68
2012-02-14 22:13:34 +00:00
Dawid Weiss
087f1e3126
LUCENE-3774: Optimized and streamlined license and notice file validation
...
by refactoring the build task into an ANT task and modifying build scripts
to perform top-level checks. (Dawid Weiss, Steve Rowe, Robert Muir)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243527 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 14:12:59 +00:00
Robert Muir
6a07201844
don't fail test due to jre bugs in String.toLowerCase
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243415 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 04:50:12 +00:00
Michael McCandless
bea8fd0fb6
SOLR-3076: fix BJQ to handle incoming liveDocs/filter correctly
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242934 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:28:52 +00:00
Uwe Schindler
70a7d4975f
LUCENE-3764: Remove MapBackedSet, it's already available in Java 6 through Collections.newSetFromMap(Map). BTW: Funny: http://blog.grovehillsoftware.com/2009/12/handy-but-hidden-collectionsnewsetfromm.html
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242932 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:26:55 +00:00
Uwe Schindler
6188bc66d7
LUCENE-3736: ParallelReader was split into ParallelAtomicReader and ParallelCompositeReader. Lucene 3.x's ParallelReader is now ParallelAtomicReader; but the new composite variant has improved performance as it works on the atomic subreaders.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242924 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:13:05 +00:00
Michael McCandless
c74d48b857
LUCENE-3760: clean up DirectoryReader/SegmentInfos methods
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242903 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 19:57:07 +00:00
Robert Muir
590741dcfe
LUCENE-3766: Remove Tokenizer's default ctor
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242890 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 19:12:35 +00:00
Robert Muir
8a50cefc6b
LUCENE-3748: EnglishPossessiveFilter did not work with a proper right quotation mark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242740 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 11:01:11 +00:00
Robert Muir
9f783ead67
SOLR-3115: improve japanese stopwords.txt description
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242557 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 22:17:44 +00:00
Robert Muir
509f4c557d
LUCENE-3751: align default japanese configurations for lucene/solr
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242543 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 21:45:41 +00:00
Robert Muir
72ae3171be
LUCENE-3765: Trappy behavior with StopFilter/ignoreCase
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242497 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 19:59:50 +00:00
Uwe Schindler
25cfcfb61e
LUCENE-3757: Change AtomicReaderContext.leaves() to return itsself as only leave to simplify code and remove an otherwise unneeded ReaderUtil method
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242233 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 08:14:19 +00:00
Robert Muir
c0319d5928
SOLR-3056: document expectations in these files
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241960 13f79535-47bb-0310-9956-ffa450edef68
2012-02-08 16:27:47 +00:00
Robert Muir
dac1b58277
SOLR-3097, SOLR-3105: add fieldtypes for different languages to the example
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241878 13f79535-47bb-0310-9956-ffa450edef68
2012-02-08 12:07:52 +00:00