Commit Graph

563 Commits

Author SHA1 Message Date
Simon Willnauer f303bcd465 LUCENE-3807: Cleanup Suggest / Lookup API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296268 13f79535-47bb-0310-9956-ffa450edef68
2012-03-02 15:59:55 +00:00
Robert Muir a273239db6 LUCENE-3801: generify FST shortestPaths to any output type
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296237 13f79535-47bb-0310-9956-ffa450edef68
2012-03-02 14:59:44 +00:00
Robert Muir adebb1592a LUCENE-3801: our suggesters are 1000x faster than we thought they were
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1296147 13f79535-47bb-0310-9956-ffa450edef68
2012-03-02 11:22:14 +00:00
Robert Muir 978ce35e40 fix suggester benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1295426 13f79535-47bb-0310-9956-ffa450edef68
2012-03-01 06:33:30 +00:00
Chris M. Hostetter 29a7c260fe LUCENE-2604: add '/' to the list of chars in the various escape functions
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294965 13f79535-47bb-0310-9956-ffa450edef68
2012-02-29 03:58:41 +00:00
Michael McCandless 9ab8f9b83c LUCENE-3824: don't do pointless by-value cmp in TermOrdVal/DocValuesComparator.setBottom
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294856 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 22:20:18 +00:00
Dawid Weiss 8c2e3cef8f LUCENE-3820: limiting the amount of input for pattern matching to go past exponential time patterns, even if they happen. A nice catch from Mike too -- un-ignore testNastyPattern and look at processing time go wild with each additional input character...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294797 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 19:26:05 +00:00
Dawid Weiss f3cc65733b Sysout of the randomized pattern.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294518 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 08:15:38 +00:00
Dawid Weiss 4d401ca87d Test thread's name reflects the current seed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294514 13f79535-47bb-0310-9956-ffa450edef68
2012-02-28 08:04:42 +00:00
Dawid Weiss 493bd8b42f LUCENE-3820: optimistic limit on running time for the randomized pattern test. This doesn't eliminate the possibility of hitting an exponential time pattern, but I re-run a few times and it seems to be pretty stbale.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294322 13f79535-47bb-0310-9956-ffa450edef68
2012-02-27 20:50:24 +00:00
Dawid Weiss 7be5533989 LUCENE-3820: Wrong trailing index calculation in PatternReplaceCharFilter.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1294141 13f79535-47bb-0310-9956-ffa450edef68
2012-02-27 13:13:10 +00:00
Tommaso Teofili 482c0610fd [LUCENE-3731] - refactored analyzeText method to initializeIterator and made it abstract inside BaseUIMATokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1293614 13f79535-47bb-0310-9956-ffa450edef68
2012-02-25 14:14:00 +00:00
Simon Willnauer f29eda768d LUCENE-3807: Clean up Suggest API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1293148 13f79535-47bb-0310-9956-ffa450edef68
2012-02-24 09:49:39 +00:00
Tommaso Teofili 930816cc5b LUCENE-3731 - AEProviderFactory getAEProvider logic cleaned
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1292585 13f79535-47bb-0310-9956-ffa450edef68
2012-02-22 23:39:51 +00:00
Robert Muir a7a3d5497f enable tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1292231 13f79535-47bb-0310-9956-ffa450edef68
2012-02-22 10:57:24 +00:00
Simon Willnauer 620622a70e LUCENE-3807: compare string with string
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1292228 13f79535-47bb-0310-9956-ffa450edef68
2012-02-22 10:39:39 +00:00
Robert Muir 9527a9e397 LUCENE-3813: improve testEmpty to also check that the iterator is exhausted
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291883 13f79535-47bb-0310-9956-ffa450edef68
2012-02-21 15:58:12 +00:00
Robert Muir 9911c1de35 LUCENE-3807: fix missing null check in HighFrequencyDictionary
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291826 13f79535-47bb-0310-9956-ffa450edef68
2012-02-21 14:56:42 +00:00
Robert Muir 51388a2f9c LUCENE-3811: remove unused benchmark dependencies
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291728 13f79535-47bb-0310-9956-ffa450edef68
2012-02-21 12:11:04 +00:00
Simon Willnauer 70501dd845 LUCENE-3807: consume all terms from the enum
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291506 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 22:53:55 +00:00
Robert Muir 9d210b0c37 add test for suggester iterators
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291502 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 22:51:34 +00:00
Simon Willnauer 1860439f15 LUCENE-3807: clean up TermFreqIterator API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291418 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 19:35:59 +00:00
Robert Muir f1345de257 don't uppercase int to İNT
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291343 13f79535-47bb-0310-9956-ffa450edef68
2012-02-20 16:17:17 +00:00
Robert Muir a519b630ee LUCENE-3714: add weighted FST suggester impl
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1291020 13f79535-47bb-0310-9956-ffa450edef68
2012-02-19 16:23:05 +00:00
Michael McCandless 854c9ac452 LUCENE-3777: separate out Int/Long/Float/DoubleField to reduce traps
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1245583 13f79535-47bb-0310-9956-ffa450edef68
2012-02-17 14:46:35 +00:00
Shai Erera 242092c929 LUCENE-3794: DirectoryTaxonomyWriter can lose the INDEX_CREATE_TIME property, causing DirTaxoReader.refresh() to falsely succeed (or fail)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244964 13f79535-47bb-0310-9956-ffa450edef68
2012-02-16 12:54:56 +00:00
Robert Muir e51795be39 LUCENE-3731: remove unnecessary code
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244714 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 20:53:53 +00:00
Robert Muir c97e3edbb9 LUCENE-3731: performance improvements and thread safety fixes to UIMA tokenizers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244688 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 20:29:20 +00:00
Steven Rowe d47f01c350 LUCENE-3754: Store generated archive manifests in per-module output directories - each artifact gets its own manifest file
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244536 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 15:30:53 +00:00
Robert Muir a5a0fd421e LUCENE-3768: fix typos in .alg files and test that all .alg files can be parsed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244509 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 14:46:05 +00:00
Tommaso Teofili c454ae6a66 [LUCENE-3731] - creating and using simple wst and pos tagger implementations for analyzers' random string testing
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244474 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 13:17:57 +00:00
Shai Erera 6c34d407cd fix DocMaker file handle leak: now with the actual fix :)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244380 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 07:02:31 +00:00
Shai Erera 505850c8f2 fix DocMaker file handle leak
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244379 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 07:01:41 +00:00
Ryan McKinley cea3acb111 LUCENE-3731: fix javadoc warnings, add uima to eclipse project
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244350 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 04:41:32 +00:00
Ryan McKinley 8d9bfe9245 LUCENE-3731: adding missing overview.html
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244340 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 04:01:57 +00:00
Tommaso Teofili d66d97790b [LUCENE-3731] - Creating the analysis-uima module for UIMA based tokenizers/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244236 13f79535-47bb-0310-9956-ffa450edef68
2012-02-14 22:13:34 +00:00
Dawid Weiss 087f1e3126 LUCENE-3774: Optimized and streamlined license and notice file validation
by refactoring the build task into an ANT task and modifying build scripts
to perform top-level checks. (Dawid Weiss, Steve Rowe, Robert Muir)

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243527 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 14:12:59 +00:00
Robert Muir 6a07201844 don't fail test due to jre bugs in String.toLowerCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243415 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 04:50:12 +00:00
Michael McCandless bea8fd0fb6 SOLR-3076: fix BJQ to handle incoming liveDocs/filter correctly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242934 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:28:52 +00:00
Uwe Schindler 70a7d4975f LUCENE-3764: Remove MapBackedSet, it's already available in Java 6 through Collections.newSetFromMap(Map). BTW: Funny: http://blog.grovehillsoftware.com/2009/12/handy-but-hidden-collectionsnewsetfromm.html
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242932 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:26:55 +00:00
Uwe Schindler 6188bc66d7 LUCENE-3736: ParallelReader was split into ParallelAtomicReader and ParallelCompositeReader. Lucene 3.x's ParallelReader is now ParallelAtomicReader; but the new composite variant has improved performance as it works on the atomic subreaders.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242924 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:13:05 +00:00
Michael McCandless c74d48b857 LUCENE-3760: clean up DirectoryReader/SegmentInfos methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242903 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 19:57:07 +00:00
Robert Muir 590741dcfe LUCENE-3766: Remove Tokenizer's default ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242890 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 19:12:35 +00:00
Robert Muir 8a50cefc6b LUCENE-3748: EnglishPossessiveFilter did not work with a proper right quotation mark
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242740 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 11:01:11 +00:00
Robert Muir 9f783ead67 SOLR-3115: improve japanese stopwords.txt description
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242557 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 22:17:44 +00:00
Robert Muir 509f4c557d LUCENE-3751: align default japanese configurations for lucene/solr
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242543 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 21:45:41 +00:00
Robert Muir 72ae3171be LUCENE-3765: Trappy behavior with StopFilter/ignoreCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242497 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 19:59:50 +00:00
Uwe Schindler 25cfcfb61e LUCENE-3757: Change AtomicReaderContext.leaves() to return itsself as only leave to simplify code and remove an otherwise unneeded ReaderUtil method
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242233 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 08:14:19 +00:00
Robert Muir c0319d5928 SOLR-3056: document expectations in these files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241960 13f79535-47bb-0310-9956-ffa450edef68
2012-02-08 16:27:47 +00:00
Robert Muir dac1b58277 SOLR-3097, SOLR-3105: add fieldtypes for different languages to the example
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241878 13f79535-47bb-0310-9956-ffa450edef68
2012-02-08 12:07:52 +00:00