Commit Graph

634 Commits

Author SHA1 Message Date
Robert Muir a5a0fd421e LUCENE-3768: fix typos in .alg files and test that all .alg files can be parsed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244509 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 14:46:05 +00:00
Tommaso Teofili c454ae6a66 [LUCENE-3731] - creating and using simple wst and pos tagger implementations for analyzers' random string testing
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244474 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 13:17:57 +00:00
Shai Erera 6c34d407cd fix DocMaker file handle leak: now with the actual fix :)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244380 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 07:02:31 +00:00
Shai Erera 505850c8f2 fix DocMaker file handle leak
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244379 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 07:01:41 +00:00
Ryan McKinley cea3acb111 LUCENE-3731: fix javadoc warnings, add uima to eclipse project
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244350 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 04:41:32 +00:00
Ryan McKinley 8d9bfe9245 LUCENE-3731: adding missing overview.html
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244340 13f79535-47bb-0310-9956-ffa450edef68
2012-02-15 04:01:57 +00:00
Tommaso Teofili d66d97790b [LUCENE-3731] - Creating the analysis-uima module for UIMA based tokenizers/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1244236 13f79535-47bb-0310-9956-ffa450edef68
2012-02-14 22:13:34 +00:00
Dawid Weiss 087f1e3126 LUCENE-3774: Optimized and streamlined license and notice file validation
by refactoring the build task into an ANT task and modifying build scripts
to perform top-level checks. (Dawid Weiss, Steve Rowe, Robert Muir)

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243527 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 14:12:59 +00:00
Robert Muir 6a07201844 don't fail test due to jre bugs in String.toLowerCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1243415 13f79535-47bb-0310-9956-ffa450edef68
2012-02-13 04:50:12 +00:00
Michael McCandless bea8fd0fb6 SOLR-3076: fix BJQ to handle incoming liveDocs/filter correctly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242934 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:28:52 +00:00
Uwe Schindler 70a7d4975f LUCENE-3764: Remove MapBackedSet, it's already available in Java 6 through Collections.newSetFromMap(Map). BTW: Funny: http://blog.grovehillsoftware.com/2009/12/handy-but-hidden-collectionsnewsetfromm.html
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242932 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:26:55 +00:00
Uwe Schindler 6188bc66d7 LUCENE-3736: ParallelReader was split into ParallelAtomicReader and ParallelCompositeReader. Lucene 3.x's ParallelReader is now ParallelAtomicReader; but the new composite variant has improved performance as it works on the atomic subreaders.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242924 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 21:13:05 +00:00
Michael McCandless c74d48b857 LUCENE-3760: clean up DirectoryReader/SegmentInfos methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242903 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 19:57:07 +00:00
Robert Muir 590741dcfe LUCENE-3766: Remove Tokenizer's default ctor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242890 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 19:12:35 +00:00
Robert Muir 8a50cefc6b LUCENE-3748: EnglishPossessiveFilter did not work with a proper right quotation mark
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242740 13f79535-47bb-0310-9956-ffa450edef68
2012-02-10 11:01:11 +00:00
Robert Muir 9f783ead67 SOLR-3115: improve japanese stopwords.txt description
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242557 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 22:17:44 +00:00
Robert Muir 509f4c557d LUCENE-3751: align default japanese configurations for lucene/solr
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242543 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 21:45:41 +00:00
Robert Muir 72ae3171be LUCENE-3765: Trappy behavior with StopFilter/ignoreCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242497 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 19:59:50 +00:00
Uwe Schindler 25cfcfb61e LUCENE-3757: Change AtomicReaderContext.leaves() to return itsself as only leave to simplify code and remove an otherwise unneeded ReaderUtil method
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1242233 13f79535-47bb-0310-9956-ffa450edef68
2012-02-09 08:14:19 +00:00
Robert Muir c0319d5928 SOLR-3056: document expectations in these files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241960 13f79535-47bb-0310-9956-ffa450edef68
2012-02-08 16:27:47 +00:00
Robert Muir dac1b58277 SOLR-3097, SOLR-3105: add fieldtypes for different languages to the example
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241878 13f79535-47bb-0310-9956-ffa450edef68
2012-02-08 12:07:52 +00:00
Steven Rowe eb0ab3d392 LUCENE-3753: Restructure the Lucene build system
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241588 13f79535-47bb-0310-9956-ffa450edef68
2012-02-07 19:59:05 +00:00
Michael McCandless 29329e1787 LUCENE-3736: revert per discussion on IRC until we can reach agreement on the chaining
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241500 13f79535-47bb-0310-9956-ffa450edef68
2012-02-07 15:49:31 +00:00
Uwe Schindler fb27a1f4e6 LUCENE-3736: Refactor ParallelReader to ParallelAtomicReader and ParallelCompositeReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241470 13f79535-47bb-0310-9956-ffa450edef68
2012-02-07 14:24:40 +00:00
Doron Cohen 36ff785606 LUCENE-3746: suggest.fst.Sort.BufferSize should not automatically fail just because of freeMemory()
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241355 13f79535-47bb-0310-9956-ffa450edef68
2012-02-07 06:00:39 +00:00
Robert Muir 4710d32447 LUCENE-3749: replace SimilarityProvider with PerFieldSimilarityWrapper
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1241001 13f79535-47bb-0310-9956-ffa450edef68
2012-02-06 12:48:01 +00:00
Robert Muir bef6e3664d LUCENE-3726: additional tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1240760 13f79535-47bb-0310-9956-ffa450edef68
2012-02-05 16:16:02 +00:00
Robert Muir 03497e7595 LUCENE-3745: add proper Japanese stopping
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1240714 13f79535-47bb-0310-9956-ffa450edef68
2012-02-05 13:05:42 +00:00
Robert Muir 009608d9f2 LUCENE-3726: default Kuromoji to search mode
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1240710 13f79535-47bb-0310-9956-ffa450edef68
2012-02-05 12:41:13 +00:00
Tommaso Teofili 6d3bb736f3 [LUCENE-3744] - applied patch for whiteList usage in TypeTokenFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1240034 13f79535-47bb-0310-9956-ffa450edef68
2012-02-03 09:13:17 +00:00
Michael McCandless 60c36c24fb don't let prefix's output bleed into full string's output (potential/latent bug)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1239658 13f79535-47bb-0310-9956-ffa450edef68
2012-02-02 15:01:13 +00:00
Yonik Seeley f6e4fceda2 SOLR-3078: only return update versions on request
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1239437 13f79535-47bb-0310-9956-ffa450edef68
2012-02-02 02:17:54 +00:00
Robert Muir 995c5b9ef1 LUCENE-3730: improve Kuromoji search mode heuristics
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1239061 13f79535-47bb-0310-9956-ffa450edef68
2012-02-01 11:03:17 +00:00
Uwe Schindler f252c064f2 LUCENE-3735: Fix PayloadProcessorProvider to no longer use Directory for lookup, instead AtomicReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1239052 13f79535-47bb-0310-9956-ffa450edef68
2012-02-01 10:51:40 +00:00
Uwe Schindler 3f99c3c2fd LUCENE-3733: simplier way to test for closed reader...
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1238892 13f79535-47bb-0310-9956-ffa450edef68
2012-02-01 01:00:03 +00:00
Michael McCandless 8e40ea5bf8 LUCENE-3742: fix token offset for hangs-off-end output in SynonymFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1238851 13f79535-47bb-0310-9956-ffa450edef68
2012-01-31 23:01:55 +00:00
Uwe Schindler 6b51b542b7 Remove redundant cast
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1238119 13f79535-47bb-0310-9956-ffa450edef68
2012-01-31 00:50:58 +00:00
Uwe Schindler 1648670485 LUCENE-2858: Fix remaining TODO: Re-add FieldCache insanity checking, got lost as tricky to implement
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1238112 13f79535-47bb-0310-9956-ffa450edef68
2012-01-31 00:33:19 +00:00
Uwe Schindler 57e33cf135 LUCENE-2858: fix remaining nocommits
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1238050 13f79535-47bb-0310-9956-ffa450edef68
2012-01-30 22:31:51 +00:00
Uwe Schindler 18533af348 LUCENE-2858: Fix Javadoc warnings, still missing some text for new classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1238026 13f79535-47bb-0310-9956-ffa450edef68
2012-01-30 21:45:08 +00:00
Uwe Schindler 9a6e69cf8d LUCENE-2858: Add a workaround for the broken PayloadProcessorProvider API that behaves like before our change (IR.directory() threw UOE for all non-directory based readers). I will open an issue to make PayloadProcessorProvider to get an AtomicReader instance as selection criteria for the processor
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237692 13f79535-47bb-0310-9956-ffa450edef68
2012-01-30 14:56:02 +00:00
Uwe Schindler 721de54e8c LUCENE-2858: Fix facets module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237618 13f79535-47bb-0310-9956-ffa450edef68
2012-01-30 12:19:21 +00:00
Uwe Schindler bfa1dec595 LUCENE-2858: Move the ReaderContexts out of their enclosing classes and make top-level classes. Makes import statements not so verbose and its much easier to understand. Prevent subclassing of IndexReader and IndexReaderContext directly.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237596 13f79535-47bb-0310-9956-ffa450edef68
2012-01-30 10:46:30 +00:00
Uwe Schindler 20fde71669 LUCENE-2858: Hide ReaderContext ctors for users. You should only get top contexts from the reader.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237512 13f79535-47bb-0310-9956-ffa450edef68
2012-01-30 00:35:08 +00:00
Uwe Schindler 2d4c738067 LUCENE-2858: Rename AtomicIndexReader -> AtomicReader, CompositeIndexReader -> CompositeReader. Contexts not yet changed!
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237508 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 23:50:54 +00:00
Uwe Schindler 10ba9abeb2 Reverse merged revision(s) from lucene/dev/trunk up to 1237502
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237505 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 23:19:05 +00:00
Michael McCandless d1165b1972 LUCENE-3725: add optional packing to FSTs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1237500 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 22:48:45 +00:00
Robert Muir adf080e451 LUCENE-2858: fix benchmarks
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237335 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 16:42:21 +00:00
Robert Muir acefe212c1 LUCENE-2858: fix modules facet (partially)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237330 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 16:34:57 +00:00
Robert Muir 4ef047b892 LUCENE-2858: fix grouping module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237325 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 16:24:30 +00:00
Robert Muir f38af41141 LUCENE-2858: fix join module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237321 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 16:03:40 +00:00
Robert Muir 45c377007f LUCENE-2858: fix queries module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237320 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 16:00:32 +00:00
Robert Muir 6d4484cb4a LUCENE-2858: fix qp module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237314 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 15:25:35 +00:00
Robert Muir 1906da16d0 LUCENE-2858: fix spellchecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237313 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 15:17:43 +00:00
Robert Muir d7fe56ddae LUCENE-2858: fix analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1237312 13f79535-47bb-0310-9956-ffa450edef68
2012-01-29 15:16:04 +00:00
Steven Rowe 97d62cc383 Fix offset array assertion off-by-one
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1236912 13f79535-47bb-0310-9956-ffa450edef68
2012-01-27 22:43:48 +00:00
Robert Muir f640687877 LUCENE-3720: add warning+experimental and disable test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1236341 13f79535-47bb-0310-9956-ffa450edef68
2012-01-26 18:26:07 +00:00
Martijn van Groningen 5644c6aa18 LUCENE-3602: Fixed documentation error.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1236143 13f79535-47bb-0310-9956-ffa450edef68
2012-01-26 11:21:26 +00:00
Robert Muir a2443fa4ff missing license headers in suggest package
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1236012 13f79535-47bb-0310-9956-ffa450edef68
2012-01-26 01:10:31 +00:00
Shai Erera 640741783d minor jdoc fixes for PerfTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235714 13f79535-47bb-0310-9956-ffa450edef68
2012-01-25 10:42:00 +00:00
Robert Muir 6edfe4f157 LUCENE-3717: add tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235199 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 10:40:46 +00:00
Robert Muir 35a73d5f55 LUCENE-3717: fix broken offsets in ngramtokenizers, and check return value of Reader.read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235187 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 09:50:21 +00:00
Robert Muir 7fafdd3576 LUCENE-3717: add checkRandomData to more analyzers and fix more offsets bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234850 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 15:19:58 +00:00
Steven Rowe 059410d424 LUCENE-3690: fix handling of unpaired numeric character entity UTF-16 surrogates to output U+FFFD REPLACEMENT CHARACTER; and add handling of properly paired numeric character entity UTF-16 surrogates, to output the corresponding pair of code units.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234687 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 07:36:38 +00:00
Robert Muir c754c1c9c8 LUCENE-3717: add better offsets testing to BaseTokenStreamTestCase, fix offsets bugs in ThaiWordFilter and ICUTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234652 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 00:08:52 +00:00
Robert Muir a7cfee6b07 SOLR-2891: fix CompoundWordTokenFilter to not create invalid offsets when the length of the text was changed by a previous filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234546 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 16:41:06 +00:00
Steven Rowe f3a363708f LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234452 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:20:46 +00:00
Shai Erera 17fe719bb5 LUCENE-3703: Calling DirectoryTaxonomyReader.refresh() could mess up reference counting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234451 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:14:45 +00:00
Uwe Schindler af9b4d816f LUCENE-3671: Add TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234396 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 19:02:44 +00:00
Martijn van Groningen ac8befe5c3 LUCENE-3602: Fixed some typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233918 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 14:15:42 +00:00
Robert Muir dea4b911da LUCENE-3646: throw exception for fieldcache on non-atomic reader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232444 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 15:34:43 +00:00
Robert Muir e869b1fbf7 LUCENE-3700: give enough ram so that you can build naist-jdic with java 5
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232274 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 02:27:31 +00:00
Robert Muir f562a8a0dc LUCENE-3700: optionally support naist-jdic for kuromoji
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232268 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 02:20:24 +00:00
Robert Muir 48c01e5a2b LUCENE-3699: share baseform with surface and flag if the reading can be computed from surface
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232265 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 02:12:27 +00:00
Robert Muir c902f63125 unbreak clover/nightly builds until we do this right
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232254 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 01:37:28 +00:00
Martijn van Groningen ac6679a7b6 - Added licence and notice files to join module.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232228 13f79535-47bb-0310-9956-ffa450edef68
2012-01-16 23:31:57 +00:00
Martijn van Groningen 6d4f6b0978 - Removed changes.txt from grouping module and moved the entries to the changes.txt in Lucene contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232227 13f79535-47bb-0310-9956-ffa450edef68
2012-01-16 23:30:16 +00:00
Martijn van Groningen 7cdb8028c1 LUCENE-3602: Added query time joining.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232223 13f79535-47bb-0310-9956-ffa450edef68
2012-01-16 23:16:35 +00:00
Robert Muir 12c9b8b4bf LUCENE-3699: simplify dictionary access and reduce tokeninfodictionary 1.5MB
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232120 13f79535-47bb-0310-9956-ffa450edef68
2012-01-16 19:19:48 +00:00
Robert Muir 354a3be78f LUCENE-3696: fix dictionary construction to work on java5
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232012 13f79535-47bb-0310-9956-ffa450edef68
2012-01-16 14:50:09 +00:00
Michael McCandless 87bc4521c2 LUCENE-3695: move some confusing FST sugar out
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231795 13f79535-47bb-0310-9956-ffa450edef68
2012-01-15 23:25:38 +00:00
Michael McCandless 11f33ee521 LUCENE-3684: add offsets to postings APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231794 13f79535-47bb-0310-9956-ffa450edef68
2012-01-15 23:17:45 +00:00
Michael McCandless 9de01b56eb LUCENE-3453: simplify DocValues/Field API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231791 13f79535-47bb-0310-9956-ffa450edef68
2012-01-15 23:05:13 +00:00
Robert Muir fbd34b4390 cleanups to 4.x CHANGES
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231552 13f79535-47bb-0310-9956-ffa450edef68
2012-01-14 18:24:48 +00:00
Yonik Seeley b2a0040e98 tests: silliness
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231526 13f79535-47bb-0310-9956-ffa450edef68
2012-01-14 16:59:36 +00:00
Michael McCandless 5ca66287ea woops
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231513 13f79535-47bb-0310-9956-ffa450edef68
2012-01-14 15:20:58 +00:00
Michael McCandless d584f6361d LUCENE-3685: add ToChildBlockJoinQuery, to join from parent to child
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231512 13f79535-47bb-0310-9956-ffa450edef68
2012-01-14 15:17:04 +00:00
Robert Muir 8b8c2b4dee LUCENE-3690: since this filter handles all kinds of bad partial and wierd input, this should be fine to enable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231272 13f79535-47bb-0310-9956-ffa450edef68
2012-01-13 19:46:30 +00:00
Robert Muir 2ff4bdb04f enable assertions when executing various tools
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231013 13f79535-47bb-0310-9956-ffa450edef68
2012-01-13 11:36:50 +00:00
Robert Muir 05a65507af LUCENE-3305: optimization, don't retrieve the base form twice in this filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230769 13f79535-47bb-0310-9956-ffa450edef68
2012-01-12 20:36:58 +00:00
Robert Muir 4ebdc0872a LUCENE-3305: sorry Mike (thanks for the help with the FST optimization)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230756 13f79535-47bb-0310-9956-ffa450edef68
2012-01-12 20:24:40 +00:00
Robert Muir cd372bdc83 LUCENE-3305: add Kuromoji Japanese morphological analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230748 13f79535-47bb-0310-9956-ffa450edef68
2012-01-12 20:10:48 +00:00
Shai Erera d74db06ddf LUCENE-3686: CategoryEnhancement must override Object.equals()
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230431 13f79535-47bb-0310-9956-ffa450edef68
2012-01-12 07:43:42 +00:00
Simon Willnauer 3b8458f6de use TEST_VERSION_CURRENT instead of 4_0 in test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229523 13f79535-47bb-0310-9956-ffa450edef68
2012-01-10 12:46:38 +00:00
Simon Willnauer f19317d318 SOLR-3020: Add KeywordAttribute support to HunspellStemFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229519 13f79535-47bb-0310-9956-ffa450edef68
2012-01-10 12:33:29 +00:00
Michael McCandless cdb2ee8a7b LUCENE-3679: replace IR.getFieldNames with IR.getFieldInfos
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229401 13f79535-47bb-0310-9956-ffa450edef68
2012-01-09 22:29:40 +00:00
Michael McCandless defd51a11b fix syn test bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1228704 13f79535-47bb-0310-9956-ffa450edef68
2012-01-07 19:28:07 +00:00
Michael McCandless ed9f0fd5ef LUCENE-3668: if there's only 1 output for a synonym rule then set start/endOffset to match the full span of the input tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1228650 13f79535-47bb-0310-9956-ffa450edef68
2012-01-07 16:26:15 +00:00
Robert Muir bbfc831d52 LUCENE-3677: remove old byte[] norms api from IndexReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1227755 13f79535-47bb-0310-9956-ffa450edef68
2012-01-05 18:55:20 +00:00
Michael McCandless ec3c3f81d9 LUCENE-3634: remove some old static main methods and other dead code
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1226871 13f79535-47bb-0310-9956-ffa450edef68
2012-01-03 17:08:19 +00:00
Shai Erera d5932e1149 LUCENE-3649: port changes from 3x to trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1226236 13f79535-47bb-0310-9956-ffa450edef68
2012-01-01 11:41:30 +00:00
Robert Muir f5d1e967db LUCENE-3667: make tests reasonable if you dont have a supercomputer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225920 13f79535-47bb-0310-9956-ffa450edef68
2011-12-30 21:14:15 +00:00
Steven Rowe 76d1662cb7 - Added license declaration
- Removed unused 'length' param to combine()

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225615 13f79535-47bb-0310-9956-ffa450edef68
2011-12-29 18:53:10 +00:00
Robert Muir b2970db4bc LUCENE-2906: filter to process output of Standard/ICUTokenizer and create overlapping bigrams for CJK
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225433 13f79535-47bb-0310-9956-ffa450edef68
2011-12-29 05:04:49 +00:00
Robert Muir e2f81e84f2 SOLR-2982: workaround bug in sun javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225228 13f79535-47bb-0310-9956-ffa450edef68
2011-12-28 16:51:23 +00:00
Robert Muir f3869ef3ce SOLR-2982: add Beider-Morse phonetic filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225211 13f79535-47bb-0310-9956-ffa450edef68
2011-12-28 16:00:52 +00:00
Robert Muir 7c7c7bd077 LUCENE-3662: add note
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1224823 13f79535-47bb-0310-9956-ffa450edef68
2011-12-26 21:36:16 +00:00
Robert Muir b096910a82 LUCENE-3662: extend LevenshteinAutomata to support transpositions as primitive a edit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1224817 13f79535-47bb-0310-9956-ffa450edef68
2011-12-26 21:18:36 +00:00
Robert Muir 17438c9eac don't allocate 128MB of ram in this assert
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1223047 13f79535-47bb-0310-9956-ffa450edef68
2011-12-25 04:24:06 +00:00
Grant Ingersoll f42b2ffd63 SOLR-2950: Improve QEC performance by dropping field cache use and keeping a local smaller map
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1220983 13f79535-47bb-0310-9956-ffa450edef68
2011-12-19 22:39:01 +00:00
Shai Erera 76ee67257d LUCENE-3635: Allow setting arbitrary objects on PerfRunData
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1220795 13f79535-47bb-0310-9956-ffa450edef68
2011-12-19 15:26:42 +00:00
Uwe Schindler 9b938f0dcd LUCENE-3652: Fix javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1215497 13f79535-47bb-0310-9956-ffa450edef68
2011-12-17 14:27:04 +00:00
Uwe Schindler ac96d90b53 LUCENE-3652: Move org.apache.lucene.messages to QueryParser module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1215480 13f79535-47bb-0310-9956-ffa450edef68
2011-12-17 12:57:41 +00:00
Robert Muir a55f511a77 LUCENE-3650: move o.a.l.index.codecs to o.a.l.codecs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1215245 13f79535-47bb-0310-9956-ffa450edef68
2011-12-16 19:03:12 +00:00
Robert Muir 18febd69e4 LUCENE-2208: improve charfilter offset testing
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1215038 13f79535-47bb-0310-9956-ffa450edef68
2011-12-16 04:37:47 +00:00
Steven Rowe 60929a5adb LUCENE-3645: Remove unnecessary array wrapping when calling varargs methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1214413 13f79535-47bb-0310-9956-ffa450edef68
2011-12-14 19:15:47 +00:00
Robert Muir eb173852ba LUCENE-3622: merge trunk (1213324:1213812)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3622@1213825 13f79535-47bb-0310-9956-ffa450edef68
2011-12-13 18:23:22 +00:00
Robert Muir 701000cf27 LUCENE-3622: function DocValues -> FunctionValues
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3622@1213426 13f79535-47bb-0310-9956-ffa450edef68
2011-12-12 20:33:26 +00:00
Robert Muir b15b981081 LUCENE-3622: merge trunk (1212830:1213323)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3622@1213405 13f79535-47bb-0310-9956-ffa450edef68
2011-12-12 20:13:02 +00:00
Robert Muir 7dc025bdce LUCENE-3642: fix invalid offsets from CharTokenizer, [Edge]NGramFilters, SmartChinese, add sanity check to BaseTokenStreamTestCase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1213329 13f79535-47bb-0310-9956-ffa450edef68
2011-12-12 17:28:09 +00:00
Robert Muir 3899e18ca3 LUCENE-3640: Remove IndexSearcher.close
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1213117 13f79535-47bb-0310-9956-ffa450edef68
2011-12-12 00:21:40 +00:00
Robert Muir 4387ff6161 LUCENE-3622: fix naming issues, IDV->DV, ValueType -> DV.Type, move all DV user-facing api into .index, TypePromoter pkg-private, etc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3622@1212506 13f79535-47bb-0310-9956-ffa450edef68
2011-12-09 16:43:45 +00:00
Uwe Schindler 905a0f211c LUCENE-3606: Make IndexReader really read-only
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1212292 13f79535-47bb-0310-9956-ffa450edef68
2011-12-09 09:13:39 +00:00
Robert Muir 9b15b1d3b0 consolidate assumes in ThaiAnalyzer test so we don't miss it for individual tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1212141 13f79535-47bb-0310-9956-ffa450edef68
2011-12-08 21:47:12 +00:00
Uwe Schindler a363e9b564 LUCENE-3606: Merge up to trunk r1212007
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3606@1212008 13f79535-47bb-0310-9956-ffa450edef68
2011-12-08 18:01:13 +00:00
Martijn van Groningen b3a07fad1f LUCENE-3496: Replaced generating simple string with realistic unicode strings.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1211993 13f79535-47bb-0310-9956-ffa450edef68
2011-12-08 17:18:19 +00:00
Robert Muir 1ae93d3697 LUCENE-3606: javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3606@1210498 13f79535-47bb-0310-9956-ffa450edef68
2011-12-05 15:33:15 +00:00
Robert Muir 8e77a1e5e1 LUCENE-3606: fix faceting tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3606@1210489 13f79535-47bb-0310-9956-ffa450edef68
2011-12-05 15:05:26 +00:00
Robert Muir 1dc3a251c4 LUCENE-3606: remove indexreader modification tasks from benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3606@1210486 13f79535-47bb-0310-9956-ffa450edef68
2011-12-05 14:57:44 +00:00
Robert Muir 3843ac5b8b LUCENE-3606: fix more tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3606@1210308 13f79535-47bb-0310-9956-ffa450edef68
2011-12-05 01:59:11 +00:00
Michael McCandless 961b820e53 LUCENE-2929: specify up front if you need freqs from DocsEnum
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1210176 13f79535-47bb-0310-9956-ffa450edef68
2011-12-04 18:50:58 +00:00
Dawid Weiss 3408defc8d JavaDoc clarifications. Removed unused parameter from one of the constructors.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1210173 13f79535-47bb-0310-9956-ffa450edef68
2011-12-04 18:38:32 +00:00
Michael McCandless c9ad724c19 sssshhhh
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1210116 13f79535-47bb-0310-9956-ffa450edef68
2011-12-04 14:35:46 +00:00
Martijn van Groningen 5037d1d5d2 Moved TestBlockJoin test into join package.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1209874 13f79535-47bb-0310-9956-ffa450edef68
2011-12-03 09:53:17 +00:00
Dawid Weiss 92de7e8848 SOLR-2888: FSTSuggester refactoring: internal storage is now UTF-8,
external sorting (on disk) prevents OOMs even with large data sets
(the bottleneck is now FST construction), code cleanups and API cleanups.

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1209265 13f79535-47bb-0310-9956-ffa450edef68
2011-12-01 21:49:27 +00:00
Uwe Schindler 277c4a1a5d Merged revision(s) 1208375 from lucene/dev/branches/branch_3x:
LUCENE-3609: Fix regression in BooleanFilter, introduced in Lucene 3.5, to correctly handle minShouldMatch behaviour of previous versions

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1208381 13f79535-47bb-0310-9956-ffa450edef68
2011-11-30 11:08:45 +00:00
Chris M. Hostetter 3ed5106920 SOLR-2819: Improved speed of parsing hex entities in HTMLStripCharFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1208032 13f79535-47bb-0310-9956-ffa450edef68
2011-11-29 19:15:54 +00:00
Simon Willnauer 9a513f82dc LUCENE-3592: Make BytesRef#startsWith & BytesRef#endsWith static
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1207564 13f79535-47bb-0310-9956-ffa450edef68
2011-11-28 20:15:17 +00:00
Shai Erera 94dda4dcaa update facet/build.xml#jar-examples to include the source files as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1207012 13f79535-47bb-0310-9956-ffa450edef68
2011-11-28 08:21:21 +00:00
Doron Cohen ec660c7997 LUCENE-3596: DirectoryTaxonomyWriter extensions can now set internal index writer config attributes such as info stream
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206996 13f79535-47bb-0310-9956-ffa450edef68
2011-11-28 07:05:44 +00:00
Michael McCandless dda0cd73a2 LUCENE-3600: close searchers in test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206817 13f79535-47bb-0310-9956-ffa450edef68
2011-11-27 19:52:40 +00:00
Michael McCandless 3187a7a26b LUCENE-3600: fix BJQ to not trip assert when parent doc has no children
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206807 13f79535-47bb-0310-9956-ffa450edef68
2011-11-27 19:30:17 +00:00
Erick Erickson 098371446a SOLR-2438, allow an analysis chain to be created for multiterm query terms or synthesize one if not defined explicitly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206229 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 15:46:26 +00:00
Robert Muir 6870592252 LUCENE-3590: unbreak directspellchecker string handling
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206186 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 14:20:00 +00:00
Robert Muir 7f766cf603 LUCENE-3590: nuke BytesRef.utf8ToChars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206174 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 13:55:41 +00:00
Robert Muir 233a289a66 LUCENE-3590: cut over new DV grouping collectors
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206156 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 13:28:43 +00:00
Robert Muir 3b6da22aa7 LUCENE-3590: clearly mark bogus deep-copying apis in BytesRef
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206143 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 12:50:13 +00:00
Martijn van Groningen 9dd60fe58f LUCENE-3496: Support grouping by IndexDocValues
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1205960 13f79535-47bb-0310-9956-ffa450edef68
2011-11-24 19:07:59 +00:00
Shai Erera abd95277d3 LUCENE-3583: LineDocSourceTest could still fail to delete some files - add more protection
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1204859 13f79535-47bb-0310-9956-ffa450edef68
2011-11-22 08:41:31 +00:00
Shai Erera cfe863797c LUCENE-3583: more safety changes to benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1204851 13f79535-47bb-0310-9956-ffa450edef68
2011-11-22 08:14:50 +00:00