Commit Graph

11927 Commits

Author SHA1 Message Date
Jan Høydahl 8f550126f7 SOLR-2901: Upgrade Solr to Tika 1.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235753 13f79535-47bb-0310-9956-ffa450edef68
2012-01-25 14:18:06 +00:00
Robert Muir 1b9e732bed LUCENE-3723: add basic sanity checks for stats in sims
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235747 13f79535-47bb-0310-9956-ffa450edef68
2012-01-25 13:59:28 +00:00
Shai Erera 640741783d minor jdoc fixes for PerfTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235714 13f79535-47bb-0310-9956-ffa450edef68
2012-01-25 10:42:00 +00:00
Jan Høydahl d0e4e731d9 SOLR-2280: commitWithin ignored for a delete query
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235305 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 15:41:16 +00:00
Uwe Schindler 20d3c8335e LUCENE-3712: Removed unused and untested ReaderUtil#subReader methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235228 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 12:52:07 +00:00
Robert Muir 6edfe4f157 LUCENE-3717: add tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235199 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 10:40:46 +00:00
Doron Cohen 0aad3d64ad LUCENE-3718: fix Lucene40PostingsReader.linearScan() to set its doc state also when refill()ing.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235190 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 10:00:47 +00:00
Robert Muir 35a73d5f55 LUCENE-3717: fix broken offsets in ngramtokenizers, and check return value of Reader.read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235187 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 09:50:21 +00:00
Robert Muir b1da6f5041 LUCENE-3706: add offsets into lucene40 postings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235022 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 22:03:08 +00:00
Steven Rowe edd8b91895 LUCENE-3690: Maven configuration: Added test resources solr/core/src/test/**/* (except *.java) to the solr-core module, to allow LegacyHTMLStripCharFilterTest to find its test resource htmlStripReaderTest.html. (Jenkins builds were failing because of this.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234884 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 16:27:23 +00:00
Steven Rowe 9157338066 LUCENE-3690: Added info about changes in HTMLStripCharFilter surrogate handling to solr/CHANGES.txt.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234867 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 15:56:06 +00:00
Robert Muir 7fafdd3576 LUCENE-3717: add checkRandomData to more analyzers and fix more offsets bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234850 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 15:19:58 +00:00
Steven Rowe 059410d424 LUCENE-3690: fix handling of unpaired numeric character entity UTF-16 surrogates to output U+FFFD REPLACEMENT CHARACTER; and add handling of properly paired numeric character entity UTF-16 surrogates, to output the corresponding pair of code units.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234687 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 07:36:38 +00:00
Robert Muir c754c1c9c8 LUCENE-3717: add better offsets testing to BaseTokenStreamTestCase, fix offsets bugs in ThaiWordFilter and ICUTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234652 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 00:08:52 +00:00
Uwe Schindler f7a474d603 LUCENE-3671: Fix wrong issue number!
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234598 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 20:35:22 +00:00
Uwe Schindler 6b5ef1dead SOLR-3054, LUCENE-3671: Remove useless assureMatchVersion check
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234579 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 18:57:24 +00:00
Uwe Schindler 366e1a2cd2 SOLR-3054, LUCENE-3671: Add TypeTokenFilterFactory that creates TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234573 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 18:50:10 +00:00
Robert Muir a7cfee6b07 SOLR-2891: fix CompoundWordTokenFilter to not create invalid offsets when the length of the text was changed by a previous filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234546 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 16:41:06 +00:00
Steven Rowe f3a363708f LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234452 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:20:46 +00:00
Shai Erera 17fe719bb5 LUCENE-3703: Calling DirectoryTaxonomyReader.refresh() could mess up reference counting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234451 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:14:45 +00:00
Simon Willnauer acf9e1333d LUCENE-3709: properly release primary output lock if wrapper creation fails
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234398 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 19:10:41 +00:00
Uwe Schindler af9b4d816f LUCENE-3671: Add TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234396 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 19:02:44 +00:00
Uwe Schindler 9b86beea98 Remove obsolete javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234318 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 12:50:25 +00:00
Steven Rowe b383f4a044 Added a junit test scope dependency in the grandparent POM to work around a Maven 2.2.X bug in which transitive test dependency resolution omits junit as duplicate everywhere it occurs, resulting in the absence of the junit jar from the test classpath, and then test compilation failure. This manifested only in Solr's clustering contrib, but I'm putting it in the grandfather POM to insure it doesn't happen elsewhere. Under Maven 3.0.4, this behavior no longer occurs, so when we upgrade the minimum Maven version to 3.0.4+, the junit test scope dependency in the grandfather POM can be removed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234219 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 00:31:03 +00:00
Steven Rowe e5e4434bdb Skip compile:test-compile under Maven 3
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234197 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 23:05:12 +00:00
Simon Willnauer 315b6ab63d LUCENE-3709: only load norms if they are present in at least on Field
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234013 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 17:19:41 +00:00
Martijn van Groningen ac8befe5c3 LUCENE-3602: Fixed some typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233918 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 14:15:42 +00:00
Yonik Seeley 53270bdb9c LUCENE-3711: add CHANGES entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233708 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 01:41:54 +00:00
Michael McCandless 5f33d8d2ae use singletons in FST outputs; add 2 commented out test cases showing non-minimality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233696 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 23:54:55 +00:00
Yonik Seeley 4fdebf90e5 SentinelIntSet can hang by not resizing really small sets early enough, add random test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233583 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 21:38:40 +00:00
Robert Muir ab52c0d7ef if classpath is wrong for preflex-rw, throw an exception up front
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233520 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:33:56 +00:00
Robert Muir 3c6d62edbb LUCENE-3708: always call postingswriter.finishDoc when writing postings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233405 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 14:38:34 +00:00
Michael McCandless 5bc7f5bbd0 LUCENE-3121: add reverse-lookup by long to FST, when output grows strictly monotonically w/ input (eg, ord, address, etc.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233381 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 14:07:40 +00:00
Simon Willnauer 47f474735f LUCENE-3707: Add a Lucene3x private SegmentInfosFormat implementation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233380 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 14:06:10 +00:00
Simon Willnauer fc0b7cfb59 don't rely on ThreadAffinity in tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233288 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 11:21:13 +00:00
Simon Willnauer 52dc6a5908 LUCENE-3705: IWFlushQueue deadlocks if deletes are flushed and a segment is published at the same time
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233248 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 08:54:33 +00:00
Steven Rowe fb1884358d SOLR-2718: Move CHANGES entry under 3.6.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233199 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 06:31:46 +00:00
Steven Rowe 6a8fbf780e SOLR-3048: Fix incorrect test classpath order under the Maven build
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233106 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 23:03:41 +00:00
Yonik Seeley 4c078d97c9 build: make property for javac args
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233010 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 19:28:39 +00:00
Steven Rowe 61e9317955 LUCENE-3666: fix example consumer code to follow the TokenStream API contract; fix cast-to-interface code to use interface method instead of instance field.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232987 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 18:34:57 +00:00
Robert Muir 98e5328845 LUCENE-3576: add surrogates shuffle for 3.x term vectors, test surrogates in TestBackwardsCompatibility
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232974 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 17:57:05 +00:00
Simon Willnauer 3d66fa922a remove files of dropped segments right after merge
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232943 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 16:10:17 +00:00
Jan Høydahl 374f6fc6c1 SOLR-2970: CSV ResponseWriter returns fields defined as stored=false in schema
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232918 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 14:55:57 +00:00
Steven Rowe c9361a507d LUCENE-3666: Update org.apache.lucene.analysis package summary
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232909 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 14:36:58 +00:00
Simon Willnauer e58eadc95b LUCENE-3697: fix solr testcase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232818 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 10:31:46 +00:00
Koji Sekiguchi 3150fae20d LUCENE-3697: SimpleBoundaryScanner does not work well when highlighting at the beginning of the text
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232769 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 07:34:56 +00:00
Robert Muir ff5b709b3a LUCENE-3576: check that the order of terms is correct in checkindex
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232656 13f79535-47bb-0310-9956-ffa450edef68
2012-01-18 00:01:48 +00:00
Robert Muir d159f25b63 LUCENE-3613: split out 4.0/3.x term vectors implementations
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232652 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 23:47:14 +00:00
Robert Muir c4f57c6081 LUCENE-3687: remove now-obselete TODO
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232556 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 19:57:46 +00:00
Robert Muir b11a68218d LUCENE-3701: merge CHANGES entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232532 13f79535-47bb-0310-9956-ffa450edef68
2012-01-17 18:46:03 +00:00