Commit Graph

12298 Commits

Author SHA1 Message Date
Doron Cohen 0aad3d64ad LUCENE-3718: fix Lucene40PostingsReader.linearScan() to set its doc state also when refill()ing.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235190 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 10:00:47 +00:00
Robert Muir 35a73d5f55 LUCENE-3717: fix broken offsets in ngramtokenizers, and check return value of Reader.read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235187 13f79535-47bb-0310-9956-ffa450edef68
2012-01-24 09:50:21 +00:00
Uwe Schindler b3a23daca8 LUCENE-2858: Reverse merged revision(s) 1-0 from lucene/dev/trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1235028 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 22:15:15 +00:00
Robert Muir b1da6f5041 LUCENE-3706: add offsets into lucene40 postings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235022 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 22:03:08 +00:00
Steven Rowe edd8b91895 LUCENE-3690: Maven configuration: Added test resources solr/core/src/test/**/* (except *.java) to the solr-core module, to allow LegacyHTMLStripCharFilterTest to find its test resource htmlStripReaderTest.html. (Jenkins builds were failing because of this.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234884 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 16:27:23 +00:00
Steven Rowe 9157338066 LUCENE-3690: Added info about changes in HTMLStripCharFilter surrogate handling to solr/CHANGES.txt.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234867 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 15:56:06 +00:00
Robert Muir 7fafdd3576 LUCENE-3717: add checkRandomData to more analyzers and fix more offsets bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234850 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 15:19:58 +00:00
Uwe Schindler 66f17c3bab LUCENE-2858, LUCENE-3716: Move all commit/version/indexExists/reopen/open stuff to DirectoryReader, IR/AIR/CIR is now only an abstract index to search on, nothing more. In the future we might create an interface Reopenable that has all methods. For now SearcherManager & Co. simply check for instanceof DirectoryReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1234719 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 09:55:26 +00:00
Steven Rowe 059410d424 LUCENE-3690: fix handling of unpaired numeric character entity UTF-16 surrogates to output U+FFFD REPLACEMENT CHARACTER; and add handling of properly paired numeric character entity UTF-16 surrogates, to output the corresponding pair of code units.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234687 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 07:36:38 +00:00
Robert Muir c754c1c9c8 LUCENE-3717: add better offsets testing to BaseTokenStreamTestCase, fix offsets bugs in ThaiWordFilter and ICUTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234652 13f79535-47bb-0310-9956-ffa450edef68
2012-01-23 00:08:52 +00:00
Uwe Schindler f7a474d603 LUCENE-3671: Fix wrong issue number!
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234598 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 20:35:22 +00:00
Uwe Schindler 6b5ef1dead SOLR-3054, LUCENE-3671: Remove useless assureMatchVersion check
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234579 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 18:57:24 +00:00
Uwe Schindler 366e1a2cd2 SOLR-3054, LUCENE-3671: Add TypeTokenFilterFactory that creates TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234573 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 18:50:10 +00:00
Robert Muir a7cfee6b07 SOLR-2891: fix CompoundWordTokenFilter to not create invalid offsets when the length of the text was changed by a previous filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234546 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 16:41:06 +00:00
Uwe Schindler d986e6442b LUCENE-2858: Reverse merged revision(s) 1-0 from lucene/dev/trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1234501 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 12:25:22 +00:00
Uwe Schindler 005fe75a48 LUCENE-2858: Move docFreq methods back to abstract IndexReader, as its used quite often (and reenable asserts). Move merge for FieldInfos to MultiFields where it is more useful
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1234500 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 12:20:19 +00:00
Uwe Schindler 02ae8b97bc LUCENE-2858: Restructure/move ReaderContexts
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1234484 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 10:59:10 +00:00
Steven Rowe f3a363708f LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234452 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:20:46 +00:00
Shai Erera 17fe719bb5 LUCENE-3703: Calling DirectoryTaxonomyReader.refresh() could mess up reference counting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234451 13f79535-47bb-0310-9956-ffa450edef68
2012-01-22 05:14:45 +00:00
Uwe Schindler 42b2c86116 LUCENE-2858: First steps, still incomplete. No tests pass or even compile, but core code now compiles. See issue!
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1234441 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 23:22:50 +00:00
Uwe Schindler 0e64939f81 Create a branch for LUCENE-2858
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene2858@1234440 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 23:18:26 +00:00
Simon Willnauer acf9e1333d LUCENE-3709: properly release primary output lock if wrapper creation fails
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234398 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 19:10:41 +00:00
Uwe Schindler af9b4d816f LUCENE-3671: Add TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234396 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 19:02:44 +00:00
Robert Muir f6c6f5dd4c LUCENE-3661: remove duplciate delete-file-handling
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1234357 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 16:18:11 +00:00
Uwe Schindler 9b86beea98 Remove obsolete javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234318 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 12:50:25 +00:00
Steven Rowe b383f4a044 Added a junit test scope dependency in the grandparent POM to work around a Maven 2.2.X bug in which transitive test dependency resolution omits junit as duplicate everywhere it occurs, resulting in the absence of the junit jar from the test classpath, and then test compilation failure. This manifested only in Solr's clustering contrib, but I'm putting it in the grandfather POM to insure it doesn't happen elsewhere. Under Maven 3.0.4, this behavior no longer occurs, so when we upgrade the minimum Maven version to 3.0.4+, the junit test scope dependency in the grandfather POM can be removed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234219 13f79535-47bb-0310-9956-ffa450edef68
2012-01-21 00:31:03 +00:00
Steven Rowe e5e4434bdb Skip compile:test-compile under Maven 3
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234197 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 23:05:12 +00:00
Robert Muir ac9bb797e1 LUCENE-3661: remove unnecessary method
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1234056 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 18:36:14 +00:00
Robert Muir 7a3542f16a LUCENE-3661: clean up how we handle the case of files outside of CFS
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1234051 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 18:10:22 +00:00
Simon Willnauer 315b6ab63d LUCENE-3709: only load norms if they are present in at least on Field
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234013 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 17:19:41 +00:00
Robert Muir aadd4725cc LUCENE-3661: get tests passing with simpletext
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1234002 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 16:55:47 +00:00
Robert Muir d95b8943c6 LUCENE-3661: hack test for simpletext
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233994 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 16:40:02 +00:00
Robert Muir 0a09551f34 LUCENE-3661: hack test for simpletext case
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233981 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 16:15:29 +00:00
Robert Muir af670f2025 LUCENE-3661: simpletext deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233975 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 16:00:35 +00:00
Robert Muir 52ab0d610e LUCENE-3661: clean this up to use IOUtils method
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233945 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 15:14:14 +00:00
Robert Muir f6b18248a9 LUCENE-3661: nuke obselete nocommit, thanks Mike
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233935 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 14:47:42 +00:00
Michael McCandless 5c920cf73a LUCENE-3661: cut back to Bits (from MutableBits) in some places in IW
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233931 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 14:35:01 +00:00
Martijn van Groningen ac8befe5c3 LUCENE-3602: Fixed some typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233918 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 14:15:42 +00:00
Robert Muir 6117558c44 LUCENE-3661: remove .del from IndexFileNames
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233726 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 02:18:59 +00:00
Robert Muir 2dee41b88e LUCENE-3661: remove SI.getDelFileName
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233709 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 01:46:07 +00:00
Yonik Seeley 53270bdb9c LUCENE-3711: add CHANGES entry
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233708 13f79535-47bb-0310-9956-ffa450edef68
2012-01-20 01:41:54 +00:00
Michael McCandless 5f33d8d2ae use singletons in FST outputs; add 2 commented out test cases showing non-minimality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233696 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 23:54:55 +00:00
Yonik Seeley 4fdebf90e5 SentinelIntSet can hang by not resizing really small sets early enough, add random test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233583 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 21:38:40 +00:00
Robert Muir c2ad31a702 LUCENE-3661: generalize from bitvector->livedocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233532 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:57:35 +00:00
Robert Muir 2227b6e668 LUCENE-3661: remove bitvector from string
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233531 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:54:14 +00:00
Robert Muir cfdd1f1c2d LUCENE-3661: remove bitvector from segmentreader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233530 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:52:27 +00:00
Robert Muir ab52c0d7ef if classpath is wrong for preflex-rw, throw an exception up front
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233520 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:33:56 +00:00
Robert Muir 413823ceaf LUCENE-3661: bitvector->mutablebits in some strings/javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233508 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:16:13 +00:00
Robert Muir e2a4b86260 LUCENE-3661: bitvector->mutablebits in indexwriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233507 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 18:10:51 +00:00
Robert Muir da3dbb0e0c move some bitvector -> mutablebits
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/lucene3661@1233498 13f79535-47bb-0310-9956-ffa450edef68
2012-01-19 17:41:14 +00:00