Commit Graph

11342 Commits

Author SHA1 Message Date
Olli Kuonanoja 351e21f620 Fix MLT like text with custom frequencies
When an analyzer with custom term frequencies is used with MLT like
texts, the custom term frequencies are incorrectly omitted and a fixed
frequency of 1 is used instead.

This commit fixes the issue by using `TermFrequencyAttribute` to get
the term frequencies instead of using fixed 1. Also adds test cases
for them mentioned issue.
2019-04-30 12:13:49 -04:00
Andrzej Bialecki 170f5fb7a3 SOLR-13423: Upgrade RRD4j to version 3.5. 2019-04-24 12:33:48 +02:00
Mike McCandless e522147eb1 cosmetic fix in FST.Builder assertion (thanks zhenyuan wei) 2019-04-22 11:52:18 -04:00
Tomoko Uchida 62ed046e52 LUCENE-2562: Preserve map key order for the user interface. 2019-04-22 00:36:41 +09:00
Tomoko Uchida 1204327b56 LUCENE-8738: Force locale to be 'en_US' in javadocs task. 2019-04-21 21:28:14 +09:00
Tomoko Uchida 511efc89fe LUCENE-2562: Use Map#copyOf(), List#copyOf() and Set#copyOf() instead of Collections#unmodifiableMap(), unmodifiableList() and unmodifiableSet() 2019-04-21 18:18:23 +09:00
Uwe Schindler cd0706bd43 Revert previous change to use separate Groovy artifacts: Use groovy-all again (bugs in ivy) 2019-04-20 10:51:05 +02:00
Uwe Schindler a43fa13d11 Revert Groovy update and downgrade Groovy to 2.4.16, as new version is not compatible with Java 13 EA builds! 2019-04-20 02:57:26 +02:00
Uwe Schindler 52090c9b11 Update flexmark to latest version 2019-04-20 01:12:51 +02:00
Uwe Schindler f1911f82d5 LUCENE-8768: Fix Javadocs build in Java 11 2019-04-20 01:01:13 +02:00
Uwe Schindler 77e1bec7dc LUCENE-8738: Add missing dependency for Maven build 2019-04-20 00:51:55 +02:00
Uwe Schindler b8494c8702 LUCENE-8738: Update Groovy to make the warnings with Java 11 a bit more silent (only one warning on first Groovy invocation) 2019-04-20 00:09:56 +02:00
Tomoko Uchida 7ba8bdca8f LUCENE-2562: Suppress SimpleText codec for CommitsImplTest. 2019-04-19 11:07:01 +09:00
Nicholas Knize faa78ad72c LUCENE-8736: Fix Polygon.contains to approriately check longitude range, and pass correct line segment vertices in EdgeTree 2019-04-18 13:15:07 -05:00
Andrzej Bialecki bd8905150d SOLR-12461: Upgrade Dropwizard Metrics to 4.0.5 release. 2019-04-18 19:08:20 +02:00
Adrien Grand 61d7569f78 LUCENE-7386: Flatten nested disjunctions. 2019-04-18 11:24:50 +02:00
Zhao Yang 48a68365bb Fix OneDimensionBKDWriter valueCount validation
Signed-off-by: Adrien Grand <jpountz@gmail.com>
2019-04-18 11:24:38 +02:00
Adrien Grand 793635eb0e LUCENE-2562: Don't randomly enable term vectors on fields. 2019-04-18 09:27:09 +02:00
Simon Willnauer 3a6f2f7543
Use Map.copyOf in lucene core (#649)
This cuts over several places that use the pattern of creating a copy of
the supplied map with Map.copyOf.
2019-04-17 15:26:12 +02:00
Uwe Schindler faaee86efb LUCENE-8738: Move to Java 11 as minimum Java version (merged branch: jira/LUCENE-8738)
Co-authored-by: Adrien Grand <jpountz@apache.org>
2019-04-16 14:00:09 +02:00
Jan Høydahl 181ca4ea7d SOLR-12121: Upgrade jose4j to v0.6.5 which adds Java11 RSA PSS signature support 2019-04-15 22:11:27 +02:00
Simon Willnauer a302be381e
LUCENE-8671: Introduce Reader attributes (#640)
Reader attributes allows a per IndexReader configuration of codec internals.
For instance this allows a per reader configuration if FSTs are loaded into memory or are left
on disk.
2019-04-15 20:39:36 +02:00
Nicholas Knize 8975ae5954 LUCENE-8736: Fix line detection bug in Polygon2D and Line2D componentRelateTriangle 2019-04-15 10:43:56 -05:00
Uwe Schindler 7830cae571 LUCENE-8752: Add license header to patch file
Revert "LUCENE-8752: Fix precommit error: patch files cannot have a license header" - This reverts commit b60548f6d8.
2019-04-14 02:01:00 +02:00
Uwe Schindler b60548f6d8 LUCENE-8752: Fix precommit error: patch files cannot have a license header 2019-04-14 01:39:49 +02:00
Tomoko Uchida 7619c07d3a LUCENE-8752: Add Japanese new imperial era '令和' (Reiwa) to the dictionary used in JapaneseTokenizer 2019-04-13 21:07:27 +09:00
Tomoko Uchida f85c08224b LUCENE-2562: Luke has no Maven artifacts 2019-04-13 01:23:46 +09:00
Jan Høydahl ef9566e13e LUCENE-8725: Fix precommit (javadoc) 2019-04-12 09:26:46 +02:00
Noble Paul 5f34c49813 LUCENE-8725: Make TermsQuery.SeekingTermSetTermsEnum a top level class and public 2019-04-12 06:22:09 +10:00
Nicholas Knize 55c241d87f LUCENE-8736: Fix LatLonShapePolygonQuery and Polygon2D.contains to correctly include points that fall on the boundary 2019-04-11 09:27:36 -05:00
Uwe Schindler 2ed86c6d54 Fix precommit error 2019-04-10 16:09:19 +02:00
jimczi 81fe814428 LUCENE-8751: Weight#matches now use the ScorerSupplier to build scorers with a lead cost of 1 (single document). 2019-04-10 12:20:02 +02:00
Tomoko Uchida 3a78c1999a LUCENE-2562: Add Luke as a Lucene module 2019-04-10 19:15:57 +09:00
Simon Willnauer a9503d2e81
LUCENE-8754: Prevent ConcurrentModificationException in SegmentInfo (#637)
In order to prevent ConcurrentModificationException this change makes
an unmodifiable copy on write for all maps in SegmentInfo. MergePolicies
can access these maps without synchronization and cause exceptions if
it's modified in the merge thread.
2019-04-10 09:29:22 +02:00
jimczi c1222b57e9 LUCENE-8477: Restore public ctr for FilteredIntervalsSource 2019-04-08 13:29:49 +02:00
jimczi b8829480f8 LUCENE-8701: remove unused import 2019-04-05 17:08:04 +02:00
jimczi 04c33c77bf LUCENE-8701: force the score mode of the child weight to COMPLETE if the score is required 2019-04-05 17:06:57 +02:00
jimczi 926de465ee LUCENE-8701: force the score mode of the child weight to COMPLETE if the score is required 2019-04-05 17:05:40 +02:00
jimczi 4b0c36b17b LUCENE-8701: ToParentBlockJoinQuery now creates a child scorer that disallows skipping over
non-competitive documents if the score of a parent depends on the score of multiple
children (avg, max, min). Additionally the score mode `none` that assigns a constant score to
each parent can early terminate top scores's collection.
2019-04-05 15:24:09 +02:00
Simon Willnauer 1ec229b604
LUCENE-8671: Expose FST off/on-heap options on Lucene50PostingsFormat (#613)
Before we can expose options to configure this postings format
on a per-reader basis we need to expose the option to load the terms
index FST off or on heap on the postings format. This already allows to
change the default in a per-field posting format if an expert user
wants to change the defaults. This essentially provides the ability to change
defaults globally while still involving some glue code.
2019-04-04 16:59:37 +02:00
Ishan Chattopadhyaya bbf6722e6a Adding 6.6.6 backcompat indexes 2019-04-03 15:23:10 +05:30
Alan Woodward 06a2b73245 Add setMissingValue to sorts from Double/LongValuesSource 2019-04-03 09:42:36 +01:00
erick 58376874aa SOLR-13363: Upgrade to ZooKeeper 3.4.14 2019-04-02 20:18:01 -07:00
Christine Poerschke de13c8e79d LUCENE-8742: add 'missing slop attribute' test coverage for SpanNearBuilder 2019-04-02 19:31:03 +01:00
Alan Woodward 3de0b36719 LUCENE-8730: WordDelimiterGraphFilter always emits its original token first 2019-04-02 08:58:49 +01:00
Erick Erickson e04a28e140 SOLR-12028: BadApple and AwaitsFix annotations usage 2019-04-01 09:27:33 -07:00
jimczi 7e83451610 LUCENE-8732: ConstantScoreQuery can now early terminate the query if the minimum score is greater than the constant score and total hits are not requested 2019-03-27 18:09:23 +01:00
Alan Woodward 1c46dd1f9b LUCENE-8741: Fix ClassCastException in ValueSource.fromDoubleValuesSource() 2019-03-27 16:08:25 +00:00
Alan Woodward 3a63c58db3 LUCENE-8477: Add CHANGES entry 2019-03-27 11:29:20 +00:00
Alan Woodward f1782d0dd1
LUCENE-8477: Automatically rewrite disjunctions when internal gaps matter (#620)
We have a number of IntervalsSource implementations where automatic minimization of
disjunctions can lead to surprising results:

* PHRASE queries can miss matches because a longer matching sub-source is minimized
  away, leaving a gap
* MAXGAPS queries can miss matches for the same reason
* CONTAINING, NOT_CONTAINING, CONTAINED_BY and NOT_CONTAINED_BY queries
  can miss matches if the 'big' interval gets minimized

The proper way to deal with this is to rewrite the queries by pulling disjunctions to the top
of the query tree, so that PHRASE("a", OR(PHRASE("b", "c"), "c")) is rewritten to
OR(PHRASE("a", "b", "c"), PHRASE("a", "c")). To be able to do this generally, we need to
add a new pullUpDisjunctions() method to IntervalsSource that performs this rewriting
for each source that it would apply to.

Because these rewritten queries will in general be less efficient due to the duplication of
effort (eg the rewritten PHRASE query above pulls 5 term iterators rather than 4 in the
original), we also add an option to Intervals.or() that will prevent this happening, so that
consumers can choose speed over accuracy if it suits their usecase.
2019-03-27 11:23:43 +00:00
Simon Willnauer 8d7619f4e6 LUCENE-8735: fix changes entry after backport 2019-03-26 15:57:43 +01:00
Henning Andersen 04afdb6442 LUCENE-8735: Avoid FileAlreadyExistsException on windows. (#619)
FilterDirectory.getPendingDeletions() did not delegate the call, which
resulted in a new IndexWriter on same directory not considering pending
delete files. This could in turn result in a FileAlreadyExistsException
when running windows.
2019-03-26 14:56:45 +01:00
Tommaso Teofili 42a548e28e LUCENE-6687 - avoid unnecessary looping 2019-03-26 11:27:09 +01:00
Kevin Risden 361d1e84ce
SOLR-13112: Upgrade jackson to 2.9.8
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-25 10:38:00 -04:00
Kevin Risden 4e230388ba
SOLR-13335: Upgrade to velocity 2.0 and velocity-tools 3.0
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-25 09:58:44 -04:00
Simon Willnauer 65e1b3ef2a Add assume to test since it only works with mmap directory 2019-03-21 14:20:23 +01:00
Kevin Risden 6222abf448
SOLR-9079: Remove commons-lang as a dependency
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-03-20 19:37:19 -04:00
Simon Willnauer 14175c46d2
LUCENE-8671: Load FST off-heap if reader is not opened from an index writer (#610)
Today we never load FSTs of ID-like fields off-heap since we need
very fast access for updates. Yet, a reader that is not loaded from
an IndexWriter can also leave the FST on disk. This change adds
this information to SegmentReadState to allow the postings format
to make this decision without configuration.
2019-03-20 11:28:10 +01:00
Uwe Schindler 2a1ed6e484 LUCENE-8729: Workaround to allow compile under JDK13+ 2019-03-19 19:05:15 +01:00
Adrien Grand 07f3535793 LUCENE-8138: Check that dv producers's next/advance and advanceExact impls are consistent. 2019-03-19 10:53:10 +01:00
Adrien Grand 577bef53dd LUCENE-8166: Require merge instances to be consumed in the thread that created them. 2019-03-19 10:51:54 +01:00
Alan Woodward 5ca2524927 LUCENE-8694: Payload-filtered term intervals 2019-03-18 09:41:00 +00:00
Simon Willnauer af5c2b22da Fix more tests to not use concurrent flushing in RIW#commit() 2019-03-15 22:18:20 +01:00
Simon Willnauer ad457d188e Improve RIW exception handling and opt out of concurrent flushing if exception is expected 2019-03-15 11:00:16 +01:00
Adrien Grand 425f207f40 LUCENE-8688: Forced merges merge more than necessary. 2019-03-15 10:27:27 +01:00
Alan Woodward fbd05167f4
LUCENE-3041: QueryVisitor (#581)
This commit adds an introspection API to Query, allowing users to traverse
the nested structure of a query and examine its leaves.  It replaces the existing
`extractTerms` method on Weight, and alters some highlighting code to use
the new API
2019-03-14 15:04:33 +00:00
Simon Willnauer ffb1fc83de
Concurrently flush next buffer during commit in RandomIndexWriter (#607)
This is a spinn-off from `LUCENE-8700` that is satisfied by IndexWriter#flushNextBuffer.
The idea here is to additionally call flushNextBuffer in RandomIndexWriter for better
test coverage. This is a test-only change.
2019-03-14 15:43:35 +01:00
jimczi 2afa6cb00a Add 8.0.0 back compat test indexes and reenable backward compatibility tests for 9 2019-03-14 14:00:13 +01:00
Alan Woodward 9096b846af LUCENE-8719: Traverse all paths at the end of a TokenStream in FixedShingleFilter 2019-03-14 11:10:02 +00:00
Alan Woodward 84785e8e45 LUCENE-8726: Changes entry 2019-03-14 10:57:34 +00:00
Alan Woodward d19dcb4ff0 LUCENE-8726: ValueSource.asDoubleValuesSource() could leak a reference to IndexSearcher 2019-03-14 10:17:12 +00:00
Mike McCandless c1bea96cf9 LUCENE-8720: fix int overflow in NameIntCacheLRU 2019-03-12 12:19:33 -04:00
jimczi b2c83de361 LUCENE-8652: remove unused import 2019-03-12 09:56:14 +01:00
Yeongsu Kim b1f870a416 LUCENE-8631: The Korean user dictionary now picks the longest-matching word and discards the other matches. 2019-03-12 09:49:16 +01:00
jimczi c87e7614f1 LUCENE-8652: ensure that the norm doesn't influence the score in TestSynonymQuery#testBoosts 2019-03-12 09:33:40 +01:00
iverase fd8d9d5199 LUCENE-8713: Add Line2D tests 2019-03-12 09:18:54 +01:00
Ignacio Vera 458205396e
LUCENE-8712: Polygon2D does not detect crossings in some cases (#598)
LUCENE-8712: revert crossing logic to use boolean logic and skip lines
over the dateline to support dateline crossing logic
2019-03-12 08:40:54 +01:00
Erick Erickson b893548d97 SOLR-13268: Clean up any test failures resulting from defaulting to async logging. Put TestXmlQParser back 2019-03-11 21:40:23 -07:00
Erick Erickson 3a1603dab3 SOLR-13268: Clean up any test failures resulting from defaulting to async logging. Fix precommit test to not include the baseDir 2019-03-11 11:03:31 -07:00
jimczi d8cfeba912 LUCENE-8652: SynonymQuery can now deboost the document frequency of each term when blending the score of the synonym 2019-03-11 10:20:11 +01:00
erick 9272c29539 SOLR-13268: Clean up any test failures resulting from defaulting to async logging 2019-03-10 11:05:00 -07:00
Ishan Chattopadhyaya aeb0774b38 Adding 7.7.1 version 2019-03-06 01:00:20 +05:30
Adrien Grand a940c40b18 Add 7.7.1 back compat test indexes 2019-03-04 10:51:32 +01:00
Alan Woodward 7ad0ac0191 LUCENE-8714: Don't use NoMergePolicy in norms tests
This can cause spurious failures when run in conjunction with HandleLimitFS,
as we can end up with lots of very small segments which trips the file handles
limit
2019-03-01 14:47:54 +00:00
iverase d381b73999 LUCENE-8709: Set pointWriter to null when size of the HeapPointWriter is 0 2019-03-01 10:44:07 +01:00
iverase 15dd4a1738 UCENE-8703: Build point writers in the BKD tree only when they are needed 2019-03-01 07:41:16 +01:00
iverase 5b2f064ba7 LUCENE-8709: Handle case of creating a HeapPointWriter with size equal 0 2019-02-28 10:41:06 +01:00
Karl Wright 78a57f75cb Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr 2019-02-27 02:28:50 -05:00
Karl Wright ff799ac03d LUCENE-8696: Rework how endpoint circles are represented to allow for consistency on WGS84. 2019-02-27 02:28:33 -05:00
Mike McCandless 1ad45a7974 fix typo in javadocs 2019-02-26 18:04:24 -05:00
Karl Wright 149469ddbc LUCENE-8696: Refactor, in preparation for creating a new SegmentEndpoint implementation to fix the problem. 2019-02-26 08:46:17 -05:00
Karl Wright 8c34da8a62 LUCENE-8696: Fix precommit objections 2019-02-25 09:46:53 -05:00
Karl Wright 303d11921f Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr 2019-02-25 04:06:09 -05:00
Karl Wright 9bf2c8cb93 LUCENE-8696: Update test to be what's actually failing 2019-02-25 04:05:56 -05:00
Alan Woodward edc73ee77c LUCENE-8702: Simplify some Intervals factory methods 2019-02-25 08:58:46 +00:00
Karl Wright c599ad9802 Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr 2019-02-25 03:34:05 -05:00
Karl Wright 3754959797 OSQ-8696: Come up with a simple test case that exercises the random failure 2019-02-25 03:33:35 -05:00
erick fe5a96a284 SOLR-13268: Clean up any test failures resulting from SOLR-12055 (async logging). Kevin's upgrades 2019-02-24 09:50:04 -08:00
iverase 9b8a4a9e6e LUCENE-8699: Add lucene internal tag to PointValue interface
and fix some typos
2019-02-22 11:04:46 +01:00
iverase f5a4159d75 LUCENE-8704: Use the HeapPointWriter to spill offline as BKDWriter does 2019-02-21 12:26:37 +01:00
jimczi 4625270ce0 LUCENE-8686: Fix test bug in TestTaxonomyFacetSumValueSource#testRandom 2019-02-21 16:36:26 +09:00
iverase 3ef6e015dd LUCENE-8699: Change HeapPointWriter to use a single byte array instead to a list
of byte arrays. In addition a new interface PointValue is added to abstract out
  the different formats between offline and on-heap writers.
2019-02-20 12:14:58 +01:00
Kevin Risden af3ff118ef
SOLR-11763: Upgrade Guava to 25.1-jre (Markus Jelsma, Kevin Risden)
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-02-19 17:27:58 -05:00
Mike McCandless ec801b4c54 LUCENE-8635: add option to move FSTs off-heap, and do so for the FST terms index in the default codec for non-primary-key fields if MMapDirectory is being used 2019-02-19 12:52:22 -05:00
iverase 6f61bdea0e LUCENE-8685: Refactor LatLonShape tests 2019-02-19 16:58:21 +01:00
Alan Woodward 55b4d2dcaa LUCENE-8697: GraphTokenStreamFiniteStrings correctly handles side paths with gaps 2019-02-19 13:56:17 +00:00
Simon Willnauer 4a513fa99f
LUCENE-8292: Make TermsEnum fully abstract (#574) 2019-02-15 17:32:55 +01:00
Alan Woodward 10acaf02f8 SOLR-13126: Correctly combine multiplicative query boosts 2019-02-15 10:09:26 +00:00
Bruno P. Kinoshita 5310152450 Fix some spell check issues 2019-02-12 21:35:30 -08:00
jimczi 4e34a0cb41 Add 7.7.0 back compat test indexes 2019-02-12 15:11:59 +01:00
iverase 9db39ab0ca LUCENE-8687: Optimise radix partitioning for points on heap 2019-02-11 08:11:23 +01:00
yyuan2 a3a4ecd80b LUCENE-8662: Change TermsEnum.seekExact(BytesRef) to abstract 2019-02-08 15:10:38 -08:00
iverase 56007af4a4 LUCENE-8680: Add CHANGES.txt entry 2019-02-08 16:26:22 +01:00
iverase 06c1ebc09e LUCENE-8680: Refactor EdgeTree#relateTriangle method 2019-02-08 16:19:38 +01:00
Kevin Risden 475f2fd36b
SOLR-9515 - Remove pom dependency kerby-kerb
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-02-07 11:38:18 -05:00
iverase 5d1d6448b9 LUCENE-8673: Use radix partitioning when merging dimensional points instead of sorting all dimensions before hand. 2019-02-07 08:12:13 +01:00
Alan Woodward 1b077cf0e3 LUCENE-8655: Add .getSource() method to FunctionScoreQuery 2019-02-05 09:46:15 +00:00
markrmiller 4a3ddc94d8
SOLR-9515: Update to Hadoop 3 (Mark Miller, Kevin Risden)
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-02-01 15:04:56 -05:00
jimczi e9c02a6f71 LUCENE-8676: The Korean tokenizer does not update the last position if the backtrace is caused by a big buffer (1024 chars). 2019-02-01 11:37:31 +01:00
iverase fdb6353539 LUCENE-8679: return WITHIN in EdgeTree#relateTriangle only when polygon and triangle share one edge 2019-02-01 11:23:31 +01:00
Kevin Risden e4f202c1e3
Revert "SOLR-9515: Update to Hadoop 3"
This reverts commit 6bb24673f4.
2019-01-31 19:46:02 -05:00
Nicholas Knize edb05314b3 LUCENE-8669: Fix LatLonShape WITHIN queries that fail with Multiple search Polygons that share the dateline. 2019-01-31 16:19:42 -06:00
markrmiller 6bb24673f4
SOLR-9515: Update to Hadoop 3
Signed-off-by: Kevin Risden <krisden@apache.org>
2019-01-31 13:00:20 -05:00
Luca Cavanna 57f693bfc2 LUCENE-8664: Add equals/hashcode methods to TotalHits class
Adding equals/hashcode to TotalHits is convenient for quick comparisons, especially for users that may use it as part of bigger objects that have their own equals/hashcode.
2019-01-30 13:11:28 +01:00
Dawid Weiss b802a52685 LUCENE-8663: NRTCachingDirectory.slowFileExists may open a file while it's inaccessible 2019-01-30 09:44:32 +01:00
jimczi cf39708609 Add next minor release version 8.1.0 2019-01-29 14:12:38 +01:00
jimczi a269a4d1cb LUCENE-8660: TopDocsCollectors now return an accurate count (instead of a lower bound) if the total hit count is equal to the provided threshold. 2019-01-29 08:58:09 +01:00
Dawid Weiss d7dc53ff7c LUCENE-8474: Remove deprecated RAMDirectory. 2019-01-28 13:49:03 +01:00
Alan Woodward 7713a4f245 LUCENE-8650: Fix end() and reset() in ConcatenatingTokenStream 2019-01-28 09:00:01 +00:00
Mikhail Khludnev f543b4e1f4 LUCENE-8640: fixing DateRangePrefixTreeTest, accepting 0..59 minutes and seconds. 2019-01-28 11:14:34 +03:00
Mikhail Khludnev 8e69d12dd7 LUCENE-8640: date range syntax validation 2019-01-27 21:45:28 +03:00
Tommaso Teofili 000785e68e LUCENE-8659 - updated sha1 for OpenNLP dependency 2019-01-26 23:32:59 +01:00
Tommaso Teofili 48073a9778 LUCENE-8659 - upgrade Lucene/Solr to use OpenNLP 1.9.1 2019-01-26 13:11:30 +01:00
Adrien Grand 23bd97bae7 Refactor IndexedDISI to avoid method call when advancing by small gaps. 2019-01-25 10:21:48 +01:00
Adrien Grand ef47582fd5 LUCENE-8658: Fix illegal assertion in WANDScorer. 2019-01-25 07:21:16 +01:00
Tommaso Teofili c32e9b6c0a Merge branch 'master' of github.com:apache/lucene-solr into LUCENE-5698 2019-01-24 10:25:10 +01:00
Tommaso Teofili 33896dde5a LUCENE-5698 - minor adjustments 2019-01-24 10:24:52 +01:00
Jan Høydahl ea2c8ba38e SOLR-12121: JWT Token authentication plugin with OpenID Connect implicit flow login through Admin UI 2019-01-23 13:03:01 +01:00
iverase 893f5cda57 LUCENE-8654: Polygon2D#relateTriangle returns the wrong answer if polygon is inside the triangle 2019-01-23 08:25:50 +01:00
Uwe Schindler 452ffa3626 LUCENE-8340: Fix typo in CHANGES.txt 2019-01-22 15:54:40 +01:00
Alan Woodward 7d7ab14776 LUCENE-8646: Multi-term intervals 2019-01-22 13:36:02 +00:00
Alan Woodward 87d68c8253 LUCENE-8645: Intervals.fixField() 2019-01-22 13:35:15 +00:00
iverase 01dfe7bf4b LUCENE-8649: LatLonShape's within and disjoint queries can return false positives with indexed multi-shapes 2019-01-21 16:00:28 +01:00
Tommaso Teofili 6df32fbc35 LUCENE-5698 - forbidden APIs related fixes 2019-01-20 09:24:25 +01:00
Tommaso Teofili e32170db83 Merge branch 'master' of github.com:apache/lucene-solr into LUCENE-5698 2019-01-20 09:24:02 +01:00
Erick Erickson 73d1b07f8e LUCENE-8585: fix precommit failure 2019-01-18 18:37:52 -08:00
Toke Eskildsen c13645bd4c LUCENE-8585: Create jump-tables for DocValues at index-time 2019-01-18 22:42:04 +01:00
Tommaso Teofili 552c367c5d LUCENE-5698 - added test for 20n dataset, minor code adjustments 2019-01-18 21:10:33 +01:00
Dawid Weiss efef89adc6 LUCENE-8642: RamUsageTester.sizeOf ignores arrays and collections if --illegal-access=deny. 2019-01-18 11:55:53 +01:00
David Smiley 70dd3ee06a LUCENE-8581: Unreference LatLonPoint.BYTES from LatLonShape & Rectangle2D. 2019-01-17 09:00:18 -05:00
Dawid Weiss f2352e9456 Revert "LUCENE-8642, LUCENE-8641: correct RamUsageTester.sizeOf's handling of ByteBuffers. Throw exceptions on denied reflection to catch problems early. This affects tests only."
This reverts commit a16f0833ed.
2019-01-17 13:05:36 +01:00
Dawid Weiss a16f0833ed LUCENE-8642, LUCENE-8641: correct RamUsageTester.sizeOf's handling of ByteBuffers. Throw exceptions on denied reflection to catch problems early. This affects tests only. 2019-01-17 12:23:30 +01:00
Dawid Weiss 9926b45597 LUCENE-8643: TestLatLonLineShapeQueries.testRandomBig fails with suite
timeout (simple text codec used, decreased iteration counts).
2019-01-17 10:10:11 +01:00
Simon Willnauer e35adf6824
LUCENE-8639: Prevent new threadstates from being created while we cut over to a new delete queue (#535)
This prevents an edge case where suddenly a lot of threads start indexing
while we carry over sequence ids from the previous to the new delete queue.
We now lock creation of new thread states for a very short time until we created and assigned
a new delete queue.
2019-01-16 16:37:49 +01:00
Alan Woodward a826649241 LUCENE-8633: Remove term weighting from IntervalQuery scores 2019-01-16 13:59:16 +00:00
Dawid Weiss 09778b2133 LUCENE-8641: add awaits fix to stop jenkins from creating subsequent alerts for this problem. 2019-01-16 11:20:11 +01:00
Dawid Weiss d4e016afdf LUCENE-8474: (partial) removal of accesses to RAMFile and RAMDirectory streams. Removal of GrowableByteArrayDataOutput. 2019-01-15 13:42:25 +01:00
Dawid Weiss f083473b89 LUCENE-8636: follow-up speedups and cleanups. 2019-01-15 12:02:15 +01:00
Dawid Weiss d965b3547e LUCENE-8636: TestPointQueries times out on nightly (decreased big range to 50k, excluded simple text codec). 2019-01-14 14:12:45 +01:00
iverase 8762b071bb LUCENE-8623: Decrease I/O pressure when merging high dimensional points 2019-01-14 07:56:06 +01:00
Steve Rowe 283b19a8da LUCENE-8527: Upgrade JFlex to 1.7.0. StandardTokenizer and UAX29URLEmailTokenizer now support Unicode 9.0, and provide UTS#51 v11.0 Emoji tokenization with the '<EMOJI>' token type. 2019-01-08 13:33:49 -05:00
Alan Woodward e015afadaa LUCENE-8622: Minimum-should-match interval function 2019-01-07 14:08:40 +00:00
Alan Woodward 7d34bfdaad LUCENE-8629: New interval functions: before, after, within, overlapping 2019-01-07 13:33:59 +00:00
Alan Woodward 906f94a8cc Remove unnecessary back compatibility 2019-01-07 13:20:01 +00:00
Alan Woodward 04f48dba8b Move to version 9.0.0 2019-01-07 13:04:01 +00:00
Mike McCandless 63dfba4c7d LUCENE-8601: attributes added to IndexableFieldType during indexing will now be preserved in the index and accessible at search time via FieldInfo attributes 2019-01-03 18:44:41 -05:00
Christine Poerschke 63a6c250d7 Update copyright year(s) in lucene/NOTICE.txt and solr/NOTICE.txt files. 2019-01-03 18:46:30 +00:00
Jim Ferenczi 7c70365811 LUCENE-8627: Fix SearchAfter#testQueries to always count the number of hits accurately. 2019-01-02 09:37:10 +01:00
Alan Woodward 2532a5d31c LUCENE-8612: Add Intervals.extend() 2019-01-01 17:55:19 +00:00
Dawid Weiss c9c3ef0ef9 LUCENE-8625: int overflow in ByteBuffersDataInput.sliceBufferList 2018-12-28 12:10:48 +01:00
Dawid Weiss 45c50a0273 LUCENE-8624: int overflow in ByteBuffersDataOutput.size(). 2018-12-28 10:55:25 +01:00
Steve Rowe 86bc028f0d LUCENE-8611: Fix Maven test compilation by making lucene-test-framework's dependencies transitive for all modules that depend on it, by not writing out the <dependencyManagement>/<exclusions> section for it in the grandparent POM. 2018-12-20 09:30:16 -05:00
Dawid Weiss f28c5bec9b LUCENE-8604: TestRuleLimitSysouts now has an optional "hard limit" of bytes that can be written to stderr and stdout (anything beyond the hard limit is ignored). The default hard limit is 2 GB of logs per test class. 2018-12-18 22:03:44 +01:00
Adrien Grand dcd4a288b4 LUCENE-8600: Use a faster sort in DocValuesFieldUpdates. 2018-12-18 17:45:17 +01:00
iverase d185ba99de LUCENE-8581: Change LatLonShape encoding to use 4 bytes Per Dimension 2018-12-18 16:50:24 +01:00
Cao Manh Dat f80e8e1167 Merge jira/http2 branch to master 2018-12-16 16:58:20 +00:00
Alan Woodward dae3e304a1 LUCENE-8597: TestIntervals.assertGaps() needs to take docBase into account 2018-12-16 11:26:40 +00:00
Jim Ferenczi 53b05c5e7e LUCENE-8603 Fix the inversion of right ids for additional nouns in the Korean user dictionary 2018-12-16 15:03:12 +07:00
Dawid Weiss e916f1fb86 LUCENE-8611: Update randomizedtesting to 2.7.2, JUnit to 4.12, add hamcrest-core dependency. 2018-12-15 09:49:36 +01:00
Simon Willnauer 5c5c42cc37 LUCENE-8609: Remove deprecated IW#numDocs() and IW#maxDoc() methdos 2018-12-14 19:36:25 +01:00
Simon Willnauer e974311d91 LUCENE-8609: Allow getting consistent docstats from IndexWriter
Today we have #numDocs() and #maxDoc() on IndexWriter. This is enough
to get all stats for the current index but it's subject to concurrency
and might return numbers that are not consistent ie. some cases can
return maxDoc < numDocs which is undesirable. This change adds a getDocStats()
method to index writer to allow fetching consistent numbers for these stats.

This change also deprecates IndexWriter#numDocs() and IndexWriter#maxDoc()
and replaces all their usages wiht IndexWriter#getDocStats()
2018-12-14 19:36:25 +01:00
Nicholas Knize d3ee925bdd Add 7.6.0 back compat test indexes 2018-12-13 21:50:53 -06:00
Simon Willnauer 4272c631ca LUCENE-8608: Extract utility class to iterate over terms docs
Today we re-implement the same algorithm in various places
when we want to consume all docs for a set/list of terms. This
caused serious slowdowns for instance in the case of applying
updates fixed in LUCENE-8602. This change extracts the common
usage and shares the interation code including logic to reuse
Terms and PostingsEnum instances as much as possble and adds
tests for it.
2018-12-13 15:30:34 +01:00
Alan Woodward fa025e1f78 LUCENE-8607: MatchAllDocsQuery can skip counting hits 2018-12-13 09:01:49 +00:00
iverase ce9a8012c0 LUCENE-8605: Separate bounding box spatial logic from query logic on LatLonShapeBoundingBoxQuery 2018-12-12 13:46:35 +01:00
Adrien Grand 55993ecb9b Fix test failure when floorSegmentMB > maxMergedSegmentMB. 2018-12-12 13:21:06 +01:00
Adrien Grand 21dc237248 Revert "test"
This reverts commit 783f01bab3e096f4f843b9de62af25d6fbe07340.
2018-12-12 13:20:37 +01:00
Adrien Grand adf73ff6bd test 2018-12-12 13:20:37 +01:00
Simon Willnauer 61e448666d LUCENE-8599: Add hasNoValue bitset to ramBytesUsed calculation 2018-12-11 22:36:12 +01:00
Karl Wright 6587f640d2 Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr 2018-12-11 13:48:44 -05:00
Karl Wright 2edf5adedc LUCENE-8587: Fix the equals method of GeoComplexPolygon 2018-12-11 13:48:34 -05:00
Simon Willnauer d08e2d49d3 LUCENE-8602: Share TermsEnum if possible while applying DV updates
Today we pull a new terms enum when we apply DV updates even though the
field stays the same which is the common case. Benchmarking this on a
larger term dictionary with a significant number of updates shows a
2x improvement in performance.
2018-12-11 19:14:16 +01:00
Karl Wright 874937aba8 Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/lucene-solr 2018-12-11 09:30:17 -05:00
Karl Wright da62c73282 LUCENE-8587: Change GeoPoint serialization to make serialization/unserialization non-lossy 2018-12-11 09:30:01 -05:00
Toke Eskildsen 8a20705b82 Revert "LUCENE-8374 part 1/4: Reduce reads for sparse DocValues".
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.

This reverts commit 58a7a8ada5.
2018-12-11 14:17:57 +01:00
Toke Eskildsen 1da6d39b41 Revert "LUCENE-8374 part 2/4: Reduce reads for sparse DocValues".
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.

This reverts commit 7ad027627a.
2018-12-11 14:14:22 +01:00
Toke Eskildsen 6c5d87a505 Revert "LUCENE-8374 part 3/4: Reduce reads for sparse DocValues".
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.

This reverts commit 7949b98f80.
2018-12-11 14:14:07 +01:00
Toke Eskildsen 3158d0c485 Revert "LUCENE-8374 part 4/4: Reduce reads for sparse DocValues".
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.

This reverts commit e356d793ca.
2018-12-11 14:13:37 +01:00
Toke Eskildsen 870bb11cc8 Revert "Pre-commit fixes for LUCENE-8374 (JavaDoc + arguments)".
LUCENE-8374 was committed without consensus and is expected to be superseded by LUCENE-8585.

This reverts commit 6c11161111.
2018-12-11 14:13:17 +01:00
Noble Paul c651cbfd9a SOLR-12885: BinaryResponseWriter (javabin format) should directly copy from BytesRef to output 2018-12-11 18:35:57 +11:00