Commit Graph

34445 Commits

Author SHA1 Message Date
Robert Muir 52f581e351
LUCENE-9605: update snowball to d8cf01ddf37a, adds Yiddish (#2077) 2020-11-14 09:27:08 -05:00
Erick Erickson 93ecd0fa0a SOLR-14986: Add warning to ref guide that using 'properties.name' is an expert option 2020-11-14 09:04:59 -05:00
Michael Sokolov 03c1910bff LUCENE-9004: CHANGES.txt entry 2020-11-13 09:05:17 -05:00
Michael Sokolov b36b4af22b
LUCENE-9004: KNN vector search using NSW graphs (#2022) 2020-11-13 08:53:51 -05:00
Dawid Weiss 80a0154d57 LUCENE-9608: add a hand-triggered test error class. 2020-11-13 14:49:08 +01:00
Tomoko Uchida 8bac4e7f74 LUCENE-9499: javadoc split package workaroud should be applied only to test-framework. 2020-11-13 21:31:37 +09:00
Uwe Schindler af47cb7bcd LUCENE-9600: Fix wrong link 2020-11-13 00:28:15 +01:00
Gautam Worah 3f8f84f9b0
LUCENE-9450 Switch to BinaryDocValues instead of stored fields in Lucene's facet implementation, yielding ~4-5% red-line QPS gain in pure faceting benchmarks (#1733) 2020-11-12 17:13:31 -05:00
Adrien Grand 06877b2c6e
LUCENE-9378: Make it possible to configure how to trade speed for compression on doc values. (#2069)
This adds a switch to `Lucene80DocValuesFormat` which allows to
configure whether to prioritize retrieval speed over compression ratio
or the other way around. When prioritizing retrieval speed, binary doc
values are written using the exact same format as before more aggressive
compression got introduced.
2020-11-12 16:10:00 +01:00
Shintaro Murakami d1297e52d9
Remove redundant fieldType.stored() check during indexing (#2076) 2020-11-12 08:51:53 -05:00
Christine Poerschke 2f02040a4c SOLR-14983: Fix response returning original score instead of reranked score due to query and filter combining.
(Krishan Goyal, Jason Baik, Christine Poerschke)
2020-11-12 12:51:21 +00:00
Mike Drob 66e285e7ae
SOLR-14995 Update Jetty to latest version 2020-11-11 13:27:49 -08:00
Jason Gerlowski a7197ac0ce
SOLR-14971: Handle atomic-removes on uncommitted docs (#2056)
Docs fetched from the update log via RTG look different than docs
fetched from commits in the index: the types of
field-values may be different between the two, etc.

This is a problem for atomic add/remove of field values, where matching
existing values has historically been done by object equals() calls (via
Collection operations).  This relies on equality checks which don't have
flexible enough semantics to match values across these different types.
(For example, `new Long(1).equals(new Integer(1))` returns `false`).
This was causing some add-distinct and remove operations on
uncommitted values to silently fail to remove field values.

This commit patches over this by converting between types in the more
common cases before using the fallback behavior.
2020-11-11 12:28:11 -05:00
Bruno Roustant 91ee53d418
SOLR-14975: Add entry in CHANGES.txt 2020-11-11 11:52:30 +01:00
Bruno Roustant 67f9245ce3
SOLR-14975: Optimize CoreContainer.getAllCoreNames and getLoadedCoreNames.
Also optimize getCoreDescriptors.
2020-11-11 11:37:45 +01:00
Mayya Sharipova 5897d14fe4
LUCENE-9594 Add linear function for FeatureField
This adds a linear function and newLinearQuery for FeatureField
2020-11-10 17:08:08 -05:00
Houston Putman d65041359e
SOLR-14949: Adding githubUrl option for docker build. (#2074) 2020-11-10 13:31:33 -05:00
Andrzej Bialecki 863a388fe7 SOLR-14683: Move the CHANGES.txt entry to the right place. Fix wrong type of null value. 2020-11-10 17:58:45 +01:00
Houston Putman 212b0f8657
SOLR-14949: Ability to customize Solr Docker build (#2020)
Also added a gradlew helpDocker page.
2020-11-10 10:42:38 -05:00
Tomoko Uchida 426a9c25c2
LUCENE-9499: migrate package.html files into package-info.java (#2072) 2020-11-10 23:57:09 +09:00
Tomoko Uchida d1110394e9
LUCENE-9600: Clean up package name conflicts between misc and core modules (#2064) 2020-11-10 22:24:48 +09:00
Jim Ferenczi 36f6359fe4
LUCENE-9023: GlobalOrdinalsWithScore should not compute occurrences when the provided min is 1 (#964) 2020-11-10 13:12:03 +01:00
Andrzej Bialecki 7ec17376be SOLR-14683: Metrics API should ensure consistent placeholders for missing values. 2020-11-10 11:48:59 +01:00
Adrien Grand bac4309326 SOLR-14749: Use h2 instead of h3 so that the javadoc tool doesn't complain about out-or-sequence headers. 2020-11-10 09:22:34 +01:00
Adrien Grand fd98f677b9 Remove unused imports. 2020-11-10 09:22:34 +01:00
Adrien Grand 514c363f1d LUCENE-9322: Move Solr to Lucene90Codec.
And drop configurability of Lucene87Codec since it shouldn't be used for writing anymore.
2020-11-10 09:22:34 +01:00
Julie Tibshirani 849a28e539
LUCENE-9536: Correct the OrdinalMap optimization. (#2070)
Previously we only checked that the first segment's ordinal deltas were all
zero. This didn't account for some rare cases where the segment's ordinals
matched the global ones, but it did not contain all terms. This can happen when
using a FilteredTermsEnum, for example when merging a segment with deletions.
2020-11-10 09:18:18 +01:00
Julie Tibshirani 42c5206cea
LUCENE-9322: Some fixes to SimpleTextVectorFormat. (#2071)
* Make sure the file extensions are unique.

* Fix bug in vector reading.
2020-11-09 21:56:38 -05:00
Mike McCandless ec9a659845 LUCENE-9322: fix minor cosmetic refactoring error in logging string in IndexWriter's infoStream logging. It was always printing 'vector values' for all merging times instead of the other parts of Lucene index ('doc values', 'stored fields', etc.) 2020-11-09 12:47:15 -05:00
Michael Sokolov 8be0cea544
LUCENE-9583: extract separate RandomAccessVectorValues interface (#2037) 2020-11-09 10:46:16 -05:00
Erick Erickson be19432b75 SOLR-14969: Prevent creating multiple cores with the same name which leads to instabilities (race condition) changed error code 2020-11-09 08:16:43 -05:00
Andrzej Bialecki 0bfa2a6908 SOLR-14749: Restructure the docs + add some examples. 2020-11-05 13:54:46 +01:00
Andrzej Bialecki bdc6e8247f SOLR-14749: Provide a clean API for cluster-level event processing. 2020-11-05 12:18:05 +01:00
Adrien Grand bcd9711ab6 LUCENE-9536: Address test failure. 2020-11-05 11:58:15 +01:00
Mike Drob 7c1ff288b7
SOLR-14978 OOM Killer in Foreground (#2055)
Combine Docker and bin/solr OOM handling scripts, move OOM handling to foreground Solr as well.

Co-authored-by: Houston Putman <houstonputman@gmail.com>
2020-11-04 17:20:16 -06:00
Atri Sharma 656ce93c3a DOAP changes for release 8.7.0 2020-11-04 10:46:39 +05:30
Dawid Weiss f7779339d5 Correct sandbox class name. 2020-11-03 17:41:22 +01:00
Dawid Weiss 22296f28a2
SOLR-14912: Unify solr-contrib-extraction with the artifact it produces (#2060) 2020-11-03 14:15:26 +01:00
Dawid Weiss 32bf7bad4b Correct sandbox link failing the check. 2020-11-03 13:06:55 +01:00
Dawid Weiss a29d7c70d5
LUCENE-9597: checkWorkingCopyClean shouldn't complain about untracked empty folders (similar to git status). Piggybacking jgit update. (#2061) 2020-11-03 12:50:05 +01:00
Dawid Weiss 63c4dfa454 LUCENE-9596: Reproduce line for failed tests should have method-level accuracy 2020-11-03 10:56:05 +01:00
Dawid Weiss 0f871b2c56
SOLR-14926: Modernize and clean up search results clustering contrib. 2020-11-03 09:31:53 +01:00
Ignacio Vera 5c02737918
LUCENE-9553: Adds a XYPoint query that accepts an array of XYGeometries (#1939) 2020-11-03 09:21:03 +01:00
Ignacio Vera 8bfbed8d4c
LUCENE-9552: Adds a LatLonPoint query that accepts an array of LatLonGeometries (#1940) 2020-11-03 08:51:28 +01:00
Tomoko Uchida 6a7131ee24
LUCENE-9319: Clean up package name conflicts for sandbox module (#2023) 2020-11-03 12:01:02 +09:00
Michael Aleythe e7f0294d85 SOLR-14961 ZkMaintenanceUtils.clean doesn't remove zk nodes with same length
fixes #2042
2020-11-02 16:54:00 -06:00
Houston Putman 5091e75c9d
SOLR-14907: Adding V2 API for ConfigSet Upload. (#1996) 2020-11-02 14:06:45 -05:00
Christine Poerschke 0729746d77
Rename TestSolrTestCaseJ4 to SolrTestCaseJ4DeleteCoreTest. (#2032) 2020-11-02 16:13:01 +00:00
Adrien Grand 2a2e612db0 LUCENE-9536: CHANGES entry. 2020-11-02 16:46:45 +01:00
Julie Tibshirani 8f004f7a38
LUCENE-9536: Optimize OrdinalMap when one segment contains all distinct values. (#1948)
LUCENE-9536: Optimize OrdinalMap when one segment contains all distinct values.

For doc values that are not too high cardinality, it is common for some large
segments to contain all distinct values. In this case, we can check if the first
segment ords map perfectly to global ords, and if so store the global ord deltas
and first segment indices as `LongValues.ZEROES`
to save some space.
2020-11-02 16:40:16 +01:00