Commit Graph

34195 Commits

Author SHA1 Message Date
Dawid Weiss 5ec2bac91c
LUCENE-9531: Consolidate duplicated generated classes CharStream and FastCharStream (#1886) 2020-09-18 08:53:30 +02:00
Ignacio Vera fbf8e4f044
LUCENE-9523: Speed up query shapes for geometries that generate multiple points (#1866)
In query shapes over shape fields, skip points while traversing the BKD tree when the relationship with the document is already known
2020-09-18 07:50:58 +02:00
Noble Paul ee0a374bb8
SOLR-14875: Make SolrEventListeners load from packages (#1887) 2020-09-18 12:07:29 +10:00
Adrien Grand 33f7280078
LUCENE-9529: Track dirtiness of stored fields via a number of docs, not chunks. (#1882)
The problem of tracking dirtiness via numbers of chunks is that larger
chunks make stored fields readers more likely to be considered dirty, so
I'm trying to work around it by tracking numbers of docs instead.
2020-09-17 18:59:08 +02:00
Adrien Grand e0a64908d8
Further tune Lucene87StoredFieldsFormat for small documents. (#1888)
The increase of the maximum number of chunks per doc done in previous
issues was mostly random. I'd like to provide users with a similar
trade-off with what the old versions of BEST_SPEED and BEST_COMPRESSION
used to do. So since BEST_SPEED used to compress at most 128 docs at
once, I think we should roughly make it 128*10 now since there are 10
sub blocks. I made it 1024 to account for the fact that there is a preset
dict as well that need decompressing. And similarly BEST_COMPRESSION used
to allow 4x more docs than BEST_SPEED, so I made it 4096.

With such larger numbers of docs per chunk, the decoding of metadata
became a bottleneck for stored field access so I made it a bit faster by
doing bulk decoding of the packed longs.
2020-09-17 18:30:57 +02:00
Ilan Ginzburg dbba48b3e5
SOLR-14613: use set-placement-plugin for both setting and unsetting plugin config 2020-09-17 15:01:19 +02:00
Erik Hatcher 2364a7aded SOLR-14792: Remove VelocityResponseWriter 2020-09-17 08:45:13 -04:00
Dawid Weiss 6c9d7adf79
LUCENE-9527: upgrade javacc to 7.0.4 (#1884) 2020-09-17 13:29:18 +02:00
noblepaul 515608a087 SOLR-14151: fixed the classloading issue 2020-09-17 19:10:04 +10:00
Dawid Weiss 4f344cb0d4
LUCENE-9530: cleaned up javacc gradle generation scripts. (#1883)
* LUCENE-9530: cleaned up gradle javacc generation/ tweaks script so that it's consistent across runs. Removed ant remnants.
2020-09-17 10:53:02 +02:00
noblepaul cbb1659640 Revert "Revert "SOLR-14151: Bug fixes (#1815)""
This reverts commit 27a14fe481.

Undoing accidental commit
2020-09-17 11:19:21 +10:00
noblepaul 5bc7fb2861 SOLR-14871: remove unused test 2020-09-17 09:10:40 +10:00
noblepaul e5d3e4d567 Merge branch 'master' of github.com:apache/lucene-solr 2020-09-17 09:06:48 +10:00
Ilan Ginzburg c7d234cafd
SOLR-14613: Autoscaling replacement using placement plugins
Allow using placement plugins to compute replica placement on the cluster for Collection API calls.
This is the first code drop for the replacement of the Autoscaling feature.
Javadoc of sample plugin org.apache.solr.cluster.placement.plugins.SamplePluginAffinityReplicaPlacement details how to enable this replica placement strategy.
PR's #1684 then #1845
2020-09-17 00:10:31 +02:00
Erik Hatcher a0404a7501 SOLR-14799: add CHANGES entry 2020-09-16 13:43:09 -04:00
Erik Hatcher 22022463d7 SOLR-14799: JWT authentication plugin only requires sub claim when principalClaim=sub 2020-09-16 13:41:19 -04:00
Erik Hatcher c63684f93b Revert "SOLR-14799: JWT authentication plugin only requires sub claim when principalClaim=sub"
This reverts commit bc0c9ffee3.
2020-09-16 12:45:03 -04:00
Erik Hatcher bc0c9ffee3 SOLR-14799: JWT authentication plugin only requires sub claim when principalClaim=sub 2020-09-16 12:27:55 -04:00
Adrien Grand ad71bee016
LUCENE-9525: Better handle small documents with Lucene87StoredFieldsFormat. (#1876)
Instead of configuring a dictionary size and a block size, the format
now tries to have 10 sub blocks per bigger block, and adapts the size of
the dictionary and of the sub blocks to this overall block size.
2020-09-16 13:09:00 +02:00
Adrien Grand 93094ef7e4
LUCENE-9510: Don't compress temporary stored fields and term vectors when index sorting is enabled. (#1874)
When index sorting is enabled, stored fields and term vectors can't be
written on the fly like in the normal case, so they are written into
temporary files that then get resorted. For these temporary files,
disabling compression speeds up indexing significantly.

On a synthetic test that indexes stored fields and a doc value field
populated with random values that is used for index sorting, this
resulted in a 3x indexing speedup.
2020-09-16 13:05:22 +02:00
noblepaul 6b87cfb88c Merge branch 'master' of github.com:apache/lucene-solr 2020-09-16 19:19:45 +10:00
Noble Paul 7b8e72e553
SOLR-14871 Use Annotations for v2 APIs in/cluster path (#1878) 2020-09-16 18:06:43 +10:00
Dawid Weiss 9b9b0a6339 Fix corrupted umlaut characters. This was introduced back in 2009... 2020-09-15 19:07:30 +02:00
Mike Drob 3134f10a42
LUCENE-9488 Update release process to work with gradle (#1860)
* Restore lucene/version.properties
* Switch release wizard commands from ant to gradle equivalents
* Remove remaining checks for ant
* Remove checks for Java 8
* Update Copyright year
* Minor bug fixes around determining next version for a major release
2020-09-15 10:10:17 -05:00
Munendra S N 58d13608b4 SOLR-10471: update default zk session timeout in bin/solr* scripts
* zkClientTimeout value is already set to 30s in solr.xml but
  same update was missing from bin/solr* script
2020-09-15 19:32:51 +05:30
Simon Willnauer f655d97b54
LUCENE-9516: Remove DocConsumer and IndexingChain from Lucene (#1867)
This removes the ability to replace the IndexingChain / DocConsumer
in Lucenes IndexWriter. The interface is not sufficient to efficiently
replace the functionality with reasonable efforts. It also seems it's completely
unused at this point and hasn't been maintained in years.
2020-09-15 10:15:24 +02:00
Alexandre Rafalovitch 7d62cad1a8
SOLR-14862: Update RefGuide page for support fied types (#1865)
Added mentions for BBoxField, NestPathField, RankField (and placehold for QParser, from SOLR-14590), RptWithGeometrySpatialField
Moved Deprecated types into separate table to improve reading comprehension
Added some cross-references for more in-depth reading.
2020-09-14 13:30:43 -04:00
Adrien Grand 97a4af6890
LUCENE-9510: Don't pull a merge instance when flushing stored fields out-of-order. (#1872)
With recent changes to stored fields that split blocks into several sub
blocks, the merge instance has become much slower at random access since
it would decompress all sub blocks when accessing a document. Since
stored fields likely get accessed in random order at flush time when
index sorting is enabled, it's better not to use the merge instance.

On a synthetic benchmark that has one stored field and one numeric
doc-value field that is used for sorting and fed with random values,
this made indexing more than 4x faster.
2020-09-14 18:07:04 +02:00
Julie Tibshirani fceab765c1
Make sure to test normal scorers with asserting wrappers. (#1834) 2020-09-14 18:06:41 +02:00
Houston Putman bc0b29eca3
SOLR-14789: Rename docker tests task, adding missing credit in CHANGES (#1871) 2020-09-14 10:55:43 -04:00
Eric Pugh 5edd4b21af
SOLR-14725 update batchSize parameter docs for update() and delete() stream expressions (#1729)
* batchSize is optional parameter with a default value

* typo
2020-09-14 09:35:12 -04:00
Dawid Weiss f92e2656b5
LUCENE-9522: Instructions for reproducing failing tests still mention ant (#1868) 2020-09-14 13:29:32 +02:00
Dawid Weiss 978446e4fd LUCENE-9521: Fix TestPassageSelector.randomizedSanityCheck failure (ensure input ranges for the formatter are within input value size). 2020-09-14 12:02:13 +02:00
Simon Willnauer 8f895d9075
LUCENE-9514: Include TermVectorsWriter in DWPT accounting (#1847)
TermVectorsWriter might consume some heap space memory that
can have a significant impact on decisions made in the IW if
writers should be stalled or DWPTs should be flushed if memory
settings are small in IWC and flushes are frequent. This change adds
RAM accounting to the TermVectorsWriter since it's part of the
DWPT lifecycle and not just present during flush.
2020-09-14 11:11:06 +02:00
noblepaul a8099d6367 Merge branch 'master' of github.com:apache/lucene-solr 2020-09-14 18:47:20 +10:00
Simon Willnauer 4d46caa05d
LUCENE-9515: Detach DWPT from DefaultIndexingChain (#1848)
This change removes the DWPT dependency from DefaultIndexingChain
and rather passes on the primitives needed for creating the chain.
2020-09-14 09:46:56 +02:00
Dawid Weiss 41c5c030aa LUCENE-9521: temporarily disable the offending test. 2020-09-14 09:39:19 +02:00
Dawid Weiss ff6e527a56 SOLR-14854: Add root project name lucene-solr (avoid automatic root project naming) 2020-09-14 09:21:30 +02:00
noblepaul cc31e23341 SOLR-14151: Fixing TestBulkSchemaConcurrent failures 2020-09-14 15:42:11 +10:00
noblepaul 27a14fe481 Revert "SOLR-14151: Bug fixes (#1815)"
This reverts commit 95ab98c920.

revert the previous change
2020-09-14 12:22:58 +10:00
Moses Schwartz 647bd5c59f
Update Json facet documentation (#1857)
Added missing preposition (to) in the sentence
2020-09-13 13:35:20 +05:30
Houston Putman 485d5fb41a
SOLR-14789: Absorb the docker-solr repo. (#1769) 2020-09-11 12:29:29 -04:00
Dawid Weiss ed930f4b29
SOLR-14417: Gradle build sometimes fails RE BlockPoolSlice (#1854)
* SOLR-14417: workaround the compiler scope problem.

* Make the list modifiable.
2020-09-10 22:25:33 +02:00
Julie Tibshirani 37e4dbef6d
LUCENE-9501: Fix invariant violation in IndexSortSortedNumericDocValuesRangeQuery. (#1833)
Previously the DocIdSetIterator returned an old value for docID advance
returned NO_MORE_DOCS. This violates the DocIdSetIterator contract and made it
possiblefor the iterator's advance method to be called even after it was
already exhausted.
2020-09-10 15:02:18 -04:00
Tomas Fernandez Lobbe 0250978869
SOLR-14510: Remove deprecations added with BlockMax WAND support (#1823) 2020-09-10 10:33:09 -07:00
Mike Drob 14e4edc1e9
SOLR-14847: Create Solr Server TGZ (#1844)
Create new targets for building solr binary release artifacts:

gradlew -p solr/packaging distTar distZip

Co-authored-by: Houston Putman <houston@apache.org>
Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
Co-authored-by: Mike Drob <mdrob@apache.org>
2020-09-10 10:55:48 -05:00
Dawid Weiss 59d83428bc
LUCENE-9519: Correct behavior for highlights that cross multi-value boundaries (#1853) 2020-09-10 16:15:51 +02:00
Dawid Weiss e2f3f626ee
LUCENE-9464: Add high(er)-level hit highlighter example that demonstrates and uses low-level components (#1820) 2020-09-10 13:17:13 +02:00
Uwe Schindler 8debc9d0c2
LUCENE-9517: Don't subclass Deflater and instead create a patch for setDictionary() using a functional interface (#1850) 2020-09-10 11:12:59 +02:00
Dawid Weiss a05a055d9d
LUCENE-9518: Add sanity to gradle archiving tasks (predictable order, default permissions, fail on duplicates) (#1851) 2020-09-10 09:48:04 +02:00