Commit Graph

34680 Commits

Author SHA1 Message Date
Peter Gromov 0a1a3f4c40
LUCENE-9688: Hunspell: consider prefix's continuation flags when applying suffix (#2229) 2021-01-22 12:03:55 +01:00
Peter Gromov d7968130c3
LUCENE-9684: Hunspell: support COMPOUNDRULE (#2228) 2021-01-22 12:01:53 +01:00
Eric Pugh cf5db8d651
SOLR-14067: v3 Create /contrib/scripting module with ScriptingUpdateProcessor (#2215)
* Creating Scripting contrib module to centralize the less secure code related to scripts.

* tweak the changelog and update notice to explain why the name changed and the security posture thinking

* the test script happens to be a currency.xml, which made me think we were doing something specific to currency types, but instead any xml formatted file will suffice for the test.

* drop the ing, and be more specific on the name of the ref guide page

* use the same name everywhere

Co-authored-by: David Smiley <dsmiley@apache.org>
2021-01-21 13:32:46 -05:00
András Salamon 83e0397c42
SOLR-14297: Replace commons-codec Base64 with JDK8 Base64 (#2222) 2021-01-21 13:10:33 -05:00
Anand e5a16f0b0f
LUCENE-9674: Use binary search in VectorValues.advance()
Lucene90VectorReader now implements advance() with binary search in place of prior linear scan
Co-authored-by: Anand Kotriwal <anakot@amazon.com>
2021-01-21 10:02:21 -05:00
Peter Gromov 37e31f2ac7
LUCENE-9685: Hunspell: prefix condition is only checked on suffix, not stem (#2226) 2021-01-21 13:38:45 +01:00
Peter Gromov fdf04d8c63
LUCENE-9681: Hunspell spellchecker: support numbers with separators (#2224) 2021-01-21 12:33:03 +01:00
Dawid Weiss 053060b925 LUCENE-9670: Workaround for gradle not liking the same stream for stdout and stderr sinks. 2021-01-21 09:51:56 +01:00
Cassandra Targett e8276e09a1
SOLR-13105 - Visual Guide to Math Expressions (#2227)
* SOLR-13105: The Visual Guide to Streaming Expressions and Math Expressions
2021-01-20 16:14:01 -06:00
Dawid Weiss 4da8f08c63
LUCENE-9683: fix incorrect logic for measuring stall time information. (#2225) 2021-01-20 19:05:22 +01:00
Peter Gromov 89456686ed
LUCENE-9681: deduplicate decodeFlags+hasFlag checks (#2223) 2021-01-20 17:27:04 +01:00
Peter Gromov 939699f550
LUCENE-9667: Hunspell: add spellchecker API, support BREAK and FORBIDDENWORD affix rules (#2207) 2021-01-20 10:57:27 +01:00
David Smiley a233ed2fd1
SOLR-15081: Metrics for core: isLeader, replicaState (#2198)
Note that getLastPublished returns an Enum type.  TextWriter.writeVal should probably support Enums, which would simplify this code.
2021-01-19 16:43:39 -05:00
Joel Bernstein 5328ced82e SOLR-14608: Reduce percentage of priority queue size above total hits 2021-01-19 15:35:57 -05:00
Joel Bernstein 64df5a65f0 SOLR-14608: Faster sorting for the /export handler 2021-01-19 15:35:56 -05:00
Simon Willnauer 1aeb1dcb86
LUCENE-9669: Restore Lucene70Codec (#2221)
This change restores the Lucene70Codec for file format compatibility of
indices that are created within the Lucene 7 major version. These indices
can be opened via an expert API on DirectoryReader in read-only mode. Changes
to these indices are prohibited and will be rejected by the IndexWriter.
In fact, IndexWriter will not open an index that is created with a major version
less than N-1 to the current major version.
2021-01-19 21:07:54 +01:00
Chris Hostetter df692076a6 rectify 8.9 CHANGES sections from branch_8x 2021-01-19 12:41:33 -07:00
Chris Hostetter a4aa3d1d4f SOLR-15079: Block Collapse - Faster collapse code when groups are co-located via Block Join style nested doc indexing.
Used by default when field=_root_, or explicitly requsted for other fields via hint=block.
2021-01-19 11:40:29 -07:00
Cassandra Targett 12e42a2a11 Ref Guide: update upgrade notes for 8.8 release 2021-01-19 12:00:27 -06:00
jimczi 5af12b5f14 LUCENE-9675: Binary doc values fields now expose their configured compression mode in the attributes of the field info. 2021-01-19 10:03:13 +01:00
Patrick Marty 227256d951
LUCENE-9646: Set BM25Similarity discountOverlaps via the constructor 2021-01-19 09:49:57 +01:00
Peter Gromov 9f5bdf43b7
LUCENE-9678: Hunspell: fix off-by-one error to support prefixes of word.length - 1 (#2219) 2021-01-19 09:34:27 +01:00
Peter Gromov 422c89baef
LUCENE-9676: Hunspell: improve stemming of all-caps words (#2217)
Hunspell: improve stemming of all-caps words

Repeat Hunspell's logic:
* when encountering a mixed- or (inflectable) all-case dictionary entry, add its title-case analog as a hidden entry
* use that hidden entry for stemming case variants for title- and uppercase words, but don't consider it a valid word itself
* ...unless there's another explicit dictionary entry of that title case
2021-01-19 09:32:23 +01:00
Simon Willnauer c1ae6dc07c
LUCENE-9669: Add an expert API to allow opening indices created < N-1 (#2212)
Today we force indices that were created with N-2 and older versions of Lucene
to fail on open. This check doesn't even check if the codecs are available. In order
to allow users to open older indices and for us to support N-2 versions this change
adds an API on DirectoryReader to specify a minimum index version on a per reader basis.
This doesn't apply for the IndexWriter which will fail on opening older indices.
2021-01-19 09:23:49 +01:00
Peter Gromov 426c902bc9
LUCENE-9677: simplify Dictionary.affixData storage (#2218)
Use char[] instead of byte[], get rid of unnecessary byte array readers/writers.
2021-01-19 09:22:33 +01:00
Peter Gromov ab08fdc6f0
LUCENE-9671: Hunspell: shorten Stemmer.applyAffix (#2209)
Call stem() recursively just once with different arguments depending on various conditions. 

NOTE: committing in directly as this is a refactoring, not a functional change (no CHANGES.txt entry).
2021-01-18 22:54:22 +01:00
Noble Paul 8505d4d416
SOLR-15052: Per-replica states for reducing overseer bottlenecks (trunk) (#2177) 2021-01-19 02:59:41 +11:00
Uwe Schindler 4b508aef24 LUCENE-8982: Add a note to MIGRATE.md 2021-01-18 00:50:02 +01:00
zacharymorn a7747b63b4
LUCENE-8982: Make NativeUnixDirectory pure java with FileChannel direct IO flag, and rename to DirectIODirectory (#2052)
LUCENE-8982: Make NativeUnixDirectory pure java with FileChannel direct IO flag, and rename to DirectIODirectory
2021-01-17 23:57:56 +01:00
Namgyu Kim eb24e95731
LUCENE-9661: Fix deadlock in TermsEnum.EMPTY 2021-01-16 06:49:23 +09:00
Cassandra Targett 30aa0f5ba4 Ref Guide: copy edits for 8.8 release 2021-01-15 14:54:41 -06:00
Cassandra Targett 90aabbdde8 Ref guide: add license to cluster-plugins.adoc; fix section title case throughout 2021-01-15 14:54:41 -06:00
Cassandra Targett cb465044d7 SOLR-14560: ref guide: remove references to XML output when examples are all JSON 2021-01-15 14:54:41 -06:00
Mike McCandless cc1d902ade fix typo in Adrien's name! 2021-01-15 09:31:13 -05:00
Peter Gromov 82f6f161ae
LUCENE-9664: Hunspell support: fix most IntelliJ warnings, cleanup (#2202) 2021-01-15 13:52:34 +01:00
Peter Gromov 90131a605a
LUCENE-9665: Hunspell: support default encoding (#2203, Peter Gromov via Dawid Weiss) 2021-01-15 09:35:25 +01:00
Florin Babes f285f02c89
SOLR-15071: add TestEdisMaxSolrFeature.testEdisMaxSolrFeatureCustomMM() test case (#2201)
* add test case for SOLR-15071

* add temporary @Ignore to be removed when the fix is committed

Co-authored-by: Florin Babes <florin.babes@emag.ro>
Co-authored-by: Christine Poerschke <cpoerschke@apache.org>
2021-01-14 10:44:26 +00:00
Noble Paul 9466af576a
SOLR-14155: Load all other SolrCore plugins from packages (#1666) 2021-01-13 22:28:01 +11:00
Cassandra Targett 7a301c736c Ref Guide: clarify backup location requirements for SolrCloud backups 2021-01-11 14:36:14 -06:00
Eric Pugh 3e2fb59272
SOLR-15010 Try to use jattach for threaddump if jstack is missing (#2192)
* introduce jattach check if jstack is missing.  jattach ships in the Solr docker image instead of jstack.
* get the full path to the jattach command

Co-authored-by: Christine Poerschke <cpoerschke@apache.org>
2021-01-11 14:58:11 -05:00
Mike Drob a429b969d8
SOLR-14413 fix unit test to use delayed handler (#2189) 2021-01-11 12:15:30 -06:00
Timothy Potter 6711eb7571
SOLR-15036: auto- select / rollup / sort / plist over facet expression when using a collection alias with multiple collections (#2132) 2021-01-11 10:34:28 -07:00
Adrien Grand f0d6fd84bb LUCENE-9346: Add CHANGES entry. 2021-01-11 15:06:08 +01:00
zacharymorn c2493283a5
LUCENE-9346: Support minimumNumberShouldMatch in WANDScorer (#2141)
Co-authored-by: Adrien Grand <jpountz@gmail.com>
2021-01-11 15:03:29 +01:00
Jason Gerlowski 98c51ca34b
SOLR-15070: Remove HashMap usage in SuggestComponent rsp (#2183)
Prior to this commit, SuggestComponent used a HashMap as part of the
response it built on the server side.  This class is serialized/
deserialized differently depending on the SolrJ ResponseParser used:
a LinkedHashMap when javabin was used, and a SimpleOrderedMap when XML
was used.  This discrepancy led to ClassCastException's in downstream
SolrJ code.

This commit fixes the issue by changing SuggestComponent to avoid these
types that are serialized differently.  "suggest" response sections now
deserialize as a NamedList in SolrJ, and the SuggesterResponse POJO has
been updated accordingly.
2021-01-11 07:31:26 -05:00
Houston Putman 7e94a56e81 SOLR-14999: Fixing SolrXmlConfig tests for hostPort. 2021-01-09 10:19:47 -05:00
Munendra S N 2c1ec75eaa SOLR-12559: fix error when multi-val fields are derefernced in JSON aggs
This ensures all derefernced fields are not parsed into actual valuesource
but parsed into a placeholder value. This works for 1-level of dereferencing
2021-01-09 19:30:43 +05:30
Tomas Fernandez Lobbe 4789112f91
Remove unused test file (#2174) 2021-01-08 16:40:21 -08:00
Houston Putman 4be49cbdf5
SOLR-14999: Option to set the advertised port for Solr. (#2089) 2021-01-08 18:21:41 -05:00
Houston Putman 86934787fe
Adding local gradle settings for github actions. (#2191) 2021-01-08 18:12:19 -05:00