OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	a45626deb5	Analysis: Wrap at 140 columns (#34494 ) Applies our standard column width to all analysis plugins.	2018-10-17 16:17:25 -04:00
Jim Ferenczi	241c74efb2	upgrade to a new snapshot of Lucene 8 (7d0a7782fa) (#33812 )	2018-09-18 18:16:40 +02:00
Alan Woodward	39c3234c2f	Upgrade to latest Lucene snapshot (#33505 ) * LeafCollector.setScorer() now takes a Scorable * Scorers may not have null Weights * IndexWriter.getFlushingBytes() reports how much memory is being used by IW threads writing to disk	2018-09-10 20:51:55 +01:00
Jim Ferenczi	7ad71f906a	Upgrade to a Lucene 8 snapshot (#33310 ) The main benefit of the upgrade for users is the search optimization for top scored documents when the total hit count is not needed. However this optimization is not activated in this change, there is another issue opened to discuss how it should be integrated smoothly. Some comments about the change: * Tests that can produce negative scores have been adapted but we need to forbid them completely: #33309 Closes #32899	2018-09-06 14:42:06 +02:00
Nicholas Knize	e162127ff3	Upgrade to Lucene-7.5.0-snapshot-13b9e28f9d The main feature is the inclusion of bkd backed geo_shape with INTERSECT, DISJOINT, WITHIN bounding box and polygon query support.	2018-08-09 11:15:02 -05:00
Jim Ferenczi	53ff06e621	Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (#32390 ) The main highlight is the removal of the reclaim_deletes_weight in the TieredMergePolicy. The es setting index.merge.policy.reclaim_deletes_weight is deprecated in this commit and the value is ignored. The new merge policy setting setDeletesPctAllowed should be added in a follow up.	2018-07-27 08:28:51 +02:00
Armin Braun	ed3b44fb4c	Handle TokenizerFactory TODOs (#32063 ) * Don't replace Replace TokenizerFactory with Supplier, this approach was rejected in #32063 * Remove unused parameter from constructor	2018-07-17 14:14:02 +02:00
Adrien Grand	f023e95ae0	Upgrade to Lucene 7.4.0. (#31529 ) This moves Elasticsearch from a recent 7.4.0 snapshot to the GA release.	2018-06-22 16:17:17 +02:00
Nhat Nguyen	8453ca638d	Upgrade to Lucene-7.4.0-snapshot-518d303506 (#31360 )	2018-06-15 10:58:21 -04:00
Nhat Nguyen	abe61159a8	Upgrade to Lucene-7.4.0-snapshot-0a7c3f462f (#31073 ) This snapshot includes: - LUCENE-8341: Record soft deletes in SegmentCommitInfo which will resolve #30851 - LUCENE-8335: Enforce soft-deletes field up-front	2018-06-04 14:18:46 -04:00
Nhat Nguyen	363f1e84ca	Upgrade to Lucene-7.4-snapshot-1cbadda4d3 (#30928 ) This snapshot includes LUCENE-8328 which is needed to stabilize CCR builds.	2018-05-29 12:29:52 -04:00
Nhat Nguyen	1918a30237	Upgrade to Lucene-7.4.0-snapshot-cc2ee23050 (#30778 ) The new snapshot includes LUCENE-8324 which fixes missing checkpoint after a fully deletes segment is dropped on flush. This snapshot should resolves failed tests in the CorruptedFileIT suite. Closes #30741 Closes #30577	2018-05-22 13:11:48 -04:00
Nhat Nguyen	67d8fc222d	Upgrade to Lucene-7.4.0-snapshot-59f2b7aec2 (#30726 ) This snapshot resolves issues related to ShrinkIndexIT.	2018-05-18 18:21:39 -04:00
Nhat Nguyen	519768b5d3	Upgrade to Lucene-7.4-snapshot-6705632810 (#30519 ) This snapshot is to include LUCENE-8298 which allows DocValues updates to reset a value. This is needed for the Lucene rollback work.	2018-05-10 12:31:45 -04:00
Jim Ferenczi	891d3bd9c3	Expose the Lucene Korean analyzer module in a plugin (#30397 ) This change adds a new plugin called `analysis-nori` that exposes Korean text analysis in es using the new Lucene Korean analyzer module named (`nori`). The plugin adds: * a Korean analyzer: `nori` * a Korean tokenizer: `nori_tokenizer` * a part of speech stop filter: `nori_part_of_speech` * a filter that can replace Hanja characters with their Hangul transcription: `nori_readingform`	2018-05-04 20:46:13 +02:00
Jim Ferenczi	dbd857341f	Upgrade to 7.4.0-snapshot-1ed95c097b (#30357 ) Upgrade to lucene-7.4.0-snapshot-1ed95c097b This version contains: * An Analyzer for Korean * An IntervalQuery and IntervalsSource that retrieve minimum intervals of positional queries. * A new API to retrieve matches (offsets and positions) of a query for a single document. * Support for soft deletes in the index writer. * A fixed shingle filter that handles index time synonyms. * Support for emoji sequence in ICUTokenizer (with an upgrade to icu 61.1)	2018-05-04 11:44:22 +02:00
Alan Woodward	dccd43af47	Upgrade to lucene 7.3.0 (#29387 )	2018-04-05 10:34:44 +01:00
Adrien Grand	3bdfc8f3fb	Upgrade to lucene-7.3.0-snapshot-98a6b3d. (#29298 ) Most notable changes include: - this release doesn't have the 7.2.1 version constant so I had to create one - spatial4j and jts were upgraded	2018-04-03 09:27:14 +02:00
Jim Ferenczi	be012b1326	upgrade to lucene 7.2.1 (#28218 )	2018-01-15 16:47:46 +01:00
Adrien Grand	77711508b0	Upgrade to Lucene 7.2.0. (#27910 )	2017-12-20 14:17:40 +01:00
Adrien Grand	6323bb0d97	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27619 ) This new snapshot mostly brings a change to TopFieldCollector which can now early terminate collection when trackTotalHits is `false`. As a follow-up, we should replace our usage of `EarlyTerminatingSortingCollector` with this new option.	2017-12-04 09:40:08 +01:00
Adrien Grand	996990ad1f	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27496 ) The main highlight of this new snapshot is that it introduces the opportunity for queries to opt out of caching. In case a query opts out of caching, not only will it never be cached, but also no compound query that wraps it will be cached.	2017-11-28 14:52:42 +01:00
Colin Goodheart-Smithe	c1b8140c83	Upgrade to Lucene 7.1 (#27225 )	2017-11-02 13:25:33 +00:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
Claudio Bley	7184cf8b5b	Fix kuromoji default stoptags (#26600 ) Initialize the default stop-tags in `KuromojiPartOfSpeechFilterFactory` if the `stoptags` are not given in the config. Also adding a test which checks that part-of-speech tokens are removed when using the kuromoji_part_of_speech filter.	2017-09-15 12:25:09 +02:00
Adrien Grand	78681bc9e5	Upgrade to lucene-7.0.0-snapshot-d94a5f0. (#26441 )	2017-08-31 09:06:40 +02:00
Adrien Grand	eb782492be	Remove support for lenient booleans. Closes #22298	2017-08-28 09:56:01 +02:00
desmorto	292dd8f992	(refactor) some opportunities to use diamond operator (#25585 ) * (refactor) some opportunities to use diamond operator * Update ExceptionRetryIT.java update typo	2017-08-15 16:36:42 -06:00
Adrien Grand	f0c1e30544	Upgrade to lucene-7.0.0-snapshot-a128fcb. (#26090 )	2017-08-08 13:03:19 +02:00
Adrien Grand	481d5d09b2	Upgrade to lucene-7.0.0-snapshot-00142c9. (#25641 ) Lucene 7.0 is feature-frozen now, so there should not be many changes until GA.	2017-07-11 13:58:55 +02:00
Adrien Grand	44e9c0b947	Upgrade to lucene-7.0.0-snapshot-ad2cb77. (#25349 ) Most notable changes: - better update concurrency: LUCENE-7868 - TopDocs.totalHits is now a long: LUCENE-7872 - QueryBuilder does not remove the boolean query around multi-term synonyms: LUCENE-7878 - removal of Fields: LUCENE-7500 For the `TopDocs.totalHits` change, this PR relies on the fact that the encoding of vInts and vLongs are compatible: you can write and read with any of them as long as the value can be represented by a positive int.	2017-06-22 12:35:33 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Nicholas Knize	deb7caf4d3	Upgrade to lucene-7.0.0-snapshot-a0aef2f This commit upgrades master to a current lucene snapshot with commit id a0aef2f.	2017-05-19 10:20:55 -05:00
Ryan Ernst	2a65bed243	Tests: Change rest test extension from .yaml to .yml (#24659 ) This commit renames all rest test files to use the .yml extension instead of .yaml. This way the extension used within all of elasticsearch for yaml is consistent.	2017-05-16 17:24:35 -07:00
Nik Everett	bb06d8ec4f	Allow plugins to build pre-configured token filters (#24223 ) This changes the way we register pre-configured token filters so that plugins can declare them and starts to move all of the pre-configured token filters out of core. It doesn't finish the job because doing so would make the change unreviewably large. So this PR includes a shim that keeps the "old" way of registering pre-configured token filters around. The Lowercase token filter is special because there is a "special" interaction between it and the lowercase tokenizer. I'm not sure exactly what to do about it so for now I'm leaving it alone with the intent of figuring out what to do with it in a followup. This also renames these pre-configured token filters from "pre-built" to "pre-configured" because that seemed like a more descriptive name. This is a part of #23658	2017-05-09 14:50:49 -04:00
Ryan Ernst	212f24aa27	Tests: Clean up rest test file handling (#21392 ) This change simplifies how the rest test runner finds test files and removes all leniency. Previously multiple prefixes and suffixes would be tried, and tests could exist inside or outside of the classpath, although outside of the classpath never quite worked. Now only classpath tests are supported, and only one resource prefix is supported, `/rest-api-spec/tests`. closes #20240	2017-04-18 15:07:08 -07:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Jim Ferenczi	0e95c90e9f	Upgrade to Lucene 6.5.0 (#23750 )	2017-03-27 15:57:54 +02:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Adrien Grand	c8496fc4f4	Upgrade to Lucene 6.4.1. (#22978 )	2017-02-06 09:28:43 +01:00
Jim Ferenczi	8028578305	Upgrade to Lucene 6.4.0 (#22724 ) * Upgrade to Lucene 6.4.0 `ValueSource`s are now converted to `DoubleValueSource`s using the Lucene adapter made for the migration to the new API in 6.4.0.	2017-01-21 04:48:01 +01:00
Jason Tedor	9781b88a38	Fix deprecation logging for lenient booleans This commit fixes an issue with deprecation logging for lenient booleans. The underlying issue is that adding deprecation logging for lenient booleans added a static deprecation logger to the Settings class. However, the Settings class is initialized very early and in CLI tools can be initialized before logging is initialized. This leads to status logger error messages. Additionally, the deprecation logging for a lot of the settings does not provide useful context (for example, in the token filter factories, the deprecation logging only produces the name of the setting, but gives no context which token filter factory it comes from). This commit addresses both of these issues by changing the call sites to push a deprecation logger through to the lenient boolean parsing. Relates #22696	2017-01-19 12:30:33 -05:00
Daniel Mitterdorfer	aece89d6a1	Make boolean conversion strict (#22200 ) This PR removes all leniency in the conversion of Strings to booleans: "true" is converted to the boolean value `true`, "false" is converted to the boolean value `false`. Everything else raises an error.	2017-01-19 07:59:18 +01:00
Adrien Grand	f8998fece5	Upgrade to lucene-6.4.0-snapshot-084f7a0. (#22413 )	2017-01-04 19:03:52 +01:00
Nik Everett	f5f2149ff2	Remove much ceremony from parsing client yaml test suites (#22311 ) * Remove a checked exception, replacing it with `ParsingException`. * Remove all Parser classes for the yaml sections, replacing them with static methods. * Remove `ClientYamlTestFragmentParser`. Isn't used any more. * Remove `ClientYamlTestSuiteParseContext`, replacing it with some static utility methods. I did not rewrite the parsers using `ObjectParser` because I don't think it is worth it right now.	2016-12-22 11:00:34 -05:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00

1 2 3

145 Commits