OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	89837eb918	Remove -Xlint exclusions in the ingest-common module. (#40505 ) Fix the generics in processors extending AbstractStringProcessor and its factory. Relates to #40366	2019-03-29 09:43:36 +01:00
Jim Ferenczi	e256eb361a	Fix merging of search_as_you_type field mapper (#40593 ) The merge of the `search_as_you_type` field mapper uses the wrong prefix field and does not update the underlying field types.	2019-03-29 09:02:40 +01:00
Jeff Hajewski	6c13ed7db8	Update max dims for vectors to 1024. (#40597 )	2019-03-28 17:08:14 -04:00
Mayya Sharipova	24755209b4	Add randomScore function in script_score query (#40186 ) To make script_score query to have the same features as function_score query, we need to add randomScore function. This function produces different random scores on different index shards. It is also able to produce random scores based on the internal Lucene Document Ids.	2019-03-28 13:23:47 -04:00
Adrien Grand	65a35c985c	Remove type from VersionConflictEngineException. (#37490 ) (#40514 ) It initially mentioned the type in the exception because the type used to be required to uniquely identify a document. This is not necessary anymore given that indices have at most one type.	2019-03-28 09:32:09 +01:00
Armin Braun	ebcb925afb	Cleanup Duplication in Netty4 Module (#40148 ) (#40563 ) * Just drying up the listener/promise handling a little	2019-03-28 00:57:58 +01:00
Andy Bristol	23395a9b9f	search as you type fieldmapper (#35600 ) Adds the search_as_you_type field type that acts like a text field optimized for as-you-type search completion. It creates a couple subfields that analyze the indexed terms as shingles, against which full terms are queried, and a prefix subfield that analyze terms as the largest shingle size used and edge-ngrams, against which partial terms are queried Adds a match_bool_prefix query type that creates a boolean clause of a term query for each term except the last, for which a boolean clause with a prefix query is created. The match_bool_prefix query is the recommended way of querying a search as you type field, which will boil down to term queries for each shingle of the input text on the appropriate shingle field, and the final (possibly partial) term as a term query on the prefix field. This field type also supports phrase and phrase prefix queries however	2019-03-27 13:29:13 -07:00
Tim Brooks	ab44f5fd5d	Add InboundHandler for inbound message handling (#40430 ) This commit adds an InboundHandler to handle inbound message processing. With this commit, this code is moved out of the TcpTransport. Additionally, finer grained unit tests are added to ensure that the inbound processing works as expected	2019-03-27 12:33:26 -06:00
Julie Tibshirani	419cf1c02f	Fix an off-by-one error in the vector field dimension limit. (#40489 ) Previously only vectors up to 499 dimensions were accepted, whereas the stated limit is 500.	2019-03-27 11:17:58 -07:00
Tim Brooks	3860ddd1a4	Move outbound message handling to OutboundHandler (#40336 ) Currently there are some components of message serializer and sending that still occur in TcpTransport. This commit makes it possible to send a message without the TcpTransport by moving all of the remaining application logic to the OutboundHandler. Additionally, it adds unit tests to ensure that this logic works as expected.	2019-03-27 11:47:36 -06:00
Martijn van Groningen	1d3ece1e96	Remove -Xlint exclusions in the percolator module. (#40372 ) Relates to #40366	2019-03-26 07:55:02 +01:00
Armin Braun	13d76239a0	Use Netty ByteBuf Bulk Operations for Faster Deserialization (#40158 ) (#40339 ) * Use bulk methods to read numbers faster from byte buffers	2019-03-24 19:08:51 +01:00
Jack Conradson	0be7780cb0	Add implicit this for class binding in Painless (#40285 ) This change allows class bindings to add as their first argument, the base script class. The this reference to the base script class will be implicitly passed into a class binding as the first constructor argument upon initialization when specified as the first argument in whitelist entry for the class binding. This allows a class binding access to additional information added to the base script class such as more information about the current document or current shard. One extra requirement for this to work is the appropriate script base class must be whitelisted (should be empty).	2019-03-22 12:55:47 -07:00
Jack Conradson	6ea3272f41	Add double and Double standard casts tests to Painless (#40324 )	2019-03-21 16:10:28 -07:00
Alan Woodward	83d2870308	Add `use_field` option to intervals query (#40157 ) This is the equivalent of the `field_masking_span` query, allowing users to merge intervals from multiple fields - for example, to search for stemmed tokens near unstemmed tokens.	2019-03-20 16:26:04 +00:00
Jack Conradson	5ec56d7d22	Add float and Float standard casting tests to Painless. (#40221 )	2019-03-20 08:55:18 -07:00
Tim Brooks	0b50a670a4	Remove transport name from tcp channel (#40074 ) Currently, we maintain a transport name ("mock-nio", "nio", "netty") that is passed to a `TcpTransportChannel` when a request is received. The value of this name is to associate with the task when we register a task with the task manager. However, it is only possible to run ES with one transport, so having an implementation specific name is unnecessary. This commit removes the name and replaces it with the generic "transport".	2019-03-15 12:04:13 -06:00
Jack Conradson	dcaabdfce8	Add Painless cast tests for long and Long (#40007 )	2019-03-15 09:37:26 -07:00
Jack Conradson	b57af6c401	Add a Painless Context REST API (#39382 ) This PR adds an internal REST API for querying context information about Painless whitelists. Commands include the following: GET /_scripts/painless/_context -- retrieves a list of contexts GET /_scripts/painless/_context?context=%name% retrieves all available information about the API for this specific context	2019-03-14 12:42:12 -07:00
Jim Ferenczi	7a7658707a	Upgrade to Lucene release 8.0.0 (#39998 ) This commit upgrades to the GA release of Lucene 8 Closes #39640	2019-03-13 18:11:50 +01:00
Adrien Grand	9731ba4338	Make the `type` parameter optional when percolating existing documents. (#39987 ) (#39989 ) `document_type` is the type to use for parsing the document to percolate, which is already optional and deprecated. However `percotale` queries also have the ability to percolate existing documents, identified by an index, an id and a type. This change makes the latter optional and deprecated. Closes #39963	2019-03-13 15:04:41 +01:00
Jack Conradson	aeb0116355	Add Painless cast tests for int and Integer (#39813 )	2019-03-12 12:03:36 -07:00
Jack Conradson	ca78e44006	Fix Painless def [char] to String casts (#39759 ) * Start to fix def char casts. * Fix def char to String casts	2019-03-11 10:47:35 -07:00
Jack Conradson	31e6f6cf48	Add char tests and fix String to char cast (#39725 ) This fixes a bug where a String to char cast in Painless could be done implicitly. It is now required that a String to char cast is explicit as documented in the existing specification. This also adds char and Character casting tests.	2019-03-11 10:43:50 -07:00
Julie Tibshirani	be9c37fc76	Small simplifications to mapping validation. (#39777 ) These simplifications to `MapperMergeValidator` are possible now that there is always a single mapping definition. * Remove the type argument in `validateMapperStructure`. * Remove unnecessary checks against existing mappers.	2019-03-08 12:34:09 -08:00
Jake Landis	797d6b8a66	Execute ingest node pipeline before creating the index (#39607 ) (#39796 ) Prior to this commit (and after 6.5.0), if an ingest node changes the _index in a pipeline, the original target index would be created. For daily indexes this could create an extra, empty index per day. This commit changes the TransportBulkAction to execute the ingest node pipeline before attempting to create the index. This ensures that the only index created is the original or one set by the ingest node pipeline. This was the execution order prior to 6.5.0 (#32786). The execution order was changed in 6.5 to better support default pipelines. Specifically the execution order was changed to be able to read the settings from the index meta data. This commit also includes a change in logic such that if the target index does not exist when ingest node pipeline runs, it will now pull the default pipeline (if one exists) from the settings of the best matched of the index template. Relates #32786 Relates #32758 Closes #36545	2019-03-07 13:31:41 -06:00
Armin Braun	f5da028a3d	Chunk + Throttle Netty Writes (#39286 ) (#39778 ) * Chunk large writes and throttle on a non-writable channel to reduce direct memory usage by Netty	2019-03-07 07:24:08 +01:00
Armin Braun	aaecaf59a4	Optimize Bulk Message Parsing and Message Length Parsing (#39634 ) (#39730 ) * Optimize Bulk Message Parsing and Message Length Parsing * findNextMarker took almost 1ms per invocation during the PMC rally track * Fixed to be about an order of magnitude faster by using Netty's bulk `ByteBuf` search * It is unnecessary to instantiate an object (the input stream wrapper) and throw it away, just to read the `int` length from the message bytes * Fixed by adding bulk `int` read to BytesReference	2019-03-06 08:13:15 +01:00
Martijn van Groningen	b78a8a3e80	Use RestToXContentListener in painless execute action rest action. (#39638 )	2019-03-05 08:55:32 +01:00
Jack Conradson	7b8ff2d7c5	Add tests for Painless casting from short and Short (#39587 ) This adds tests for casting from short and Short to other standard types in Painless. This also corrects a few errors from byte and Byte cast tests.	2019-03-04 10:09:29 -08:00
Martijn van Groningen	b8659fcb83	No need to extend from StatusToXContentObject, if RestToXContentListener is used instead of RestStatusToXContentListener	2019-03-04 13:29:10 +01:00
Martijn van Groningen	0550ead176	Cleanup GrokProcessorGetAction class (#39567 ) * Removed request builder. From 7.0, request builders are no longer used. * Use RestStatusToXContentListener instead of custom RestBuilderListener in the rest action. * Changed a few public constructor's and constants' visibility from public to package protected. (these are only used internally, so no need to for public visibility)	2019-03-04 08:51:23 +01:00
Jack Conradson	687a66b580	Add byte and Byte to Painless standard cast tests (#39415 )	2019-03-01 08:35:20 -08:00
Alan Woodward	71b8494181	Upgrade to lucene 8.0.0-snapshot-ff9509a8df (#39444 ) Backport of #39350 Contains the following: * LUCENE-8635: Move terms dictionary off-heap for non-primary-key fields in `MMapDirectory` * LUCENE-8292: `TermsEnum` is fully abstract * LUCENE-8679: Return WITHIN in `EdgeTree#relateTriangle` only when polygon and triangle share one edge * LUCENE-8676: Nori tokenizer deals correctly with large buffers * LUCENE-8697: `GraphTokenStreamFiniteStrings` better handles side paths with gaps * LUCENE-8664: Add `equals` and `hashCode` to `TotalHits` * LUCENE-8660: `TopDocsCollector` returns accurate hit counts if the total equals the threshold * LUCENE-8654: `Polygon2D#relateTriangle` fix for when the polygon is inside the triangle * LUCENE-8645: `Intervals#fixField` can merge intervals from different fields * LUCENE-8585: Create jump-tables for DocValues at index time	2019-02-27 14:36:08 +00:00
Marios Trivyzas	11fe8cd16f	[Tests] Fix flakiness by ensuring stable cluster (#39300 ) (#39356 ) In integration tests where `setBootstrapMasterNodeIndex()` is used in combination with `autoMinMasterNodes = false` the cluster can start bootstrapping once the number of nodes set with the `setBootstrapMasterNodeIndex` have been started but it's not ensured that all nodes have successfully joined to form the cluster. This behaviour was introduced with `5db7ed22a0` and in order to ensure that the cluster is properly formed before proceeding with the integration test, use `ensureStableCluster()` with the appropriate number of expected nodes. Fixes: #39220	2019-02-25 17:26:15 +01:00
Mayya Sharipova	e80284231d	Backport distance functions vectors (#39330 ) Distance functions for dense and sparse vectors Backport for #37947, #39313	2019-02-23 11:52:43 -05:00
Zachary Tong	8af0e7c4b6	Only create final MatrixStatsResults on final reduction (#39205 ) MatrixStatsResults is the "final" result object, and runs an additional computation in it's ctor to calculate covariance, etc. This means it should only run on the final reduction instead of on every reduce.	2019-02-21 14:18:45 -05:00
Christoph Büscher	4b77d0434a	Remove `nGram` and `edgeNGram` token filter names (#39070 ) In #30209 we deprecated the camel case `nGram` filter name in favour of `ngram` and did the same for `edgeNGram` and `edge_ngram` and we are removing those names in 8.0. This change disallows using the deprecated names for new indices created in 7.0 by throwing an error if these filters are used. Relates to #38911	2019-02-21 16:55:40 +01:00
Jason Tedor	751c05eff9	Bump jackson-databind version for ingest-geoip (#39182 ) This commit bumps the jackson-databind version for ingest-geoip to 2.8.11.3.	2019-02-20 11:40:31 -05:00
Henning Andersen	00a26b9dd2	Blob store compression fix (#39073 ) Blob store compression was not enabled for some of the files in snapshots due to constructor accessing sub-class fields. Fixed to instead accept compress field as constructor param. Also fixed chunk size validation to work. Deprecated repositories.fs.compress setting as well to be able to unify in a future commit.	2019-02-20 09:24:41 +01:00
Ioannis Kakavas	ec2b64af63	Disable date parsing test in non english locale (#39052 ) This ensures we do not attempt to parse non english locale dates in FIPS mode. The error, originally assumed to affect only Joda, affects Java time in the same manner and manifests only with the version of BouncyCastle FIPS certified provider we use in tests. The upstream issue https://github.com/bcgit/bc-java/issues/405 indicates that the behavior is resolved in later versions of the BouncyCastle library and should be tested again when the new versions become FIPS 140 certified	2019-02-20 09:02:37 +02:00
Tal Levy	f30f1fe9b6	fix RethrottleTests retry (#38978 ) (#39131 ) the RethrottleTests assumed that tasks that were unprepared to rethrottle would bubble up into the Rethrottle response as an ElasticsearchException wrapping an IllegalArgumentException. This seems to have changed to potentially involve further levels of wrapping. This change makes the retry logic more resilient to arbitrary nesting of the underlying IllegalArgumentException	2019-02-19 11:10:39 -08:00
Jake Landis	46bb663a09	Make 7.x like 6.7 user agent ecs, but default to true (#38828 ) Forward port of https://github.com/elastic/elasticsearch/pull/38757 This change reverts the initial 7.0 commits and replaces them with the 6.7 variant that still allows for the ecs flag. This commit differs from the 6.7 variants in that ecs flag will now default to true. 6.7: `ecs` : default `false` 7.x: `ecs` : default `true` 8.0: no option, but behaves as `true` * Revert "Ingest node - user agent, move device to an object (#38115)" This reverts commit `5b008a34aa`. * Revert "Add ECS schema for user-agent ingest processor (#37727) (#37984)" This reverts commit `cac6b8e06f`. * cherry-pick 5dfe1935345da3799931fd4a3ebe0b6aa9c17f57 Add ECS schema for user-agent ingest processor (#37727) * cherry-pick ec8ddc890a34853ee8db6af66f608b0ad0cd1099 Ingest node - user agent, move device to an object (#38115) (#38121) * cherry-pick f63cbdb9b426ba24ee4d987ca767ca05a22f2fbb (with manual merge fixes) Dep. check for ECS changes to User Agent processor (#38362) * make true the default for the ecs option, and update 7.0 references and tests	2019-02-13 10:28:01 -06:00
Alexander Reelsen	884b5063a4	Create ISO8601 joda compatible java time formatter (#38434 ) The existing formatter being used was not on par with the joda formatter as it was missing the ability to parse a comma as a separator between seconds and milliseconds. While a real iso8601 would be much more complex, this might be sufficient for some more use-cases. The ingest date formatter now also uses the iso8601 formatter by default. Closes #38345	2019-02-11 15:11:26 +01:00
Christoph Büscher	f61420140d	Use only default type in rank_eval API (#38530 ) Currently tests still use custom type names. In preparation for the final types removal this change moves all of them to use the default "_doc" type in tests.	2019-02-11 10:18:13 +01:00
Alexander Reelsen	56edc8e37f	Fix timezone fallback in ingest processor (#38407 ) (#38664 ) If no timezone was specified in the date processor, then the conversion would lead to wrong time, as UTC was assumed by default, leading to incorrectly parsed dates. This commit does not assume a default timezone and will thus not format the dates in a wrong way.	2019-02-09 20:28:59 +01:00
Luca Cavanna	a7046e001c	Remove support for maxRetryTimeout from low-level REST client (#38085 ) We have had various reports of problems caused by the maxRetryTimeout setting in the low-level REST client. Such setting was initially added in the attempts to not have requests go through retries if the request already took longer than the provided timeout. The implementation was problematic though as such timeout would also expire in the first request attempt (see #31834), would leave the request executing after expiration causing memory leaks (see #33342), and would not take into account the http client internal queuing (see #25951). Given all these issues, it seems that this custom timeout mechanism gives little benefits while causing a lot of harm. We should rather rely on connect and socket timeout exposed by the underlying http client and accept that a request can overall take longer than the configured timeout, which is the case even with a single retry anyways. This commit removes the `maxRetryTimeout` setting and all of its usages.	2019-02-06 08:43:47 +01:00
Boaz Leskes	033ba725af	Remove support for internal versioning for concurrency control (#38254 ) Elasticsearch has long [supported](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-index_.html#index-versioning) compare and set (a.k.a optimistic concurrency control) operations using internal document versioning. Sadly that approach is flawed and can sometime do the wrong thing. Here's the relevant excerpt from the resiliency status page: > When a primary has been partitioned away from the cluster there is a short period of time until it detects this. During that time it will continue indexing writes locally, thereby updating document versions. When it tries to replicate the operation, however, it will discover that it is partitioned away. It won’t acknowledge the write and will wait until the partition is resolved to negotiate with the master on how to proceed. The master will decide to either fail any replicas which failed to index the operations on the primary or tell the primary that it has to step down because a new primary has been chosen in the meantime. Since the old primary has already written documents, clients may already have read from the old primary before it shuts itself down. The version numbers of these reads may not be unique if the new primary has already accepted writes for the same document We recently [introduced](https://www.elastic.co/guide/en/elasticsearch/reference/6.x/optimistic-concurrency-control.html) a new sequence number based approach that doesn't suffer from this dirty reads problem. This commit removes support for internal versioning as a concurrency control mechanism in favor of the sequence number approach. Relates to #1078	2019-02-05 20:53:35 +01:00
Julie Tibshirani	3ce7d2c9b6	Make sure to reject mappings with type _doc when include_type_name is false. (#38270 ) `CreateIndexRequest#source(Map<String, Object>, ... )`, which is used when deserializing index creation requests, accidentally accepts mappings that are nested twice under the type key (as described in the bug report #38266). This in turn causes us to be too lenient in parsing typeless mappings. In particular, we accept the following index creation request, even though it should not contain the type key `_doc`: ``` PUT index?include_type_name=false { "mappings": { "_doc": { "properties": { ... } } } } ``` There is a similar issue for both 'put templates' and 'put mappings' requests as well. This PR makes the minimal changes to detect and reject these typed mappings in requests. It does not address #38266 generally, or attempt a larger refactor around types in these server-side requests, as I think this should be done at a later time.	2019-02-05 10:52:32 -08:00
David Turner	f2dd5dd6eb	Remove DiscoveryPlugin#getDiscoveryTypes (#38414 ) With this change we no longer support pluggable discovery implementations. No known implementations of `DiscoveryPlugin` actually override this method, so in practice this should have no effect on the wider world. However, we were using this rather extensively in tests to provide the `test-zen` discovery type. We no longer need a separate discovery type for tests as we no longer need to customise its behaviour. Relates #38410	2019-02-05 17:42:24 +00:00

1 2 3 4 5 ...

5119 Commits