OpenSearch

Commit Graph

Author	SHA1	Message	Date
Andy Bristol	d07b4ecfa3	awaitsfix testRandomClusterStateUpdates For #32308	2018-07-23 17:20:01 -07:00
Zachary Tong	6ba144ae31	Add WeightedAvg metric aggregation (#31037 ) Adds a new single-value metrics aggregation that computes the weighted average of numeric values that are extracted from the aggregated documents. These values can be extracted from specific numeric fields in the documents. When calculating a regular average, each datapoint has an equal "weight"; it contributes equally to the final value. In contrast, weighted averages scale each datapoint differently. The amount that each datapoint contributes to the final value is extracted from the document, or provided by a script. As a formula, a weighted average is the `∑(value * weight) / ∑(weight)` A regular average can be thought of as a weighted average where every value has an implicit weight of `1`. Closes #15731	2018-07-23 18:33:15 -04:00
Julie Tibshirani	1b1aa4ecff	Fix a test bug around nested aggregations and field aliases. (#32287 ) This issue affected both NestedAggregatorTest and ReverseNestedAggregatorTest.	2018-07-23 12:25:42 -07:00
Andrey Ershov	33f11e637d	Fail shard if IndexShard#storeStats runs into an IOException (#32241 ) Fail shard if IndexShard#storeStats runs into an IOException. Closes #29008	2018-07-23 16:38:55 +02:00
Christoph Büscher	ff87b7aba4	Remove unnecessary warning supressions (#32250 )	2018-07-23 11:31:04 +02:00
itsnotv	4b3284f7cb	CCE when re-throwing "shard not available" exception in TransportShardMultiGetAction (#32185 ) ClassCastException can be thrown by callers of TransportActions.isShardNotAvailableException(e) as e is not always an instance of ElasticSearchException fixes #32173	2018-07-23 11:09:52 +03:00
Christoph Büscher	54d896c4ed	[Tests] Remove QueryStringQueryBuilderTests#toQuery class assertions (#32236 ) Currently we check that the queries that QueryStringQueryBuilder#toQuery returns is one out of a list of many Lucene query classes. This list has extended a lot over time, since QueryStringQueryBuilder can build all sort of queries. This makes the test hard to maintain. The recent addition of alias fields which build a BlendedTermQuery show how easy this test breaks. Also the current assertions doesn't add a lot in terms of catching errors. This is why we decided to remove this check. Closes #32234	2018-07-20 19:08:59 +02:00
Julie Tibshirani	af0c1d30fe	Make sure that field aliases count towards the total fields limit. (#32222 )	2018-07-20 10:06:07 -07:00
Paul Sanwald	320f1d263f	muting failing test for internal auto date histogram to avoid failure before fix is merged	2018-07-20 11:20:51 -04:00
Armin Braun	91a0daf0e4	MINOR: Remove unused `IndexDynamicSettings` (#32237 )	2018-07-20 17:14:17 +02:00
Jim Ferenczi	6ed1ad0b6f	Fix multi level nested sort (#32204 ) The parent filter for nested sort should always match all parents regardless of the child queries. It is used to find the boundaries of a single parent and we use the child query to match all the filters set in the nested tree so there is no need to repeat the nested filters. With this change we ensure that we build bitset filters only to find the root docs (or the docs at the level where the sort applies) that can be reused among queries. Closes #31554 Closes #32130 Closes #31783 Co-authored-by: Dominic Bevacqua <bev@treatwell.com>	2018-07-20 16:55:11 +02:00
Lee Hinman	74aa7b0815	Enhance Parent circuit breaker error message (#32056 ) * Enhance Parent circuit breaker error message This adds information about either the current real usage (if tracking "real" memory usage) or the child breaker usages to the exception message when the parent circuit breaker trips. The messages now look like: ``` [parent] Data too large, data for [my_request] would be [211288064/201.5mb], which is larger than the limit of [209715200/200mb], usages [request=157286400/150mb, fielddata=54001664/51.5mb, in_flight_requests=0/0b, accounting=0/0b] ``` Or when tracking real memory usage: ``` [parent] Data too large, data for [request] would be [251/251b], which is larger than the limit of [200/200b], real usage: [181/181b], new bytes reserved: [70/70b] ``` * Only call currentMemoryUsage once by returning structured object	2018-07-20 08:52:45 -06:00
Alexander Reelsen	c5cde96691	Dependencies: Upgrade to joda time 2.10 (#32160 ) Changelog: http://www.joda.org/joda-time/changes-report.html	2018-07-20 10:18:38 +02:00
Luca Cavanna	00a6ad0e9e	Remove aliases resolution limitations when security is enabled (#31952 ) Resolving wildcards in aliases expression is challenging as we may end up with no aliases to replace the original expression with, but if we replace with an empty array that means _all which is quite the opposite. Now that we support and serialize the original requested aliases, whenever aliases are replaced we will be able to know what was initially requested. `MetaData#findAliases` can then be updated to not return anything in case it gets empty aliases, but the original aliases were not empty. That means that empty aliases are interpreted as _all only if they were originally requested that way. Relates to #31516	2018-07-20 09:23:32 +02:00
Julie Tibshirani	0f0068b91c	Ensure that field aliases cannot be used in multi-fields. (#32219 )	2018-07-20 00:18:54 -07:00
Mayya Sharipova	4c68dfe001	Handle missing values in painless (#32207 ) Throw an exception for doc['field'].value if this document is missing a value for the field. After deprecation changes have been backported to 6.x, make this a default behaviour in 7.0 Closes #29286	2018-07-19 17:41:06 -04:00
Tal Levy	9ae6905657	add support for write index resolution when creating/updating documents (#31520 ) Now write operations like Index, Delete, Update rely on the write-index associated with an alias to operate against. This means writes will be accepted even when an alias points to multiple indices, so long as one is the write index. Routing values will be used from the AliasMetaData for the alias in the write-index. All read operations are left untouched.	2018-07-19 09:17:49 -07:00
Christoph Büscher	f232c36c19	Fix comments causing errors with Java 11	2018-07-19 09:42:33 +02:00
Julie Tibshirani	15ff3da653	Add support for field aliases. (#32172 ) * Add basic support for field aliases in index mappings. (#31287) * Allow for aliases when fetching stored fields. (#31411) * Add tests around accessing field aliases in scripts. (#31417) * Add documentation around field aliases. (#31538) * Add validation for field alias mappings. (#31518) * Return both concrete fields and aliases in DocumentFieldMappers#getMapper. (#31671) * Make sure that field-level security is enforced when using field aliases. (#31807) * Add more comprehensive tests for field aliases in queries + aggregations. (#31565) * Remove the deprecated method DocumentFieldMappers#getFieldMapper. (#32148)	2018-07-18 09:33:09 -07:00
Alan Woodward	cfb30144c9	Call setReferences() on custom referring tokenfilters in _analyze (#32157 ) When building custom tokenfilters without an index in the _analyze endpoint, we need to ensure that referring filters are correctly built by calling their #setReferences() method Fixes #32154	2018-07-18 14:43:20 +01:00
Boaz Leskes	5856c396dd	A replica can be promoted and started in one cluster state update (#32042 ) When a replica is fully recovered (i.e., in `POST_RECOVERY` state) we send a request to the master to start the shard. The master changes the state of the replica and publishes a cluster state to that effect. In certain cases, that cluster state can be processed on the node hosting the replica together with a cluster state that promotes that, now started, replica to a primary. This can happen due to cluster state batched processing or if the master died after having committed the cluster state that starts the shard but before publishing it to the node with the replica. If the master also held the primary shard, the new master node will remove the primary (as it failed) and will also immediately promote the replica (thinking it is started). Sadly our code in IndexShard didn't allow for this which caused [assertions](`13917162ad/server/src/main/java/org/elasticsearch/index/seqno/ReplicationTracker.java (L482)`) to be tripped in some of our tests runs.	2018-07-18 11:30:44 +02:00
Christoph Büscher	15f95a9f93	Fix `range` queries on `_type` field for singe type indices (#31756 ) With the introduction of single types in 6.x, the `_type` field is no longer indexed, which leads to certain queries that were working before throw errors now. One such query is the `range` query, that, if performed on a single typer index, currently throws an IAE since the field is not indexed. This change adds special treatment for this case in the TypeFieldMapper, comparing the range queries lower and upper bound to the one existing type and either returns a MatchAllDocs or a MatchNoDocs query. Relates to #31632 Closes #31476	2018-07-18 09:12:28 +02:00
Nhat Nguyen	df1380b8d3	Remove versionType from translog (#31945 ) With the introduction of sequence number, we no longer use versionType to resolve out of order collision in replication and recovery requests. This PR removes removes the versionType from translog. We can only remove it in 7.0 because it is still required in a mixed cluster between 6.x and 5.x.	2018-07-17 21:59:48 -04:00
Ryan Ernst	6371d51866	Build: Make additional test deps of check (#32015 ) This commit moves additional unit test runners from being dependencies of the test task to dependencies of check. Without this change, reproduce lines are incorrect due to the additional test runner not matching any of the reproduce class/method info. closes #31964	2018-07-17 13:14:46 -07:00
Nhat Nguyen	ef81c1df57	Ensure to release translog snapshot in primary-replica resync (#32045 ) Previously we create a translog snapshot inside the resync method, and that snapshot will be closed by the resync listener. However, if the resync method throws an exception before the resync listener is initialized, the translog snapshot won't be released. Closes #32030	2018-07-17 09:41:34 -04:00
Armin Braun	ed3b44fb4c	Handle TokenizerFactory TODOs (#32063 ) * Don't replace Replace TokenizerFactory with Supplier, this approach was rejected in #32063 * Remove unused parameter from constructor	2018-07-17 14:14:02 +02:00
markharwood	a7e477126f	Relax TermVectors API to work with textual fields other than TextFieldType (#31915 ) This changes the field-eligibility test to check one level up in the class hierarchy to allow any subclasses of StringFieldType. Closes #31902	2018-07-17 13:11:10 +01:00
Ioannis Kakavas	9e529d9d58	Enable testing in FIPS140 JVM (#31666 ) Ensure our tests can run in a FIPS JVM JKS keystores cannot be used in a FIPS JVM as attempting to use one in order to init a KeyManagerFactory or a TrustManagerFactory is not allowed.( JKS keystore algorithms for private key encryption are not FIPS 140 approved) This commit replaces JKS keystores in our tests with the corresponding PEM encoded key and certificates both for key and trust configurations. Whenever it's not possible to refactor the test, i.e. when we are testing that we can load a JKS keystore, etc. we attempt to mute the test when we are running in FIPS 140 JVM. Testing for the JVM is naive and is based on the name of the security provider as we would control the testing infrastrtucture and so this would be reliable enough. Other cases of tests being muted are the ones that involve custom TrustStoreManagers or KeyStoreManagers, null TLS Ciphers and the SAMLAuthneticator class as we cannot sign XML documents in the way we were doing. SAMLAuthenticator tests in a FIPS JVM can be reenabled with precomputed and signed SAML messages at a later stage. IT will be covered in a subsequent PR	2018-07-17 10:54:10 +03:00
Christoph Büscher	36165265ce	Fix put mappings java API documentation (#31955 ) The current docs of the put-mapping Java API is currently broken. It its current form, it creates an index and uses the whole mapping definition given as a JSON string as the type name. Since we didn't check the index created in the IndicesDocumentationIT so far this went unnoticed. This change adds test to catch this error to the documentation test, changes the documentation so it works correctly now and adds an input validation to PutMappingRequest#buildFromSimplifiedDef() which was used internally to reject calls where no mapping definition is given. Closes #31906	2018-07-17 09:09:03 +02:00
Armin Braun	4b5071f2d0	Add Index UUID to `/_stats` Response (#31871 ) * Add "uuid" field to each index's section in the `/_stats` response * closes #31791	2018-07-17 06:50:21 +02:00
Jim Ferenczi	f699cb9f55	Bypass highlight query terms extraction on empty fields (#32090 ) Dealing with empty fields in the highlight phase can slow down the query because the query terms extraction is done independently on each field. This change shortcuts the highlighting performed by the unified highlighter for fields that are not present in the document. In such cases there is nothing to higlight so we don't need to visit the query to build the highligh builder.	2018-07-17 00:26:01 +02:00
Daniel Mitterdorfer	1fef139c11	Ensure only parent breaker trips in unit test With this commit we raise the limit of the child circuit breaker used in the unit test for the circuit breaker service so it is high enough to trip only the parent circuit breaker. The previous limit was 300 bytes but theoretically (considering overhead) we could reach 346 bytes. Thus any value larger than 300 bytes could trip the child circuit breaker leading to spurious failures. Relates #31767	2018-07-16 13:50:17 +02:00
Jim Ferenczi	fa59bb1099	Fix BWC check after backport Relates #31808	2018-07-16 11:59:59 +02:00
Armin Braun	3679d00a74	Replace Ingest ScriptContext with Custom Interface (#32003 ) * Replace Ingest ScriptContext with Custom Interface * Make org.elasticsearch.ingest.common.ScriptProcessorTests#testScripting more precise * Don't mock script factory in ScriptProcessorTests * Adjust mock script plugin in IT for new API	2018-07-13 23:26:10 +02:00
Jack Conradson	42ca520377	Clean Up Snapshot Create Rest API (#31779 ) Make SnapshotInfo and CreateSnapshotResponse parsers lenient for backwards compatibility. Remove extraneous fields from CreateSnapshotRequest toXContent.	2018-07-13 13:07:26 -07:00
Vladimir Dolzhenko	b1bf643e41	lazy snapshot repository initialization (#31606 ) lazy snapshot repository initialization	2018-07-13 20:05:49 +02:00
Colin Goodheart-Smithe	0edb096eb4	Adds a new auto-interval date histogram (#28993 ) * Adds a new auto-interval date histogram This change adds a new type of histogram aggregation called `auto_date_histogram` where you can specify the target number of buckets you require and it will find an appropriate interval for the returned buckets. The aggregation works by first collecting documents in buckets at second interval, when it has created more than the target number of buckets it merges these buckets into minute interval bucket and continues collecting until it reaches the target number of buckets again. It will keep merging buckets when it exceeds the target until either collection is finished or the highest interval (currently years) is reached. A similar process happens at reduce time. This aggregation intentionally does not support min_doc_count, offest and extended_bounds to keep the already complex logic from becoming more complex. The aggregation accepts sub-aggregations but will always operate in `breadth_first` mode deferring the computation of sub-aggregations until the final buckets from the shard are known. min_doc_count is effectively hard-coded to zero meaning that we will insert empty buckets where necessary. Closes #9572 * Adds documentation * Added sub aggregator test * Fixes failing docs test * Brings branch up to date with master changes * trying to get tests to pass again * Fixes multiBucketConsumer accounting * Collects more buckets than needed on shards This gives us more options at reduce time in terms of how we do the final merge of the buckeets to produce the final result * Revert "Collects more buckets than needed on shards" This reverts commit 993c782d117892af9a3c86a51921cdee630a3ac5. * Adds ability to merge within a rounding * Fixes nonn-timezone doc test failure * Fix time zone tests * iterates on tests * Adds test case and documentation changes Added some notes in the documentation about the intervals that can bbe returned. Also added a test case that utilises the merging of conseecutive buckets * Fixes performance bug The bug meant that getAppropriate rounding look a huge amount of time if the range of the data was large but also sparsely populated. In these situations the rounding would be very low so iterating through the rounding values from the min key to the max keey look a long time (~120 seconds in one test). The solution is to add a rough estimate first which chooses the rounding based just on the long values of the min and max keeys alone but selects the rounding one lower than the one it thinks is appropriate so the accurate method can choose the final rounding taking into account the fact that intervals are not always fixed length. Thee commit also adds more tests * Changes to only do complex reduction on final reduce * merge latest with master * correct tests and add a new test case for 10k buckets * refactor to perform bucket number check in innerBuild * correctly derive bucket setting, update tests to increase bucket threshold * fix checkstyle * address code review comments * add documentation for default buckets * fix typo	2018-07-13 13:08:35 -04:00
Mayya Sharipova	80492cacfc	Add second level of field collapsing (#31808 ) * Put second level collapse under inner_hits Closes #24855	2018-07-13 11:40:03 -04:00
Alan Woodward	f9791cf158	Remove deprecated AnalysisPlugin#requriesAnalysisSettings method (#32037 )	2018-07-13 15:49:26 +01:00
Alan Woodward	a01e26a39b	Correct spelling of AnalysisPlugin#requriesAnalysisSettings (#32025 ) Because this is a static method on a public API, and one that we encourage plugin authors to use, the method with the typo is deprecated in 6.x rather than just renamed.	2018-07-13 13:13:21 +01:00
Daniel Mitterdorfer	f174f72fee	Circuit-break based on real memory usage With this commit we introduce a new circuit-breaking strategy to the parent circuit breaker. Contrary to the current implementation which only accounts for memory reserved via child circuit breakers, the new strategy measures real heap memory usage at the time of reservation. This allows us to be much more aggressive with the circuit breaker limit so we bump it to 95% by default. The new strategy is turned on by default and can be controlled with the new cluster setting `indices.breaker.total.userealmemory`. Note that we turn it off for all integration tests with an internal test cluster because it leads to spurious test failures which are of no value (we cannot fully control heap memory usage in tests). All REST tests, however, will make use of the real memory circuit breaker. Relates #31767	2018-07-13 10:08:28 +02:00
Igor Motov	44f280fc89	Force execution of fetch tasks (#31974 ) Forces fetch tasks to queue even in the event that the queue is already full. The reasoning is that fetch tasks may only be follow-up to query tasks, so the number of additional fetch tasks that may enter the threadpool is expected to be reasonable. Closes #29442	2018-07-12 08:56:06 -07:00
Alexander Reelsen	0b7e7befdd	Tests: Fix SearchFieldsIT.testDocValueFields (#31995 ) This test produced different implementations of joda time classes, depending on if the data was serialized or not (DateTime vs MutableDateTime). This now uses a common base class to extract the milliseconds from the data. Closes #31992	2018-07-12 16:06:56 +02:00
Alexander Reelsen	e3707efe74	Tests: Mute SearchFieldsIT.testDocValueFields() Relates #31992	2018-07-12 12:19:39 +02:00
Alexander Reelsen	ac4e0f1b1d	Tests: Remove use of joda time in some tests (#31922 ) This also extends the dateformatters test to ensure that the printers are acting the same in java time and joda time.	2018-07-12 09:55:17 +02:00
James Baiera	5bcdff73d7	Add Snapshots Status API to High Level Rest Client (#31515 ) This PR adds the Snapshots Status API to the Snapshot Client, as well as additional documentation for the status api.	2018-07-11 12:07:31 -04:00
Nhat Nguyen	25cd835010	Increase logging level for testStressMaybeFlush Relates #31629	2018-07-10 17:44:48 -04:00
Sohaib Iftikhar	88c270d844	Added lenient flag for synonym token filter (#31484 ) * Added lenient flag for synonym-tokenfilter. Relates to #30968 * added docs for synonym-graph-tokenfilter -- Also made lenient final -- changed from !lenient to lenient == false * Changes after review (1) -- Renamed to ElasticsearchSynonymParser -- Added explanation for ElasticsearchSynonymParser::add method -- Changed ElasticsearchSynonymParser::logger instance to static * Added lenient option for WordnetSynonymParser -- also added more documentation * Added additional documentation * Improved documentation	2018-07-10 17:11:50 -04:00
Christoph Büscher	2ac7e49924	Fix broken NaN check in MovingFunctions#stdDev() (#31888 ) The initial check will never be true, because of the special semantics of NaN, where no value is equal to Nan, including NaN. Thus, x == Double.NaN always evaluates to false. The method still works correct because later computations will also return NaN if the avg argument is NaN, but the intended shortcut doesn't work.	2018-07-10 09:34:17 +02:00
Alexander Reelsen	1c32497c44	Date: Add DateFormatters class that uses java.time (#31856 ) A newly added class called DateFormatters now contains java.time based builders for dates, which also intends to be fully backwards compatible, when the name based date formatters are picked. Also a new class named CompoundDateTimeFormatter for being able to parse multiple different formats has been added. A duelling test class has been added that ensures the same dates when parsing java or joda time formatted dates for the name based dates. Note, that java.time and joda time are not fully backwards compatible, which also means that old formats will currently not work with this setup.	2018-07-10 09:28:28 +02:00

1 2 3 4 5 ...

909 Commits