OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-20 03:45:02 +00:00

Author	SHA1	Message	Date
Mark Vieira	63eccb16de	Make LoggerUsageTask cacheable (#42550 ) (cherry picked from commit 0bb46d73cb89016ab9d96e76693bb0d7cee267a1)	2019-05-24 18:30:46 -07:00
Mark Vieira	24cf86a013	Ignore JAR manifests when snapshotting runtime classpaths (#42548 ) (cherry picked from commit d5281fc96f6fb2f022c87699bdad64d88614e04c)	2019-05-24 18:28:47 -07:00
Julie Tibshirani	3a6c2525ca	Deprecate support for chained multi-fields. (#42330 ) This PR contains a straight backport of #41926, and also updates the migration documentation and deprecation info API for 7.x.	2019-05-24 15:55:06 -07:00
Jason Tedor	f2cfd09289	Remove renewal in retention lease recovery test (#42536 ) This commit removes the act of renewing some retention leases during a retention lease recovery test. Having renewal does not add anything extra to this test, but does allow for some situations where the test can fail spuriously (i.e., in a way that does not indicate that production code is broken).	2019-05-24 17:40:59 -05:00
Mark Vieira	5695992094	Use reproducible method of generating properties file for better caching (#42539 ) (cherry picked from commit 9772574f9d0b942a1ee8dba5ff503b4cd286e36c)	2019-05-24 15:07:29 -07:00
Nhat Nguyen	74d771d8f6	Adjust load SplitIndexIT#testSplitIndexPrimaryTerm (#42477 ) SplitIndexIT#testSplitIndexPrimaryTerm sometimes timeout due to relocating many shards. This change adjusts loads and increases the timeout.	2019-05-24 15:47:29 -04:00
Nhat Nguyen	02739d038c	Mute accounting circuit breaker check after test (#42448 ) If we close an engine while a refresh is happening, then we might leak refCount of some SegmentReaders. We need to skip the ram accounting circuit breaker check until we have a new Lucene snapshot which includes the fix for LUCENE-8809. This also adds a test to the engine but left it muted so we won't forget to reenable this check. Closes #30290	2019-05-24 15:42:12 -04:00
David Roberts	48dc0dca57	[ML] Use map and filter instead of flatMap in find_file_structure (#42534 ) Using map and filter avoids the garbage from all the Stream.of calls that flatMap necessitated. Performance is better when there are masses of fields.	2019-05-24 20:12:06 +01:00
David Roberts	34de68b007	[ML] Fix possible race condition when closing an opening job (#42506 ) This change fixes a race condition that would result in an in-memory data structure becoming out-of-sync with persistent tasks in cluster state. If repeated often enough this could result in it being impossible to open any ML jobs on the affected node, as the master node would think the node had capacity to open another job but the chosen node would error during the open sequence due to its in-memory data structure being full. The race could be triggered by opening a job and then closing it a tiny fraction of a second later. It is unlikely a user of the UI could open and close the job that fast, but a script or program calling the REST API could. The nasty thing is, from the externally observable states and stats everything would appear to be fine - the fast open then close sequence would appear to leave the job in the closed state. It's only later that the leftovers in the in-memory data structure might build up and cause a problem.	2019-05-24 20:11:58 +01:00
James Rodewig	d521a88e19	[DOCS] Move callouts to end of line for Asciidoctor migration (#42356 )	2019-05-24 15:03:46 -04:00
Hendrik Muhs	6d47ee9268	[ML-DataFrame] add support for fixed_interval, calendar_interval, remove interval (#42427 ) * add support for fixed_interval, calendar_interval, remove interval * adapt HLRC * checkstyle * add a hlrc to server test * adapt yml test * improve naming and doc * improve interface and add test code for hlrc to server * address review comments * repair merge conflict * fix date patterns * address review comments * remove assert for warning * improve exception message * use constants	2019-05-24 20:30:17 +02:00
Igor Motov	e28a9e99c4	SQL: Moves the JTS-based tests suppression to Before (#42526 ) Moves the test suppression from `ClassRule` to `Before`, where it is properly handled in the CI build. Fixes #42221	2019-05-24 13:58:53 -04:00
David Roberts	09e8910b0f	[DOCS] Adding ML-specific prerequisites to setup docs (#42529 )	2019-05-24 10:49:41 -07:00
James Rodewig	43dd081e22	[DOCS] Fix nested def list for Asciidoctor (#42353 )	2019-05-24 13:39:49 -04:00
Nhat Nguyen	329d1307a5	Add test to verify force primary allocation on closed indices (#42458 ) This change adds a test verifying that we can force primary allocation on closed indices.	2019-05-24 17:23:58 +02:00
Henning Andersen	075fd2a0ac	Shard CLI tool always check shards (#41480 ) The shard CLI tool would not do anything if a corruption marker was not present. But a corruption marker is only added if a corruption is detected during indexing/writing, not if a search or other read fails. Changed the tool to always check shards regardless of corruption marker presence. Related to #41298	2019-05-24 16:49:37 +02:00
Marios Trivyzas	523b5bfdb5	Fix sorting on nested field with unmapped (#42451 ) Previously sorting on a missing nested field would fail with an Exception: `[nested_field] failed to find nested object under path [nested_path]` despite `unmapped_type` being set on the query. Fixes: #33644 (cherry picked from commit 631142d5dd088a10de8dcd939b50a14301173283)	2019-05-24 15:47:41 +02:00
Christoph Büscher	12d5642e93	Small internal AnalysisRegistry changes (#42500 ) Some internal refactorings to the AnalysisRegistry, spin-off from #40782.	2019-05-24 15:27:35 +02:00
David Turner	a5b6ed8d1e	Remove AwaitsFix of #41967 following #42504	2019-05-24 14:26:49 +01:00
David Turner	4d02ca1633	Drain master task queue when stabilising (#42504 ) Today the default stabilisation time is calculated on the assumption that the elected master has no pending tasks to process when it is elected, but this is not a safe assumption to make. This can result in a cluster reaching the end of its stabilisation time without having stabilised. Furthermore in #36943 we increased the probability that each step in `runRandomly()` enqueues another task, vastly increasing the chance that we hit such a situation. This change extends the stabilisation process to allow time for all pending tasks, plus a task that might currently be in flight. Fixes #41967, in which the master entered the stabilisation phase with over 800 tasks to process.	2019-05-24 14:18:02 +01:00
Marios Trivyzas	56677f69cf	Mute testTrustConfigReloadsFileContents Tracked by #42509	2019-05-24 14:03:46 +02:00
weizijun	40348ab726	Use accurate total hits in IndexPrimaryRelocationIT By default, we track total hits up to 10k but we might index more than 10k documents `testPrimaryRelocationWhileIndexing`. With this change, we always request for the accurate total hits in the test. > java.lang.AssertionError: Count is 10000+ hits but 11684 was expected.	2019-05-24 12:47:21 +02:00
Simon Willnauer	46ccfba808	Remove IndexStore and DirectoryService (#42446 ) Both of these classes are basically a bloated wrapper around a simple construct that can simply be a DirectoryFactory interface. This change removes both classes and replaces them with a simple stateless interface that creates a new `Directory` per shard. The concept of `index.store` is preserved since it makes sense from a configuration perspective.	2019-05-24 12:14:56 +02:00
Hendrik Muhs	7cee294acf	[ML-DataFrame]backport dataframe changes from 42202, using client instead of transport (#42468 ) backport dataframe changes from #42202, using client instead of transport	2019-05-24 11:05:30 +02:00
David Roberts	f472186b9f	[ML] Improve file structure finder timestamp format determination (#41948 ) This change contains a major refactoring of the timestamp format determination code used by the ML find file structure endpoint. Previously timestamp format determination was done separately for each piece of text supplied to the timestamp format finder. This had the drawback that it was not possible to distinguish dd/MM and MM/dd in the case where both numbers were 12 or less. In order to do this sensibly it is best to look across all the available timestamps and see if one of the numbers is greater than 12 in any of them. This necessitates making the timestamp format finder an instantiable class that can accumulate evidence over time. Another problem with the previous approach was that it was only possible to override the timestamp format to one of a limited set of timestamp formats. There was no way out if a file to be analysed had a timestamp that was sane yet not in the supported set. This is now changed to allow any timestamp format that can be parsed by a combination of these Java date/time formats: yy, yyyy, M, MM, MMM, MMMM, d, dd, EEE, EEEE, H, HH, h, mm, ss, a, XX, XXX, zzz Additionally S letter groups (fractional seconds) are supported providing they occur after ss and separated from the ss by a dot, comma or colon. Spacing and punctuation is also permitted with the exception of the question mark, newline and carriage return characters, together with literal text enclosed in single quotes. The full list of changes/improvements in this refactor is: - Make TimestampFormatFinder an instantiable class - Overrides must be specified in Java date/time format - Joda format is no longer accepted - Joda timestamp formats in outputs are now derived from the determined or overridden Java timestamp formats, not stored separately - Functionality for determining the "best" timestamp format in a set of lines has been moved from TextLogFileStructureFinder to TimestampFormatFinder, taking advantage of the fact that TimestampFormatFinder is now an instantiable class with state - The functionality to quickly rule out some possible Grok patterns when looking for timestamp formats has been changed from using simple regular expressions to the much faster approach of using the Shift-And method of sub-string search, but using an "alphabet" consisting of just 1 (representing any digit) and 0 (representing non-digits) - Timestamp format overrides are now much more flexible - Timestamp format overrides that do not correspond to a built-in Grok pattern are mapped to a %{CUSTOM_TIMESTAMP} Grok pattern whose definition is included within the date processor in the ingest pipeline - Grok patterns that correspond to multiple Java date/time patterns are now handled better - the Grok pattern is accepted as matching broadly, and the required set of Java date/time patterns is built up considering all observed samples - As a result of the more flexible acceptance of Grok patterns, when looking for the "best" timestamp in a set of lines timestamps are considered different if they are preceded by a different sequence of punctuation characters (to prevent timestamps far into some lines being considered similar to timestamps near the beginning of other lines) - Out-of-the-box Grok patterns that are considered now include %{DATE} and %{DATESTAMP}, which have indeterminate day/month ordering - The order of day/month in formats with indeterminate day/month order is determined by considering all observed samples (plus the server locale if the observed samples still do not suggest an ordering) Relates #38086 Closes #35137 Closes #35132	2019-05-24 09:10:08 +01:00
David Turner	f864f6a740	Cluster state from API should always have a master (#42454 ) Today the `TransportClusterStateAction` ignores the state passed by the `TransportMasterNodeAction` and obtains its state from the cluster applier. This might be inconsistent, showing a different node as the master or maybe even having no master. This change adjusts the action to use the passed-in state directly, and adds tests showing that the state returned is consistent with our expectations even if there is a concurrent master failover. Fixes #38331 Relates #38432	2019-05-24 08:45:22 +01:00
David Turner	528f8cc073	Add stack traces to RetentionLeasesIT failures (#42425 ) Today `RetentionLeaseIT` calls `fail(e.toString())` on some exceptions, losing the stack trace that came with the exception. This commit adjusts this to re-throw the exception wrapped in an `AssertionError` so we can see more details about failures such as #41430.	2019-05-24 08:37:51 +01:00
David Turner	c0974a9813	Add more logging to MockDiskUsagesIT (#42424 ) This commit adds a log message containing the routing table, emitted on each iteration of the failing assertBusy() in #40174. It also modernizes the code a bit.	2019-05-24 08:28:10 +01:00
Adrien Grand	f3c33d6d96	Add 7.1.1 release notes.	2019-05-24 09:26:04 +02:00
Mark Vieira	200b7e1888	Gradle init script for enabling remote build cache	2019-05-23 21:00:08 -07:00
Tim Vernum	567c0d331f	Fix settings prefix for realm truststore password (#42413 ) As part of #30241 realm settings were changed to be true affix settings. In the process of this change, the "ssl." prefix was lost from the realm truststore password. It should be: xpack.security.authc.realms.<type>.<name>.ssl.truststore.password Due to a mismatch between the way we define SSL settings and load SSL contexts, there was no way to define this legacy password setting in a realm config. The settings validation would reject "ssl.truststore.password" but the SSL service would ignore "truststore.password" Backport of: #42336	2019-05-24 13:16:26 +10:00
Jack Conradson	167f391cfd	Bug fix to allow access to top level params in reduce script (#42096 )	2019-05-23 16:00:39 -07:00
Ryan Ernst	a49bafc194	Split document and metadata fields in GetResult (#38373 ) (#42456 ) This commit makes creators of GetField split the fields into document fields and metadata fields. It is part of larger refactoring that aims to remove the calls to static methods of MapperService related to metadata fields, as discussed in #24422.	2019-05-23 14:01:07 -07:00
David Roberts	14f29de2a8	Avoid HashMap construction on Grok non-match (#42444 ) This change moves the construction of the result HashMap in Grok.captures() into the branch that actually needs it. This probably will not make a measurable difference for ingest pipelines, but it is beneficial to the ML find_file_structure endpoint, as it tries out many Grok patterns that will fail to match.	2019-05-23 21:09:33 +01:00
Jake Landis	2b22ceac04	Bulk processor concurrent requests (#41451 ) (#42438 ) `org.elasticsearch.action.bulk.BulkProcessor` is a threadsafe class that allows for simple semantics to deal with sending bulk requests. Once a bulk reaches it's pre-defined size, documents, or flush interval it will execute sending the bulk. One configurable option is the number of concurrent outstanding bulk requests. That concurrency is implemented in `org.elasticsearch.action.bulk.BulkRequestHandler` via a semaphore. However, the only code that currently calls into this code is blocked by `synchronized` methods. This results in the in-ability for the BulkProcessor to behave concurrently despite supporting configurable amounts of concurrent requests. This change removes the `synchronized` method in favor an explicit lock around the non-thread safe parts of the method. The call into `org.elasticsearch.action.bulk.BulkRequestHandler` is no longer blocking, which allows `org.elasticsearch.action.bulk.BulkRequestHandler` to handle it's own concurrency.	2019-05-23 14:22:16 -05:00
Costin Leau	a48125a9f7	Fix FROZEN indices backport	2019-05-23 21:30:41 +03:00
Costin Leau	9fdf4215dd	Docs: Documentation for the upcoming SQL support of frozen indices (#41863 ) (cherry picked from commit a3cc03eb1503df24c1706a721fcc9af38c3b2873) (cherry picked from commit f42dcf2ffd7bd25f3f91aa6127515f393cd1860f)	2019-05-23 21:16:16 +03:00
Costin Leau	d5f04d29c9	SQL: Add support for FROZEN indices (#41558 ) Allow querying of FROZEN indices both through dedicated SQL grammar extension: > SELECT field FROM FROZEN index and also through driver configuration parameter, namely: > index.include.frozen: true/false Fix #39390 Fix #39377 (cherry picked from commit 2445a933915f420c7f51e8505afa0a7978ce6b0f)	2019-05-23 21:16:16 +03:00
emasab	a142e8cfd8	Build local year inside DateFormat lambda bugfix for https://github.com/elastic/elasticsearch/issues/41797 (#42120) This makes sure that the year can change between when the lambda is generated and when it is executed without causing the incorrect year to be used. Resolves #41797	2019-05-23 10:36:11 -06:00
Mark Vieira	c9ed91312a	Mute slow and flaky build-tools integration tests	2019-05-23 09:30:26 -07:00
Simon Willnauer	5a884dac03	Unguice Snapshot / Restore services (#42357 ) This removes the @Inject annotations from the Snapshot/Restore infrastructure classes and registers them manually in Node.java	2019-05-23 17:09:26 +02:00
Jim Ferenczi	a497603219	Disable max score optimization for queries with unbounded max scores (#41361 ) Lucene 8 has the ability to skip blocks of non-competitive documents. However some queries don't track their maximum score (`script_score`, `span`, ...) so they always return Float.POSITIVE_INFINITY as maximum score. This can slow down some boolean queries if other clauses have bounded max scores. This commit disables the max score optimization when we detect a mandatory scoring clause with unbounded max scores. Optional clauses are not checked since they can still skip documents when the unbounded clause is after the current document.	2019-05-23 16:53:57 +02:00
Yannick Welsch	f57fdc57e9	Deprecate max_local_storage_nodes (#42426 ) Allows this setting to be removed in 8.0, see #42428	2019-05-23 15:59:55 +02:00
Christoph Büscher	85ff9543b7	Prevent normalizer from not being closed on exception (#42375 ) Currently AnalysisRegistry#processNormalizerFactory creates a normalizer and only later checks whether it should be added to the normalizer map passed in. In case we throw an exception it isn't closed. This can be prevented by moving the check that throws the exception earlier.	2019-05-23 15:53:55 +02:00
Zachary Tong	6d8a0e36ec	Re-mute all ml_datafeed_crud rolling upgrade tests AwaitsFix https://github.com/elastic/elasticsearch/issues/42258 Thought this was fixed, but throwing deprecation warnings at an unexpected time so putting this back on mute until we figure it out.	2019-05-23 09:50:27 -04:00
David Kyle	a23257ce06	[ML Data Frame] Account for completed data frames in test (#42351 ) When asserting on the checkpoint value if the DF has completed the checkpoint will be 1 else 0. Similarly state may be started or indexing. Closes #42309	2019-05-23 14:05:09 +01:00
markharwood	c2c8d0e637	Test fix - results equality failed because of subtle scoring differences between replicas. (#42366 ) Diverging merge policies means the segments and therefore scores are not the same. Fixed the test by ensuring there are zero replicas. Closes #32492	2019-05-23 12:00:57 +01:00
Jim Ferenczi	b88e80ab89	Upgrade to Lucene 8.1.0 (#42214 ) This commit upgrades to the GA release of Lucene 8.1.0	2019-05-23 11:46:45 +02:00
Jim Ferenczi	4ca5649a0d	Upgrade to lucene 8.1.0-snapshot-e460356abe (#40952 )	2019-05-23 11:45:33 +02:00
Marios Trivyzas	0777223bab	Allow `fields` to be set to `` (#42301 ) Allow for SimpleQueryString, QueryString and MultiMatchQuery to set the `fields` parameter to the wildcard ``. If so, set the leniency to `true`, to achieve the same behaviour as from the `"default_field" : "" setting. Furthermore, check if `` is in the list of the `default_field` but not necessarily as the 1st element. Closes: #39577 (cherry picked from commit e75ff0c748e6b68232c2b08e19ac4a4934918264)	2019-05-23 10:10:48 +02:00

1 2 3 4 5 ...

45952 Commits