OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-19 19:35:02 +00:00

Author	SHA1	Message	Date
Benjamin Trent	12943c5d2c	[ML] Add data frame task state object and field (#40169 ) (#40490 ) * [ML] Add data frame task state object and field * A new state item is added so that the overall task state can be accoutned for * A new FAILED state and reason have been added as well so that failures can be shown to the user for optional correction * Addressing PR comments * adjusting after master merge * addressing pr comment * Adjusting auditor usage with failure state * Refactor, renamed state items to task_state and indexer_state * Adding todo and removing redundant auditor call * Address HLRC changes and PR comment * adjusting hlrc IT test	2019-03-27 06:53:58 -05:00
Costin Leau	33737b6b21	SQL: Polish parsing of CAST expression (#40428 ) (cherry picked from commit 9d291aa300bbb827eeae606e7d3e55eeef7cce00)	2019-03-27 12:20:58 +02:00
Hendrik Muhs	f4e56118c2	[ML] generate unique doc ids for data frame (#40382 ) create and use unique, deterministic document ids based on the grouping values. This is a pre-requisite for updating documents as well as preventing duplicates after a hard failure during indexing.	2019-03-27 08:27:05 +01:00
Julie Tibshirani	25954a8dd3	Stop clearing all watches in watcher integration tests. (#39724 )	2019-03-26 13:14:33 -07:00
alex101101	fb8ad0cf30	Add a soft limit to the field name length (#40309 ) Adds an optional limit to the length of field names, throws an IllegalArgumentException if the limit is breached. Closes #33651	2019-03-26 17:58:32 +01:00
Jay Modi	9bd8600c2e	Use ephemeral ports for idp-fixture (#40333 ) This change removes the use of hardcoded port values for the idp-fixture in favor of the mapped ephemeral ports. This should prevent failures due to port conflicts in CI.	2019-03-26 08:44:53 -06:00
David Kyle	1354696db9	[ML] Data Frame HLRC Get Stats API (#40443 )	2019-03-26 11:17:13 +00:00
Costin Leau	7234d78747	SQL: Fix classpath discovery on Java 10+ (#40420 ) (cherry picked from commit 2cef233cb34ee80d8ed9cd014cea76ea5096d206)	2019-03-26 08:16:37 +02:00
Ed Savage	c20ea9a2dd	[ML][TEST] Fix failing test testPersistJobOnGracefulShutdown_givenTimeAdvancedAfterNoNewData (#40363 ) Ensure that there is at least a 1s delay between the time that state is persisted by each of the two jobs in the test. Model snapshot IDs use the current time in epoch seconds to distinguish themselves, hence snapshots will be overwritten by another if it occurs in the same 1s window. Closes #40347	2019-03-25 17:55:10 +00:00
Costin Leau	61f49af497	SQL: Spec tests now use classpath discovery (#40388 ) To avoid having to specify each spec by hand (which can miss specs to be added), the test infrastructure now performs classpath discovery so that each spec added, is automatically considered. Relates #40358 (cherry picked from commit d0f60b4425c731509aa8ca765d55f563f866ef90)	2019-03-25 15:22:52 +02:00
Benjamin Trent	7b4f964708	[ML] make source and dest objects in the transform config (#40337 ) (#40396 ) * [ML] make source and dest objects in the transform config * addressing PR comments * Fixing compilation post merge * adding comment for Arrays.hashCode * addressing changes for moving dest to object * fixing data_frame yml tests * fixing API test	2019-03-25 07:16:41 -05:00
Hendrik Muhs	38afc9f27d	refresh audit index before searching (#40401 ) refresh the audit index before searching	2019-03-25 11:57:57 +01:00
Nhat Nguyen	b9f96a8e1f	Expose external refreshes through the stats API (#38643 ) Right now, the stats API only provides refresh metrics regarding internal refreshes. This isn't very useful and somewhat misleading for cluster administrators since the internal refreshes are not indicative of documents being available for search. In this PR I added a new metric for collecting external refreshes as they occur and exposing them through the stats API. Now, calling an endpoint for stats will yield external refresh metrics as well. Relates #36712	2019-03-24 22:21:00 -04:00
Benjamin Trent	a30bf27b2f	[ML] add auditor to data frame plugin (#40012 ) (#40394 ) * [Data Frame] add auditor * Adjusting Level, Auditor, and message to address pr comments * Addressing PR comments	2019-03-23 18:56:44 -05:00
Benjamin Trent	2dd879abac	[ML] adds support for non-numeric mapped types (#40220 ) (#40380 ) * [ML] adds support for non-numeric mapped types and mapping overrides * correcting hlrc compilation issues after merge * removing mapping_override option * clearing up unnecessary changes	2019-03-23 14:04:14 -05:00
Benjamin Trent	88f510ffc2	[ML] making test more determinate (#40374 ) (#40381 ) * [ML] making test more determinate * unmuting test	2019-03-23 12:15:37 -05:00
Jason Tedor	03839ba1a2	Update feature aware check ASM to 7.1 (#40389 ) This commit updates the feature aware check ASM dependency to ASM 7.1. This gives us JDK 13 compatibility.	2019-03-23 12:57:15 -04:00
Marios Trivyzas	143db10980	SQL: Fix issue timezone issues with JDBC getDate/getTime (#40360 ) Previously, `getDate(int columnIdx)/getDate(String columnLabel)` and were using legacy`java.util.Calendar` instead of the the `java.time.*` classes to reset to the start of day. This resulted in different results for certain timestamps and timezones when calling `getDate(col)` vs`getObject(col, java.sql.Date)` Now only the methods (that must be implemented due to the JDBC spec) `getDate(int columnIdx, Calendar cal)/getDate(String columnLabel, Calendar cal)` are still using the `java.util.Calendar` for those conversion. The same change was applied to `getTime(int columnIdx)/getTime(String columnLabel)` and `getTimestamp(int columnIdx)/getTimestamp(String columnLabel)` Fixes: #40289 (cherry picked from commit 44560671f18397e0c58e3647732880fcb73a5034)	2019-03-23 17:01:08 +01:00
Jason Tedor	10bbb082a4	Only run retention lease actions on active primary (#40386 ) In some cases, a request to perform a retention lease action can arrive on a primary shard before it is active. In this case, the primary shard would not yet be in primary mode, tripping an assertion in the replication tracker. Instead, we should not attempt to perform such actions on an initializing shard. This commit addresses this by not returning the primary shard in the single shard iterator if the primary shard is not yet active.	2019-03-23 09:39:39 -04:00
Marios Trivyzas	17b8b54d5e	SQL: Fix metric aggs on date/time to not return double (#40377 ) Previously metric aggregations on date fields would return a double which caused errors when trying to apply scalar functions on top, e.g.: ``` SELECT YEAR(MAX(date)) FROM test ``` Fixes: #40376 (cherry-picked from commit 41d0a038467fbdbbf67fd9bfdf27623451cae63a)	2019-03-23 14:13:38 +01:00
Costin Leau	558adc0f28	SQL: Add missing handling of IP field in JDBC (#40384 ) Fix #40358 (cherry picked from commit ee286fa4893817637c05d72b93b254b36efc0dae) (cherry picked from commit d2296249499e31bd512390ac3d20bc38009612b3)	2019-03-23 12:58:10 +02:00
Andrei Stefan	150d1332cf	SQL: Fix RLIKE bug and improve testing for RLIKE statement (#40354 ) * Refactor RegexMatch to support both LIKE and RLIKE * Add integration tests for RLIKE * Polish the rest of tests (cherry picked from commit 7562d6eeeb77c04794002649fe726f4b3a9a398b)	2019-03-23 06:37:53 +02:00
Costin Leau	87d3d16c5a	SQL: JLine upgrade and polishing (#40321 ) Upgrade JLine to 3.10.0 Switch to using JLine granular jars instead of the uber-one Remove Jansi dependency (due to errors in closing streams) Pin JNA dependency to our own artifact Fix #40239 (cherry picked from commit 9afa65fa80111f3b68c13373c7b6db13c11dde31)	2019-03-22 23:55:51 +02:00
Costin Leau	496070fda6	SQL: CAST supports both SQL and ES types (#40365 ) Extend CAST to support all data types notations (whether SQL or ES specific) Fix #40282 (cherry picked from commit eb2ee8a344da946920598839a5db76c8bb9bc3fe)	2019-03-22 23:55:51 +02:00
Benjamin Trent	05460cca58	Muting test testExtractIndexCheckpointsInconsistentGlobalCheckpoints (#40370 )	2019-03-22 13:25:48 -05:00
Hendrik Muhs	5a0c32833e	Add a checkpoint service for data frame transforms (#39836 ) Add a checkpoint service for data frame transforms, which allows to ask for a checkpoint of the source. In future these checkpoints will be stored in the internal index to - detect upstream changes - updating the data frame without a full re-run - allow data frame clients to checkpoint themselves	2019-03-22 10:25:30 +01:00
David Turner	1265a15b75	Mute testPersistJobOnGracefulShutdown_givenTimeAdvancedAfterNoNewData	2019-03-22 08:46:51 +00:00
Costin Leau	980ee14f57	DOC: Expand section on ORDER BY aggs (#40332 ) (cherry picked from commit 99d2f6fc9864ab972259ef5692129ab49e4a7ab8)	2019-03-22 10:04:52 +02:00
Andrei Stefan	f9ab9afcc1	Extract the first value in an array when looking at the returned values (#40318 ) (cherry picked from commit faf02e0f42a101985619abc0d30753851605e01d)	2019-03-22 06:43:37 +02:00
Andrei Stefan	35fe05308e	SQL: rewrite ROUND and TRUNCATE functions with a different optional parameter handling method (#40242 ) * Rewrite Round and Truncate functions to have a slightly different approach to handling the optional parameter in the constructor. Until now the optional parameter was considered 0 if the value was missing and the constructor was filling in this value. The current solution is to have the optional parameter as null right until the actual calculation is done. (cherry picked from commit 3e314f8fa4cb322e67949e80857561ce51268726)	2019-03-22 06:43:37 +02:00
Yogesh Gaikwad	280567da8d	Correct documentation link for authorization engine example (#40261 ) (#40292 ) This commit fixes the link for authorization engine example.	2019-03-22 12:38:03 +11:00
Nhat Nguyen	0e12065b54	Relax max_seq_no_of_updates assertion in follow tests If there's a failover on the follower, then its max_seq_no_of_updates is bootstrapped from its max_seq_no which might be higher than the max_seq_no_of_updates of the leader. We need to relax this check. Relates #40249	2019-03-21 19:41:55 -04:00
Ed Savage	23d5f7babf	[ML] Add integration tests to check persistence (#40272 ) (#40315 ) Additional checks to exercise the behaviour of persistence on graceful close of an anomaly job. Related to elastic/ml-cpp#393 Backports #40272	2019-03-21 17:01:10 +00:00
Lisa Cawley	e6799849d1	[DOCS] Adds placeholder for start and stop data frame transform APIs (#40278 )	2019-03-21 09:39:10 -07:00
Lisa Cawley	caa0129d44	[DOCS] Adds placeholder for create and delete data frame transform APIs (#40233 )	2019-03-21 09:13:50 -07:00
lcawl	0e712d476e	Adds URL for preview data frame transforms	2019-03-21 08:28:23 -07:00
Lisa Cawley	ff2bcc9d11	[DOCS] Adds placeholder for get data frame transform APIs (#40283 )	2019-03-21 07:57:01 -07:00
Albert Zaharovits	2f80b7304f	Refactor Token Service (#39808 ) This refactoring is in the context of the work related to moving security tokens to a new index. In that regard, the Token Service has to work with token documents stored in any of the two indices, albeit only as a transient situation. I reckoned the added complexity as unmanageable, hence this refactoring. This is incomplete, as it fails to address the goal of minimizing .security accesses, but I have stopped because otherwise it would've become a full blown rewrite (if not already). I will follow-up with more targeted PRs. In addition to being a true refactoring, some 400 errors moved to 500. Furthermore, more stringed validation of various return result, has been implemented, notably the one of the token document creation.	2019-03-21 15:55:56 +02:00
Costin Leau	dd41ce0763	SQL: Preserve original source for cast/convert function (#40271 ) Improve rule for pruning cast to preserve the original source Fix #40239 (cherry picked from commit 7591cb1a1577320b3aec2ec557b0f881b6af744f)	2019-03-21 14:08:15 +02:00
Jason Tedor	1e6941b138	Reduce retention lease sync intervals (#40302 ) This commit adjusts the frequency with which CCR renews retention leases and with which primaries sync retention leases to replicas. This helps Lucene reclaim soft-deleted documents more aggressively, which we have found in some use-cases can help improve performance, and either way will help keep disk space under more control.	2019-03-21 07:37:44 -04:00
Andrei Stefan	1a5ff05870	SQL: fix LIKE function equality by considering its pattern as well (#40260 ) * Define a equals method for Like function so that the pattern used is considered in the equality check. Whenever the functions are resolved this check should be used. (cherry picked from commit 4e5d5af58a140573b8ee19d57c7839db7b779e3b)	2019-03-21 11:44:57 +02:00
David Kyle	a4cb92a300	[ML] Data Frame HLRC Preview API (#40258 )	2019-03-21 09:38:27 +00:00
Andrei Stefan	d485be631b	Moving tests in locale-aware test file (#40254 ) (cherry picked from commit 9beb31fd3c5a8323cb08cc524f1a2268e9c72c24)	2019-03-21 10:57:37 +02:00
Yogesh Gaikwad	5d30df5a60	Fix so non super users can also create API keys (#40028 ) (#40286 ) When creating API keys we check for if API key with the same key name already exists and fail the request if it does. The check should have been performed with XPackSecurityUser instead of the authenticated user. This caused the request to fail in case of the non-super user trying to create an API key. This commit fixes by executing search action with SECURITY_ORIGIN so it can be executed with XPackSecurityUser. Also fixed the Rest test to avoid using a user with `super_user` role. Closes #40029	2019-03-21 15:53:25 +11:00
Marios Trivyzas	e1eb683c51	SQL: Fix issue with getting DATE type in JDBC (#40207 ) Previously, calling getDate()/getTime()/getTimestamp() and getObject() with the corresponding java.sql class on a column of SQL DATE type from the JDBC result set would throw an Exception.	2019-03-21 01:48:06 +01:00
Benjamin Trent	5ae43855fc	[ML] Refactor GET Transforms API (#40015 ) (#40269 ) * [Data Frame] Refactor GET Transforms API: * Add pagination * comma delimited list expression support GET transforms * Flag troublesome internal code for future refactor * Removing `allow_no_transforms` param, ratcheting down pageparam option * Changing DataFrameFeatureSet#usage to not get all configs * Intermediate commit * Writing test for batch data gatherer * Removing unused import * removing bad println used for debugging * Updating BatchedDataIterator comments and query * addressing pr comments * disallow null scrollId to cause stackoverflow	2019-03-20 19:14:50 -05:00
Marios Trivyzas	f37f2b5d39	SQL: Fix issue with optimization on queries with ORDER BY/LIMIT (#40256 ) Previously, when a trival plain `SELECT` or a trivial `SELECT` with aggregations has also an `ORDER BY` or a `LIMIT` or both, then the optimization to convert it to a `LocalRelation` was skipped resulting in exception thrown. E.g.:: ``` SELECT 'foo' FROM test LIMIT 10 ``` or ``` SELECT 'foo' FROM test GROUP BY 1 ORDER BY 1 ``` Fixes: #40211	2019-03-20 23:52:35 +01:00
Marios Trivyzas	bc4c8e53c5	SQL: Fix issue with date columns returned always in UTC (#40163 ) When selecting columns of ES type `date` (SQL's DATETIME) the `FieldHitExtractor` was not using the timezone of the client session but always resorted to UTC. The same behaviour (UTC only) was encountered also for grouping keys (`CompositeKeyExtractor`) and for First/Last functions on dates (`TopHitsAggExtractor`). Fixes: #40152	2019-03-20 20:32:33 +01:00
Like	6f64267626	Make setting index.translog.sync_interval be dynamic (#37382 ) Currently, we cannot update index setting index.translog.sync_interval if index is open, because it's not dynamic which can be updated for closed index only. Closes #32763	2019-03-20 17:12:45 +01:00
Henning Andersen	4c2a8638ca	Cascading primary failure lead to MSU too low (#40249 ) If a replica were first reset due to one primary failover and then promoted (before resync completes), its MSU would not include changes since global checkpoint, leading to errors during translog replay. Fixed by re-initializing MSU before restoring local history.	2019-03-20 14:00:43 +01:00

1 2 3 4 5 ...

2896 Commits