OpenSearch/x-pack/plugin
Dimitris Athanasiou f2d4c94a9c
[7.x][ML] Deduplicate multi-fields for data frame analytics (#48799) (#48806)
In the case multi-fields exist in the source index, we pick
all variants of them in our extracted fields detection for
data frame analytics. This means we may have multiple instances
of the same feature. The worse consequence of this is when the
dependent variable (for regression or classification) is also
duplicated which means we train a model on the dependent variable
itself.

Now that #48770 is merged, this commit is adding logic to
only select one variant of multi-fields.

Closes #48756

Backport of #48799
2019-11-01 16:53:05 +02:00
..
analytics Remove eclipse conditionals (#44075) 2019-10-03 11:55:00 +03:00
ccr Restore from Individual Shard Snapshot Files in Parallel (#48110) (#48686) 2019-10-30 14:36:30 +01:00
core Copy http headers to ThreadContext strictly (#45945) (#48675) 2019-10-31 23:05:12 +02:00
deprecation Add migration tool checks for `_field_names` disabling (#46972) 2019-09-25 10:15:10 +02:00
enrich Don't preserve indices between enrich qa tests. 2019-10-31 14:23:56 +01:00
frozen-indices Remove unused transport action from TransportFreezeIndexAction (#47992) 2019-10-14 16:20:37 +02:00
graph Fix XPackPlugin usages in tests (#47252) 2019-10-02 12:36:02 +02:00
ilm Fix TimeSeriesLifecycleActionsIT.testRolloverAlreadyExists (#48747) (#48795) 2019-11-01 12:34:33 +00:00
logstash Remove description from xpack feature sets (#43065) 2019-06-11 09:22:58 -07:00
mapper-flattened Remove eclipse conditionals (#44075) 2019-10-03 11:55:00 +03:00
ml [7.x][ML] Deduplicate multi-fields for data frame analytics (#48799) (#48806) 2019-11-01 16:53:05 +02:00
monitoring [7.x] Validate monitoring username at parse time (#48774) 2019-11-01 09:02:37 -05:00
rollup Ensure random timestamps are within search boundary (#38753) (#47787) 2019-10-10 14:38:01 +02:00
search-business-rules Remove eclipse conditionals (#44075) 2019-10-03 11:55:00 +03:00
security Copy http headers to ThreadContext strictly (#45945) (#48675) 2019-10-31 23:05:12 +02:00
spatial Remove eclipse conditionals (#44075) 2019-10-03 11:55:00 +03:00
sql Cleanup static instance in @AfterClass 2019-10-31 23:24:40 -04:00
src/test Add owner flag parameter to the rest spec (#48500) 2019-10-30 13:07:01 +11:00
transform [ML][Transforms] add wait_for_checkpoint flag to stop (#47935) (#48591) 2019-10-28 13:02:57 -04:00
vectors Refactor unit tests for vector functions. (#48662) 2019-10-30 15:36:06 -07:00
voting-only-node Remove eclipse conditionals (#44075) 2019-10-03 11:55:00 +03:00
watcher Update jakarta mail dependency to 1.6.4 (#47810) 2019-10-11 09:24:11 +02:00
build.gradle Convert RunTask to use testclusers, remove ClusterFormationTasks (#47572) 2019-10-08 14:43:29 +03:00