OpenSearch/x-pack/plugin
Dimitris Athanasiou 873ad3f942
[7.x][ML] Add option to regression to randomize training set (#45969) (#46017)
Adds a parameter `training_percent` to regression. The default
value is `100`. When the parameter is set to a value less than `100`,
from the rows that can be used for training (ie. those that have a
value for the dependent variable) we randomly choose whether to actually
use for training. This enables splitting the data into a training set and
the rest, usually called testing, validation or holdout set, which allows
for validating the model on data that have not been used for training.

Technically, the analytics process considers as training the data that
have a value for the dependent variable. Thus, when we decide a training
row is not going to be used for training, we simply clear the row's
dependent variable.
2019-08-27 17:53:11 +03:00
..
ccr Include leases in ccr errmsg when ops no longer available (#45681) 2019-08-20 10:40:12 -04:00
core [7.x][ML] Add option to regression to randomize training set (#45969) (#46017) 2019-08-27 17:53:11 +03:00
data-frame [ML][Transforms] adjusting when and what to audit (#45876) (#45916) 2019-08-23 13:53:02 -05:00
data-science Add Cumulative Cardinality agg (and Data Science plugin) (#45990) 2019-08-26 16:19:55 -04:00
deprecation Add deprecation check for pidfile setting (#45939) 2019-08-24 17:19:20 -04:00
frozen-indices Ignore translog retention policy if soft-deletes enabled (#45473) 2019-08-22 16:40:06 -04:00
graph Remove Settings from BaseRestRequest Constructor (#45418) (#45429) 2019-08-12 05:14:45 +02:00
ilm [7.x] Use rollover for SLM's history indices (#45686) 2019-08-21 13:42:11 -06:00
logstash Remove description from xpack feature sets (#43065) 2019-06-11 09:22:58 -07:00
mapper-flattened Add support for 'flattened object' fields. (#43762) 2019-07-01 12:08:50 +03:00
ml [7.x][ML] Add option to regression to randomize training set (#45969) (#46017) 2019-08-27 17:53:11 +03:00
monitoring Remove Settings from BaseRestRequest Constructor (#45418) (#45429) 2019-08-12 05:14:45 +02:00
rollup Fixing rollup state tests after onFailure ordering change (#45784) (#45814) 2019-08-21 14:46:09 -05:00
search-business-rules Search enhancement: pinned queries (#44345) (#45657) 2019-08-20 11:38:22 +01:00
security Add `manage_own_api_key` cluster privilege (#45897) (#46023) 2019-08-28 00:44:23 +10:00
spatial Mute ShapeQueryTests#testFieldAlias() 2019-08-21 10:31:13 +01:00
sql Update joda to 2.10.3 (#45495) 2019-08-23 10:39:39 +02:00
src/test [7.x][ML] Add option to regression to randomize training set (#45969) (#46017) 2019-08-27 17:53:11 +03:00
vectors Adds usage stats for vectors (#45023) 2019-07-31 12:32:41 -04:00
voting-only-node Close nodes properly in Coordinator tests (#44967) 2019-07-30 11:47:36 +01:00
watcher Watcher add email warning if CSV attachment contains formulas (#44460) (#45557) 2019-08-26 08:35:33 -05:00
build.gradle Skip feature aware check on JDK 14 (#45928) 2019-08-23 17:38:15 -04:00