OpenSearch

History

Dimitris Athanasiou 873ad3f942 [7.x][ML] Add option to regression to randomize training set (#45969 ) (#46017 ) Adds a parameter `training_percent` to regression. The default value is `100`. When the parameter is set to a value less than `100`, from the rows that can be used for training (ie. those that have a value for the dependent variable) we randomly choose whether to actually use for training. This enables splitting the data into a training set and the rest, usually called testing, validation or holdout set, which allows for validating the model on data that have not been used for training. Technically, the analytics process considers as training the data that have a value for the dependent variable. Thus, when we decide a training row is not going to be used for training, we simply clear the row's dependent variable.		2019-08-27 17:53:11 +03:00
..
basic-multi-node	Convert testclusters to use distro download plugin (#44253 ) (#44362 )	2019-07-15 17:53:05 -07:00
disabled	Convert testclusters to use distro download plugin (#44253 ) (#44362 )	2019-07-15 17:53:05 -07:00
ml-with-security	[7.x][ML] Add option to regression to randomize training set (#45969 ) (#46017 )	2019-08-27 17:53:11 +03:00
native-multi-node-tests	[7.x][ML] Add option to regression to randomize training set (#45969 ) (#46017 )	2019-08-27 17:53:11 +03:00
no-bootstrap-tests	Testclusters: convert ml qa tests (#43229 )	2019-06-18 11:55:11 +03:00
single-node-tests	Convert testclusters to use distro download plugin (#44253 ) (#44362 )	2019-07-15 17:53:05 -07:00
build.gradle	Build: Pull "skip assemble on qa" to common build (#35214 )	2018-11-05 16:16:00 -05:00