OpenSearch/docs/reference/ml/df-analytics/apis/evaluate-dfanalytics.asciidoc

[role="xpack"]
[testenv="platinum"]
[[evaluate-dfanalytics]]
=== Evaluate {dfanalytics} API

[subs="attributes"]
++++
<titleabbrev>Evaluate {dfanalytics}</titleabbrev>
++++

Evaluates the {dfanalytics} for an annotated index.

experimental[]

[[ml-evaluate-dfanalytics-request]]
==== {api-request-title}

`POST _ml/data_frame/_evaluate`

[[ml-evaluate-dfanalytics-prereq]]
==== {api-prereq-title}

* You must have `monitor_ml` privilege to use this API. For more 
information, see {stack-ov}/security-privileges.html[Security privileges] and 
{stack-ov}/built-in-roles.html[Built-in roles].

[[ml-evaluate-dfanalytics-desc]]
==== {api-description-title}

The API packages together commonly used evaluation metrics for various types of 
machine learning features. This has been designed for use on indexes created by 
{dfanalytics}. Evaluation requires both a ground truth field and an analytics 
result field to be present.


[[ml-evaluate-dfanalytics-request-body]]
==== {api-request-body-title}

`index`::
  (Required, object) Defines the `index` in which the evaluation will be
  performed.

`query`::
  (Optional, object) A query clause that retrieves a subset of data from the 
  source index. See <<query-dsl>>.

`evaluation`::
  (Required, object) Defines the type of evaluation you want to perform. See 
  <<ml-evaluate-dfanalytics-resources>>.
+
--
Available evaluation types:

* `binary_soft_classification`
* `regression`
--


////
[[ml-evaluate-dfanalytics-results]]
==== {api-response-body-title}

`binary_soft_classification`::
  (object) If you chose to do binary soft classification, the API returns the
  following evaluation metrics:
  
`auc_roc`::: TBD

`confusion_matrix`::: TBD
  
`precision`::: TBD

`recall`::: TBD
////

[[ml-evaluate-dfanalytics-example]]
==== {api-examples-title}

===== Binary soft classification

[source,console]
--------------------------------------------------
POST _ml/data_frame/_evaluate
{
  "index": "my_analytics_dest_index",
  "evaluation": {
    "binary_soft_classification": {
      "actual_field": "is_outlier",
      "predicted_probability_field": "ml.outlier_score"
    }
  }
}
--------------------------------------------------
// TEST[skip:TBD]

The API returns the following results:

[source,console-result]
----
{
  "binary_soft_classification": {
    "auc_roc": {
      "score": 0.92584757746414444
    },
    "confusion_matrix": {
      "0.25": {
          "tp": 5,
          "fp": 9,
          "tn": 204,
          "fn": 5
      },
      "0.5": {
          "tp": 1,
          "fp": 5,
          "tn": 208,
          "fn": 9
      },
      "0.75": {
          "tp": 0,
          "fp": 4,
          "tn": 209,
          "fn": 10
      }
    },
    "precision": {
        "0.25": 0.35714285714285715,
        "0.5": 0.16666666666666666,
        "0.75": 0
    },
    "recall": {
        "0.25": 0.5,
        "0.5": 0.1,
        "0.75": 0
    }
  }
}
----


===== {regression-cap}

[source,console]
--------------------------------------------------
POST _ml/data_frame/_evaluate
{
  "index": "house_price_predictions", <1>
  "query": {
      "bool": {
        "filter": [
          { "term":  { "ml.is_training": false } } <2>
        ]
      }
  },
  "evaluation": {
    "regression": { 
      "actual_field": "price", <3>
      "predicted_field": "ml.price_prediction", <4>
      "metrics": {  
        "r_squared": {},
        "mean_squared_error": {}                             
      }
    }
  }
}
--------------------------------------------------
// TEST[skip:TBD]

<1> The output destination index from a {dfanalytics} {reganalysis}.
<2> In this example, a test/train split (`training_percent`) was defined for the 
{reganalysis}. This query limits evaluation to be performed on the test split 
only. 
<3> The ground truth value for the actual house price. This is required in order 
to evaluate results.
<4> The predicted value for house price calculated by the {reganalysis}.


The following example calculates the training error:

[source,console]
--------------------------------------------------
POST _ml/data_frame/_evaluate
{
  "index": "student_performance_mathematics_reg",
  "query": {
    "term": {
      "ml.is_training": {
        "value": true <1>
      }
    }
  },
  "evaluation": {
    "regression": { 
      "actual_field": "G3", <2>
      "predicted_field": "ml.G3_prediction", <3>
      "metrics": {  
        "r_squared": {},
        "mean_squared_error": {}                             
      }
    }
  }
}
--------------------------------------------------
// TEST[skip:TBD]

<1> In this example, a test/train split (`training_percent`) was defined for the 
{reganalysis}. This query limits evaluation to be performed on the train split 
only. It means that a training error will be calculated.
<2> The field that contains the ground truth value for the actual student 
performance. This is required in order to evaluate results.
<3> The field that contains the predicted value for student performance 
calculated by the {reganalysis}.


The next example calculates the testing error. The only difference compared with 
the previous example is that `ml.is_training` is set to `false` this time, so 
the query excludes the train split from the evaluation.

[source,console]
--------------------------------------------------
POST _ml/data_frame/_evaluate
{
  "index": "student_performance_mathematics_reg",
  "query": {
    "term": {
      "ml.is_training": {
        "value": false <1>
      }
    }
  },
  "evaluation": {
    "regression": { 
      "actual_field": "G3", <2>
      "predicted_field": "ml.G3_prediction", <3>
      "metrics": {  
        "r_squared": {},
        "mean_squared_error": {}                             
      }
    }
  }
}
--------------------------------------------------
// TEST[skip:TBD]

<1> In this example, a test/train split (`training_percent`) was defined for the 
{reganalysis}. This query limits evaluation to be performed on the test split 
only. It means that a testing error will be calculated.
<2> The field that contains the ground truth value for the actual student 
performance. This is required in order to evaluate results.
<3> The field that contains the predicted value for student performance 
calculated by the {reganalysis}.
[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00			`[role="xpack"]`
			`[testenv="platinum"]`
			`[[evaluate-dfanalytics]]`
			`=== Evaluate {dfanalytics} API`

			`[subs="attributes"]`
			`++++`
			`<titleabbrev>Evaluate {dfanalytics}</titleabbrev>`
			`++++`

[DOCS] Adds data frame analytics API and evaluate API resource documentation (#43972) This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool. 2019-07-11 12:05:05 -04:00			`Evaluates the {dfanalytics} for an annotated index.`
[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00
[DOCS] Adds data frame analytics API and evaluate API resource documentation (#43972) This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool. 2019-07-11 12:05:05 -04:00			`experimental[]`
[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00
			`[[ml-evaluate-dfanalytics-request]]`
			`==== {api-request-title}`

			`POST _ml/data_frame/_evaluate`

			`[[ml-evaluate-dfanalytics-prereq]]`
			`==== {api-prereq-title}`

			* You must have `monitor_ml` privilege to use this API. For more
			`information, see {stack-ov}/security-privileges.html[Security privileges] and`
			`{stack-ov}/built-in-roles.html[Built-in roles].`

[DOCS] Adds data frame analytics API and evaluate API resource documentation (#43972) This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool. 2019-07-11 12:05:05 -04:00			`[[ml-evaluate-dfanalytics-desc]]`
			`==== {api-description-title}`

[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> 2019-09-19 03:10:11 -04:00			`The API packages together commonly used evaluation metrics for various types of`
			`machine learning features. This has been designed for use on indexes created by`
			`{dfanalytics}. Evaluation requires both a ground truth field and an analytics`
			`result field to be present.`
[DOCS] Adds data frame analytics API and evaluate API resource documentation (#43972) This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool. 2019-07-11 12:05:05 -04:00

[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00			`[[ml-evaluate-dfanalytics-request-body]]`
			`==== {api-request-body-title}`

[DOCS] Reformats API parameter details (#44194) 2019-07-12 11:26:31 -04:00			`index`::
			(Required, object) Defines the `index` in which the evaluation will be
			`performed.`
[7.x] Allow the user to specify 'query' in Evaluate Data Frame request (#45775) (#45825) 2019-08-22 05:14:26 -04:00
			`query`::
[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> 2019-09-19 03:10:11 -04:00			`(Optional, object) A query clause that retrieves a subset of data from the`
			`source index. See <<query-dsl>>.`
[7.x] Allow the user to specify 'query' in Evaluate Data Frame request (#45775) (#45825) 2019-08-22 05:14:26 -04:00
[DOCS] Reformats API parameter details (#44194) 2019-07-12 11:26:31 -04:00			`evaluation`::
[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> 2019-09-19 03:10:11 -04:00			`(Required, object) Defines the type of evaluation you want to perform. See`
			`<<ml-evaluate-dfanalytics-resources>>.`
			`+`
			`--`
			`Available evaluation types:`
[DOCS] Fixes typos in the PUT dfa and the evaluate dfa documentation. (#47348) 2019-10-02 03:49:59 -04:00
[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> 2019-09-19 03:10:11 -04:00			* `binary_soft_classification`
			* `regression`
			`--`


[DOCS] Reformats API parameter details (#44194) 2019-07-12 11:26:31 -04:00			`////`
[DOCS] Adds data frame analytics API and evaluate API resource documentation (#43972) This PR adds the resource documentation of the data frame analytics APIs and the evaluate API to the ML API doc pool. 2019-07-11 12:05:05 -04:00			`[[ml-evaluate-dfanalytics-results]]`
			`==== {api-response-body-title}`

			`binary_soft_classification`::
			`(object) If you chose to do binary soft classification, the API returns the`
			`following evaluation metrics:`

			`auc_roc`::: TBD

			`confusion_matrix`::: TBD

			`precision`::: TBD

			`recall`::: TBD
[DOCS] Reformats API parameter details (#44194) 2019-07-12 11:26:31 -04:00			`////`
[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00
			`[[ml-evaluate-dfanalytics-example]]`
			`==== {api-examples-title}`

[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> 2019-09-19 03:10:11 -04:00			`===== Binary soft classification`

[DOCS] Change // CONSOLE comments to [source,console] (#46440) (#46494) 2019-09-09 12:35:50 -04:00			`[source,console]`
[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00			`--------------------------------------------------`
			`POST _ml/data_frame/_evaluate`
			`{`
			`"index": "my_analytics_dest_index",`
			`"evaluation": {`
			`"binary_soft_classification": {`
			`"actual_field": "is_outlier",`
			`"predicted_probability_field": "ml.outlier_score"`
			`}`
			`}`
			`}`
			`--------------------------------------------------`
			`// TEST[skip:TBD]`

			`The API returns the following results:`

[DOCS] Replace "// TESTRESPONSE" magic comments with "[source,console-result] (#46295) (#46418) 2019-09-06 09:22:08 -04:00			`[source,console-result]`
[DOCS] Adds data frame analytics APIs to the ML APIs (#43875) This PR adds the reference documentation pages of the data frame analytics APIs (PUT, START, STOP, GET, GET stats, DELETE, Evaluate) to the ML APIs pool. 2019-07-05 07:34:05 -04:00			`----`
			`{`
			`"binary_soft_classification": {`
			`"auc_roc": {`
			`"score": 0.92584757746414444`
			`},`
			`"confusion_matrix": {`
			`"0.25": {`
			`"tp": 5,`
			`"fp": 9,`
			`"tn": 204,`
			`"fn": 5`
			`},`
			`"0.5": {`
			`"tp": 1,`
			`"fp": 5,`
			`"tn": 208,`
			`"fn": 9`
			`},`
			`"0.75": {`
			`"tp": 0,`
			`"fp": 4,`
			`"tn": 209,`
			`"fn": 10`
			`}`
			`},`
			`"precision": {`
			`"0.25": 0.35714285714285715,`
			`"0.5": 0.16666666666666666,`
			`"0.75": 0`
			`},`
			`"recall": {`
			`"0.25": 0.5,`
			`"0.5": 0.1,`
			`"0.75": 0`
			`}`
			`}`
			`}`
			`----`
[DOCS] Adds regression analytics resources and examples to the data frame analytics APIs and the evaluation API (#46176) * [DOCS] Adds regression analytics resources and examples to the data frame analytics APIs. Co-Authored-By: Benjamin Trent <ben.w.trent@gmail.com> Co-Authored-By: Tom Veasey <tveasey@users.noreply.github.com> 2019-09-19 03:10:11 -04:00

			`===== {regression-cap}`

			`[source,console]`
			`--------------------------------------------------`
			`POST _ml/data_frame/_evaluate`
			`{`
			`"index": "house_price_predictions", <1>`
			`"query": {`
			`"bool": {`
			`"filter": [`
			`{ "term": { "ml.is_training": false } } <2>`
			`]`
			`}`
			`},`
			`"evaluation": {`
			`"regression": {`
			`"actual_field": "price", <3>`
			`"predicted_field": "ml.price_prediction", <4>`
			`"metrics": {`
			`"r_squared": {},`
			`"mean_squared_error": {}`
			`}`
			`}`
			`}`
			`}`
			`--------------------------------------------------`
			`// TEST[skip:TBD]`

			`<1> The output destination index from a {dfanalytics} {reganalysis}.`
			<2> In this example, a test/train split (`training_percent`) was defined for the
			`{reganalysis}. This query limits evaluation to be performed on the test split`
			`only.`
			`<3> The ground truth value for the actual house price. This is required in order`
			`to evaluate results.`
			`<4> The predicted value for house price calculated by the {reganalysis}.`
[DOCS] Adds examples to the PUT dfa and the evaluate dfa APIs (#46966) * [DOCS] Adds examples to the PUT dfa and the evaluate dfa APIs. * [DOCS] Removes extra lines from examples. * Update docs/reference/ml/df-analytics/apis/evaluate-dfanalytics.asciidoc Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Update docs/reference/ml/df-analytics/apis/put-dfanalytics.asciidoc Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * [DOCS] Explains examples. 2019-10-02 04:26:20 -04:00

			`The following example calculates the training error:`

			`[source,console]`
			`--------------------------------------------------`
			`POST _ml/data_frame/_evaluate`
			`{`
			`"index": "student_performance_mathematics_reg",`
			`"query": {`
			`"term": {`
			`"ml.is_training": {`
			`"value": true <1>`
			`}`
			`}`
			`},`
			`"evaluation": {`
			`"regression": {`
			`"actual_field": "G3", <2>`
			`"predicted_field": "ml.G3_prediction", <3>`
			`"metrics": {`
			`"r_squared": {},`
			`"mean_squared_error": {}`
			`}`
			`}`
			`}`
			`}`
			`--------------------------------------------------`
			`// TEST[skip:TBD]`

			<1> In this example, a test/train split (`training_percent`) was defined for the
			`{reganalysis}. This query limits evaluation to be performed on the train split`
			`only. It means that a training error will be calculated.`
			`<2> The field that contains the ground truth value for the actual student`
			`performance. This is required in order to evaluate results.`
			`<3> The field that contains the predicted value for student performance`
			`calculated by the {reganalysis}.`


			`The next example calculates the testing error. The only difference compared with`
			the previous example is that `ml.is_training` is set to `false` this time, so
			`the query excludes the train split from the evaluation.`

			`[source,console]`
			`--------------------------------------------------`
			`POST _ml/data_frame/_evaluate`
			`{`
			`"index": "student_performance_mathematics_reg",`
			`"query": {`
			`"term": {`
			`"ml.is_training": {`
			`"value": false <1>`
			`}`
			`}`
			`},`
			`"evaluation": {`
			`"regression": {`
			`"actual_field": "G3", <2>`
			`"predicted_field": "ml.G3_prediction", <3>`
			`"metrics": {`
			`"r_squared": {},`
			`"mean_squared_error": {}`
			`}`
			`}`
			`}`
			`}`
			`--------------------------------------------------`
			`// TEST[skip:TBD]`

			<1> In this example, a test/train split (`training_percent`) was defined for the
			`{reganalysis}. This query limits evaluation to be performed on the test split`
			`only. It means that a testing error will be calculated.`
			`<2> The field that contains the ground truth value for the actual student`
			`performance. This is required in order to evaluate results.`
			`<3> The field that contains the predicted value for student performance`
			`calculated by the {reganalysis}.`