OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-10 23:15:04 +00:00

History

[ML] adds multi-class feature importance support (#53803 ) (#54024 )

Adds multi-class feature importance calculation. 

Feature importance objects are now mapped as follows
(logistic) Regression:
```
{
   "feature_name": "feature_0",
   "importance": -1.3
}
```
Multi-class [class names are `foo`, `bar`, `baz`]
```
{ 
   “feature_name”: “feature_0”, 
   “importance”: 2.0, // sum(abs()) of class importances
   “foo”: 1.0, 
   “bar”: 0.5, 
   “baz”: -0.5 
},
```

For users to get the full benefit of aggregating and searching for feature importance, they should update their index mapping as follows (before turning this option on in their pipelines)
```
 "ml.inference.feature_importance": {
          "type": "nested",
          "dynamic": true,
          "properties": {
            "feature_name": {
              "type": "keyword"
            },
            "importance": {
              "type": "double"
            }
          }
        }
```
The mapping field name is as follows
`ml.<inference.target_field>.<inference.tag>.feature_importance`
if `inference.tag` is not provided in the processor definition, it is not part of the field path.
`inference.target_field` is defaulted to `ml.inference`.
//cc @lcawl ^ Where should we document this?

If this makes it in for 7.7, there shouldn't be any feature_importance at inference BWC worries as 7.7 is the first version to have it.

2020-03-23 18:49:07 -04:00

basic-multi-node

[ML] Make ml internal indices hidden (#52423 ) (#52509 )

2020-02-19 14:02:32 +01:00

disabled

Apply 2-space indent to all gradle scripts (#49071 )

2019-11-14 11:01:23 +00:00

ml-with-security

[7.x] Optimize which Rest resources are used by the Rest tests… (#53766 )

2020-03-19 12:28:59 -05:00

native-multi-node-tests

[ML] adds multi-class feature importance support (#53803 ) (#54024 )