opensearch-docs-cn/_ml-commons-plugin/model-serving-framework.md

299 lines
10 KiB
Markdown
Raw Normal View History

---
layout: default
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
title: Model-serving framework
Add GPU acceleration documentation (#2384) * Add GPU acceleration documentation Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Address tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Address technical feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Adjust model size sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add optional to neuron step Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add Jeff's feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add copy and customize for Inferntia examples Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Nate Bower <nbower@amazon.com> * Apply suggestions from code review Co-authored-by: Nate Bower <nbower@amazon.com> * Fix link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Apply suggestions from code review Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Fix numbering in final section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add final tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * A couple more suggestion Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Apply suggestions from code review Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Fix Neural Search link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add experimental warning Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Final tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Move OpenSearch to step 2. Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com>
2023-01-18 14:31:52 -05:00
has_children: true
nav_order: 110
---
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
# Model-serving framework
Add GPU acceleration documentation (#2384) * Add GPU acceleration documentation Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Address tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Address technical feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Adjust model size sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add optional to neuron step Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add Jeff's feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add copy and customize for Inferntia examples Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Nate Bower <nbower@amazon.com> * Apply suggestions from code review Co-authored-by: Nate Bower <nbower@amazon.com> * Fix link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Apply suggestions from code review Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Fix numbering in final section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add final tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * A couple more suggestion Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Apply suggestions from code review Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Fix Neural Search link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add experimental warning Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Final tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Move OpenSearch to step 2. Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com>
2023-01-18 14:31:52 -05:00
The model-serving framework is an experimental feature. For updates on the progress of the model-serving framework, or if you want to leave feedback that could help improve the feature, join the discussion in the [Model-serving framework forum](https://forum.opensearch.org/t/feedback-machine-learning-model-serving-framework-experimental-release/11439).
{: .warning}
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
ML Commons allows you to serve custom models and use those models to make inferences. For those who want to run their PyTorch deep learning model inside an OpenSearch cluster, you can upload and run that model with the ML Commons REST API.
This page outlines the steps required to upload a custom model and run it with the ML Commons plugin.
## Prerequisites
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
To upload a custom model to OpenSearch, you need to prepare it outside of your OpenSearch cluster. You can use a pretrained model, like one from [Huggingface](https://huggingface.co/), or train a new model in accordance with your needs.
### Model support
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
As of OpenSearch 2.4, the model-serving framework only supports text embedding models without GPU acceleration.
### Model format
To use a model in OpenSearch, you'll need to export the model into a portable format. As of Version 2.5, OpenSearch only supports the [TorchScript](https://pytorch.org/docs/stable/jit.html) and [ONNX][https://onnx.ai/] formats.
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
Furthermore, files must be saved as zip files before upload. Therefore, to ensure that ML Commons can upload your model, compress your TorchScript file before uploading. You can download an example file [here](https://github.com/opensearch-project/ml-commons/blob/2.x/ml-algorithms/src/test/resources/org/opensearch/ml/engine/algorithms/text_embedding/all-MiniLM-L6-v2_torchscript_sentence-transformer.zip).
### Model size
Add GPU acceleration documentation (#2384) * Add GPU acceleration documentation Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Address tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Address technical feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Adjust model size sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add optional to neuron step Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add Jeff's feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add copy and customize for Inferntia examples Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Nate Bower <nbower@amazon.com> * Apply suggestions from code review Co-authored-by: Nate Bower <nbower@amazon.com> * Fix link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Apply suggestions from code review Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> * Fix numbering in final section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add final tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * A couple more suggestion Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Apply suggestions from code review Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Fix Neural Search link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add experimental warning Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/gpu-acceleration.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Final tech feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Move OpenSearch to step 2. Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: Caroline <113052567+carolxob@users.noreply.github.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com>
2023-01-18 14:31:52 -05:00
Most deep learning models are more than 100 MB, making it difficult to fit them into a single document. OpenSearch splits the model file into smaller chunks to be stored in a model index. When allocating machine learning (ML) or data nodes for your OpenSearch cluster, make sure you correctly size your ML nodes so that you have enough memory when making ML inferences.
## GPU acceleration
To achieve better performance within the model-serving framework, you can take advantage of GPU acceleration on your ML node. For more information, see [GPU acceleration]({{site.url}}{{site.baseurl}}/ml-commons-plugin/gpu-acceleration/).
## Upload model to OpenSearch
Use the URL upload operation for models that already exist on another server, such as GitHub or S3.
```
POST /_plugins/_ml/models/_upload
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
The URL upload method requires the following request fields.
Field | Data type | Description
:--- | :--- | :---
`name`| string | The name of the model. |
`version` | string | The version number of the model. Since OpenSearch does not enforce a specific version schema for models, you can choose any number or format that makes sense for your models. |
`model_format` | string | The portable format of the model file. Currently only supports `TORCH_SCRIPT`. |
2022-11-18 11:08:51 -05:00
[`model_config`](#the-model_config-object) | json object | The model's configuration, including the `model_type`, `embedding_dimension`, and `framework_type`. |
`url` | string | The URL where the model is located. |
2022-11-18 11:08:51 -05:00
### The `model_config` object
| Field | Data type | Description |
2022-11-18 11:08:51 -05:00
| :--- | :--- | :--- |
| `model_type` | string | The model type, such as `bert`. For a Huggingface model, the model type is specified in `config.json`. For an example, see the [`all-MiniLM-L6-v2` Huggingface model `config.json`](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/blob/main/config.json#L15).|
| `embedding_dimension` | integer | The dimension of the model-generated dense vector. For a Huggingface model, the dimension is specified in the model card. For example, in the [`all-MiniLM-L6-v2` Huggingface model card](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2), the statement `384 dimensional dense vector space` specifies 384 as the embedding dimension. |
| `framework_type` | string | The framework the model is using. Currently, we support `sentence_transformers` and `huggingface_transformers` frameworks. The `sentence_transformers` model outputs text embeddings directly, so ML Commons does not perform any post processing. For `huggingface_transformers`, ML Commons performs post processing by applying mean pooling to get text embeddings. See the example [`all-MiniLM-L6-v2` Huggingface model](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) for more details. |
| `all_config` _(Optional)_ | string | This field is used for reference purposes. You can specify all model configurations in this field. For example, if you are using a Huggingface model, you can minify the `config.json` file to one line and save its contents in the `all_config` field. Once the model is uploaded, you can use the get model API operation to get all model configurations stored in this field. |
#### Sample request
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
The following sample request uploads version `1.0.0` of a natural language processing (NLP) sentence transformation model named `all-MiniLM-L6-v2`:
```json
POST /_plugins/_ml/models/_upload
{
"name": "all-MiniLM-L6-v2",
"version": "1.0.0",
"description": "test model",
"model_format": "TORCH_SCRIPT",
"model_config": {
"model_type": "bert",
"embedding_dimension": 384,
"framework_type": "sentence_transformers"
},
"url": "https://github.com/opensearch-project/ml-commons/raw/2.x/ml-algorithms/src/test/resources/org/opensearch/ml/engine/algorithms/text_embedding/all-MiniLM-L6-v2_torchscript_sentence-transformer.zip?raw=true"
}
```
#### Sample response
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
OpenSearch responds with the `task_id` and task `status`:
```json
{
"task_id" : "ew8I44MBhyWuIwnfvDIH",
"status" : "CREATED"
}
```
To see the status of your model upload, pass the `task_id` into the [task API]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api#get-task-information).
## Load the model
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
The load model operation reads the model's chunks from the model index and then creates an instance of the model to load into memory. The bigger the model, the more chunks the model is split into. The more chunks a model index contains, the longer it takes for the model to load into memory.
### Get the `model_id`
To load a model, you need the `model_id`. To find the `model_id`, take the `task_id` from the model's upload operations API response and use the `GET _ml/tasks` API.
This example request uses the `task_id` from the upload example.
```json
GET /_plugins/_ml/tasks/ew8I44MBhyWuIwnfvDIH
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
OpenSearch responds with the `model_id`:
```json
{
"model_id" : "WWQI44MBbzI2oUKAvNUt",
"task_type" : "UPLOAD_MODEL",
"function_name" : "TEXT_EMBEDDING",
"state" : "COMPLETED",
"worker_node" : "KzONM8c8T4Od-NoUANQNGg",
"create_time" : 3455961564003,
"last_update_time" : 3216361373241,
"is_async" : true
}
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
### Load the model from the model index
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
With the `model_id`, you can now load the model from the model's index in order to deploy the model to ML nodes. The load API reads model chunks from the model index, creates an instance of that model, and saves the model instance in the ML node's cache.
Add the `model_id` to the load API:
```json
POST /_plugins/_ml/models/<model_id>/_load
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
By default, the ML Commons setting `plugins.ml_commons.only_run_on_ml_node` is set to `false`. When `false`, models load on ML nodes first. If no ML nodes exist, models load on data nodes. When running ML models in production, set `plugins.ml_commons.only_run_on_ml_node` to `true` so that models only load on ML nodes.
#### Sample request: Load into any available ML node
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
In this example request, OpenSearch loads the model into all available OpenSearch node:
```json
POST /_plugins/_ml/models/WWQI44MBbzI2oUKAvNUt/_load
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
#### Sample request: Load into a specific node
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
If you want to reserve the memory of other ML nodes within your cluster, you can load your model into a specific node(s) by specifying each node's ID in the request body:
```json
POST /_plugins/_ml/models/WWQI44MBbzI2oUKAvNUt/_load
{
"node_ids": ["4PLK7KJWReyX0oWKnBA8nA"]
}
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
#### Sample response
All models load asynchronously. Therefore, the load API responds with a new `task_id` based on the load and responds with a new `status` for the task.
```json
{
"task_id" : "hA8P44MBhyWuIwnfvTKP",
"status" : "CREATED"
}
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
### Check the model load status
With your `task_id` from the load response, you can use the `GET _ml/tasks` API to see the load status of your model. Before a loaded model can be used for inferences, the load task's `state` must be `COMPLETED`.
#### Sample request
```json
GET /_plugins/_ml/tasks/hA8P44MBhyWuIwnfvTKP
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
#### Sample response
```json
{
"model_id" : "WWQI44MBbzI2oUKAvNUt",
"task_type" : "LOAD_MODEL",
"function_name" : "TEXT_EMBEDDING",
"state" : "COMPLETED",
"worker_node" : "KzONM8c8T4Od-NoUANQNGg",
"create_time" : 1665961803150,
"last_update_time" : 1665961815959,
"is_async" : true
}
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
## Use the loaded model for inferences
After the model has been loaded, you can enter the `model_id` into the [predict API]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api#predict) to perform inferences.
```json
POST /_plugins/_ml/models/<model_id>/_predict
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
### Sample request
```json
POST /_plugins/_ml/_predict/text_embedding/WWQI44MBbzI2oUKAvNUt
{
"text_docs":[ "today is sunny"],
"return_number": true,
"target_response": ["sentence_embedding"]
}
```
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
### Sample response
```json
{
"inference_results" : [
{
"output" : [
{
"name" : "sentence_embedding",
"data_type" : "FLOAT32",
"shape" : [
384
],
"data" : [
-0.023315024,
0.08975691,
0.078479774,
...
]
}
]
}
]
}
```
## Unload the model
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
If you're done making predictions with your model, use the unload operation to remove the model from your memory cache. The model will remain accessible in the model index.
```json
POST /_plugins/_ml/models/<model_id>/_unload
```
### Sample request
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
```json
POST /_plugins/_ml/models/MGqJhYMBbbh0ushjm8p_/_unload
```
### Sample response
Editorial review for Model Framework and Neural Search (#1956) * Fix header in ML commons Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Remove redundant file Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Yaliang Wu <ylwu@amazon.com> * Add lines for easier editing Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/api.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add ML editorial feedback Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _ml-commons-plugin/model-serving-framework.md Co-authored-by: Nate Bower <nbower@amazon.com> * Add Neural Search section (#1882) * Add Neural Search section Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add getting started instructions Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add example responses to Neural Search Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add neural request fields Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add review tweaks Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Add forum link Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update _neural-search-plugin/index.md Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Update index.md * Fix second sentence Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> * Small typos in Neural Search (#1959) * Small typos in Neural Search * Update index.md * Update index.md * Update plugin documentation to reference the CAT API (#1962) * Update plugin documentation to reference the CAT API Signed-off-by: ariamarble <armarble@amazon.com> * made editorial changes Signed-off-by: ariamarble <armarble@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> * Add ML commons plugin Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Change model serving format Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Fix plugin cap Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Neural Search editorial Signed-off-by: Naarcha-AWS <naarcha@amazon.com> * Update index.md Incorporate Chris' feedback * Update model-serving-framework.md Signed-off-by: Naarcha-AWS <naarcha@amazon.com> Signed-off-by: ariamarble <armarble@amazon.com> Co-authored-by: Yaliang Wu <ylwu@amazon.com> Co-authored-by: Nate Bower <nbower@amazon.com> Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com> Co-authored-by: Aria Marble <111301581+ariamarble@users.noreply.github.com>
2022-11-17 14:42:24 -05:00
```json
{
"s5JwjZRqTY6nOT0EvFwVdA": {
"stats": {
"MGqJhYMBbbh0ushjm8p_": "deleted"
}
}
}
```