opensearch-docs-cn/_search-plugins/knn/knn-index.md

---
layout: default
title: k-NN Index
nav_order: 1
parent: k-NN
has_children: false
---

# k-NN Index

## knn_vector data type

The k-NN plugin introduces a custom data type, the `knn_vector`, that allows users to ingest their k-NN vectors
into an OpenSearch index and perform different kinds of k-NN search. The `knn_vector` field is highly configurable and can serve many different k-NN workloads. In general, a `knn_vector` field can be built either by providing a method definition or specifying a model id.

Method definitions are used when the underlying Approximate k-NN algorithm does not require training. For example, the following `knn_vector` field specifies that *nmslib*'s implementation of *hnsw* should be used for Approximate k-NN search. During indexing, *nmslib* will build the corresponding *hnsw* segment files.

```json
"my_vector": {
  "type": "knn_vector",
  "dimension": 4,
  "method": {
  "name": "hnsw",
  "space_type": "l2",
  "engine": "nmslib",
  "parameters": {
    "ef_construction": 128,
    "m": 24
  }
  }
}
```

Model IDs are used when the underlying Approximate k-NN algorithm requires a training step. As a prerequisite, the
model has to be created with the [Train API]({{site.url}}{{site.baseurl}}/search-plugins/knn/api#train-model). The
model contains the information needed to initialize the native library segment files.

```json
  "type": "knn_vector",
  "model_id": "my-model"
}
```

However, if you intend to just use painless scripting or a k-NN score script, you only need to pass the dimension.
 ```json
   "type": "knn_vector",
   "dimension": 128
 }
 ```

## Method Definitions

A method definition refers to the underlying configuration of the Approximate k-NN algorithm you want to use. Method definitions are used to either create a `knn_vector` field (when the method does not require training) or [create a model during training]({{site.url}}{{site.baseurl}}/search-plugins/knn/api#train-model) that can then be used to [create a `knn_vector` field]({{site.url}}{{site.baseurl}}/search-plugins/knn/approximate-knn/#building-a-k-nn-index-from-a-model).

A method definition will always contain the name of the method, the space_type the method is built for, the engine
(the native library) to use, and a map of parameters.

Mapping Parameter | Required | Default | Updatable | Description
:--- | :--- | :--- | :--- | :---
`name` | true | n/a | false | The identifier for the nearest neighbor method.
`space_type` | false | "l2" | false | The vector space used to calculate the distance between vectors.
`engine` | false | "nmslib" | false | The approximate k-NN library to use for indexing and search. Either "faiss" or "nmslib".
`parameters` | false | null | false | The parameters used for the nearest neighbor method.

### Supported nmslib methods

Method Name | Requires Training? | Supported Spaces | Description
:--- | :--- | :--- | :---
`hnsw` | false | "l2", "innerproduct", "cosinesimil", "l1", "linf" | Hierarchical proximity graph approach to Approximate k-NN search. For more details on the algorithm, [checkout this paper](https://arxiv.org/abs/1603.09320)!

#### HNSW Parameters

Paramater Name | Required | Default | Updatable | Description
:--- | :--- | :--- | :--- | :---
`ef_construction` | false | 512 | false | The size of the dynamic list used during k-NN graph creation. Higher values lead to a more accurate graph, but slower indexing speed.
`m` | false | 16 | false | The number of bidirectional links that the plugin creates for each new element. Increasing and decreasing this value can have a large impact on memory consumption. Keep this value between 2-100.

**Note** --- For *nmslib*, *ef_search* is set in the [index settings](#index-settings).

### Supported faiss methods

Method Name | Requires Training? | Supported Spaces | Description
:--- | :--- | :--- | :---
`hnsw` | false | "l2", "innerproduct"* | Hierarchical proximity graph approach to Approximate k-NN search.
`ivf` | true | "l2", "innerproduct" | Bucketing approach where vectors are assigned different buckets based on clustering and, during search, only a subset of the buckets are searched.

**Note** --- For *hnsw*, "innerproduct" is not available when PQ is used.

#### HNSW Parameters

Paramater Name | Required | Default | Updatable | Description
:--- | :--- | :--- | :--- | :---
`ef_search` | false | 512 | false | The size of the dynamic list used during k-NN searches. Higher values lead to more accurate but slower searches.
`ef_construction` | false | 512 | false | The size of the dynamic list used during k-NN graph creation. Higher values lead to a more accurate graph, but slower indexing speed.
`m` | false | 16 | false | The number of bidirectional links that the plugin creates for each new element. Increasing and decreasing this value can have a large impact on memory consumption. Keep this value between 2-100.
`encoder` | false | flat | false | Encoder definition for encoding vectors. Encoders can reduce the memory footprint of your index, at the expense of search accuracy.

#### IVF Parameters

Paramater Name | Required | Default | Updatable | Description
:--- | :--- | :--- | :--- | :---
`nlists` | false | 4 | false | Number of buckets to partition vectors into. Higher values may lead to more accurate searches, at the expense of memory and training latency. For more information about choosing the right value, refer to [*faiss*'s documentation](https://github.com/facebookresearch/faiss/wiki/Guidelines-to-choose-an-index).
`nprobes` | false | 1 | false | Number of buckets to search over during query. Higher values lead to more accurate but slower searches.
`encoder` | false | flat | false | Encoder definition for encoding vectors. Encoders can reduce the memory footprint of your index, at the expense of search accuracy.

For more information about setting these parameters, please refer to [*faiss*'s documentation](https://github.com/facebookresearch/faiss/wiki/Faiss-indexes).

#### IVF training requirements

The IVF algorithm requires a training step. To create an index that uses IVF, you need to train a model with the
[Train API]({{site.url}}{{site.baseurl}}/search-plugins/knn/api#train-model), passing the IVF method definition. IVF requires that, at a minimum, there should be `nlist` training
data points, but it is [recommended to use more](https://github.com/facebookresearch/faiss/wiki/Guidelines-to-choose-an-index#how-big-is-the-dataset).
Training data can either the same data that is going to be ingested or a separate set of data.

### Supported faiss encoders

You can use encoders to reduce the memory footprint of a k-NN index at the expense of search accuracy. *faiss* has
several encoder types, but currently, the plugin only supports *flat* and *pq* encoding.

An example method definition that specifies an encoder may look something like this:

```json
"method": {
  "name":"hnsw",
  "engine":"faiss",
  "parameters":{
    "encoder":{
      "name":"pq",
      "parameters":{
        "code_size": 8,
        "m": 8
      }
    }
  }
}
```

Encoder Name | Requires Training? | Description
:--- | :--- | :---
`flat` | false | Encode vectors as floating point arrays. This encoding does not reduce memory footprint.
`pq` | true | Short for product quantization, it is a lossy compression technique that encodes a vector into a fixed size of bytes using clustering, with the goal of minimizing the drop in k-NN search accuracy. From a high level, vectors are broken up into `m` subvectors, and then each subvector is represented by a `code_size` code obtained from a code book produced during training. For more details on product quantization, here is a [great blog post](https://medium.com/dotstar/understanding-faiss-part-2-79d90b1e5388)!

#### PQ Parameters

Paramater Name | Required | Default | Updatable | Description
:--- | :--- | :--- | :--- | :---
`m` | false | 1 | false |  Determine how many many sub-vectors to break the vector into. sub-vectors are encoded independently of each other. This dimension of the vector must be divisible by `m`. Max value is 1024.
`code_size` | false | 8 | false | Determines the number of bits to encode a sub-vector into. Max value is 8. **Note** --- for IVF, this value must be less than or equal to 8. For HNSW, this value can only be 8.

### Choosing the right method

There are a lot of options to choose from when building your `knn_vector` field. To determine the correct methods and parameters to choose, you should first understand what requirements you have for your workload and what trade-offs you are willing to make. Factors to consider are (1) query latency, (2) query quality, (3) memory limits, (4) indexing latency.

If memory is not a concern, HNSW offers a very strong query latency/query quality tradeoff.

If you want to use less memory and index faster than HNSW, while maintaining similar query quality, you should evaluate IVF.

If memory is a concern, consider adding a PQ encoder to your HNSW or IVF index. Because PQ is a lossy encoding, query quality will drop.

### Memory Estimation

In a typical OpenSearch cluster, a certain portion of RAM is set aside for the JVM heap. The k-NN plugin allocates
native library indices to a portion of the remaining RAM. This portion's size is determined by
the `circuit_breaker_limit` cluster setting. By default, the limit is set at 50%.

Having a replica doubles the total number of vectors.
{: .note }

#### HNSW memory estimation

The memory required for HNSW is estimated to be `1.1 * (4 * dimension + 8 * M)` bytes/vector.

As an example, assume you have a million vectors with a dimension of 256 and M of 16. The memory requirement can be estimated as follows:

```
1.1 * (4 * 256 + 8 * 16) * 1,000,000 ~= 1.267 GB
```

#### IVF memory estimation

The memory required for IVF is estimated to be `1.1 * (((4 * dimension) * num_vectors) + (4 * nlist * d))` bytes.

As an example, assume you have a million vectors with a dimension of 256 and nlist of 128. The memory requirement can be estimated as follows:

```
1.1 * (((4 * 256) * 1,000,000) + (4 * 128 * 256))  ~= 1.126 GB

```

## Index settings

Additionally, the k-NN plugin introduces several index settings that can be used to configure the k-NN structure as well.

At the moment, several parameters defined in the settings are in the deprecation process. Those parameters should be set in the mapping instead of the index settings. Parameters set in the mapping will override the parameters set in the index settings. Setting the parameters in the mapping allows an index to have multiple `knn_vector` fields with different parameters.

Setting | Default | Updateable | Description
:--- | :--- | :--- | :---
`index.knn` | false | false | Whether the index should build native library indices for the `knn_vector` fields. If set to false, the `knn_vector` fields will be stored in doc values, but Approximate k-NN search functionality will be disabled.
`index.knn.algo_param.ef_search` | 512 | true | The size of the dynamic list used during k-NN searches. Higher values lead to more accurate but slower searches. Only available for *nmslib*.
`index.knn.algo_param.ef_construction` | 512 | false | (Deprecated in 1.0.0. Use the mapping parameters to set this value instead.) Only available for *nmslib*. Refer to mapping definition.
`index.knn.algo_param.m` | 16 | false | (Deprecated in 1.0.0. Use the mapping parameters to set this value instead.) Only available for *nmslib*. Refer to mapping definition.
`index.knn.space_type` | "l2" | false | (Deprecated in 1.0.0. Use the mapping parameters to set this value instead.) Only available for *nmslib*. Refer to mapping definition.
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00			`---`
			`layout: default`
			`title: k-NN Index`
			`nav_order: 1`
			`parent: k-NN`
			`has_children: false`
			`---`

			`# k-NN Index`

Minor KNN tweaks 2021-05-28 16:54:13 -04:00			`## knn_vector data type`

Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00			The k-NN plugin introduces a custom data type, the `knn_vector`, that allows users to ingest their k-NN vectors
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			into an OpenSearch index and perform different kinds of k-NN search. The `knn_vector` field is highly configurable and can serve many different k-NN workloads. In general, a `knn_vector` field can be built either by providing a method definition or specifying a model id.
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			Method definitions are used when the underlying Approximate k-NN algorithm does not require training. For example, the following `knn_vector` field specifies that nmslib's implementation of hnsw should be used for Approximate k-NN search. During indexing, nmslib will build the corresponding hnsw segment files.
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00
			```json
			`"my_vector": {`
			`"type": "knn_vector",`
			`"dimension": 4,`
			`"method": {`
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`"name": "hnsw",`
			`"space_type": "l2",`
			`"engine": "nmslib",`
			`"parameters": {`
			`"ef_construction": 128,`
			`"m": 24`
			`}`
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00			`}`
			`}`
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			```

			`Model IDs are used when the underlying Approximate k-NN algorithm requires a training step. As a prerequisite, the`
			`model has to be created with the [Train API]({{site.url}}{{site.baseurl}}/search-plugins/knn/api#train-model). The`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`model contains the information needed to initialize the native library segment files.`

			```json
			`"type": "knn_vector",`
			`"model_id": "my-model"`
			`}`
			```

			`However, if you intend to just use painless scripting or a k-NN score script, you only need to pass the dimension.`
			```json
			`"type": "knn_vector",`
			`"dimension": 128`
			`}`
			```

			`## Method Definitions`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			A method definition refers to the underlying configuration of the Approximate k-NN algorithm you want to use. Method definitions are used to either create a `knn_vector` field (when the method does not require training) or [create a model during training]({{site.url}}{{site.baseurl}}/search-plugins/knn/api#train-model) that can then be used to [create a `knn_vector` field]({{site.url}}{{site.baseurl}}/search-plugins/knn/approximate-knn/#building-a-k-nn-index-from-a-model).
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`A method definition will always contain the name of the method, the space_type the method is built for, the engine`
			`(the native library) to use, and a map of parameters.`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			`Mapping Parameter \| Required \| Default \| Updatable \| Description`
			`:--- \| :--- \| :--- \| :--- \| :---`
			`name` \| true \| n/a \| false \| The identifier for the nearest neighbor method.
			`space_type` \| false \| "l2" \| false \| The vector space used to calculate the distance between vectors.
			`engine` \| false \| "nmslib" \| false \| The approximate k-NN library to use for indexing and search. Either "faiss" or "nmslib".
			`parameters` \| false \| null \| false \| The parameters used for the nearest neighbor method.

			`### Supported nmslib methods`

			`Method Name \| Requires Training? \| Supported Spaces \| Description`
			`:--- \| :--- \| :--- \| :---`
			`hnsw` \| false \| "l2", "innerproduct", "cosinesimil", "l1", "linf" \| Hierarchical proximity graph approach to Approximate k-NN search. For more details on the algorithm, [checkout this paper](https://arxiv.org/abs/1603.09320)!

			`#### HNSW Parameters`

			`Paramater Name \| Required \| Default \| Updatable \| Description`
			`:--- \| :--- \| :--- \| :--- \| :---`
			`ef_construction` \| false \| 512 \| false \| The size of the dynamic list used during k-NN graph creation. Higher values lead to a more accurate graph, but slower indexing speed.
			`m` \| false \| 16 \| false \| The number of bidirectional links that the plugin creates for each new element. Increasing and decreasing this value can have a large impact on memory consumption. Keep this value between 2-100.

			`Note --- For nmslib, ef_search is set in the [index settings](#index-settings).`

			`### Supported faiss methods`

			`Method Name \| Requires Training? \| Supported Spaces \| Description`
			`:--- \| :--- \| :--- \| :---`
Call out some issues with faiss hnsw pq Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-18 14:11:19 -05:00			`hnsw` \| false \| "l2", "innerproduct"* \| Hierarchical proximity graph approach to Approximate k-NN search.
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`ivf` \| true \| "l2", "innerproduct" \| Bucketing approach where vectors are assigned different buckets based on clustering and, during search, only a subset of the buckets are searched.

Call out some issues with faiss hnsw pq Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-18 14:11:19 -05:00			`Note --- For hnsw, "innerproduct" is not available when PQ is used.`

Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`#### HNSW Parameters`

			`Paramater Name \| Required \| Default \| Updatable \| Description`
			`:--- \| :--- \| :--- \| :--- \| :---`
			`ef_search` \| false \| 512 \| false \| The size of the dynamic list used during k-NN searches. Higher values lead to more accurate but slower searches.
			`ef_construction` \| false \| 512 \| false \| The size of the dynamic list used during k-NN graph creation. Higher values lead to a more accurate graph, but slower indexing speed.
			`m` \| false \| 16 \| false \| The number of bidirectional links that the plugin creates for each new element. Increasing and decreasing this value can have a large impact on memory consumption. Keep this value between 2-100.
			`encoder` \| false \| flat \| false \| Encoder definition for encoding vectors. Encoders can reduce the memory footprint of your index, at the expense of search accuracy.

			`#### IVF Parameters`

			`Paramater Name \| Required \| Default \| Updatable \| Description`
			`:--- \| :--- \| :--- \| :--- \| :---`
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`nlists` \| false \| 4 \| false \| Number of buckets to partition vectors into. Higher values may lead to more accurate searches, at the expense of memory and training latency. For more information about choosing the right value, refer to [faiss's documentation](https://github.com/facebookresearch/faiss/wiki/Guidelines-to-choose-an-index).
			`nprobes` \| false \| 1 \| false \| Number of buckets to search over during query. Higher values lead to more accurate but slower searches.
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`encoder` \| false \| flat \| false \| Encoder definition for encoding vectors. Encoders can reduce the memory footprint of your index, at the expense of search accuracy.

			`For more information about setting these parameters, please refer to [faiss's documentation](https://github.com/facebookresearch/faiss/wiki/Faiss-indexes).`

			`#### IVF training requirements`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`The IVF algorithm requires a training step. To create an index that uses IVF, you need to train a model with the`
			[Train API]({{site.url}}{{site.baseurl}}/search-plugins/knn/api#train-model), passing the IVF method definition. IVF requires that, at a minimum, there should be `nlist` training
			`data points, but it is [recommended to use more](https://github.com/facebookresearch/faiss/wiki/Guidelines-to-choose-an-index#how-big-is-the-dataset).`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`Training data can either the same data that is going to be ingested or a separate set of data.`

			`### Supported faiss encoders`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`You can use encoders to reduce the memory footprint of a k-NN index at the expense of search accuracy. faiss has`
			`several encoder types, but currently, the plugin only supports flat and pq encoding.`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			`An example method definition that specifies an encoder may look something like this:`
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			```json
			`"method": {`
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`"name":"hnsw",`
			`"engine":"faiss",`
			`"parameters":{`
			`"encoder":{`
			`"name":"pq",`
			`"parameters":{`
			`"code_size": 8,`
			`"m": 8`
			`}`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`}`
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`}`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`}`
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00			```

Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`Encoder Name \| Requires Training? \| Description`
			`:--- \| :--- \| :---`
			`flat` \| false \| Encode vectors as floating point arrays. This encoding does not reduce memory footprint.
Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`pq` \| true \| Short for product quantization, it is a lossy compression technique that encodes a vector into a fixed size of bytes using clustering, with the goal of minimizing the drop in k-NN search accuracy. From a high level, vectors are broken up into `m` subvectors, and then each subvector is represented by a `code_size` code obtained from a code book produced during training. For more details on product quantization, here is a [great blog post](https://medium.com/dotstar/understanding-faiss-part-2-79d90b1e5388)!
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			`#### PQ Parameters`

			`Paramater Name \| Required \| Default \| Updatable \| Description`
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00			`:--- \| :--- \| :--- \| :--- \| :---`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`m` \| false \| 1 \| false \| Determine how many many sub-vectors to break the vector into. sub-vectors are encoded independently of each other. This dimension of the vector must be divisible by `m`. Max value is 1024.
Call out some issues with faiss hnsw pq Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-18 14:11:19 -05:00			`code_size` \| false \| 8 \| false \| Determines the number of bits to encode a sub-vector into. Max value is 8. Note --- for IVF, this value must be less than or equal to 8. For HNSW, this value can only be 8.
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			`### Choosing the right method`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			There are a lot of options to choose from when building your `knn_vector` field. To determine the correct methods and parameters to choose, you should first understand what requirements you have for your workload and what trade-offs you are willing to make. Factors to consider are (1) query latency, (2) query quality, (3) memory limits, (4) indexing latency.
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			`If memory is not a concern, HNSW offers a very strong query latency/query quality tradeoff.`

			`If you want to use less memory and index faster than HNSW, while maintaining similar query quality, you should evaluate IVF.`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`If memory is a concern, consider adding a PQ encoder to your HNSW or IVF index. Because PQ is a lossy encoding, query quality will drop.`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			`### Memory Estimation`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			`In a typical OpenSearch cluster, a certain portion of RAM is set aside for the JVM heap. The k-NN plugin allocates`
			`native library indices to a portion of the remaining RAM. This portion's size is determined by`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			the `circuit_breaker_limit` cluster setting. By default, the limit is set at 50%.

			`Having a replica doubles the total number of vectors.`
			`{: .note }`

			`#### HNSW memory estimation`

			The memory required for HNSW is estimated to be `1.1 * (4 * dimension + 8 * M)` bytes/vector.

			`As an example, assume you have a million vectors with a dimension of 256 and M of 16. The memory requirement can be estimated as follows:`

			```
Mistake in memory estimation for IVF k-NN Signed-off-by: John Mazanec <jmazane@amazon.com> 2022-01-19 13:32:51 -05:00			`1.1 * (4 * 256 + 8 * 16) * 1,000,000 ~= 1.267 GB`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			```

			`#### IVF memory estimation`

			The memory required for IVF is estimated to be `1.1 * (((4 * dimension) * num_vectors) + (4 * nlist * d))` bytes.

			`As an example, assume you have a million vectors with a dimension of 256 and nlist of 128. The memory requirement can be estimated as follows:`

			```
Mistake in memory estimation for IVF k-NN Signed-off-by: John Mazanec <jmazane@amazon.com> 2022-01-19 13:32:51 -05:00			`1.1 * (((4 * 256) * 1,000,000) + (4 * 128 * 256)) ~= 1.126 GB`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00
			```
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00
			`## Index settings`

			`Additionally, the k-NN plugin introduces several index settings that can be used to configure the k-NN structure as well.`

Language tweaks Signed-off-by: keithhc2 <keithhc2@users.noreply.github.com> 2022-01-25 17:54:58 -05:00			At the moment, several parameters defined in the settings are in the deprecation process. Those parameters should be set in the mapping instead of the index settings. Parameters set in the mapping will override the parameters set in the index settings. Setting the parameters in the mapping allows an index to have multiple `knn_vector` fields with different parameters.
Update knn documentation for rc1 Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-05-28 15:34:56 -04:00
			`Setting \| Default \| Updateable \| Description`
			`:--- \| :--- \| :--- \| :---`
Complete v1 of faiss feature docs Signed-off-by: John Mazanec <jmazane@amazon.com> 2021-11-16 18:08:02 -05:00			`index.knn` \| false \| false \| Whether the index should build native library indices for the `knn_vector` fields. If set to false, the `knn_vector` fields will be stored in doc values, but Approximate k-NN search functionality will be disabled.
			`index.knn.algo_param.ef_search` \| 512 \| true \| The size of the dynamic list used during k-NN searches. Higher values lead to more accurate but slower searches. Only available for nmslib.
			`index.knn.algo_param.ef_construction` \| 512 \| false \| (Deprecated in 1.0.0. Use the mapping parameters to set this value instead.) Only available for nmslib. Refer to mapping definition.
			`index.knn.algo_param.m` \| 16 \| false \| (Deprecated in 1.0.0. Use the mapping parameters to set this value instead.) Only available for nmslib. Refer to mapping definition.
			`index.knn.space_type` \| "l2" \| false \| (Deprecated in 1.0.0. Use the mapping parameters to set this value instead.) Only available for nmslib. Refer to mapping definition.