OpenSearch/docs/reference/mapping/params/term-vector.asciidoc

[[term-vector]]
=== `term_vector`

Term vectors contain information about the terms produced by the
<<analysis,analysis>> process, including:

* a list of terms.
* the position (or order) of each term.
* the start and end character offsets mapping the term to its
  origin in the original string.

These term vectors can be stored so that they can be retrieved for a
particular document.

The `term_vector` setting accepts:

[horizontal]
`no`::                      No term vectors are stored. (default)
`yes`::                     Just the terms in the field are stored.
`with_positions`::          Terms and positions are stored.
`with_offsets`::            Terms and character offsets are stored.
`with_positions_offsets`::  Terms, positions, and character offsets are stored.

The fast vector highlighter requires `with_positions_offsets`.  The term
vectors API can retrieve whatever is stored.

WARNING:  Setting `with_positions_offsets` will double the size of a field's
index.

[source,js]
--------------------------------------------------
PUT my_index?include_type_name=true
{
  "mappings": {
    "_doc": {
      "properties": {
        "text": {
          "type":        "text",
          "term_vector": "with_positions_offsets"
        }
      }
    }
  }
}

PUT my_index/_doc/1
{
  "text": "Quick brown fox"
}

GET my_index/_search
{
  "query": {
    "match": {
      "text": "brown fox"
    }
  },
  "highlight": {
    "fields": {
      "text": {} <1>
    }
  }
}
--------------------------------------------------
// CONSOLE
<1> The fast vector highlighter will be used by default for the `text` field
    because term vectors are enabled.
Docs: Mapping docs completely rewritten for 2.0 2015-08-06 17:24:29 +02:00			`[[term-vector]]`
			=== `term_vector`

			`Term vectors contain information about the terms produced by the`
			`<<analysis,analysis>> process, including:`

			`* a list of terms.`
			`* the position (or order) of each term.`
			`* the start and end character offsets mapping the term to its`
			`origin in the original string.`

			`These term vectors can be stored so that they can be retrieved for a`
			`particular document.`

			The `term_vector` setting accepts:

			`[horizontal]`
			`no`:: No term vectors are stored. (default)
			`yes`:: Just the terms in the field are stored.
			`with_positions`:: Terms and positions are stored.
			`with_offsets`:: Terms and character offsets are stored.
			`with_positions_offsets`:: Terms, positions, and character offsets are stored.

			The fast vector highlighter requires `with_positions_offsets`. The term
			`vectors API can retrieve whatever is stored.`

			WARNING: Setting `with_positions_offsets` will double the size of a field's
			`index.`

			`[source,js]`
			`--------------------------------------------------`
Update the default for include_type_name to false. (#37285) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs. 2019-01-14 13:08:01 -08:00			`PUT my_index?include_type_name=true`
Docs: Mapping docs completely rewritten for 2.0 2015-08-06 17:24:29 +02:00			`{`
			`"mappings": {`
Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			`"_doc": {`
Docs: Mapping docs completely rewritten for 2.0 2015-08-06 17:24:29 +02:00			`"properties": {`
			`"text": {`
Document 5.0 mapping changes. 2016-03-18 17:01:27 +01:00			`"type": "text",`
Docs: Mapping docs completely rewritten for 2.0 2015-08-06 17:24:29 +02:00			`"term_vector": "with_positions_offsets"`
			`}`
			`}`
			`}`
			`}`
			`}`

Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			`PUT my_index/_doc/1`
Docs: Mapping docs completely rewritten for 2.0 2015-08-06 17:24:29 +02:00			`{`
			`"text": "Quick brown fox"`
			`}`

			`GET my_index/_search`
			`{`
			`"query": {`
			`"match": {`
			`"text": "brown fox"`
			`}`
			`},`
			`"highlight": {`
			`"fields": {`
			`"text": {} <1>`
			`}`
			`}`
			`}`
			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 15:42:23 +02:00			`// CONSOLE`
Docs: Mapping docs completely rewritten for 2.0 2015-08-06 17:24:29 +02:00			<1> The fast vector highlighter will be used by default for the `text` field
			`because term vectors are enabled.`