OpenSearch/docs/java-rest/high-level/indices/analyze.asciidoc

--
:api: analyze
:request: AnalyzeRequest
:response: AnalyzeResponse
--

[id="{upid}-{api}"]
=== Analyze API

[id="{upid}-{api}-request"]
==== Analyze Request

An +{request}+ contains the text to analyze, and one of several options to
specify how the analysis should be performed.

The simplest version uses a built-in analyzer:

["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-builtin-request]
---------------------------------------------------
<1> A built-in analyzer
<2> The text to include.  Multiple strings are treated as a multi-valued field

You can configure a custom analyzer:
["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-custom-request]
---------------------------------------------------
<1> Configuration for a custom tokenfilter
<2> Configure the tokenizer
<3> Configure char filters
<4> Add a built-in tokenfilter
<5> Add the custom tokenfilter

You can also build a custom normalizer, by including only charfilters and
tokenfilters:
["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-custom-normalizer-request]
---------------------------------------------------

You can analyze text using an analyzer defined in an existing index:
["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-index-request]
---------------------------------------------------
<1> The index containing the mappings
<2> The analyzer defined on this index to use

Or you can use a normalizer:
["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-index-normalizer-request]
---------------------------------------------------
<1> The index containing the mappings
<2> The normalizer defined on this index to use

You can analyze text using the mappings for a particular field in an index:
["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-field-request]
---------------------------------------------------

==== Optional arguments
The following arguments can also optionally be provided:

["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-request-explain]
---------------------------------------------------
<1> Setting `explain` to true will add further details to the response
<2> Setting `attributes` allows you to return only token attributes that you are
interested in

include::../execution.asciidoc[]

[id="{upid}-{api}-response"]
==== Analyze Response

The returned +{response}+ allows you to retrieve details of the analysis as
follows:
["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-response-tokens]
---------------------------------------------------
<1> `AnalyzeToken` holds information about the individual tokens produced by analysis

If `explain` was set to `true`, then information is instead returned from the `detail()`
method:

["source","java",subs="attributes,callouts,macros"]
---------------------------------------------------
include-tagged::{doc-tests-file}[{api}-response-detail]
---------------------------------------------------
<1> `DetailAnalyzeResponse` holds more detailed information about tokens produced by
the various substeps in the analysis chain.
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`--`
			`:api: analyze`
			`:request: AnalyzeRequest`
			`:response: AnalyzeResponse`
			`--`

			`[id="{upid}-{api}"]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`=== Analyze API`

Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`[id="{upid}-{api}-request"]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`==== Analyze Request`

Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`An +{request}+ contains the text to analyze, and one of several options to`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`specify how the analysis should be performed.`

			`The simplest version uses a built-in analyzer:`

			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-builtin-request]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197) This commit clones the existing AnalyzeRequest/AnalyzeResponse classes to the high-level rest client, and adjusts request converters to use these new classes. This is a prerequisite to removing the Streamable interface from the internal server version of these classes. 2019-06-03 04:16:54 -04:00			`<1> A built-in analyzer`
			`<2> The text to include. Multiple strings are treated as a multi-valued field`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00
			`You can configure a custom analyzer:`
			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-custom-request]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197) This commit clones the existing AnalyzeRequest/AnalyzeResponse classes to the high-level rest client, and adjusts request converters to use these new classes. This is a prerequisite to removing the Streamable interface from the internal server version of these classes. 2019-06-03 04:16:54 -04:00			`<1> Configuration for a custom tokenfilter`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`<2> Configure the tokenizer`
Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197) This commit clones the existing AnalyzeRequest/AnalyzeResponse classes to the high-level rest client, and adjusts request converters to use these new classes. This is a prerequisite to removing the Streamable interface from the internal server version of these classes. 2019-06-03 04:16:54 -04:00			`<3> Configure char filters`
			`<4> Add a built-in tokenfilter`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`<5> Add the custom tokenfilter`

			`You can also build a custom normalizer, by including only charfilters and`
			`tokenfilters:`
			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-custom-normalizer-request]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`

			`You can analyze text using an analyzer defined in an existing index:`
			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-index-request]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
			`<1> The index containing the mappings`
			`<2> The analyzer defined on this index to use`

			`Or you can use a normalizer:`
			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-index-normalizer-request]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
			`<1> The index containing the mappings`
			`<2> The normalizer defined on this index to use`

			`You can analyze text using the mappings for a particular field in an index:`
			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-field-request]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`

Remove obsolete parameters from analyze rest spec (#31795) This commit also fixes a typo in the analyze high-level client documentation. 2018-07-06 04:05:34 -04:00			`==== Optional arguments`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`The following arguments can also optionally be provided:`

			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-request-explain]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
			<1> Setting `explain` to true will add further details to the response
			<2> Setting `attributes` allows you to return only token attributes that you are
			`interested in`

Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include::../execution.asciidoc[]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`[id="{upid}-{api}-response"]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`==== Analyze Response`

Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`The returned +{response}+ allows you to retrieve details of the analysis as`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`follows:`
			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-response-tokens]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
			<1> `AnalyzeToken` holds information about the individual tokens produced by analysis

			If `explain` was set to `true`, then information is instead returned from the `detail()`
			`method:`

			`["source","java",subs="attributes,callouts,macros"]`
			`---------------------------------------------------`
Docs: DRY up indices docs (#35971) This commit DRYs up the indices folder as well as fixing a few minor mishaps that were in the docs. 2018-11-27 20:40:49 -05:00			`include-tagged::{doc-tests-file}[{api}-response-detail]`
Add analyze API to high-level rest client (#31577) 2018-07-03 10:57:02 -04:00			`---------------------------------------------------`
			<1> `DetailAnalyzeResponse` holds more detailed information about tokens produced by
			`the various substeps in the analysis chain.`