spring-data-elasticsearch/src/main/asciidoc/reference/elasticsearch-misc.adoc

[[elasticsearch.misc]]
= Miscellaneous Elasticsearch Operation Support

This chapter covers additional support for Elasticsearch operations that cannot be directly accessed via the repository interface.
It is recommended to add those operations as custom implementation as described in <<repositories.custom-implementations>> .

[[elasticsearc.misc.index.settings]]
== Index settings

When creating Elasticsearch indices with Spring Data Elasticsearch different index settings can be defined by using the `@Setting` annotation.
The following arguments are available:

* `useServerConfiguration` does not send any settings parameters, so the Elasticsearch server configuration determines them.
* `settingPath` refers to a JSON file defining the settings that must be resolvable in the classpath
* `shards` the number of shards to use, defaults to _1_
* `replicas` the number of replicas, defaults to _1_
* `refreshIntervall`, defaults to _"1s"_
* `indexStoreType`, defaults to _"fs"_


It is as well possible to define https://www.elastic.co/guide/en/elasticsearch/reference/7.11/index-modules-index-sorting.html[index sorting] (check the linked Elasticsearch documentation for the possible field types and values):

====
[source,java]
----
@Document(indexName = "entities")
@Setting(
  sortFields = { "secondField", "firstField" },                                  <.>
  sortModes = { Setting.SortMode.max, Setting.SortMode.min },                    <.>
  sortOrders = { Setting.SortOrder.desc, Setting.SortOrder.asc },
  sortMissingValues = { Setting.SortMissing._last, Setting.SortMissing._first })
class Entity {
    @Nullable
    @Id private String id;

    @Nullable
    @Field(name = "first_field", type = FieldType.Keyword)
    private String firstField;

    @Nullable @Field(name = "second_field", type = FieldType.Keyword)
    private String secondField;

    // getter and setter...
}
----

<.> when defining sort fields, use the name of the Java property (_firstField_), not the name that might be defined for Elasticsearch (_first_field_)
<.> `sortModes`, `sortOrders` and `sortMissingValues` are optional, but if they are set, the number of entries must match the number of `sortFields` elements
====

[[elasticsearch.misc.mappings]]
== Index Mapping

When Spring Data Elasticsearch creates the index mapping with the `IndexOperations.createMapping()` methods, it uses the annotations described in <<elasticsearch.mapping.meta-model.annotations>>, especially the `@Field` annotation.
In addition to that it is possible to add the `@Mapping` annotation to a class.
This annotation has the following properties:

* `mappingPath` a classpath resource in JSON format; if this is not empty it is used as the mapping, no other mapping processing is done.
* `enabled`  when set to false, this flag is written to the mapping and no further processing is done.
* `dateDetection` and `numericDetection` set the corresponding properties in the mapping when not set to `DEFAULT`.
* `dynamicDateFormats` when this String array is not empty, it defines the date formats used for automatic date detection.
* `runtimeFieldsPath` a classpath resource in JSON format containing the definition of runtime fields which is written to the index mappings, for example:

====
[source,json]
----
{
  "day_of_week": {
    "type": "keyword",
    "script": {
      "source": "emit(doc['@timestamp'].value.dayOfWeekEnum.getDisplayName(TextStyle.FULL, Locale.ROOT))"
    }
  }
}
----
====

[[elasticsearch.misc.filter]]
== Filter Builder

Filter Builder improves query speed.

====
[source,java]
----
private ElasticsearchOperations operations;

IndexCoordinates index = IndexCoordinates.of("sample-index");

SearchQuery searchQuery = new NativeSearchQueryBuilder()
  .withQuery(matchAllQuery())
  .withFilter(boolFilter().must(termFilter("id", documentId)))
  .build();

Page<SampleEntity> sampleEntities = operations.searchForPage(searchQuery, SampleEntity.class, index);
----
====

[[elasticsearch.scroll]]
== Using Scroll For Big Result Set

Elasticsearch has a scroll API for getting big result set in chunks.
This is internally used by Spring Data Elasticsearch to provide the implementations of the `<T> SearchHitsIterator<T> SearchOperations.searchForStream(Query query, Class<T> clazz, IndexCoordinates index)` method.

====
[source,java]
----
IndexCoordinates index = IndexCoordinates.of("sample-index");

SearchQuery searchQuery = new NativeSearchQueryBuilder()
  .withQuery(matchAllQuery())
  .withFields("message")
  .withPageable(PageRequest.of(0, 10))
  .build();

SearchHitsIterator<SampleEntity> stream = elasticsearchTemplate.searchForStream(searchQuery, SampleEntity.class, index);

List<SampleEntity> sampleEntities = new ArrayList<>();
while (stream.hasNext()) {
  sampleEntities.add(stream.next());
}

stream.close();
----
====

There are no methods in the `SearchOperations` API to access the scroll id, if it should be necessary to access this, the following methods of the `ElasticsearchRestTemplate` can be used:

====
[source,java]
----

@Autowired ElasticsearchRestTemplate template;

IndexCoordinates index = IndexCoordinates.of("sample-index");

SearchQuery searchQuery = new NativeSearchQueryBuilder()
  .withQuery(matchAllQuery())
  .withFields("message")
  .withPageable(PageRequest.of(0, 10))
  .build();

SearchScrollHits<SampleEntity> scroll = template.searchScrollStart(1000, searchQuery, SampleEntity.class, index);

String scrollId = scroll.getScrollId();
List<SampleEntity> sampleEntities = new ArrayList<>();
while (scroll.hasSearchHits()) {
  sampleEntities.addAll(scroll.getSearchHits());
  scrollId = scroll.getScrollId();
  scroll = template.searchScrollContinue(scrollId, 1000, SampleEntity.class);
}
template.searchScrollClear(scrollId);
----
====

To use the Scroll API with repository methods, the return type must defined as `Stream` in the Elasticsearch Repository.
The implementation of the method will then use the scroll methods from the ElasticsearchTemplate.

====
[source,java]
----
interface SampleEntityRepository extends Repository<SampleEntity, String> {

    Stream<SampleEntity> findBy();

}
----
====

[[elasticsearch.misc.sorts]]
== Sort options

In addition to the default sort options described in <<repositories.paging-and-sorting>>, Spring Data Elasticsearch provides the class `org.springframework.data.elasticsearch.core.query.Order` which derives from `org.springframework.data.domain.Sort.Order`.
It offers additional parameters that can be sent to Elasticsearch when specifying the sorting of the result (see https://www.elastic.co/guide/en/elasticsearch/reference/7.15/sort-search-results.html).

There also is the  `org.springframework.data.elasticsearch.core.query.GeoDistanceOrder` class which can be used to have the result of a search operation ordered by geographical distance.

If the class to be retrieved has a `GeoPoint` property named _location_, the following `Sort` would sort the results by distance to the given point:

====
[source,java]
----
Sort.by(new GeoDistanceOrder("location", new GeoPoint(48.137154, 11.5761247)))
----
====
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00			`[[elasticsearch.misc]]`
			`= Miscellaneous Elasticsearch Operation Support`

Add routing parameter to ElasticsearchOperations. Original Pull Request #562 Closes #1218 2021-01-18 23:54:55 +01:00			`This chapter covers additional support for Elasticsearch operations that cannot be directly accessed via the repository interface.`
			`It is recommended to add those operations as custom implementation as described in <<repositories.custom-implementations>> .`
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`[[elasticsearc.misc.index.settings]]`
			`== Index settings`

Custom Order class with specific parameters for Elasticsearch. Original Pull Request #1955 Closes #1911 2021-10-08 13:38:22 +02:00			When creating Elasticsearch indices with Spring Data Elasticsearch different index settings can be defined by using the `@Setting` annotation.
			`The following arguments are available:`
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00
			* `useServerConfiguration` does not send any settings parameters, so the Elasticsearch server configuration determines them.
			* `settingPath` refers to a JSON file defining the settings that must be resolvable in the classpath
			* `shards` the number of shards to use, defaults to _1_
			* `replicas` the number of replicas, defaults to _1_
			* `refreshIntervall`, defaults to _"1s"_
			* `indexStoreType`, defaults to _"fs"_


			`It is as well possible to define https://www.elastic.co/guide/en/elasticsearch/reference/7.11/index-modules-index-sorting.html[index sorting] (check the linked Elasticsearch documentation for the possible field types and values):`

			`====`
			`[source,java]`
			`----`
			`@Document(indexName = "entities")`
			`@Setting(`
			`sortFields = { "secondField", "firstField" }, <.>`
			`sortModes = { Setting.SortMode.max, Setting.SortMode.min }, <.>`
			`sortOrders = { Setting.SortOrder.desc, Setting.SortOrder.asc },`
			`sortMissingValues = { Setting.SortMissing._last, Setting.SortMissing._first })`
			`class Entity {`
			`@Nullable`
			`@Id private String id;`

			`@Nullable`
			`@Field(name = "first_field", type = FieldType.Keyword)`
			`private String firstField;`

			`@Nullable @Field(name = "second_field", type = FieldType.Keyword)`
			`private String secondField;`

			`// getter and setter...`
			`}`
			`----`
Custom Order class with specific parameters for Elasticsearch. Original Pull Request #1955 Closes #1911 2021-10-08 13:38:22 +02:00
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`<.> when defining sort fields, use the name of the Java property (_firstField_), not the name that might be defined for Elasticsearch (_first_field_)`
			<.> `sortModes`, `sortOrders` and `sortMissingValues` are optional, but if they are set, the number of entries must match the number of `sortFields` elements
			`====`

datatype detection support in mapping. Original Pull Request #1810 Closes #638 2021-05-13 10:26:24 +02:00			`[[elasticsearch.misc.mappings]]`
			`== Index Mapping`

Custom Order class with specific parameters for Elasticsearch. Original Pull Request #1955 Closes #1911 2021-10-08 13:38:22 +02:00			When Spring Data Elasticsearch creates the index mapping with the `IndexOperations.createMapping()` methods, it uses the annotations described in <<elasticsearch.mapping.meta-model.annotations>>, especially the `@Field` annotation.
			In addition to that it is possible to add the `@Mapping` annotation to a class.
			`This annotation has the following properties:`
datatype detection support in mapping. Original Pull Request #1810 Closes #638 2021-05-13 10:26:24 +02:00
Add runtime fields to index mapping. Original Pull Request: #1820 Closes: #1816 2021-05-19 21:38:48 +02:00			* `mappingPath` a classpath resource in JSON format; if this is not empty it is used as the mapping, no other mapping processing is done.
datatype detection support in mapping. Original Pull Request #1810 Closes #638 2021-05-13 10:26:24 +02:00			* `enabled` when set to false, this flag is written to the mapping and no further processing is done.
			* `dateDetection` and `numericDetection` set the corresponding properties in the mapping when not set to `DEFAULT`.
			* `dynamicDateFormats` when this String array is not empty, it defines the date formats used for automatic date detection.
Add runtime fields to index mapping. Original Pull Request: #1820 Closes: #1816 2021-05-19 21:38:48 +02:00			* `runtimeFieldsPath` a classpath resource in JSON format containing the definition of runtime fields which is written to the index mappings, for example:
Custom Order class with specific parameters for Elasticsearch. Original Pull Request #1955 Closes #1911 2021-10-08 13:38:22 +02:00
Add runtime fields to index mapping. Original Pull Request: #1820 Closes: #1816 2021-05-19 21:38:48 +02:00			`====`
			`[source,json]`
			`----`
			`{`
			`"day_of_week": {`
			`"type": "keyword",`
			`"script": {`
			`"source": "emit(doc['@timestamp'].value.dayOfWeekEnum.getDisplayName(TextStyle.FULL, Locale.ROOT))"`
			`}`
			`}`
			`}`
			`----`
			`====`
datatype detection support in mapping. Original Pull Request #1810 Closes #638 2021-05-13 10:26:24 +02:00
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00			`[[elasticsearch.misc.filter]]`
			`== Filter Builder`

			`Filter Builder improves query speed.`

			`====`
			`[source,java]`
			`----`
DATAES-676 - Fix documentation to reflect the changes in API restructuring. Original PR: #362 2019-12-24 09:23:23 +01:00			`private ElasticsearchOperations operations;`

			`IndexCoordinates index = IndexCoordinates.of("sample-index");`
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00
			`SearchQuery searchQuery = new NativeSearchQueryBuilder()`
DATAES-497 - Update reference documentation. Original pull request: #291. 2019-06-28 21:40:59 +02:00			`.withQuery(matchAllQuery())`
			`.withFilter(boolFilter().must(termFilter("id", documentId)))`
			`.build();`
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00
DATAES-676 - Fix documentation to reflect the changes in API restructuring. Original PR: #362 2019-12-24 09:23:23 +01:00			`Page<SampleEntity> sampleEntities = operations.searchForPage(searchQuery, SampleEntity.class, index);`
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00			`----`
			`====`

DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`[[elasticsearch.scroll]]`
			`== Using Scroll For Big Result Set`
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00
Add routing parameter to ElasticsearchOperations. Original Pull Request #562 Closes #1218 2021-01-18 23:54:55 +01:00			`Elasticsearch has a scroll API for getting big result set in chunks.`
			This is internally used by Spring Data Elasticsearch to provide the implementations of the `<T> SearchHitsIterator<T> SearchOperations.searchForStream(Query query, Class<T> clazz, IndexCoordinates index)` method.
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00			`[source,java]`
			`----`
DATAES-676 - Fix documentation to reflect the changes in API restructuring. Original PR: #362 2019-12-24 09:23:23 +01:00			`IndexCoordinates index = IndexCoordinates.of("sample-index");`

DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00			`SearchQuery searchQuery = new NativeSearchQueryBuilder()`
DATAES-497 - Update reference documentation. Original pull request: #291. 2019-06-28 21:40:59 +02:00			`.withQuery(matchAllQuery())`
			`.withFields("message")`
			`.withPageable(PageRequest.of(0, 10))`
			`.build();`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			`SearchHitsIterator<SampleEntity> stream = elasticsearchTemplate.searchForStream(searchQuery, SampleEntity.class, index);`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00
			`List<SampleEntity> sampleEntities = new ArrayList<>();`
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			`while (stream.hasNext()) {`
			`sampleEntities.add(stream.next());`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`}`
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00
			`stream.close();`
DATAES-114 - Migrate to Asciidoctor for reference documentation. Converted existing docbook material to Asciidoctor and applied Spring styling. 2014-08-07 12:24:32 -05:00			`----`
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			There are no methods in the `SearchOperations` API to access the scroll id, if it should be necessary to access this, the following methods of the `ElasticsearchRestTemplate` can be used:
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`[source,java]`
			`----`
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00
			`@Autowired ElasticsearchRestTemplate template;`

DATAES-676 - Fix documentation to reflect the changes in API restructuring. Original PR: #362 2019-12-24 09:23:23 +01:00			`IndexCoordinates index = IndexCoordinates.of("sample-index");`

DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`SearchQuery searchQuery = new NativeSearchQueryBuilder()`
DATAES-497 - Update reference documentation. Original pull request: #291. 2019-06-28 21:40:59 +02:00			`.withQuery(matchAllQuery())`
			`.withFields("message")`
			`.withPageable(PageRequest.of(0, 10))`
			`.build();`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			`SearchScrollHits<SampleEntity> scroll = template.searchScrollStart(1000, searchQuery, SampleEntity.class, index);`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			`String scrollId = scroll.getScrollId();`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`List<SampleEntity> sampleEntities = new ArrayList<>();`
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			`while (scroll.hasSearchHits()) {`
			`sampleEntities.addAll(scroll.getSearchHits());`
			`scrollId = scroll.getScrollId();`
			`scroll = template.searchScrollContinue(scrollId, 1000, SampleEntity.class);`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`}`
DATAES-766 - Replace CloseableIterator with SearchHitsIterator in stream operations. Original PR: #412 fix documentation 2020-03-27 17:19:56 +01:00			`template.searchScrollClear(scrollId);`
DATAES-445 - Updated scroll API example. Original pull request: #218 2018-08-31 12:40:51 +03:00			`----`
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-734 - Add Sort implementation that allows geo distance sorts. Original PR: #382 2020-01-23 18:03:37 +01:00
Add routing parameter to ElasticsearchOperations. Original Pull Request #562 Closes #1218 2021-01-18 23:54:55 +01:00			To use the Scroll API with repository methods, the return type must defined as `Stream` in the Elasticsearch Repository.
			`The implementation of the method will then use the scroll methods from the ElasticsearchTemplate.`
DATAES-802 - Update documentation for using scroll API with repository methods. Original PR: #440 2020-04-26 16:40:30 +01:00
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-802 - Update documentation for using scroll API with repository methods. Original PR: #440 2020-04-26 16:40:30 +01:00			`[source,java]`
			`----`
			`interface SampleEntityRepository extends Repository<SampleEntity, String> {`

			`Stream<SampleEntity> findBy();`

			`}`
			`----`
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-802 - Update documentation for using scroll API with repository methods. Original PR: #440 2020-04-26 16:40:30 +01:00
DATAES-734 - Add Sort implementation that allows geo distance sorts. Original PR: #382 2020-01-23 18:03:37 +01:00			`[[elasticsearch.misc.sorts]]`
			`== Sort options`

Custom Order class with specific parameters for Elasticsearch. Original Pull Request #1955 Closes #1911 2021-10-08 13:38:22 +02:00			In addition to the default sort options described in <<repositories.paging-and-sorting>>, Spring Data Elasticsearch provides the class `org.springframework.data.elasticsearch.core.query.Order` which derives from `org.springframework.data.domain.Sort.Order`.
			`It offers additional parameters that can be sent to Elasticsearch when specifying the sorting of the result (see https://www.elastic.co/guide/en/elasticsearch/reference/7.15/sort-search-results.html).`

			There also is the `org.springframework.data.elasticsearch.core.query.GeoDistanceOrder` class which can be used to have the result of a search operation ordered by geographical distance.
DATAES-734 - Add Sort implementation that allows geo distance sorts. Original PR: #382 2020-01-23 18:03:37 +01:00
			If the class to be retrieved has a `GeoPoint` property named _location_, the following `Sort` would sort the results by distance to the given point:

Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`
DATAES-734 - Add Sort implementation that allows geo distance sorts. Original PR: #382 2020-01-23 18:03:37 +01:00			`[source,java]`
			`----`
			`Sort.by(new GeoDistanceOrder("location", new GeoPoint(48.137154, 11.5761247)))`
			`----`
Configure index settings with @Setting annotation. Original Pull Request #1748 Closes #1719 2021-03-28 13:24:52 +02:00			`====`