OpenSearch/docs/reference/index-modules/merge.asciidoc

[[index-modules-merge]]
== Merge

experimental[All of the settings exposed in the `merge` module are expert only and may be removed in the future]

A shard in elasticsearch is a Lucene index, and a Lucene index is broken
down into segments. Segments are internal storage elements in the index
where the index data is stored, and are immutable up to delete markers.
Segments are, periodically, merged into larger segments to keep the
index size at bay and expunge deletes.

[float]
[[policy]]
=== Policy

The index merge policy module allows one to control which segments of a
shard index are to be merged. There are several types of policies with
the default set to `tiered`.

[float]
[[tiered]]
==== tiered

Merges segments of approximately equal size, subject to an allowed
number of segments per tier. This is similar to `log_bytes_size` merge
policy, except this merge policy is able to merge non-adjacent segment,
and separates how many segments are merged at once from how many
segments are allowed per tier. This merge policy also does not
over-merge (i.e., cascade merges).

This policy has the following settings:

`index.merge.policy.expunge_deletes_allowed`::

	When expungeDeletes is called, we only merge away a segment if its delete
	percentage is over this threshold. Default is `10`.

`index.merge.policy.floor_segment`::

	Segments smaller than this are "rounded up" to this size, i.e. treated as
	equal (floor) size for merge selection. This is to prevent frequent
	flushing of tiny segments, thus preventing a long tail in the index. Default
	is `2mb`.

`index.merge.policy.max_merge_at_once`::

	Maximum number of segments to be merged at a time during "normal" merging.
	Default is `10`.

`index.merge.policy.max_merge_at_once_explicit`::

	Maximum number of segments to be merged at a time, during optimize or
	expungeDeletes. Default is `30`.

`index.merge.policy.max_merged_segment`::

	Maximum sized segment to produce during normal merging (not explicit
	optimize). This setting is approximate: the estimate of the merged segment
	size is made by summing sizes of to-be-merged segments (compensating for
	percent deleted docs). Default is `5gb`.

`index.merge.policy.segments_per_tier`::

	Sets the allowed number of segments per tier. Smaller values mean more
	merging but fewer segments. Default is `10`. Note, this value needs to be
	>= than the `max_merge_at_once` otherwise you'll force too many merges to
	occur.

`index.merge.policy.reclaim_deletes_weight`::

	Controls how aggressively merges that reclaim more deletions are favored.
	Higher values favor selecting merges that reclaim deletions. A value of
	`0.0` means deletions don't impact merge selection. Defaults to `2.0`.

For normal merging, this policy first computes a "budget" of how many
segments are allowed to be in the index. If the index is over-budget,
then the policy sorts segments by decreasing size (proportionally considering percent
deletes), and then finds the least-cost merge. Merge cost is measured by
a combination of the "skew" of the merge (size of largest seg divided by
smallest seg), total merge size and pct deletes reclaimed, so that
merges with lower skew, smaller size and those reclaiming more deletes,
are favored.

If a merge will produce a segment that's larger than
`max_merged_segment` then the policy will merge fewer segments (down to
1 at once, if that one has deletions) to keep the segment size under
budget.

Note, this can mean that for large shards that holds many gigabytes of
data, the default of `max_merged_segment` (`5gb`) can cause for many
segments to be in an index, and causing searches to be slower. Use the
indices segments API to see the segments that an index has, and
possibly either increase the `max_merged_segment` or issue an optimize
call for the index (try and aim to issue it on a low traffic time).

[float]
[[log-byte-size]]
==== log_byte_size

A merge policy that merges segments into levels of exponentially
increasing *byte size*, where each level has fewer segments than the
value of the merge factor. Whenever extra segments (beyond the merge
factor upper bound) are encountered, all segments within the level are
merged.

This policy has the following settings:

[cols="<,<",options="header",]
|=======================================================================
|Setting |Description
|index.merge.policy.merge_factor |Determines how often segment indices
are merged by index operation. With smaller values, less RAM is used
while indexing, and searches on unoptimized indices are faster, but
indexing speed is slower. With larger values, more RAM is used during
indexing, and while searches on unoptimized indices are slower, indexing
is faster. Thus larger values (greater than 10) are best for batch index
creation, and smaller values (lower than 10) for indices that are
interactively maintained. Defaults to `10`.

|index.merge.policy.min_merge_size |A size setting type which sets the
minimum size for the lowest level segments. Any segments below this size
are considered to be on the same level (even if they vary drastically in
size) and will be merged whenever there are mergeFactor of them. This
effectively truncates the "long tail" of small segments that would
otherwise be created into a single level. If you set this too large, it
could greatly increase the merging cost during indexing (if you flush
many small segments). Defaults to `1.6mb`

|index.merge.policy.max_merge_size |A size setting type which sets the
largest segment (measured by total byte size of the segment's files)
that may be merged with other segments. Defaults to unbounded.

|index.merge.policy.max_merge_docs |Determines the largest segment
(measured by document count) that may be merged with other segments.
Defaults to unbounded.
|=======================================================================

[float]
[[log-doc]]
==== log_doc

A merge policy that tries to merge segments into levels of exponentially
increasing *document count*, where each level has fewer segments than
the value of the merge factor. Whenever extra segments (beyond the merge
factor upper bound) are encountered, all segments within the level are
merged.

[cols="<,<",options="header",]
|=======================================================================
|Setting |Description
|index.merge.policy.merge_factor |Determines how often segment indices
are merged by index operation. With smaller values, less RAM is used
while indexing, and searches on unoptimized indices are faster, but
indexing speed is slower. With larger values, more RAM is used during
indexing, and while searches on unoptimized indices are slower, indexing
is faster. Thus larger values (greater than 10) are best for batch index
creation, and smaller values (lower than 10) for indices that are
interactively maintained. Defaults to `10`.

|index.merge.policy.min_merge_docs |Sets the minimum size for the lowest
level segments. Any segments below this size are considered to be on the
same level (even if they vary drastically in size) and will be merged
whenever there are mergeFactor of them. This effectively truncates the
"long tail" of small segments that would otherwise be created into a
single level. If you set this too large, it could greatly increase the
merging cost during indexing (if you flush many small segments).
Defaults to `1000`.

|index.merge.policy.max_merge_docs |Determines the largest segment
(measured by document count) that may be merged with other segments.
Defaults to unbounded.
|=======================================================================

[float]
[[scheduling]]
=== Scheduling

The merge scheduler (ConcurrentMergeScheduler) controls the execution of
merge operations once they are needed (according to the merge policy).  Merges
run in separate threads, and when the maximum number of threads is reached,
further merges will wait until a merge thread becomes available.  The merge
scheduler supports this setting:

`index.merge.scheduler.max_thread_count`::

The maximum number of threads that may be merging at once. Defaults to
`Math.max(1, Math.min(4, Runtime.getRuntime().availableProcessors() / 2))`
which works well for a good solid-state-disk (SSD).  If your index is on
spinning platter drives instead, decrease this to 1.

`index.merge.scheduler.auto_throttle`::

If this is true (the default), then the merge scheduler will
rate-limit IO (writes) for merges to an adaptive value depending on
how many merges are requested over time.  An application with a low
indexing rate that unluckily suddenly requires a large merge will see
that merge aggressively throttled, while an application doing heavy
indexing will see the throttle move higher to allow merges to keep up
with ongoing indexing.  This is a dynamic setting (you can <<../indices/update-settings,change it
at any time on a running index>>).

[float]
==== SerialMergeScheduler

This is accepted for backwards compatibility, but just uses
ConcurrentMergeScheduler with index.merge.scheduler.max_thread_count
set to 1 so that only 1 merge may run at a time.
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`[[index-modules-merge]]`
			`== Merge`

Docs: Updated the experimental annotations in the docs as follows: * Removed the docs for `index.compound_format` and `index.compound_on_flush` - these are expert settings which should probably be removed (see https://github.com/elastic/elasticsearch/issues/10778) * Removed the docs for `index.index_concurrency` - another expert setting * Labelled the segments verbose output as experimental * Marked the `compression`, `precision_threshold` and `rehash` options as experimental in the cardinality and percentile aggs * Improved the experimental text on `significant_terms`, `execution_hint` in the terms agg, and `terminate_after` param on count and search * Removed the experimental flag on the `geobounds` agg * Marked the settings in the `merge` and `store` modules as experimental, rather than the modules themselves Closes #10782 2015-04-26 18:49:15 +02:00			experimental[All of the settings exposed in the `merge` module are expert only and may be removed in the future]
Docs: Use the new experimental annotation. We now have a very useful annotation to mark features or parameters as experimental. Let's use it! This commit replaces some custom text warnings with this annotation and adds this annotation to some existing features/parameters: - inner_hits (unreleased yet) - terminate_after (released in 1.4) - per-bucket doc count errors in the terms agg (released in 1.4) I also tagged with this annotation settings which should either be not needed (like the ability to evict entries from the filter cache based on time) or that are too deep into the way that Elasticsearch works like the Directory implementation or merge settings. Close #9563 2015-02-04 15:43:22 +01:00
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`A shard in elasticsearch is a Lucene index, and a Lucene index is broken`
			`down into segments. Segments are internal storage elements in the index`
			`where the index data is stored, and are immutable up to delete markers.`
			`Segments are, periodically, merged into larger segments to keep the`
			`index size at bay and expunge deletes.`

			`[float]`
Add more anchor links to documentation Related to #3679 2013-09-25 10:17:40 -06:00			`[[policy]]`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`=== Policy`

			`The index merge policy module allows one to control which segments of a`
			`shard index are to be merged. There are several types of policies with`
			the default set to `tiered`.

			`[float]`
Add more anchor links to documentation Related to #3679 2013-09-25 10:17:40 -06:00			`[[tiered]]`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`==== tiered`

			`Merges segments of approximately equal size, subject to an allowed`
			number of segments per tier. This is similar to `log_bytes_size` merge
			`policy, except this merge policy is able to merge non-adjacent segment,`
			`and separates how many segments are merged at once from how many`
			`segments are allowed per tier. This merge policy also does not`
			`over-merge (i.e., cascade merges).`

			`This policy has the following settings:`

[DOCS] Added compound_on_flush docs and updated compound_format docs to include note about accepting a float 2013-10-15 13:30:56 +02:00			`index.merge.policy.expunge_deletes_allowed`::

			`When expungeDeletes is called, we only merge away a segment if its delete`
			percentage is over this threshold. Default is `10`.

			`index.merge.policy.floor_segment`::

			`Segments smaller than this are "rounded up" to this size, i.e. treated as`
			`equal (floor) size for merge selection. This is to prevent frequent`
[DOCS] Multiple doc fixes Closes #5047 2014-03-07 14:21:45 +01:00			`flushing of tiny segments, thus preventing a long tail in the index. Default`
[DOCS] Added compound_on_flush docs and updated compound_format docs to include note about accepting a float 2013-10-15 13:30:56 +02:00			is `2mb`.

			`index.merge.policy.max_merge_at_once`::

			`Maximum number of segments to be merged at a time during "normal" merging.`
			Default is `10`.

			`index.merge.policy.max_merge_at_once_explicit`::

			`Maximum number of segments to be merged at a time, during optimize or`
			expungeDeletes. Default is `30`.

			`index.merge.policy.max_merged_segment`::

			`Maximum sized segment to produce during normal merging (not explicit`
			`optimize). This setting is approximate: the estimate of the merged segment`
			`size is made by summing sizes of to-be-merged segments (compensating for`
			percent deleted docs). Default is `5gb`.

			`index.merge.policy.segments_per_tier`::

			`Sets the allowed number of segments per tier. Smaller values mean more`
			merging but fewer segments. Default is `10`. Note, this value needs to be
[DOCS] Multiple doc fixes Closes #5047 2014-03-07 14:21:45 +01:00			>= than the `max_merge_at_once` otherwise you'll force too many merges to
[DOCS] Added compound_on_flush docs and updated compound_format docs to include note about accepting a float 2013-10-15 13:30:56 +02:00			`occur.`

Docs: Fix misnamed setting The settings is `index.merge.policy.reclaim_deletes_weight` not `index.reclaim_deletes_weight`. Closes #7676 2014-09-10 14:56:23 -04:00			`index.merge.policy.reclaim_deletes_weight`::
[DOCS] Added compound_on_flush docs and updated compound_format docs to include note about accepting a float 2013-10-15 13:30:56 +02:00
			`Controls how aggressively merges that reclaim more deletions are favored.`
			`Higher values favor selecting merges that reclaim deletions. A value of`
			`0.0` means deletions don't impact merge selection. Defaults to `2.0`.

Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`For normal merging, this policy first computes a "budget" of how many`
[DOCS] Multiple doc fixes Closes #5047 2014-03-07 14:21:45 +01:00			`segments are allowed to be in the index. If the index is over-budget,`
			`then the policy sorts segments by decreasing size (proportionally considering percent`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`deletes), and then finds the least-cost merge. Merge cost is measured by`
			`a combination of the "skew" of the merge (size of largest seg divided by`
			`smallest seg), total merge size and pct deletes reclaimed, so that`
			`merges with lower skew, smaller size and those reclaiming more deletes,`
			`are favored.`

			`If a merge will produce a segment that's larger than`
			`max_merged_segment` then the policy will merge fewer segments (down to
			`1 at once, if that one has deletions) to keep the segment size under`
			`budget.`

			`Note, this can mean that for large shards that holds many gigabytes of`
			data, the default of `max_merged_segment` (`5gb`) can cause for many
			`segments to be in an index, and causing searches to be slower. Use the`
[DOCS] Multiple doc fixes Closes #5047 2014-03-07 14:21:45 +01:00			`indices segments API to see the segments that an index has, and`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			possibly either increase the `max_merged_segment` or issue an optimize
			`call for the index (try and aim to issue it on a low traffic time).`

			`[float]`
Add more anchor links to documentation Related to #3679 2013-09-25 10:17:40 -06:00			`[[log-byte-size]]`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`==== log_byte_size`

			`A merge policy that merges segments into levels of exponentially`
			`increasing byte size, where each level has fewer segments than the`
			`value of the merge factor. Whenever extra segments (beyond the merge`
			`factor upper bound) are encountered, all segments within the level are`
			`merged.`

			`This policy has the following settings:`

			`[cols="<,<",options="header",]`
			`\|=======================================================================`
			`\|Setting \|Description`
			`\|index.merge.policy.merge_factor \|Determines how often segment indices`
			`are merged by index operation. With smaller values, less RAM is used`
			`while indexing, and searches on unoptimized indices are faster, but`
			`indexing speed is slower. With larger values, more RAM is used during`
			`indexing, and while searches on unoptimized indices are slower, indexing`
			`is faster. Thus larger values (greater than 10) are best for batch index`
			`creation, and smaller values (lower than 10) for indices that are`
			interactively maintained. Defaults to `10`.

			`\|index.merge.policy.min_merge_size \|A size setting type which sets the`
			`minimum size for the lowest level segments. Any segments below this size`
			`are considered to be on the same level (even if they vary drastically in`
			`size) and will be merged whenever there are mergeFactor of them. This`
			`effectively truncates the "long tail" of small segments that would`
			`otherwise be created into a single level. If you set this too large, it`
			`could greatly increase the merging cost during indexing (if you flush`
			many small segments). Defaults to `1.6mb`

			`\|index.merge.policy.max_merge_size \|A size setting type which sets the`
			`largest segment (measured by total byte size of the segment's files)`
			`that may be merged with other segments. Defaults to unbounded.`

			`\|index.merge.policy.max_merge_docs \|Determines the largest segment`
			`(measured by document count) that may be merged with other segments.`
			`Defaults to unbounded.`
			`\|=======================================================================`

			`[float]`
Add more anchor links to documentation Related to #3679 2013-09-25 10:17:40 -06:00			`[[log-doc]]`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`==== log_doc`

			`A merge policy that tries to merge segments into levels of exponentially`
			`increasing document count, where each level has fewer segments than`
			`the value of the merge factor. Whenever extra segments (beyond the merge`
			`factor upper bound) are encountered, all segments within the level are`
			`merged.`

			`[cols="<,<",options="header",]`
			`\|=======================================================================`
			`\|Setting \|Description`
			`\|index.merge.policy.merge_factor \|Determines how often segment indices`
			`are merged by index operation. With smaller values, less RAM is used`
			`while indexing, and searches on unoptimized indices are faster, but`
			`indexing speed is slower. With larger values, more RAM is used during`
			`indexing, and while searches on unoptimized indices are slower, indexing`
			`is faster. Thus larger values (greater than 10) are best for batch index`
			`creation, and smaller values (lower than 10) for indices that are`
			interactively maintained. Defaults to `10`.

			`\|index.merge.policy.min_merge_docs \|Sets the minimum size for the lowest`
			`level segments. Any segments below this size are considered to be on the`
			`same level (even if they vary drastically in size) and will be merged`
			`whenever there are mergeFactor of them. This effectively truncates the`
			`"long tail" of small segments that would otherwise be created into a`
			`single level. If you set this too large, it could greatly increase the`
			`merging cost during indexing (if you flush many small segments).`
			Defaults to `1000`.

			`\|index.merge.policy.max_merge_docs \|Determines the largest segment`
			`(measured by document count) that may be merged with other segments.`
			`Defaults to unbounded.`
			`\|=======================================================================`

			`[float]`
Add more anchor links to documentation Related to #3679 2013-09-25 10:17:40 -06:00			`[[scheduling]]`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`=== Scheduling`

#6120 Remove SerialMergeScheduler (master only) It's dangerous to expose SerialMergeScheduler as an option: since it only allows one merge at a time, it can easily cause merging to fall behind. Closes #6120 2014-05-12 14:06:20 -04:00			`The merge scheduler (ConcurrentMergeScheduler) controls the execution of`
			`merge operations once they are needed (according to the merge policy). Merges`
			`run in separate threads, and when the maximum number of threads is reached,`
			`further merges will wait until a merge thread becomes available. The merge`
			`scheduler supports this setting:`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
[DOCS] Multiple doc fixes Closes #5047 2014-03-07 14:21:45 +01:00			`index.merge.scheduler.max_thread_count`::

Docs: fix merge docs to match the code (the max_thread_count default is 'aggressive' (favor SSDs)) 2014-07-11 07:00:57 -04:00			`The maximum number of threads that may be merging at once. Defaults to`
expose current CMS throttle in merge stats; fix tests, docs; also log per-merge stop/throttle/rate 2015-01-11 05:52:43 -05:00			`Math.max(1, Math.min(4, Runtime.getRuntime().availableProcessors() / 2))`
Docs: fix merge docs to match the code (the max_thread_count default is 'aggressive' (favor SSDs)) 2014-07-11 07:00:57 -04:00			`which works well for a good solid-state-disk (SSD). If your index is on`
			`spinning platter drives instead, decrease this to 1.`
Upgrade to Lucene 4.8.1 This commit upgrades to the latest Lucene 4.8.1 release including the following bugfixes: * An IndexThrottle now kicks in when merges start falling behind limiting index threads to 1 until merges caught up. Closes #6066 * RateLimiter now kicks in at the configured rate where previously the limiter was limiting at ~8MB/sec almost all the time. Closes #6018 2014-05-17 12:19:07 +02:00
expose current CMS throttle in merge stats; fix tests, docs; also log per-merge stop/throttle/rate 2015-01-11 05:52:43 -05:00			`index.merge.scheduler.auto_throttle`::

			`If this is true (the default), then the merge scheduler will`
			`rate-limit IO (writes) for merges to an adaptive value depending on`
			`how many merges are requested over time. An application with a low`
			`indexing rate that unluckily suddenly requires a large merge will see`
			`that merge aggressively throttled, while an application doing heavy`
			`indexing will see the throttle move higher to allow merges to keep up`
			`with ongoing indexing. This is a dynamic setting (you can <<../indices/update-settings,change it`
			`at any time on a running index>>).`

Upgrade to Lucene 4.8.1 This commit upgrades to the latest Lucene 4.8.1 release including the following bugfixes: * An IndexThrottle now kicks in when merges start falling behind limiting index threads to 1 until merges caught up. Closes #6066 * RateLimiter now kicks in at the configured rate where previously the limiter was limiting at ~8MB/sec almost all the time. Closes #6018 2014-05-17 12:19:07 +02:00			`[float]`
			`==== SerialMergeScheduler`

Docs: explain that SerialMergeScheduler just maps to CMS for back compat Closes #6878 2014-07-15 11:38:43 -04:00			`This is accepted for backwards compatibility, but just uses`
			`ConcurrentMergeScheduler with index.merge.scheduler.max_thread_count`
			`set to 1 so that only 1 merge may run at a time.`