OpenSearch

Commit Graph

Author	SHA1	Message	Date
Simon Willnauer	84ce9f3618	Remove the ability to fsync on every operation and only schedule fsync task if really needed This commit limits the `index.translog.sync_interval` to a value not less than `100ms` and removes the support for fsync on every operation which used to be enabled if `index.translog.sync_interval` was set to `0s` Now this pr also only schedules an async fsync if the durability is set to `async`. By default not async task is scheduled. Closes #16152	2016-01-27 12:28:38 +01:00
Simon Willnauer	fcfd98e9e8	Drop support for simple translog and hard-wire buffer to 8kb Today we have two variants of translogs for indexing. We only recommend the buffered one which also has a 20% advantage in indexing speed. This commit removes the option and defaults to the buffered case. It also hard-wires the translog buffer to 8kb instead of 64kb. We used to adjust that buffer based on if the shard is active or not, this code has also been removed and instead we just keep an 8kb buffer arround.	2015-12-21 16:44:35 +01:00
Simon Willnauer	afc1cc19af	Simplify translog-based flush settings This commit removes `index.translog.flush_threshold_ops` and `index.translog.disable_flush` in favor of `index.translog.flush_threshold_size`. The number of operations is meaningless by itself and can easily be turned into a size value with knowledge of the data. Disabling the flush is only useful in tests and we can set the size value to a really high value. If users really need to do this they can also apply a very high value like `1PB`.	2015-12-21 15:15:00 +01:00
Jason O'Donnell	73f620907d	Fixing typo	2015-10-26 16:43:25 -04:00
Simon Willnauer	75e816400c	Remove TranslogService and fold it into synchronous IndexShard API This commit moves the size and ops based flush into a synchronous API into IndexShard and removes the time-based flush alltogether since it' basically covered by the inactive async flush API we have today. The functionality doesn't need to be covered by scheduled task and async APIs while we can actually make all the decisions in a sync manner which is way easier to control and to test. Closes #13707	2015-09-23 12:39:06 +02:00
Clinton Gormley	aaf1d14b21	Docs: Fixed bad links	2015-07-07 16:08:10 +02:00
Clinton Gormley	93fe8f8910	Docs: Updated the translog docs to reflect the new behaviour/settings in master Closes #11287	2015-06-30 19:08:31 +02:00
Clinton Gormley	603a0c193b	Docs: More translog doc improvements	2015-05-05 22:01:58 +02:00
Clinton Gormley	a60251068c	Docs: Improved the translog docs	2015-05-05 21:32:52 +02:00
Boaz Leskes	d596f5cc45	Decouple recoveries from engine flush In order to safely complete recoveries / relocations we have to keep all operation done since the recovery start at available for replay. At the moment we do so by preventing the engine from flushing and thus making sure that the operations are kept in the translog. A side effect of this is that the translog keeps on growing until the recovery is done. This is not a problem as we do need these operations but if the another recovery starts concurrently it may have an unneededly long translog to replay. Also, if we shutdown the engine for some reason at this point (like when a node is restarted) we have to recover a long translog when we come back. To void this, the translog is changed to be based on multiple files instead of a single one. This allows recoveries to keep hold to the files they need while allowing the engine to flush and do a lucene commit (which will create a new translog files bellow the hood). Change highlights: - Refactor Translog file management to allow for multiple files. - Translog maintains a list of referenced files, both by outstanding recoveries and files containing operations not yet committed to Lucene. - A new Translog.View concept is introduced, allowing recoveries to get a reference to all currently uncommitted translog files plus all future translog files created until the view is closed. They can use this view to iterate over operations. - Recovery phase3 is removed. That phase was replaying operations while preventing new writes to the engine. This is unneeded as standard indexing also send all operations from the start of the recovery to the recovering shard. Replay all ops in the view acquired in recovery start is enough to guarantee no operation is lost. - IndexShard now creates the translog together with the engine. The translog is closed by the engine on close. ShadowIndexShards do not open the translog. - Moved the ownership of translog fsyncing to the translog it self, changing the responsible setting to `index.translog.sync_interval` (was `index.gateway.local.sync`) Closes #10624	2015-04-30 23:42:50 +03:00
Michael McCandless	3c0d2081cf	Core: change default xlog size from 200 MB to 512 MB Closes #9341	2015-01-19 15:52:29 -05:00
Clinton Gormley	4b0a89d4fb	Update translog.asciidoc Documented `index.gateway.local.sync`	2014-07-31 14:06:24 +02:00
Simon Willnauer	1cf62e7782	Use unlimited flush_threshold_ops for translog Currently we use 5k operations as a flush threshold. Indexing 5k documents per second is rather common which would cause the index to be committed on the lucene level each time the flush logic runs which is 5 seconds by default. We should rather use a size based threshold similar to the lucene index writer that doesn't cause such agressive commits which can slow down indexing significantly especially since they cause the underlying devices to fsync their data.	2014-04-22 16:37:07 +02:00
Shay Banon	4aa5ef139e	randomize flush interval so multiple shards won't flush at the sam time - also, allow to update interval using update settings on an index	2014-01-07 19:58:28 +01:00
Clinton Gormley	822043347e	Migrated documentation into the main repo	2013-08-29 01:24:34 +02:00

15 Commits