OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	36a5cf8f35	Automatically early terminate search query based on index sorting (#24864 ) This commit refactors the query phase in order to be able to automatically detect queries that can be early terminated. If the index sort matches the query sort, the top docs collection is early terminated on each segment and the computing of the total number of hits that match the query is delegated to a simple TotalHitCountCollector. This change also adds a new parameter to the search request called `track_total_hits`. It indicates if the total number of hits that match the query should be tracked. If false, queries sorted by the index sort will not try to compute this information and and will limit the collection to the first N documents per segment. Aggregations are not impacted and will continue to see every document even when the index sort matches the query sort and `track_total_hits` is false. Relates #6720	2017-06-08 12:10:46 +02:00
Adrien Grand	bbdf50f6bd	Docs: More search speed advices. (#24802 )	2017-06-01 17:23:22 +02:00
Jim Ferenczi	f05af0a382	Enable index-time sorting (#24055 ) This change adds an index setting to define how the documents should be sorted inside each Segment. It allows any numeric, date, boolean or keyword field inside a mapping to be used to sort the index on disk. It is not allowed to use a `nested` fields inside an index that defines an index sorting since `nested` fields relies on the original sort of the index. This change does not add early termination capabilities in the search layer. This will be added in a follow up. Relates #6720	2017-04-19 14:36:11 +02:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Igor Motov	93b5e55660	Restores the original default format of search slow log In 5.0, the search slow log switched to the multi-line format with no option to get back to the origin single-line format that was used prior to 5.0 by default. This commit removes the reformat option from the search slow log and returns the search slow log back to the single-line format. Closes #21711	2016-12-09 12:38:28 -05:00
Loek van Gool	1a23739211	Update store.asciidoc (#21353 ) * Update store.asciidoc * Update store.asciidoc * Update store.asciidoc	2016-11-05 14:58:16 +01:00
Adriel Dean-Hall	b72a708c0d	Add docs with up to date instructions on updating default similarity (#21242 ) * Add docs with up to date instructions on updating default similarity The default similarity can no longer be set in the configuration file (you will get an error on startup). Update the docs with the method that works. * Add instructions for changing similarity on index creation	2016-11-01 16:14:20 -04:00
Jason Tedor	96aa5e33ce	Fix slowlog docs This commit fixes two issues with the slow log docs: - clarifies that these settings are per index - updates index slow log configuration for Log4j 2 Relates #20976	2016-10-17 10:50:32 -04:00
Pascal Borreli	fcb01deb34	Fixed typos (#20843 )	2016-10-10 14:51:47 -06:00
Jason Tedor	750033dc4b	Update docs for Log4j 2 This commit updates the logging docs for Elasticsearch to reflect the migration to Log4j 2.	2016-08-31 15:51:52 -04:00
Lee Hinman	0ade5a207d	Add documentation for the 'elasticsearch-translog' tool This adds documentation to the translog page for the CLI truncation tool.	2016-08-02 16:26:28 -06:00
Sakthipriyan Vairamani	8d5a5e500a	file is -> file name (#18994 )	2016-06-21 13:20:56 +02:00
Jim Ferenczi	423291b6bc	Change default similarity to BM25 The default similarity was set to `classic` which refers to TFIDF and has not been moved after the upgrade to Lucene 6. Though moving to BM25 could have some downside for queries that relies on coordination factor (match_query, multi_match_query) ? relates #18944	2016-06-21 11:29:36 +02:00
Adrien Grand	93415d4506	Expose MMapDirectory.preLoad(). #18880 The MMapDirectory has a switch that allows the content of files to be loaded into the filesystem cache upon opening. This commit exposes it with the new `index.store.pre_load` setting.	2016-06-20 13:42:56 +02:00
eratio08	26aacfff72	default values for BM25 Similarity (#18778 ) assuming elasticsearch uses the lucene default values	2016-06-13 18:57:44 +02:00
trangvh	c0da8e4060	Fix some typos (#18746 ) * Update java-doc of SearchResponse.getProfileResults() * Fix a trivial typo in Reference document	2016-06-07 16:41:39 +02:00
Nik Everett	72eb621bce	Docs: Replace [source,json] with [source,js] The syntax highlighter only supports [source,js]. Also adds a check to the rest test generator that runs during the build that'll fail the build if it sees `[source,json]`.	2016-05-24 11:17:27 -04:00
eratio08	7e00a1c1a3	Added Type name for DFI (#18480 )	2016-05-20 11:02:06 +02:00
Jason Tedor	c257e2c51f	Remove settings and system properties entanglement Today when parsing settings during bootstrap, we add a system property for every Elasticsearch setting. Additionally, settings can be set via system properties. This commit simplifies this situation. - settings are no longer propogated to system properties - system properties can not be used to set settings - the "es." prefix on settings is no longer required (nor permitted) - test logging has a dedicated system property (tests.logger.level) Relates #18198	2016-05-19 14:08:08 -04:00
Clinton Gormley	3f594089c2	Renamed all AUTOSENSE snippets to CONSOLE (#18210 )	2016-05-09 15:42:23 +02:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Adrien Grand	51a53c55cb	Update store documentation after #17616 .	2016-05-04 08:53:11 +02:00
Simon Willnauer	2514681f66	updateing filtering.asciidoc to also use 'node.attr' namespace	2016-03-30 14:11:59 +02:00
Adrien Grand	b42f66c8ac	Document 5.0 mapping changes.	2016-03-22 16:22:58 +01:00
Jason Tedor	8a05c2a2be	Bootstrap does not set system properties Today, certain bootstrap properties are set and read via system properties. This action-at-distance way of managing these properties is rather confusing, and completely unnecessary. But another problem exists with setting these as system properties. Namely, these system properties are interpreted as Elasticsearch settings, not all of which are registered. This leads to Elasticsearch failing to startup if any of these special properties are set. Instead, these properties should be kept as local as possible, and passed around as method parameters where needed. This eliminates the action-at-distance way of handling these properties, and eliminates the need to register these non-setting properties. This commit does exactly that. Additionally, today we use the "-D" command line flag to set the properties, but this is confusing because "-D" is a special flag to the JVM for setting system properties. This creates confusion because some "-D" properties should be passed via arguments to the JVM (so via ES_JAVA_OPTS), and some should be passed as arguments to Elasticsearch. This commit changes the "-D" flag for Elasticsearch settings to "-E".	2016-03-13 20:09:15 -04:00
Clinton Gormley	6d7e8814d6	Redocument the `index.merge.scheduler.max_thread_count` setting Closes #16961	2016-03-05 16:28:43 +01:00
Boaz Leskes	4a7980f96c	Merge pull request #16766 from rstruber/patch-1 fix grammar in Total Shards Per Node docs	2016-02-22 08:42:42 -08:00
Dongjoon Hyun	21ea552070	Fix typos in docs.	2016-02-09 02:07:32 -08:00
Robert Muir	d5dc05f69e	Upgrade to lucene 5.5.0-snapshot-1725675	2016-02-02 22:53:39 -05:00
Simon Willnauer	84ce9f3618	Remove the ability to fsync on every operation and only schedule fsync task if really needed This commit limits the `index.translog.sync_interval` to a value not less than `100ms` and removes the support for fsync on every operation which used to be enabled if `index.translog.sync_interval` was set to `0s` Now this pr also only schedules an async fsync if the durability is set to `async`. By default not async task is scheduled. Closes #16152	2016-01-27 12:28:38 +01:00
Robert Muir	6e7e3a2274	Update lucene to r1725675 Adds DFI (divergence from independence) provider. Fixes test bugs passing invalid values for BM25 parameters.	2016-01-20 03:32:51 -05:00
Jim Ferenczi	992ffac509	Merge pull request #15446 from jimferenczi/classic_similarity Renames `default` similarity into `classic`	2015-12-30 08:42:20 -08:00
Simon Willnauer	fcfd98e9e8	Drop support for simple translog and hard-wire buffer to 8kb Today we have two variants of translogs for indexing. We only recommend the buffered one which also has a 20% advantage in indexing speed. This commit removes the option and defaults to the buffered case. It also hard-wires the translog buffer to 8kb instead of 64kb. We used to adjust that buffer based on if the shard is active or not, this code has also been removed and instead we just keep an 8kb buffer arround.	2015-12-21 16:44:35 +01:00
Jim Ferenczi	81fd2169cf	Renames "default" similarity into "classic". Replaces deprecated DefaultSimilarity by ClassicSimilarity. Fixes #15102	2015-12-21 16:22:53 +01:00
Simon Willnauer	afc1cc19af	Simplify translog-based flush settings This commit removes `index.translog.flush_threshold_ops` and `index.translog.disable_flush` in favor of `index.translog.flush_threshold_size`. The number of operations is meaningless by itself and can easily be turned into a size value with knowledge of the data. Disabling the flush is only useful in tests and we can set the size value to a really high value. If users really need to do this they can also apply a very high value like `1PB`.	2015-12-21 15:15:00 +01:00
Clinton Gormley	f20f41e02e	Merge pull request #15405 from alexg-dev/patch-1 More detailed explanation of some similarity types	2015-12-14 14:28:43 +01:00
William	e042e06a5a	Update similarity.asciidoc	2015-11-19 16:41:29 -08:00
Yannick Welsch	2084df825f	Simplify delayed shard allocation - moves calculation of the delay to a single place (ReplicaShardAllocator) - reduces coupling between GatewayAllocator and RoutingService - in master failover situations, elapsed delay time is forgotten Closes #14808	2015-11-19 09:53:07 +01:00
Lee Hinman	145374b762	Add cluster-wide setting for total shard limit This adds the `cluster.routing.allocation.total_shards_per_node` setting, which limits the total number of shards across all indices on each node. It defaults to -1 and can be dynamically configured. Resolves #14456	2015-11-09 11:03:07 -07:00
Jason O'Donnell	c7060c1b63	Fixing typo	2015-10-26 16:48:20 -04:00
Jason O'Donnell	73f620907d	Fixing typo	2015-10-26 16:43:25 -04:00
Simon Willnauer	75e816400c	Remove TranslogService and fold it into synchronous IndexShard API This commit moves the size and ops based flush into a synchronous API into IndexShard and removes the time-based flush alltogether since it' basically covered by the inactive async flush API we have today. The functionality doesn't need to be covered by scheduled task and async APIs while we can actually make all the decisions in a sync manner which is way easier to control and to test. Closes #13707	2015-09-23 12:39:06 +02:00
David Pilato	35049a05c3	Allocation: add support for filtering by transport IP address Allocation filtering by IP only works today using the node host address. But in some cases, you might want to filter using the publish address which could be different.	2015-09-09 15:15:53 +02:00
Michael McCandless	1c85b68674	Don't document expert segment merge settings	2015-08-29 17:21:46 -04:00
Nik Everett	79d9f5b775	Logging: Log less source in slowlog Instead of logging the entire `_source` in the indexing slowlog we log by default just the first 1000 characters - this is controlled by the `index.indexing.slowlog.source` settings and can be set to `true` to log the whole `_source`, `false` to log none of it, and a number to log at most that many characters. Closes #4485	2015-08-11 13:16:04 -07:00
Clinton Gormley	c22e179e87	Docs: Documented cancelation of shard recovery Relates to #12421	2015-08-07 19:44:34 +02:00
Clinton Gormley	ac2b8951c6	Docs: Mapping docs completely rewritten for 2.0	2015-08-06 17:24:51 +02:00
Clinton Gormley	c56ce0e242	Docs: Refactored the mapping meta-fields docs	2015-07-20 01:26:27 +02:00
Clinton Gormley	dbc0b45896	Docs: Documented index prioritization	2015-07-15 18:05:42 +02:00
Clinton Gormley	2b512f1f29	Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting	2015-07-14 18:14:09 +02:00
Shay Banon	e598f16b58	Default delayed allocation timeout to 1m from 0 Change the default delayed allocation timeout from 0 (no delayed allocation) to 1m. The value came from a test of having a node with 50 shards being indexed into (so beefy translog requiring flush on shutdown), then shutting it down and starting it back up and waiting for it to join the cluster. This took, on a slow machine, about 30s. The value is conservatively low and does not try to address a virtual machine / OS restart for now, in order to not have the affect of node going away and users being concerned that shards are not being allocated to the rest of the cluster as a result of that. The setting can always be changed in order to increase the delayed allocation if needed. closes #12166	2015-07-14 11:31:16 +02:00
Clinton Gormley	aaf1d14b21	Docs: Fixed bad links	2015-07-07 16:08:10 +02:00
Clinton Gormley	93fe8f8910	Docs: Updated the translog docs to reflect the new behaviour/settings in master Closes #11287	2015-06-30 19:08:31 +02:00
Clinton Gormley	84acb65ca1	Docs: Documented delayed allocation settings Relates to: #11712	2015-06-30 13:53:04 +02:00
Clinton Gormley	f123a53d72	Docs: Refactored modules and index modules sections	2015-06-22 23:49:45 +02:00
Adrien Grand	14c9c239bc	Remove non-default fielddata formats. Now that doc values are the default for fielddata, specialized in-memory formats are becoming an esoteric option. This commit removes such formats: - `fst` on string fields, - `compressed` on geo points. I also removed documentation and tests that the fielddata cache is shared if you change the format, since this is only true for in-memory fielddata formats (given that for doc values, the caching is done directly in Lucene).	2015-06-15 14:05:23 +02:00
Clinton Gormley	0216dfd3b6	Docs: Removed left over table header from merge.asciidoc	2015-06-11 13:26:34 +02:00
Simon Willnauer	f77804dad3	Bake in TieredMergePolicy Today we provide the ability to plug in MergePolicy and we provide the once lucene ships with. We do not recommend to change the default and even only a small number of expert users would ever touch this. This commit removes the ancient log byte size and log doc count merge policy providers, simplifies the MergePolicy wiring and makes the tiered MP the one and only default. All notions of a merge policy has been removed from the docs and should be deprecated in the previous version. Closes #11588	2015-06-11 11:58:30 +02:00
Simon Willnauer	657d6dd9cf	Remove MergeScheduler pluggability Nobody should really plug in a different merge scheduler for elasticsearch. This is too expert and might cause catastrophic failures.	2015-06-10 20:28:30 +02:00
Clinton Gormley	60c7e0eb91	Update merge.asciidoc Corrected typo in merge docs	2015-06-08 16:45:59 +02:00
Martijn van Groningen	359d9ac0d0	docs: added missing ids	2015-05-29 22:45:01 +02:00
Clinton Gormley	603a0c193b	Docs: More translog doc improvements	2015-05-05 22:01:58 +02:00
Clinton Gormley	a60251068c	Docs: Improved the translog docs	2015-05-05 21:32:52 +02:00
Simon Willnauer	fe5a35b68e	Merge branch 'master' into pr-10624 Conflicts: src/main/java/org/elasticsearch/index/shard/IndexShard.java	2015-05-05 11:46:02 +02:00
Pascal Borreli	af6d890ad5	Docs: Fixed typos Closes #10973	2015-05-05 10:38:05 +02:00
Simon Willnauer	7e5f9d5628	Merge branch 'master' into pr-10624 Conflicts: src/main/java/org/elasticsearch/index/engine/EngineConfig.java src/main/java/org/elasticsearch/index/shard/IndexShard.java src/test/java/org/elasticsearch/index/engine/InternalEngineTests.java src/test/java/org/elasticsearch/index/engine/ShadowEngineTests.java	2015-05-04 11:37:54 +02:00
Ryan Ernst	4ef9f3ca63	Mappings: Remove file based default mappings Using files that must be specified on each node is an anti-pattern from the API based goal of ES. This change removes the ability to specify the default mapping with a file on each node. closes #10620	2015-04-30 13:50:35 -07:00
Boaz Leskes	d596f5cc45	Decouple recoveries from engine flush In order to safely complete recoveries / relocations we have to keep all operation done since the recovery start at available for replay. At the moment we do so by preventing the engine from flushing and thus making sure that the operations are kept in the translog. A side effect of this is that the translog keeps on growing until the recovery is done. This is not a problem as we do need these operations but if the another recovery starts concurrently it may have an unneededly long translog to replay. Also, if we shutdown the engine for some reason at this point (like when a node is restarted) we have to recover a long translog when we come back. To void this, the translog is changed to be based on multiple files instead of a single one. This allows recoveries to keep hold to the files they need while allowing the engine to flush and do a lucene commit (which will create a new translog files bellow the hood). Change highlights: - Refactor Translog file management to allow for multiple files. - Translog maintains a list of referenced files, both by outstanding recoveries and files containing operations not yet committed to Lucene. - A new Translog.View concept is introduced, allowing recoveries to get a reference to all currently uncommitted translog files plus all future translog files created until the view is closed. They can use this view to iterate over operations. - Recovery phase3 is removed. That phase was replaying operations while preventing new writes to the engine. This is unneeded as standard indexing also send all operations from the start of the recovery to the recovering shard. Replay all ops in the view acquired in recovery start is enough to guarantee no operation is lost. - IndexShard now creates the translog together with the engine. The translog is closed by the engine on close. ShadowIndexShards do not open the translog. - Moved the ownership of translog fsyncing to the translog it self, changing the responsible setting to `index.translog.sync_interval` (was `index.gateway.local.sync`) Closes #10624	2015-04-30 23:42:50 +03:00
Clinton Gormley	37ed61807f	Docs: Updated the experimental annotations in the docs as follows: * Removed the docs for `index.compound_format` and `index.compound_on_flush` - these are expert settings which should probably be removed (see https://github.com/elastic/elasticsearch/issues/10778) * Removed the docs for `index.index_concurrency` - another expert setting * Labelled the segments verbose output as experimental * Marked the `compression`, `precision_threshold` and `rehash` options as experimental in the cardinality and percentile aggs * Improved the experimental text on `significant_terms`, `execution_hint` in the terms agg, and `terminate_after` param on count and search * Removed the experimental flag on the `geobounds` agg * Marked the settings in the `merge` and `store` modules as experimental, rather than the modules themselves Closes #10782	2015-04-26 18:49:15 +02:00
Lee Hinman	a4f98e7400	[DOCS] Add example of setting disk threshold decider settings Fixes #10686	2015-04-22 11:53:19 -06:00
Adrien Grand	a608db122d	Search: Remove the `count` search type. This commit brings the benefits of the `count` search type to search requests that have a `size` of 0: - a single round-trip to shards (no fetch phase) - ability to use the query cache Since `count` now provides no benefits over `query_then_fetch`, it has been deprecated. Close #7630	2015-03-31 11:31:49 +02:00
Joshua Rich	db2caa54cd	Small grammar fix.	2015-03-17 11:27:13 -07:00
Adrien Grand	95f46f1212	Docs: Use the new experimental annotation. We now have a very useful annotation to mark features or parameters as experimental. Let's use it! This commit replaces some custom text warnings with this annotation and adds this annotation to some existing features/parameters: - inner_hits (unreleased yet) - terminate_after (released in 1.4) - per-bucket doc count errors in the terms agg (released in 1.4) I also tagged with this annotation settings which should either be not needed (like the ability to evict entries from the filter cache based on time) or that are too deep into the way that Elasticsearch works like the Directory implementation or merge settings. Close #9563	2015-02-05 15:29:45 +01:00
Masaru Hasegawa	b4f7d26723	Fielddata: Change threshold value of fielddata.filter.frequency.max/min Make it consider 1.0 as 100% instead of aboslute count 1. Closes: #9327	2015-02-05 13:27:42 +09:00
Michael McCandless	3c0d2081cf	Core: change default xlog size from 200 MB to 512 MB Closes #9341	2015-01-19 15:52:29 -05:00
Michael McCandless	def2d34f80	don't mention fixed throttling in the docs	2015-01-14 10:13:10 -05:00
Michael McCandless	107099affa	put back fixed throttling, but off by default	2015-01-14 05:35:09 -05:00
Michael McCandless	1aad275c55	expose current CMS throttle in merge stats; fix tests, docs; also log per-merge stop/throttle/rate	2015-01-11 05:52:43 -05:00
Michael McCandless	31e6acf3f2	first cut	2015-01-10 16:38:56 -05:00
Itamar Syn-Hershko	cb042cd662	Fixing typo Closes #8713	2014-12-01 10:52:00 +01:00
Simon Willnauer	0fcb466555	[STORE] Remove `memory`/ `ram` store The RAM store is discuraged for production usage anyway and we don't test it in our randomized infrastructure. This commit removes it for `2.0`	2014-11-20 14:47:19 +01:00
Israel Tsadok	7590629531	Docs: note about confusing disk threshold settings	2014-11-12 09:24:03 +01:00
Henrik Nordvik	fdbb62b1ab	Docs: Fix curl statements in query-cache.asciidoc Closes #7989	2014-10-15 13:16:20 +02:00
Suyog Rao	82b16ae0ad	Doc: Clarify that index.routing.allocation.total_shards_per_node means both primary and replica shards Closes #8002	2014-10-13 18:08:23 -07:00
Adrien Grand	491a48e55b	Docs: Remove the note that fielddata doesn't support filtering. This particular note was about fielddata filtering but could cause confusion that fields that have doc values enabled cannot be used for filtering (as in a `filtered_query`).	2014-10-10 10:50:47 +02:00
Clinton Gormley	cb00d4a542	Docs: Removed all the added/deprecated tags from 1.x	2014-09-26 21:04:42 +02:00
Lee Hinman	4185566e93	Add option to take currently relocating shards' sizes into account When using the DiskThresholdDecider, it's possible that shards could already be marked as relocating to the node being evaluated. This commit adds a new setting `cluster.routing.allocation.disk.include_relocations` which adds the size of the shards currently being relocated to this node to the node's used disk space. This new option defaults to `true`, however it's possible to over-estimate the usage for a node if the relocation is already partially complete, for instance: A node with a 10gb shard that's 45% of the way through a relocation would add 10gb + (.45 * 10) = 14.5gb to the node's disk usage before examining the watermarks to see if a new shard can be allocated. Fixes #7753 Relates to #6168	2014-09-19 12:36:51 +02:00
Nik Everett	2bc58d5f77	Docs: Fix misnamed setting The settings is `index.merge.policy.reclaim_deletes_weight` not `index.reclaim_deletes_weight`. Closes #7676	2014-09-11 10:41:23 +02:00
Tanvir Alam	c7d0c3ea18	Docs: fixed typo Closes #7544	2014-09-08 10:50:59 +02:00
Alexander Reelsen	f2aa4a38bc	Docs: Added link to clarify meaning of filtering in fielddata context	2014-08-26 12:00:06 +02:00
Adrien Grand	ea96359d82	Facets: Removal from master. Close #7337	2014-08-21 10:34:39 +02:00
Adrien Grand	a242a63817	[DOCS] Remove the section about codecs. This documentation was dangerous because it felt like it was possible to gain substantial performance by just switching the codec of the index. However, non-default codecs are dangerous to use since they are not supported in terms of backward compatibility, and most improvements that they bring have been folded into the default codec anyway (for example, the default codec "pulses" postings lists that contain a single document).	2014-08-07 11:24:44 +02:00
Clinton Gormley	e7f1aa4f4f	Documented the query cache module Related to #7161 and #7167	2014-08-06 11:55:11 +02:00
Clinton Gormley	4b0a89d4fb	Update translog.asciidoc Documented `index.gateway.local.sync`	2014-07-31 14:06:24 +02:00
Lee Hinman	6abe4c951d	Add HierarchyCircuitBreakerService Adds a breaker for request BigArrays, which are used for parent/child queries as well as some aggregations. Certain operations like Netty HTTP responses and transport responses increment the breaker, but will not trip. This also changes the output of the nodes' stats endpoint to show the parent breaker as well as the fielddata and request breakers. There are a number of new settings for breakers now: `indices.breaker.total.limit`: starting limit for all memory-use breaker, defaults to 70% `indices.breaker.fielddata.limit`: starting limit for fielddata breaker, defaults to 60% `indices.breaker.fielddata.overhead`: overhead for fielddata breaker estimations, defaults to 1.03 (the fielddata breaker settings also use the backwards-compatible setting `indices.fielddata.breaker.limit` and `indices.fielddata.breaker.overhead`) `indices.breaker.request.limit`: starting limit for request breaker, defaults to 40% `indices.breaker.request.overhead`: request breaker estimation overhead, defaults to 1.0 The breaker service infrastructure is now generic and opens the path to adding additional circuit breakers in the future. Fixes #6129 Conflicts: src/main/java/org/elasticsearch/index/fielddata/IndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/IndexFieldDataService.java src/main/java/org/elasticsearch/index/fielddata/RamAccountingTermsEnum.java src/main/java/org/elasticsearch/index/fielddata/ordinals/GlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/ordinals/InternalGlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/plain/AbstractIndexOrdinalsFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/DisabledIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/IndexIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/NonEstimatingEstimator.java src/main/java/org/elasticsearch/index/fielddata/plain/PackedArrayIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/ParentChildIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/SortedSetDVOrdinalsIndexFieldData.java src/main/java/org/elasticsearch/node/internal/InternalNode.java src/test/java/org/elasticsearch/index/aliases/IndexAliasesServiceTests.java src/test/java/org/elasticsearch/index/codec/CodecTests.java src/test/java/org/elasticsearch/index/fielddata/AbstractFieldDataTests.java src/test/java/org/elasticsearch/index/fielddata/IndexFieldDataServiceTests.java src/test/java/org/elasticsearch/index/mapper/MapperTestUtils.java src/test/java/org/elasticsearch/index/query/IndexQueryParserFilterCachingTests.java src/test/java/org/elasticsearch/index/query/SimpleIndexQueryParserTests.java src/test/java/org/elasticsearch/index/query/guice/IndexQueryParserModuleTests.java src/test/java/org/elasticsearch/index/search/FieldDataTermsFilterTests.java src/test/java/org/elasticsearch/index/search/child/ChildrenConstantScoreQueryTests.java src/test/java/org/elasticsearch/index/similarity/SimilarityTests.java	2014-07-28 11:27:33 +02:00
mikemccand	96ecec34d1	Docs: fix documentation for bloom filter defaults	2014-07-27 18:39:29 -04:00
Simon Willnauer	5bfea56457	[DOCS] move all coming tags to added in master	2014-07-23 16:37:19 +02:00
Peter Johnson @insertcoffee	9a4abc2620	Docs: typo example fails in bash Closes #6977	2014-07-23 12:43:43 +02:00
mikemccand	cc4d7c6272	Core: don't load bloom filters by default This change just changes the default for index.codec.bloom.load to false: with recent performance improvements to ID lookup, such as #6298, bloom filters don't give much of a performance gain anymore, and they can consume non-trivial RAM when there are many tiny documents. For now, we still index the bloom filters, so if a given app wants them back, it can just update the index.codec.bloom.load to true. Closes #6959	2014-07-23 05:58:41 -04:00
mikemccand	63cab559e3	Docs: explain that SerialMergeScheduler just maps to CMS for back compat Closes #6878	2014-07-15 11:38:43 -04:00

1 2 3 4 5

211 Commits