OpenSearch

Commit Graph

Author	SHA1	Message	Date
Michael McCandless	b9358ccca8	Core: switch to auto IO throttle for merges This adds a new boolean (index.merge.scheduler.auto_throttle) dynamic setting, default true (matching Lucene), to adaptively set the IO rate limit for merges over time. This is more flexible than the previous fixed rate throttling because it responds depending on the incoming merge rate, so search-heavy applications that are not doing much indexing will see merges heavily throttled while indexing-heavy cases will lighten the throttle so merges can keep up within incoming indexing. The fixed rate throttling is still available as a fallback if things go horribly wrong. Closes #9243 Closes #9133	2015-01-16 13:00:08 -05:00
Clinton Gormley	c644c377ab	Update api-conventions.asciidoc Corrected explanation of fuzzy AUTO Related to #9278	2015-01-16 14:26:50 +01:00
Clinton Gormley	f5b91c374a	Update upgrade.asciidoc Upgrade request needs pretty and human for the demonstrated output. Closes #9313	2015-01-16 13:55:22 +01:00
David Haney	395960feef	Docs: Updated standard token filter docs to indicate true behavior: doing nothing Closes #9300	2015-01-15 21:33:29 +01:00
Michael McCandless	def2d34f80	don't mention fixed throttling in the docs	2015-01-14 10:13:10 -05:00
Michael McCandless	107099affa	put back fixed throttling, but off by default	2015-01-14 05:35:09 -05:00
Paul Echeverri	4f938ad37e	Updates the command to add the repo to not use add-apt-repository, which automatically adds a non-working deb-src line to sources.list. Command now uses echo to write the correct line to sources.list instead. Fixes #9261	2015-01-12 21:18:00 +00:00
Tomoya Hirano	15d46988dc	Fix typo in sample json Fixes #9253	2015-01-12 15:58:16 +00:00
David Pilato	052645903a	Rest: remove status code from main action Today we give the HTTP status back within the HTTP response itself and within the JSON response as well: ```sh curl localhost:9200/ ``` ```js { "status" : 200, "name" : "Red Wolf", "version" : { "number" : "2.0.0", "build_hash" : "6837a61d8a646a2ac7dc8da1ab3c4ab85d60882d", "build_timestamp" : "2014-08-19T13:55:56Z", "build_snapshot" : true, "lucene_version" : "4.9" }, "tagline" : "You Know, for Search" } ```	2015-01-12 12:37:46 +01:00
David Pilato	fc7a0d3a4a	[Docs] fix three to four	2015-01-12 12:13:23 +01:00
Michael McCandless	1aad275c55	expose current CMS throttle in merge stats; fix tests, docs; also log per-merge stop/throttle/rate	2015-01-11 05:52:43 -05:00
Michael McCandless	31e6acf3f2	first cut	2015-01-10 16:38:56 -05:00
Christoph Büscher	04cb09f44c	[TEST] Add missing docs and tests for '_cat/segments' The '_cat/segments' api was missing docs and a rest test which are added here. Closes #5856	2015-01-09 12:29:11 +01:00
Ryan Ernst	060f963a8e	Mappings: Remove allow_type_wrapper setting Before Elasticsearch 1.0, the type was allowed to be passed as the root element when uploading a document. However, this was ambiguous if the mappings also contained a field with the same name as the type. The behavior was changed in 1.0 to not allow this, but a setting was added for backwards compatibility. This change removes the setting for 2.0.	2015-01-08 09:13:40 -08:00
Martijn van Groningen	ca4f27f40e	Core: Added `_shards` header to all write responses. The header indicates to how many shard copies (primary and replicas shards) a write was supposed to go to, to how many shard copies to write succeeded and potentially captures shard failures if writing into a replica shard fails. For async writes it also includes the number of shards a write is still pending. Closes #7994	2015-01-08 18:10:08 +01:00
Martijn van Groningen	dedaf9387e	Core: Also check if indices resolved via aliases resolution aren't closed and deal with this according to IndicesOptions. Closes #9057	2015-01-08 16:45:34 +01:00
Martijn van Groningen	20f7be378b	Removed parent parameter from update request, because it is just sets the routing. The routing option should be used instead. The parent a child document points to can't be updated. Closes #4538	2015-01-07 10:26:20 +01:00
Ryan Ernst	f7f99b8dbf	Stats: Added verbose option to segments api, with full ram tree as first additional element per segment. This commit adds a verbose flag to the _segments api. Currently the only additional information returned when set to true is the full ram tree from lucene for each segment.	2015-01-06 10:04:52 -08:00
Adrien Grand	bc86796592	Core: Remove terms filter cache. This is our only cache which is not 'exact' and might allow for stalled results. Additionally, a similar cache that we have and needs to perform lookups in other indices in order to run queries is the script index, and for this index we rely on the filesystem cache, so we should probably do the same with terms filters lookups. Close #9056	2015-01-06 17:21:20 +01:00
Simon Willnauer	236e2491b4	[ALLOCATION] Remove primary balance factor The `cluster.routing.allocation.balance.primary` setting has caused a lot of confusion in the past while it has very little benefit form a shard allocatioon point of view. Users tend to modify this value to evently distribute primaries across the nodes which is dangerous since a prmiary flag on it's own can trigger relocations. The primary flag for a shard is should not have any impact on cluster performance unless the high level feature suffereing from primary hotspots is buggy. Yet, this setting was intended to be a tie-breaker which is not necessary anymore since the algorithm is deterministic. This commit removes this setting entriely.	2015-01-06 16:43:39 +01:00
Simon Willnauer	4900f52619	[ALLOCATION] Weight deltas must be absolute deltas In some situations the shard balanceing weight delta becomes negative. Yet, a negative delta is always treated as `well balanced` which is wrong. I wasn't able to reproduce the issue in any way other than useing the real world data from issue #9023. This commit adds a fix for absolute deltas as well as a base test class that allows to build tests or simulations from the cat API output. Closes #9023	2015-01-06 15:48:44 +01:00
Clinton Gormley	75cc7077c7	Update plugins.asciidoc Added entity resolution plugin for duplication detection Related to #9131	2015-01-05 12:53:37 +01:00
Mikhail Korobov	707025fb7a	[Docs] fix curl examples in Nodes Stats docs Closes #9118	2014-12-31 14:01:37 +01:00
Clinton Gormley	f83909f7ae	Docs: The regexp query defaults to the `ALL` flag, and removed the `AUTOMATON` flag which is not used in Elasticsearch. Closes #6180	2014-12-30 19:53:31 +01:00
Clinton Gormley	904f20a41b	Update setup.asciidoc Add a note about using the same JVM version on all nodes and clients	2014-12-30 17:40:51 +01:00
dtpeacock	582d5e8d3c	Doc has store "false" not store "true" Came from `3465e69e83` due to changing "yes" to "false". Closes #9075	2014-12-29 11:59:22 +01:00
Martijn van Groningen	d8054ec299	inner_hits: Added another more compact syntax for inner hits. Closes #8770	2014-12-24 17:41:35 +01:00
Ryan Ernst	39b3613420	Fix date histogram docs grammar.	2014-12-23 10:19:55 -08:00
Nicholas Knize	77a7ef28b3	[GEO] Add optional left/right parameter to GeoJSON This feature adds an optional orientation parameter to the GeoJSON document and geo_shape mapping enabling users to explicitly define how they want Elasticsearch to interpret vertex ordering. The default uses the right-hand rule (counterclockwise for outer ring, clockwise for inner ring) complying with OGC Simple Feature Access standards. The parameter can be explicitly specified for an entire index using the geo_shape mapping by adding "orientation":{"left"\|"right"\|"cw"\|"ccw"\|"clockwise"\|"counterclockwise"} and/or overridden on each insert by adding the same parameter to the GeoJSON document. closes #8764	2014-12-22 12:09:45 -06:00
Adrien Grand	fb6c3b7c29	[Docs] Improve documentation of the new caching policy for filters.	2014-12-22 17:14:47 +01:00
Adrien Grand	ce11e0ee6d	Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449	2014-12-18 15:51:36 +01:00
Michael McCandless	242e631e95	Core: ignore known idle threads by default in /_nodes/hot_threads Add a new ignore_idle_threads boolean option (default true) to /_nodes/hot_threads, to filter out threads in known idle places like waiting on a socket select or on pulling the next task from an empty queue. Closes #8985 Closes #8908	2014-12-17 11:59:31 -05:00
Yasir Bamarni	5059d6fe1c	Update percolate.asciidoc wrong type used in the -GET request Closes #8942	2014-12-17 14:05:27 +01:00
Pablo Díaz-López	adb1a5b43b	Update getting-started.asciidoc Missing -X flag at the curl template Closes #8977	2014-12-17 14:03:38 +01:00
Peter Johnson a.k.a. insertcoffee	4b5e6b2de0	[docs] pedantry Closes #8982	2014-12-17 13:46:39 +01:00
Nicholas Knize	ac0e37449e	Adding unit test for self intersecting polygons. Relevant to #7751 even/odd discussion Updating documentation to describe polygon ambiguity and vertex ordering.	2014-12-16 10:54:39 -06:00
Ryan Ernst	37287284e6	Settings: Remove `mapping.date.round_ceil` setting for date math parsing The setting `mapping.date.round_ceil` (and the undocumented setting `index.mapping.date.parse_upper_inclusive`) affect how date ranges using `lte` are parsed. In #8556 the semantics of date rounding were solidified, eliminating the need to have different parsing functions whether the date is inclusive or exclusive. This change removes these legacy settings and improves the tests for the date math parser (now at 100% coverage!). It also removes the unnecessary function `DateMathParser.parseTimeZone` for which the existing `DateTimeZone.forID` handles all use cases. Any user previously using these settings can refer to the changed semantics and change their query accordingly. This is a breaking change because even dates without datemath previously used the different parsing functions depending on context. closes #8598 closes #8889	2014-12-15 13:13:45 -08:00
Timothy Perisho	ceafde41e9	Docs: typo on "frequent" I replaced "high frequent terms" with "high frequency terms" and "low frequent terms" with "low frequency terms". Alternatively, we could write, "highly frequent terms" and "minimally frequent terms" (or just "rare terms"). Closes #8962	2014-12-15 19:59:50 +01:00
Clinton Gormley	fcb83055de	Update repositories.asciidoc Update formatting of PGP key	2014-12-15 18:04:17 +01:00
Simon Willnauer	1247774ff1	Remove Gateway abstraction We only have a single gatweway since es 1.3. There is no need to keep all these abstractsion and nested packages. We can fold most of it into simpler structures.	2014-12-15 15:53:02 +01:00
spapin	ad747ba67f	Docs: fix a typo in cluster stats documentation example Closes #8898	2014-12-15 14:14:38 +01:00
Ayush	23dbecf3e7	Update percolate.asciidoc Updating the `associated` spelling Closes #8907	2014-12-15 14:12:03 +01:00
Alexander Reelsen	544ef8cb17	Packaging: Add java7/8 java-package paths to debian init script If you use the java-package tool to create java packages, those paths also should be added to the debian init script. Also updated the docs, that it is ok to install java8. Closes #7383	2014-12-11 16:15:00 +01:00
Peter Fabian Mitchell	b2bab05c29	HTTP: Add 'http.publish_port' setting to the HTTP module This change adds a 'http.publish_port' setting to the HTTP module to configure the port which HTTP clients should use when communicating with the node. This is useful when running on a bridged network interface or when running behind a proxy or firewall. Closes #8807 Closes #8137	2014-12-11 16:10:07 +01:00
Robert Muir	a2ffe494ae	[core] add best_compression option for Lucene 5.0 Upgrades lucene to latest, and supports the BEST_COMPRESSION parameter now supported (with backwards compatibility, etc) in Lucene. This option uses deflate, tuned for highly compressible data. index.codec:: The default value compresses stored data with LZ4 compression, but this can be set to best_compression for a higher compression ratio, at the expense of slower stored fields performance. IMO its safest to implement as a named codec here, because ES already has logic to handle this correctly, and because its unrealistic to have a plethora of options to Lucene's default codec... we are practically limited in Lucene to what we can support with back compat, so I don't think we should overengineer this and add additional unnecessary plumbing. See also: https://issues.apache.org/jira/browse/LUCENE-5914 https://issues.apache.org/jira/browse/LUCENE-6089 https://issues.apache.org/jira/browse/LUCENE-6090 https://issues.apache.org/jira/browse/LUCENE-6100 Closes #8863	2014-12-10 22:13:09 -05:00
Alexander Clausen	633905161a	Docs: use https to download the gpg public key Closes #8818	2014-12-10 18:14:07 +01:00
Adam Menges	3a3030e217	Docs: Fix the wording for inner hits a bit Closes #8747	2014-12-09 13:36:26 +01:00
Ashraf Sarhan	24f8807cb5	Docs: Update repositories.asciidoc 1. Enable the repository using "add-apt-repository" to avoid this error "No command 'deb' found". 2. Adding "sudo" to update and install command. Closes #8691	2014-12-09 13:23:16 +01:00
Kevin Kluge	63ac4614f4	docs: add pgp key to repositories page	2014-12-08 15:41:09 +01:00
Jun Ohtani	d78d2ff93d	Docs: add randomizedtesting-runner to testing-framework.asciidoc Close #8450	2014-12-07 01:30:58 +09:00

1 2 3 4 5 ...

1023 Commits