OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Jim Ferenczi	891d3bd9c3	Expose the Lucene Korean analyzer module in a plugin (#30397 ) This change adds a new plugin called `analysis-nori` that exposes Korean text analysis in es using the new Lucene Korean analyzer module named (`nori`). The plugin adds: * a Korean analyzer: `nori` * a Korean tokenizer: `nori_tokenizer` * a part of speech stop filter: `nori_part_of_speech` * a filter that can replace Hanja characters with their Hangul transcription: `nori_readingform`	2018-05-04 20:46:13 +02:00
Jason Tedor	c12c2a6cc9	Rename the bulk thread pool to write thread pool (#29593 ) This commit renames the bulk thread pool to the write thread pool. This is to better reflect the fact that the underlying thread pool is used to execute any document write request (single-document index/delete/update requests, and bulk requests). With this change, we add support for fallback settings thread_pool.bulk.* which will be supported until 7.0.0. We also add a system property so that the display name of the thread pool remains as "bulk" if needed to avoid breaking users.	2018-04-19 08:18:58 -04:00
Jason Tedor	2b47d67d95	Remove the index thread pool (#29556 ) Now that single-document indexing requests are executed on the bulk thread pool the index thread pool is no longer needed. This commit removes this thread pool from Elasticsearch.	2018-04-18 09:18:08 -04:00
Jason Tedor	faa7fe86c5	Introduce analyze thread pool (#29541 ) We want to remove the index thread pool as it is no longer needed since single-document indexing requests are executed as bulk requests now. Analyze requests are also executed on the index thread pool though and they need a thread pool to execute on. The bulk thread does not seem like the right thread pool, let us keep that thread pool conceptually for bulk requests and free for bulk requests. None of the existing thread pools make sense for analyze requests either. The generic thread pool would be a terrible choice since it has an unbounded queue and that is a bad idea for user-facing APIs. This commit introduces a small by default (size=1, queue_size=16) thread pool for analyze requests.	2018-04-17 06:46:15 -04:00
Jason Tedor	8fdca6a89a	Align cat thread pool info to thread pool config (#29195 ) Today we report thread pool info using a common object. This means that we use a shared set of terminology that is not consistent with the terminology used to the configure thread pools. This holds in particular for the minimum and maximum number of threads in the thread pool where we use the following terminology: thread pool info \| fixed \| scaling min core size max max size A previous change addressed this for the nodes info API. This commit changes the display of thread pool info in the cat thread pool API too to be dependent on the type of the thread pool so that we can align the terminology in the output of thread pool info with the terminology used to configure a thread pool.	2018-04-03 17:27:26 -04:00
Tanguy Leroux	be74f11517	Replace jvm-example by two plugin examples (#28339 ) This pull request replaces the jvm-example plugin (from the jvm/site plugins era) by two new plugins: a custom-settings that shows how to register and use custom settings (including secured settings) in a plugin, and rest-handler plugin that shows how to register a rest handler. The two plugins now reside in the plugins/examples project. They can serve as sample plugins for users, a special attention has been put on documentation. The packaging tests have been adapted to use the custom-settings plugin.	2018-01-26 17:34:24 +01:00
Christoph Büscher	0d11b9fe34	[Docs] Unify spelling of Elasticsearch (#27567 ) Removes occurences of "elasticsearch" or "ElasticSearch" in favour of "Elasticsearch" where appropriate.	2017-11-29 09:44:25 +01:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Jason Tedor	8eba1fa17c	Add docs on full_id parameter in cat nodes API This commit adds a note to the docs on the full_id parameter in the cat nodes API. This is a useful parameter but was not previously documented anywhere. Relates #27009	2017-10-13 13:49:25 -04:00
Tanguy Leroux	c16c653c3e	[Test] Fix reference/cat/allocation/line_8 test failure In this test, 260b is replaced by the regexp \d+b but the test sometimes produces results like 1.1kb so this commit adapts the regexp to match values with decimals	2017-09-18 10:46:19 +02:00
Nik Everett	6d2c40e546	Enforce that responses in docs are valid json (#26249 ) All of the snippets in our docs marked with `// TESTRESPONSE` are checked against the response from Elasticsearch but, due to the way they are implemented they are actually parsed as YAML instead of JSON. Luckilly, all valid JSON is valid YAML! Unfurtunately that means that invalid JSON has snuck into the exmples! This adds a step during the build to parse them as JSON and fail the build if they don't parse. But no! It isn't quite that simple. The displayed text of some of these responses looks like: ``` { ... "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` Note the `...` which isn't valid json but we like it anyway and want it in the output. We use substitution rules to convert the `...` into the response we expect. That yields a response that looks like: ``` { "took": $body.took,"timed_out": false,"_shards": $body._shards,"hits": $body.hits, "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` That is what the tests consume but it isn't valid JSON! Oh no! We don't want to go update all the substitution rules because that'd be huge and, ultimately, wouldn't buy much. So we quote the `$body.took` bits before parsing the JSON. Note the responses that we use for the `_cat` APIs are all converted into regexes and there is no expectation that they are valid JSON. Closes #26233	2017-08-17 09:02:10 -04:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
Andreas Gebhardt	a156ccd80e	Expand `/_cat/nodes` to return information about hard drive (#21775 ) Expand `/_cat/nodes` with already present information about available disk space `diskAvail` (alias: `d`, `disk`) by: * `diskTotal` (alias `dt`): total disk space * `diskUsed` (alias `du`): used disk space (`diskTotal - diskAvail`) * `diskUsedPercent` (alias `dup`): used disk space percentage Note: The available disk space is the number of bytes available to the node's Java virtual machine. The size might be smaller than the real one. That means the used disk space (percentage) is larger. Closes #21679	2017-06-28 18:20:20 +02:00
Pandiyan Murugan	34c3d1d5bf	Fix typo in shards.asciidoc (#25143 )	2017-06-09 12:45:43 +02:00
Andrey Groshev	e4fd8485ce	Made the same length of opening and closing lines (#23583 )	2017-06-09 00:50:43 -07:00
Nik Everett	45dd3780e2	CONSOLEify remaining _cat docs Relates to #18160	2017-05-03 20:59:27 -04:00
Nik Everett	ae0290bae9	Doc test: use propery regex for file size The _cat/shards docs asserted that one of the columns looked like a propery byte size but used a regex like `\d+\.\d+.*` which doesn't match `0b` which is a possible value. Instead this uses `\d(\.\d+)?[kmg]?b`.	2017-05-01 15:49:00 -04:00
Nik Everett	94e3796908	Docs tests: cat/health can have max_task_wait_time Make the doc test assertions ok with a non `-` value for `max_task_wait_time`. These are rare, but possible: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=oraclelinux/900/consoleFull	2017-04-28 09:58:53 -04:00
Guillaume Le Floch	382a617d34	Handle multiple aliases in _cat/aliases api (#23698 ) The alias parameter was documented as a list in our rest-spec, yet only the first value out of a list was getting read and processed. This commit adds support for multiple aliases to _cat/aliases Closes #23661	2017-04-28 15:21:44 +02:00
Sakthipriyan Vairamani	dd3bbfb153	doc: highlight that doc counts come from lucene (#23522 ) The docs don't clearly explain that the deleted doc count also comes from lucene. IMHO, it is worth highlighting this information separately, as a Note. Apart from that, there should be an official recommended alternative as well.	2017-04-17 21:52:29 -04:00
Nik Everett	718e332c64	Docs: Be ok with long recovery times The _cat docs were asserting that an index took only some number of milliseconds to recovery. In this build it took a whole second: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.3+multijob-intake/192/consoleFull So this changes the assertion to be ok with a second.	2017-04-17 16:56:12 -04:00
Nik Everett	0b20a59391	Docs test: defend against round numbers If a shard has a nice, round number the test in the `_cat/shards` reference file would fail. They should be ok with it. A failure: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.3+multijob-unix-compatibility/os=fedora/93/console	2017-04-05 15:31:11 -04:00
Alexander Reelsen	6781c4320c	Documentation: Consoleify cat shards/recovery API docs (#23116 ) Relates #23001	2017-02-22 09:18:10 +01:00
Andreas Roussos	788c64848b	[DOCS] Fixed various typos in the 'cat APIs' section (#23216 )	2017-02-16 20:41:42 +01:00
Patryk Krawaczyński	42c0e8947f	Fix duplicates from search.query (#22701 ) search.query_current, search.query_time and search.query_total have wrong aliases.	2017-01-20 18:45:10 +01:00
Daniel Mitterdorfer	aece89d6a1	Make boolean conversion strict (#22200 ) This PR removes all leniency in the conversion of Strings to booleans: "true" is converted to the boolean value `true`, "false" is converted to the boolean value `false`. Everything else raises an error.	2017-01-19 07:59:18 +01:00
Nik Everett	c79371fd5b	Remove lang-python and lang-javascript (#20734 ) They were deprecated in 5.0. We are concentrating on making Painless awesome rather than supporting every language possible. Closes #20698	2016-11-21 22:13:25 -05:00
Adrien Grand	6db683a4bd	Fix recurring doc test failures with the cat API. (#21561 ) This failure is due to the fact that we sort on store size, which is cached. So it might happen that the store size that is taken into account is not the right one, which makes the indices sorted in the wrong order. This changes the doc example to sort on the number of docs instead. Closes #21062	2016-11-15 16:00:44 +01:00
Nik Everett	a4b3a95f5a	Move flush in _cat/indices docs tests (#21117 ) Moves the `_flush` in the `_cat/indices` snippets testing framework to the very first test. We need to flush super early because index size is cached for a few seconds so we really need to read a consistent size on the first read so we can sort by it properly. Closes #21062	2016-11-04 10:32:07 -04:00
Christoph Büscher	1f5adaa824	Docs: Adding Ukrainian analyzer	2016-10-31 18:20:39 +01:00
Nik Everett	44c3b04bef	Convert more docs to // CONSOLE Converts docs for `_cat/segments`, `_cat/plugins` and `_cat/repositories` from `curl` to `// CONSOLE` so they are tested as part of the build and are cleaner to use in Console. They should work fine with `curl` with the `COPY AS CURL` link. Also swaps the `source` type of the response from `js` to `txt` because that is more correct. The syntax highlighter doesn't care. It looks at the text to figure out the language. So it looks a little funny for `_cat` responses regardless. Relates to #18160	2016-10-25 11:17:24 -04:00
Nik Everett	4fbe1a8819	CONSOLEify _cat/pending_tasks docs Relates to #18160	2016-10-14 14:12:35 -04:00
Nik Everett	68ed183381	CONSOLEify a few more _cat docs `_cat/master`, `_cat/nodeattrs`, `_cat/nodes`.	2016-10-13 16:43:06 -04:00
Nik Everett	279baa0284	Add a flush to test in _cat/indices.asciidoc We test that sorting by `store.size` works but sometimes the sizes aren't what we expect. At least in CI: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=opensuse/101/console https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=centos/100/console I haven't been able to reproduce it locally but adding a `_flush` won't hurt and might make the inconsistency vanish.	2016-10-13 13:21:57 -04:00
Nik Everett	42a7a554b1	Don't mind pending cluster tasks in docs build This removes an assertion that the cluster doesn't have any pending cluster state tasks from the `_cat/health` docs. Relates to #18160	2016-10-11 17:44:50 -04:00
Nik Everett	298cf1cf21	CONSOLEify _cat/indices docs Relates to #18160 Uses the new sorting (#20658) in the `_cat` API to support all use cases natively. We can still resort to piping things through `sort` if we need to, but we don't have to for basic stuff like sorting!	2016-10-11 14:55:37 -04:00
Nik Everett	06049283a0	CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160	2016-10-07 16:30:45 -04:00
Nik Everett	d9781bd069	Fix broken regex in doc tests Regexes are hard.	2016-10-06 13:53:20 -04:00
Nik Everett	d7d5df8863	CONSOLEify some _cat docs Added `// NOTCONSOLE` to some `_cat` docs that rely on `sort` or are otherwise too difficult for us to test at this point. Relates to #20717	2016-10-06 13:37:21 -04:00
Alexander Lin	d31a8e6558	Provides a cat api endpoint for templates. (#20545 ) Adds a cat api endpoint: /_cat/templates and its more specific version, /_cat/templates/{name}. It looks something like: $ curl "localhost:9200/_cat/templates?v" name template order version sushi_california_roll avocado 1 1 pizza_hawaiian pineapples 1 pizza_pepperoni pepperoni 1 The specified version (only allows * globs) looks like: $ curl "localhost:9200/_cat/templates/pizza" name template order version pizza_hawaiian pineapples* 1 pizza_pepperoni pepperoni 1 Partially specified columns: $ curl "localhost:9200/_cat/templates/pizza?v=true&h=name,template" name template pizza_hawaiian pineapples* pizza_pepperoni pepperoni The help text: $ curl "localhost:9200/_cat/templates/pizza*?help" name \| n \| template name template \| t \| template pattern string order \| o \| template application order number version \| v \| version Closes #20467	2016-09-20 10:40:23 +02:00
Tanguy Leroux	656596c2a9	[DOC] Remove obsolete node names from documentation Funny node names have been removed in #19456 and replaced by UUID. This commit removes these obsolete node names and replace them by real UUIDs in the documentation. closes #20065	2016-09-19 11:56:28 +02:00
Tanguy Leroux	1894489832	[DOC] Update /_cat/nodes doc closes #20162	2016-09-19 09:31:48 +02:00
Clinton Gormley	960efe6202	Fixed typo in cat indices docs	2016-09-14 18:36:58 +01:00
Jason Tedor	c7bfbe3e69	Add health status parameter to cat indices API This commit adds a health status parameter to the cat indices API for filtering on indices that match the specified status (green\|yellow\|red). Relates #20393	2016-09-13 07:57:18 -04:00
Jason Tedor	533412e36f	Improve cat thread pool API Today, when listing thread pools via the cat thread pool API, thread pools are listed in a column-delimited format. This is unfriendly to command-line tools, and inconsistent with other cat APIs. Instead, thread pools should be listed in a row-delimited format. Additionally, the cat thread pool API is limited to a fixed list of thread pools that excludes certain built-in thread pools as well as all custom thread pools. These thread pools should be available via the cat thread pool API. This commit improves the cat thread pool API by listing all thread pools (built-in or custom), and by listing them in a row-delimited format. Finally, for each node, the output thread pools are sorted by thread pool name. Relates #19721	2016-08-03 23:02:13 -04:00
Tanguy Leroux	737db98bd7	/_cat/shards should support wilcards for indices closes #19634	2016-08-01 11:09:48 +02:00
David Pilato	8c6c00ff15	Update documentation for cat/plugins API Cat API for plugins doesn't display anymore url or jvm/site flag	2016-06-30 13:57:43 +02:00
Mike McCandless	5c525e6606	Remove index_writer_max_memory stat from segment stats	2016-05-31 06:29:29 -04:00
Lee Hinman	1c54033e92	Merge branch 'pr/18068'	2016-05-10 08:27:43 -06:00

1 2 3

110 Commits