OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-05-23 09:05:40 +00:00

Author	SHA1	Message	Date
Christoph Büscher	95a6951f78	Use new bulk API endpoint in the docs (#37698 ) This change switches to using the typeless bulk API endpoint in the documentation snippets where possible	2019-01-23 09:46:28 +01:00
Jason Tedor	18a3e48a4a	Change file descriptor limit to 65535 (#37537 ) Some systems default to a nofile ulimit of 65535. To reduce the pain of deploying Elasticsearch to such systems, this commit lowers the required limit from 65536 to 65535.	2019-01-16 17:19:12 -05:00
Boaz Leskes	e356b8cb95	Add doc's sequence number + primary term to GetResult and use it for updates (#36680 ) This commit adds the last sequence number and primary term of the last operation that have modified a document to `GetResult` and uses it to power the Update API. Relates #36148 Relates #10708	2018-12-17 15:22:13 +01:00
Lisa Cawley	fb697c7442	[DOCS] Add missing anchors (#36288 )	2018-12-06 10:14:37 -08:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Wilder Pereira	5af6ae564d	Change "REST Verb" to "HTTP Verb" (#34195 )	2018-10-02 17:09:54 -04:00
Lisa Cawley	949e4e9d1a	[DOCS] Synchronizes captialization in top-level titles (#33605 )	2018-09-27 08:36:18 -07:00
debadair	1c579646d3	[DOCS] Removed Homebrew installation section (#34011 )	2018-09-24 10:29:12 -07:00
Nik Everett	26c4f1fb6c	Core: Default node.name to the hostname (#33677 ) Changes the default of the `node.name` setting to the hostname of the machine on which Elasticsearch is running. Previously it was the first 8 characters of the node id. This had the advantage of producing a unique name even when the node name isn't configured but the disadvantage of being unrecognizable and not being available until fairly late in the startup process. Of particular interest is that it isn't available until after logging is configured. This forces us to use a volatile read whenever we add the node name to the log. Using the hostname is available immediately on startup and is generally recognizable but has the disadvantage of not being unique when run on machines that don't set their hostname or when multiple elasticsearch processes are run on the same host. I believe that, taken together, it is better to default to the hostname. 1. Running multiple copies of Elasticsearch on the same node is a fairly advanced feature. We do it all the as part of the elasticsearch build for testing but we make sure to set the node name then. 2. That the node.name defaults to some flavor of "localhost" on an unconfigured box feels like it isn't going to come up too much in production. I expect most production deployments to at least set the hostname. As a bonus, production deployments need no longer set the node name in most cases. At least in my experience most folks set it to the hostname anyway.	2018-09-19 15:21:29 -04:00
Luca Cavanna	393eec1482	Set maxScore for empty TopDocs to Nan rather than 0 (#32938 ) We used to set `maxScore` to `0` within `TopDocs` in situations where there is really no score as the size was set to `0` and scores were not even tracked. In such scenarios, `Float.Nan` is more appropriate, which gets converted to `max_score: null` on the REST layer. That's also more consistent with lucene which set `maxScore` to `Float.Nan` when merging empty `TopDocs` (see `TopDocs#merge`).	2018-08-22 17:23:54 +02:00
Tim Ryan	dce72c7985	Fix some small issues in the getting started docs (#30346 ) * Modified a reference to real time to match the previous line reference of realtime. * Modified eg to e.g. as it's an abbreviation for the latin exempli gratia * Added missing pronoun to `_executing_filters` section.	2018-08-20 07:54:03 +01:00
markwalkom	ae38cfbaec	[DOCS] Update getting-started.asciidoc (#29518 ) Highlighted that you can change shard counts using `_shrink` and `_split`.	2018-08-17 11:09:22 +02:00
ahmedakef	5b65e2ccbf	DOCS: put LIMIT 10 to the SQL query (#32065 ) Provides a more precise equivalent SQL query for the aggregation example in the getting started guide.	2018-07-16 16:09:56 -04:00
debadair	ff8260c234	Tweaked Elasticsearch Service links for SEO	2018-07-16 12:58:08 -07:00
Clinton Gormley	bc1284eb28	Docs: Restyled cloud link in getting started	2018-07-13 15:48:14 +02:00
Clinton Gormley	aedbfc63cd	Docs: Added note about cloud service to installation and getting started	2018-07-11 20:17:18 +02:00
Jonathan Pool	297e99c4c2	[Docs] Extend Homebrew installation instructions (#28902 ) Adding a note about proceeding after a successful homebrew installation.	2018-06-20 14:20:51 +02:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Ragnar Nevries	5adfe2d8e3	[Docs] Update getting-started.asciidoc (#29294 ) Description after first use of search API with body does not refer to verb POST.	2018-04-03 14:57:42 +02:00
Chun On Lee	4ec179b2ba	Update "_doc" to "account" type for bulk example (#28786 ) * Change 'account' to '_doc' as types are deprecated	2018-03-13 15:51:59 -07:00
Alex Moros Marco	261fb6a29e	[Docs] Fix explanation for `from` and `size` example (#28320 )	2018-01-25 11:40:39 +01:00
Boudewijn	3a96518b58	Update getting-started.asciidoc (#28145 ) Replaced single quotation marks with double quotation marks surrounding localhost address	2018-01-12 10:21:16 -05:00
Adrien Grand	1b660821a2	Allow `_doc` as a type. (#27816 ) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751	2017-12-14 17:47:53 +01:00
K. Daniel Newton	365dda8748	Correct usage of "an" to "a" in getting started docs This commit corrects a word usage error in the getting started docs. Since pronunciation is what determines when to use either "a" or "an" and the word "ubiquitous" is pronounced /yo͞oˈbikwədəs/, it should be preceded by "a." Relates #27420	2017-11-18 07:36:43 -05:00
Alexander Reelsen	08037eebff	Tests: Improve size regex in documentation test (#26879 ) The regex has been changed to not only be able to deal with something like `260b`, but also support `1.1kb`.	2017-11-13 10:21:53 +01:00
Boris Tyukin	8e9b30417c	Update to support bulk updates by query (#27172 ) Getting started doc stated that bulk updates by query are not supported but they are now	2017-11-06 17:32:20 +01:00
Christoph Büscher	8e62314ce4	[Docs] Remove first person "I" from getting started (#27155 ) Avoid first person style and consistently switch to an unpersonal style in the getting started docs.	2017-10-30 10:45:50 +01:00
İsmail Arılık	71f5e2ce6b	Fix a typo. (#27043 ) `=== Instalation with Homebrew` should be `=== Installation with Homebrew`.	2017-10-18 09:46:53 -04:00
Divyum Rastogi	984731f36b	[DOCS] better formatting of ES cluster status (#26838 ) * better formatting of ES cluster status * change phrase missing data	2017-10-18 01:40:21 -06:00
vurple	b3e9aa89dc	Add Homebrew instructions to getting started This commit adds instructions for installing Elasticsearch via Homebrew to the Getting Started guide. Relates #26847	2017-10-10 06:21:33 -04:00
Nik Everett	6d2c40e546	Enforce that responses in docs are valid json (#26249 ) All of the snippets in our docs marked with `// TESTRESPONSE` are checked against the response from Elasticsearch but, due to the way they are implemented they are actually parsed as YAML instead of JSON. Luckilly, all valid JSON is valid YAML! Unfurtunately that means that invalid JSON has snuck into the exmples! This adds a step during the build to parse them as JSON and fail the build if they don't parse. But no! It isn't quite that simple. The displayed text of some of these responses looks like: ``` { ... "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` Note the `...` which isn't valid json but we like it anyway and want it in the output. We use substitution rules to convert the `...` into the response we expect. That yields a response that looks like: ``` { "took": $body.took,"timed_out": false,"_shards": $body._shards,"hits": $body.hits, "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` That is what the tests consume but it isn't valid JSON! Oh no! We don't want to go update all the substitution rules because that'd be huge and, ultimately, wouldn't buy much. So we quote the `$body.took` bits before parsing the JSON. Note the responses that we use for the `_cat` APIs are all converted into regexes and there is no expectation that they are valid JSON. Closes #26233	2017-08-17 09:02:10 -04:00
Deb Adair	23c810b334	[DOCS] Changes xrefs to cross doc links to enable building GS "mini-docs"	2017-07-18 13:52:38 -07:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
olcbean	2ba9fd2aec	Remove deprecated created and found from index, delete and bulk (#25516 ) The created and found fields in index and delete responses became obsolete after the introduction of the result field in index, update and delete responses (#19566). After deprecating the created and found fields in 5.x (#19633), now they are removed. Fixes #19630	2017-07-07 13:58:46 -04:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
Lisa Cawley	03f952a838	[DOCS] Update docs to use shared attribute file (#25403 ) * [DOCS] Update docs to use shared attribute file * [DOCS] Add shared attributes to Versions.asciidoc	2017-06-27 08:33:28 -07:00
Russ Cam	9eca380a76	Add MSI installation to documentation (#25213 ) * Add MSI installation to documentation Move installation documentation for Windows with the .zip archive into the zip and tar installation documentation, and clearly indicate any differences for installing on macOS/Linux and Windows. * Separate out installation with .zip on Windows	2017-06-19 14:52:47 +10:00
Jason Tedor	4796557a30	Add primary term to doc write response This commit adds the primary term to the doc write response. Relates #24171	2017-04-19 14:44:22 -04:00
Nik Everett	efc62f193f	Docs: fix health response test I managed to test the wrong snippet before pushing the last commit.... This fixes the error in it.	2017-03-28 08:29:10 -04:00
Nik Everett	d505c20373	Docs: Clean up response test in getting_started It had drifted from a fix applied to 5.x. The 5.x fix was more readable. This applies the 5.x fix.	2017-03-28 08:29:10 -04:00
Jürgen van Dijk	7609d67c7d	Typo (#23344 )	2017-02-28 16:33:09 +01:00
Lee Hinman	6c9b89b882	[TEST] Fix incorrect test cluster name in cluster health doc tests	2017-02-22 17:18:11 -07:00
Christopher Best	eeaa0ccec2	Update getting-started.asciidoc (#23296 )	2017-02-22 11:06:27 +01:00
Nik Everett	a6cf170384	Add content-type header to curl in getting-started Relates to #23001	2017-02-15 18:14:25 -05:00
sungjunyoung	ec686b52c1	Update getting-started.asciidoc (#23180 ) The "sort" and "_score" fields are contained within the "hits" field.	2017-02-15 10:55:09 +01:00
Clinton Gormley	e181a020a9	Replaced absolute URLs in docs with attributes	2017-02-04 12:05:03 +01:00
Folusho Oladipo	daf1f184d4	Corrected a plural verb to a singular one. (#22681 )	2017-01-20 19:00:10 +01:00
Ali Beyad	099d229138	[TEST] fix documentation checking tests to account for possible pending tasks in the cluster state	2017-01-17 17:29:14 -05:00
Colin Goodheart-Smithe	aa0c8ce895	[DOCS] Fiexes typo in getting started doc	2017-01-16 09:32:07 +00:00
Colin Goodheart-Smithe	04f19bbaa4	[DOCS]cMake it clear that bulk API actions are processed sequentially on each shard rather than sequentially overall (#22550 )	2017-01-16 09:15:24 +00:00

1 2 3

113 Commits