OpenSearch

Commit Graph

Author	SHA1	Message	Date
Kartika Prasad	8ab0c1b4a0	Update indexing-speed.asciidoc (#59347 ) typo fix	2020-07-13 12:19:43 +01:00
Lisa Cawley	db5bf92acf	[7.x][DOCS] Replace docdir attribute with es-repo-dir (#57489 ) (#57494 )	2020-06-01 16:42:53 -07:00
Paweł Krześniak	e6dce13bda	DOCS: minor formatting (#56263 ) Removed extra back ticks. Please cherry-pick to other branches.	2020-05-06 13:41:47 -04:00
Vanessa Bell	9b005f70ef	[Docs] Grammar improvements in disk-usage.asciidoc (#55620 )	2020-04-30 19:31:56 +02:00
James Rodewig	6f9513915d	[DOCS] Add 'how to' doc about avoiding oversharding (#55480 ) Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>	2020-04-22 10:44:16 -04:00
James Rodewig	9569a8eb13	[DOCS] Add example to "avoid scripts" advice (#54719 ) Adds a detailed example to the "Avoid scripts" section of the "Tune for search speed" docs. The detail outlines how a script used to transform indexed data can be moved to ingest. The update also removes an outdated reference to supported script languages.	2020-04-07 15:25:10 -04:00
Adrien Grand	cb868d2f5e	Introduce a `constant_keyword` field. (#49713 ) (#53024 ) This field is a specialization of the `keyword` field for the case when all documents have the same value. It typically performs more efficiently than keywords at query time by figuring out whether all or none of the documents match at rewrite time, like `term` queries on `_index`. The name is up for discussion. I liked including `keyword` in it, so that we still have room for a `singleton_numeric` in the future. However I'm unsure whether to call it `singleton`, `constant` or something else, any opinions? For this field there is a choice between 1. accepting values in `_source` when they are equal to the value configured in mappings, but rejecting mapping updates 2. rejecting values in `_source` but then allowing updates to the value that is configured in the mapping This commit implements option 1, so that it is possible to reindex from/to an index that has the field mapped as a keyword with no changes to the source. Backport of #49713	2020-03-03 16:01:47 +01:00
Adrien Grand	9b0ddc1c03	Clarify the resiliency trade-off of disabling replicas to speed up indexing. (#52714 ) We should be more explicit about the downsides of disabling replicas and explain that users should be ready to re-do the entire load in case of issues mid-way.	2020-02-25 08:54:10 +01:00
Adrien Grand	5ce66b8b3c	Document how CCR may be used to speed up indexing. (#52717 ) One architecture that we have recommended to several users to speed up indexing involved using CCR to prevent searching from stealing resources from indexing.	2020-02-25 08:54:10 +01:00
Grzegorz Banasiak	87b126bbfc	[DOCS] Fix index_prefixes link in 'faster prefix queries' docs (#51833 ) Fixes a link in 'faster prefix queries' which incorrectly redirects to index_phrases mapping parameter description instead of index_prefixes.	2020-02-04 08:40:18 -05:00
James Rodewig	ef467cc6f5	[DOCS] Remove unneeded redirects (#50476 ) The docs/reference/redirects.asciidoc file stores a list of relocated or deleted pages for the Elasticsearch Reference documentation. This prunes several older redirects that are no longer needed and don't require work to fix broken links in other repositories.	2019-12-26 08:29:28 -05:00
James Rodewig	726c35dfd0	[DOCS] Add identifier mapping tip to numeric and keyword datatype docs (#49933 ) Users often mistakenly map numeric IDs to numeric datatypes. However, this is often slow for the `term` and other term-level queries. The "Tune for search speed" docs includes advice for mapping numeric IDs to `keyword` fields. However, this tip is not included in the `numeric` or `keyword` field datatype doc pages. This rewords the tip in the "Tune for search speed" docs, relocates it to the `numeric` field docs, and reuses it using tagged regions.	2019-12-17 09:34:32 -05:00
James Rodewig	0b4fb05540	[DOCS] Reformat refresh API docs (#46667 ) (#47589 )	2019-10-04 13:50:09 -04:00
James Rodewig	0c575dc1e8	[DOCS] Correct link to `index.store.preload` setting (#47145 )	2019-09-26 08:57:09 -04:00
James Rodewig	b59ecde041	[DOCS] [2 of 5] Change // CONSOLE comments to [source,console] (#46353 ) (#46502 )	2019-09-09 13:38:14 -04:00
James Rodewig	f04573f8e8	[DOCS] [5 of 5] Change // TESTRESPONSE comments to [source,console-results] (#46449 ) (#46459 )	2019-09-06 16:09:09 -04:00
David Turner	8516fb0f3b	Expand docs on force-merge and global ordinals (#44684 ) Some small clarifications about force-merging and global ordinals, particularly that global ordinals are cheap on a single-segment index and how this relates to frozen indices. Fixes #41687	2019-07-23 07:33:33 +01:00
James Rodewig	d46545f729	[DOCS] Update anchors and links for Elasticsearch API relocation (#44500 )	2019-07-19 09:18:23 -04:00
Tanguy Buchier	078efc9ec4	[DOCS] Clarify refresh_interval new behavior (#43726 ) Update indexing-speed.asciidoc to clarify refresh_interval new behavior	2019-07-16 14:53:46 +02:00
markharwood	b17fbe2933	Docs enhancement for quote_field_suffix. (#43093 ) * Docs enhancement for quote_field_suffix. Mentions the use of a fall-back field when specified field is missing. Closes #40778	2019-06-11 16:33:12 +01:00
swstepp	4181c5ccf5	Fix grammar problem in stemming reference. (#42148 )	2019-05-22 09:50:30 -07:00
James Rodewig	53702efddd	[DOCS] Add anchors for Asciidoctor migration (#41648 )	2019-04-30 10:20:17 -04:00
James Rodewig	08c5d3b912	[DOCS] Explicitly set section IDs for Asciidoctor migration (#41547 ) * [DOCS] Explicitly set section ID for faster phrase queries * [DOCS] Explicitly set section ID for faster prefix queries	2019-04-25 15:07:52 -04:00
Adrien Grand	965e311094	Update indexing speed recommendations around the refresh interval. (#40690 ) We now need to update recommendations now that we have introduced the concept of "search idle" shards.	2019-04-02 11:19:22 +02:00
Adrien Grand	466864710a	Update the how-to section of the docs for 7.0: (#37717 ) - new `rank_feature`/`script_score` queries - new `index_phrases`/`index_prefixes` options - disabling `_field_names` doesn't help anymore - adaptive replica selection is on by default	2019-03-12 08:24:39 +01:00
Christoph Büscher	34f2d2ec91	Remove remaining occurances of "include_type_name=true" in docs (#37646 )	2019-01-22 15:13:52 +01:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
lcawley	00997b4f60	[DOCS] Fixes broken links	2019-01-04 17:41:28 +10:00
Peter Dyson	7839cec301	subsequent fix to edit in recent cherry-pick	2019-01-04 17:34:24 +10:00
Peter Dyson	7cc9754d94	fix to edit in recent cherry-pick	2019-01-04 17:26:42 +10:00
Peter Dyson	0ff2707c9f	Add Profile API to search speed tuning howto (#29489 ) * Add Profile API to search speed tuning howto Seemed useful to mention the Profile API in the context of tuning for search speed.	2019-01-04 16:49:12 +10:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Julie Tibshirani	f854330e06	Make sure to use the type _doc in the REST documentation. (#34662 ) * Replace custom type names with _doc in REST examples. * Avoid using two mapping types in the percolator docs. * Rename doc -> _doc in the main repository README. * Also replace some custom type names in the HLRC docs.	2018-10-22 11:54:04 -07:00
Jan Jíša	822b067a3e	Docs: Corrected typo in how to (#33910 ) max_context_length -> max_content_length	2018-09-20 16:13:46 -04:00
Jim Ferenczi	7ad71f906a	Upgrade to a Lucene 8 snapshot (#33310 ) The main benefit of the upgrade for users is the search optimization for top scored documents when the total hit count is not needed. However this optimization is not activated in this change, there is another issue opened to discuss how it should be integrated smoothly. Some comments about the change: * Tests that can produce negative scores have been adapted but we need to forbid them completely: #33309 Closes #32899	2018-09-06 14:42:06 +02:00
Christoph Büscher	978d1ed257	[Docs] Improve tuning for speed advice (#33315 ) This change merges two sections in the "Tune for search speed" documentation that recommend mapping numeric identifiers as keywords. Both sections contain mostly the same advice, so they can be merged. Closes #32733	2018-09-03 11:09:30 +02:00
DeDe Morton	ecd05d5be4	Use correct formatting for links (#29460 )	2018-07-16 21:11:24 +02:00
Adrien Grand	21fe6159d4	Docs: remove notes on sparsity. (#30905 ) Sparsity is less of a concern since 6.0. Closes #30833	2018-06-05 08:58:52 +02:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Sue Gallagher	3530a676e0	[Docs]Corrected spelling errors. (#28976 )	2018-03-19 10:22:40 -07:00
Adrien Grand	89b4485511	Document how copy-to can help speed up queries by querying fewer fields. (#28373 )	2018-01-31 15:03:54 +01:00
Adrien Grand	1b660821a2	Allow `_doc` as a type. (#27816 ) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751	2017-12-14 17:47:53 +01:00
Christoph Büscher	0d11b9fe34	[Docs] Unify spelling of Elasticsearch (#27567 ) Removes occurences of "elasticsearch" or "ElasticSearch" in favour of "Elasticsearch" where appropriate.	2017-11-29 09:44:25 +01:00
Adrien Grand	4e1ff8d086	Add documentation about disabling `_field_names`. (#26813 ) This field has significant index-time overhead. Closes #26779	2017-10-06 16:49:15 +02:00
Lee Hinman	cff904bf97	Enable adaptive replica selection by default (#26522 ) Relates to #24915	2017-09-07 09:25:05 -06:00
Lee Hinman	4157eead22	[DOCS] Add documentation for adaptive replica selection This adds a blurb for adaptive replica selection since it was previously undocumented. Relates to #24915	2017-09-01 09:53:22 -06:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Christoph Wurm	0120448f76	Expand How to tune for disk usage (#25562 )	2017-08-21 12:07:54 -07:00
Clinton Gormley	25a89e613a	Broke recipes into separate pages	2017-07-17 18:21:39 +02:00

1 2

68 Commits