OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-06 13:08:29 +00:00

Author	SHA1	Message	Date
markharwood	70d80a3d09	Docs enhancement: added reference to cluster-level setting `search.default_allow_partial_results` (#32810 ) Closes #32809	2018-08-16 10:21:37 +01:00
Andy Bristol	8bfb0f3f8d	serialize suggestion responses as named writeables (#30284 ) Suggestion responses were previously serialized as streamables which made writing suggesters in plugins with custom suggestion response types impossible. This commit makes them serialized as named writeables and provides a facility for registering a reader for suggestion responses when registering a suggester. This also makes Suggestion responses abstract, requiring a suggester implementation to provide its own types. Suggesters which do not need anything additional to what is defined in Suggest.Suggestion should provide a minimal subclass. The existing plugin suggester integration tests are removed and replaced with an equivalent implementation as an example plugin.	2018-08-07 13:31:00 -07:00
David Roberts	733f84c49b	[DOCS] Add missing docs include	2018-08-07 17:49:48 +01:00
David Roberts	2608012422	Add temporary directory cleanup workarounds (#32615 ) On some Linux distributions tmpfiles.d cleans files and directories under /tmp if they haven't been accessed for 10 days. This can cause problems for ML as ML is currently the only component that uses the temp directory more than a few seconds after startup. If you didn't open an ML job for 10 days and then tried to open one then the temp directory would have been deleted. This commit prevents the problem occurring in the case of Elasticsearch being managed by systemd, as systemd private temp directories are not subject to periodic cleanup (by default). Additionally there are now some docs to warn people about the risk and suggest a manual mitigation for .tar.gz users.	2018-08-07 16:59:56 +01:00
simonzheng	ab81078949	[Docs] Correct a small typo (#32655 )	2018-08-07 10:34:55 +02:00
DeDe Morton	e01e4393a8	[Docs] Light edit to info about docker images (#32376 )	2018-08-06 12:00:07 -07:00
Jake Landis	3d4c84f7ca	ingest: doc: move Dot Expander Processor doc to correct position (#31743 ) No changes to the content.	2018-08-03 07:21:05 -07:00
Shaunak Kashyap	0a83968650	Add cluster UUID to Cluster Stats API response (#32206 ) * Make cluster stats response contain cluster UUID * Updating constructor usage in Monitoring tests * Adding cluster_uuid field to Cluster Stats API reference doc * Adding rest api spec test for expecting cluster_uuid in cluster stats response * Adding missing newline * Indenting do section properly * Missed a spot! * Fixing the test cluster ID	2018-08-02 17:14:19 -07:00
Matthias Sieber	a39f6f09f4	fixed elements in array of produced terms (#32519 )	2018-08-02 11:12:15 -04:00
Ryan Ernst	478f6d6cf1	Scripting: Conditionally use java time api in scripting (#31441 ) This commit adds a boolean system property, `es.scripting.use_java_time`, which controls the concrete return type used by doc values within scripts. The return type of accessing doc values for a date field is changed to Object, essentially duck typing the type to allow co-existence during the transition from joda time to java time.	2018-08-01 08:58:49 -07:00
Albert Zaharovits	2d87287c0d	[DOCS] Reloadable Secure Settings (#31713 ) Docs on reloadable secure settings for plugins #29135 .	2018-08-01 12:07:23 +03:00
Nik Everett	22459576d7	Logging: Make node name consistent in logger (#31588 ) First, some background: we have 15 different methods to get a logger in Elasticsearch but they can be broken down into three broad categories based on what information is provided when building the logger. Just a class like: ``` private static final Logger logger = ESLoggerFactory.getLogger(ActionModule.class); ``` or: ``` protected final Logger logger = Loggers.getLogger(getClass()); ``` The class and settings: ``` this.logger = Loggers.getLogger(getClass(), settings); ``` Or more information like: ``` Loggers.getLogger("index.store.deletes", settings, shardId) ``` The goal of the "class and settings" variant is to attach the node name to the logger. Because we don't always have the settings available, we often use the "just a class" variant and get loggers without node names attached. There isn't any real consistency here. Some loggers get the node name because it is convenient and some do not. This change makes the node name available to all loggers all the time. Almost. There are some caveats are testing that I'll get to. But in production code the node name is node available to all loggers. This means we can stop using the "class and settings" variants to fetch loggers which was the real goal here, but a pleasant side effect is that the ndoe name is now consitent on every log line and optional by editing the logging pattern. This is all powered by setting the node name statically on a logging formatter very early in initialization. Now to tests: tests can't set the node name statically because subclasses of `ESIntegTestCase` run many nodes in the same jvm, even in the same class loader. Also, lots of tests don't run with a real node so they don't have a node name at all. To support multiple nodes in the same JVM tests suss out the node name from the thread name which works surprisingly well and easy to test in a nice way. For those threads that are not part of an `ESIntegTestCase` node we stick whatever useful information we can get form the thread name in the place of the node name. This allows us to keep the logger format consistent.	2018-07-31 10:54:24 -04:00
Colm O'Shea	97b379e0d4	fix no=>not typo (#32463 ) Found a tiny typo while reading the docs	2018-07-31 13:33:23 +01:00
Tal Levy	1e0fcebfe1	update rollover to leverage write-alias semantics (#32216 ) Rollover should not swap aliases when `is_write_index` is set to `true`. Instead, both the new and old indices should have the rollover alias, with the newly created index as the new write index Updates Rollover to leverage the ability to preserve aliases and swap which is the write index. Historically, Rollover would swap which index had the designated alias for writing documents against. This required users to keep a separate read-alias that enabled reading against both rolled over and newly created indices, whiles the write-alias was being re-assigned at every rollover. With the ability for aliases to designate a write index, Rollover can be a bit more flexible with its use of aliases. Updates include: - Rollover validates that the target alias has a write index (the index that is being rolled over). This means that the restriction that aliases only point to one index is no longer necessary. - Rollover explicitly (and atomically) swaps which index is the write-index by explicitly assigning the existing index to have `is_write_index: false` and have the newly created index have its rollover alias as `is_write_index: true`. This is only done when `is_write_index: true` on the write index. Default behavior of removing the alias from the rolled over index stays when `is_write_index` is not explicitly set Relevant things that are staying the same: - Rollover is rejected if there exist any templates that match the newly-created index and configure the rollover-alias - I think this existed to prevent the situation where an alias pointed to two indices for a short while. Although this can technically be relaxed, the specific cases that are safe are really particular and difficult to reason, so leaving the broad restriction sounds good	2018-07-30 14:32:55 -07:00
w-bonelli	072c0be8af	Update Fuzzy Query docs to clarify default behavior re max_expansions (#30819 ) Stating that the Fuzzy Query generates "all possible" matching terms is misleading, given that the query's default behavior is to generate a maximum of 50 matching terms. (cherry picked from commit 345a0071a2a41fd7f80ae9ef8a39a2cb4991aedd)	2018-07-30 13:19:26 -07:00
Jason Tedor	588db621ac	Remove reference to non-existent store type (#32418 ) We removed the default_fs store type yet the docs still contain a reference to them. This commit addresses that by removing this reference, and changing a reference to this section of the docs to instead refer to mmapfs.	2018-07-27 11:24:03 -04:00
Jason Tedor	3ac57f0ba3	Fix ordering of bootstrap checks in docs (#32417 ) In the section of the bootstrap checks docs for the maximum map count check, we refer to max size virtual memory check and explicitly call out the maximum size virtual memory check as being the previous point. However, this is not correct as the previous point is currently the max file size check. It does make sense for these two checks to be proximate to each other in the docs so this commit reorders the checks so that the maximum size virtual memory check indeed comes before the maximum map count check. This makes the sense in the maximum map count check correct.	2018-07-27 10:40:16 -04:00
Sandeep Kanabar	7ad16ffd84	Docs: Correcting a typo in tophits (#32359 )	2018-07-26 13:30:01 -04:00
mzbeck	63a0436764	Update update-settings.asciidoc (#31378 )	2018-07-26 17:16:53 +02:00
Lisa Cawley	8067250fbe	[DOCS] Adds recommendation for xpack.security.enabled (#32345 )	2018-07-25 09:51:40 -07:00
Christoph Büscher	c1cc0cef61	Add ERR to ranking evaluation documentation (#32314 ) This change adds a section about the Expected Reciprocal Rank metric (ERR) to the Ranking Evaluation documentation.	2018-07-24 19:58:34 +02:00
Zachary Tong	6ba144ae31	Add WeightedAvg metric aggregation (#31037 ) Adds a new single-value metrics aggregation that computes the weighted average of numeric values that are extracted from the aggregated documents. These values can be extracted from specific numeric fields in the documents. When calculating a regular average, each datapoint has an equal "weight"; it contributes equally to the final value. In contrast, weighted averages scale each datapoint differently. The amount that each datapoint contributes to the final value is extracted from the document, or provided by a script. As a formula, a weighted average is the `∑(value * weight) / ∑(weight)` A regular average can be thought of as a weighted average where every value has an implicit weight of `1`. Closes #15731	2018-07-23 18:33:15 -04:00
Christoph Büscher	fe6bb75eb4	Rename ranking evaluation `quality_level` to `metric_score` (#32168 ) The notion of "quality" is an overloaded term in the search ranking evaluation context. Its usually used to decribe certain levels of "good" vs. "bad" of a seach result with respect to the users information need. We currently report the result of the ranking evaluation as `quality_level` which is a bit missleading. This changes the response parameter name to `metric_score` which fits better.	2018-07-23 22:25:02 +02:00
debadair	5525cb1615	[DOCS] Clarified that you must remove X-Pack plugin when upgrading from pre-6.3. (#32016 )	2018-07-20 14:17:48 -07:00
Armin Braun	7aa8a0a927	INGEST: Extend KV Processor (#31789 ) (#32232 ) * INGEST: Extend KV Processor (#31789) Added more capabilities supported by LS to the KV processor: * Stripping of brackets and quotes from values (`include_brackets` in corresponding LS filter) * Adding key prefixes * Trimming specified chars from keys and values Refactored the way the filter is configured to avoid conditionals during execution. Refactored Tests a little to not have to add more redundant getters for new parameters. Relates #31786 * Add documentation	2018-07-20 22:32:50 +02:00
Daniel Mitterdorfer	6afb661ca5	Remove indices stats timeout from monitoring docs With this commit we remove the documentation for the setting `xpack.monitoring.collection.indices.stats.timeout` which has already been removed in code. Closes #32133 Relates #32229	2018-07-20 12:55:20 +02:00
Christoph Büscher	5cbd9ad177	Rename ranking evaluation response section (#32166 ) Currently the ranking evaluation response contains a 'unknown_docs' section for each search use case in the evaluation set. It contains document ids for results in the search hits that currently don't have a quality rating. This change renames it to `unrated_docs`, which better reflects its purpose.	2018-07-20 11:43:46 +02:00
Luca Cavanna	00a6ad0e9e	Remove aliases resolution limitations when security is enabled (#31952 ) Resolving wildcards in aliases expression is challenging as we may end up with no aliases to replace the original expression with, but if we replace with an empty array that means _all which is quite the opposite. Now that we support and serialize the original requested aliases, whenever aliases are replaced we will be able to know what was initially requested. `MetaData#findAliases` can then be updated to not return anything in case it gets empty aliases, but the original aliases were not empty. That means that empty aliases are interpreted as _all only if they were originally requested that way. Relates to #31516	2018-07-20 09:23:32 +02:00
Julie Tibshirani	0f0068b91c	Ensure that field aliases cannot be used in multi-fields. (#32219 )	2018-07-20 00:18:54 -07:00
David Turner	f6d7854f76	Remove indication of future multi-homing support (#32187 ) We do not support intra-cluster connections on multiple interfaces, but the documentation indicates that we will in future. In fact there is currently no plan to support this, so the forward-looking documentation is misleading. This commit - removes the misleading sentence - fixes that a transport profile affects outbound connections, not inbound ones - tidies up some nearby text	2018-07-19 11:33:46 +01:00
Julie Tibshirani	15ff3da653	Add support for field aliases. (#32172 ) * Add basic support for field aliases in index mappings. (#31287) * Allow for aliases when fetching stored fields. (#31411) * Add tests around accessing field aliases in scripts. (#31417) * Add documentation around field aliases. (#31538) * Add validation for field alias mappings. (#31518) * Return both concrete fields and aliases in DocumentFieldMappers#getMapper. (#31671) * Make sure that field-level security is enforced when using field aliases. (#31807) * Add more comprehensive tests for field aliases in queries + aggregations. (#31565) * Remove the deprecated method DocumentFieldMappers#getFieldMapper. (#32148)	2018-07-18 09:33:09 -07:00
David Turner	380b45b965	Improve docs for search preferences (#32159 ) Today it is unclear what guarantees are offered by the search preference feature, and we claim a guarantee that is stronger than what we really offer: > A custom value will be used to guarantee that the same shards will be used > for the same custom value. This commit clarifies this documentation. Forward-port of #32098 to `master`.	2018-07-18 12:58:17 +01:00
Christoph Büscher	61486680a2	Add exclusion option to `keep_types` token filter (#32012 ) Currently the `keep_types` token filter includes all token types specified using its `types` parameter. Lucenes TypeTokenFilter also provides a second mode where instead of keeping the specified tokens (include) they are filtered out (exclude). This change exposes this option as a new `mode` parameter that can either take the values `include` (the default, if not specified) or `exclude`. Closes #29277	2018-07-17 09:04:41 +02:00
ahmedakef	5b65e2ccbf	DOCS: put LIMIT 10 to the SQL query (#32065 ) Provides a more precise equivalent SQL query for the aggregation example in the getting started guide.	2018-07-16 16:09:56 -04:00
debadair	ff8260c234	Tweaked Elasticsearch Service links for SEO	2018-07-16 12:58:08 -07:00
debadair	cccc3f7a64	Tweaked Elasticsearch Service links for SEO	2018-07-16 12:47:57 -07:00
DeDe Morton	ecd05d5be4	Use correct formatting for links (#29460 )	2018-07-16 21:11:24 +02:00
Paul Sanwald	feb07559aa	fix typo	2018-07-13 14:59:11 -04:00
Colin Goodheart-Smithe	0edb096eb4	Adds a new auto-interval date histogram (#28993 ) * Adds a new auto-interval date histogram This change adds a new type of histogram aggregation called `auto_date_histogram` where you can specify the target number of buckets you require and it will find an appropriate interval for the returned buckets. The aggregation works by first collecting documents in buckets at second interval, when it has created more than the target number of buckets it merges these buckets into minute interval bucket and continues collecting until it reaches the target number of buckets again. It will keep merging buckets when it exceeds the target until either collection is finished or the highest interval (currently years) is reached. A similar process happens at reduce time. This aggregation intentionally does not support min_doc_count, offest and extended_bounds to keep the already complex logic from becoming more complex. The aggregation accepts sub-aggregations but will always operate in `breadth_first` mode deferring the computation of sub-aggregations until the final buckets from the shard are known. min_doc_count is effectively hard-coded to zero meaning that we will insert empty buckets where necessary. Closes #9572 * Adds documentation * Added sub aggregator test * Fixes failing docs test * Brings branch up to date with master changes * trying to get tests to pass again * Fixes multiBucketConsumer accounting * Collects more buckets than needed on shards This gives us more options at reduce time in terms of how we do the final merge of the buckeets to produce the final result * Revert "Collects more buckets than needed on shards" This reverts commit 993c782d117892af9a3c86a51921cdee630a3ac5. * Adds ability to merge within a rounding * Fixes nonn-timezone doc test failure * Fix time zone tests * iterates on tests * Adds test case and documentation changes Added some notes in the documentation about the intervals that can bbe returned. Also added a test case that utilises the merging of conseecutive buckets * Fixes performance bug The bug meant that getAppropriate rounding look a huge amount of time if the range of the data was large but also sparsely populated. In these situations the rounding would be very low so iterating through the rounding values from the min key to the max keey look a long time (~120 seconds in one test). The solution is to add a rough estimate first which chooses the rounding based just on the long values of the min and max keeys alone but selects the rounding one lower than the one it thinks is appropriate so the accurate method can choose the final rounding taking into account the fact that intervals are not always fixed length. Thee commit also adds more tests * Changes to only do complex reduction on final reduce * merge latest with master * correct tests and add a new test case for 10k buckets * refactor to perform bucket number check in innerBuild * correctly derive bucket setting, update tests to increase bucket threshold * fix checkstyle * address code review comments * add documentation for default buckets * fix typo	2018-07-13 13:08:35 -04:00
Mayya Sharipova	80492cacfc	Add second level of field collapsing (#31808 ) * Put second level collapse under inner_hits Closes #24855	2018-07-13 11:40:03 -04:00
Clinton Gormley	bc1284eb28	Docs: Restyled cloud link in getting started	2018-07-13 15:48:14 +02:00
Clinton Gormley	9a928756e9	Docs: Change formatting of Cloud options	2018-07-13 15:40:38 +02:00
Alan Woodward	a01e26a39b	Correct spelling of AnalysisPlugin#requriesAnalysisSettings (#32025 ) Because this is a static method on a public API, and one that we encourage plugin authors to use, the method with the typo is deprecated in 6.x rather than just renamed.	2018-07-13 13:13:21 +01:00
Daniel Mitterdorfer	f174f72fee	Circuit-break based on real memory usage With this commit we introduce a new circuit-breaking strategy to the parent circuit breaker. Contrary to the current implementation which only accounts for memory reserved via child circuit breakers, the new strategy measures real heap memory usage at the time of reservation. This allows us to be much more aggressive with the circuit breaker limit so we bump it to 95% by default. The new strategy is turned on by default and can be controlled with the new cluster setting `indices.breaker.total.userealmemory`. Note that we turn it off for all integration tests with an internal test cluster because it leads to spurious test failures which are of no value (we cannot fully control heap memory usage in tests). All REST tests, however, will make use of the real memory circuit breaker. Relates #31767	2018-07-13 10:08:28 +02:00
Jimi Ford	e955ffc38d	Docs: fix typo in datehistogram (#31972 )	2018-07-11 15:04:57 -04:00
Clinton Gormley	aedbfc63cd	Docs: Added note about cloud service to installation and getting started	2018-07-11 20:17:18 +02:00
Lisa Cawley	efcfd0d827	[DOCS] Removes alternative docker pull example (#31934 )	2018-07-11 09:08:32 -07:00
Sohaib Iftikhar	88c270d844	Added lenient flag for synonym token filter (#31484 ) * Added lenient flag for synonym-tokenfilter. Relates to #30968 * added docs for synonym-graph-tokenfilter -- Also made lenient final -- changed from !lenient to lenient == false * Changes after review (1) -- Renamed to ElasticsearchSynonymParser -- Added explanation for ElasticsearchSynonymParser::add method -- Changed ElasticsearchSynonymParser::logger instance to static * Added lenient option for WordnetSynonymParser -- also added more documentation * Added additional documentation * Improved documentation	2018-07-10 17:11:50 -04:00
Jim Ferenczi	584fa261cc	Remove the ability to index or query context suggestions without context (#31007 ) This is a follow up of #30712 that removes the ability to index or query and context enabled completion field without context. Relates #30712	2018-07-09 16:01:01 +02:00
Armin Braun	e46ed73379	Ingest: Add ignore_missing option to RemoveProc (#31693 ) Added `ignore_missing` setting to the RemoveProcessor to fix #23086	2018-07-09 10:24:34 +02:00

1 2 3 4 5 ...

4509 Commits