OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	22459576d7	Logging: Make node name consistent in logger (#31588 ) First, some background: we have 15 different methods to get a logger in Elasticsearch but they can be broken down into three broad categories based on what information is provided when building the logger. Just a class like: ``` private static final Logger logger = ESLoggerFactory.getLogger(ActionModule.class); ``` or: ``` protected final Logger logger = Loggers.getLogger(getClass()); ``` The class and settings: ``` this.logger = Loggers.getLogger(getClass(), settings); ``` Or more information like: ``` Loggers.getLogger("index.store.deletes", settings, shardId) ``` The goal of the "class and settings" variant is to attach the node name to the logger. Because we don't always have the settings available, we often use the "just a class" variant and get loggers without node names attached. There isn't any real consistency here. Some loggers get the node name because it is convenient and some do not. This change makes the node name available to all loggers all the time. Almost. There are some caveats are testing that I'll get to. But in production code the node name is node available to all loggers. This means we can stop using the "class and settings" variants to fetch loggers which was the real goal here, but a pleasant side effect is that the ndoe name is now consitent on every log line and optional by editing the logging pattern. This is all powered by setting the node name statically on a logging formatter very early in initialization. Now to tests: tests can't set the node name statically because subclasses of `ESIntegTestCase` run many nodes in the same jvm, even in the same class loader. Also, lots of tests don't run with a real node so they don't have a node name at all. To support multiple nodes in the same JVM tests suss out the node name from the thread name which works surprisingly well and easy to test in a nice way. For those threads that are not part of an `ESIntegTestCase` node we stick whatever useful information we can get form the thread name in the place of the node name. This allows us to keep the logger format consistent.	2018-07-31 10:54:24 -04:00
Colm O'Shea	97b379e0d4	fix no=>not typo (#32463 ) Found a tiny typo while reading the docs	2018-07-31 13:33:23 +01:00
Yannick Welsch	11f6983d69	HLRC: Add delete watch action (#32337 ) Adds the "delete watch" API to the High-Level Rest Client. Relates #29827	2018-07-31 10:29:22 +02:00
Tal Levy	1e0fcebfe1	update rollover to leverage write-alias semantics (#32216 ) Rollover should not swap aliases when `is_write_index` is set to `true`. Instead, both the new and old indices should have the rollover alias, with the newly created index as the new write index Updates Rollover to leverage the ability to preserve aliases and swap which is the write index. Historically, Rollover would swap which index had the designated alias for writing documents against. This required users to keep a separate read-alias that enabled reading against both rolled over and newly created indices, whiles the write-alias was being re-assigned at every rollover. With the ability for aliases to designate a write index, Rollover can be a bit more flexible with its use of aliases. Updates include: - Rollover validates that the target alias has a write index (the index that is being rolled over). This means that the restriction that aliases only point to one index is no longer necessary. - Rollover explicitly (and atomically) swaps which index is the write-index by explicitly assigning the existing index to have `is_write_index: false` and have the newly created index have its rollover alias as `is_write_index: true`. This is only done when `is_write_index: true` on the write index. Default behavior of removing the alias from the rolled over index stays when `is_write_index` is not explicitly set Relevant things that are staying the same: - Rollover is rejected if there exist any templates that match the newly-created index and configure the rollover-alias - I think this existed to prevent the situation where an alias pointed to two indices for a short while. Although this can technically be relaxed, the specific cases that are safe are really particular and difficult to reason, so leaving the broad restriction sounds good	2018-07-30 14:32:55 -07:00
w-bonelli	072c0be8af	Update Fuzzy Query docs to clarify default behavior re max_expansions (#30819 ) Stating that the Fuzzy Query generates "all possible" matching terms is misleading, given that the query's default behavior is to generate a maximum of 50 matching terms. (cherry picked from commit 345a0071a2a41fd7f80ae9ef8a39a2cb4991aedd)	2018-07-30 13:19:26 -07:00
Jason Tedor	588db621ac	Remove reference to non-existent store type (#32418 ) We removed the default_fs store type yet the docs still contain a reference to them. This commit addresses that by removing this reference, and changing a reference to this section of the docs to instead refer to mmapfs.	2018-07-27 11:24:03 -04:00
Jason Tedor	3ac57f0ba3	Fix ordering of bootstrap checks in docs (#32417 ) In the section of the bootstrap checks docs for the maximum map count check, we refer to max size virtual memory check and explicitly call out the maximum size virtual memory check as being the previous point. However, this is not correct as the previous point is currently the max file size check. It does make sense for these two checks to be proximate to each other in the docs so this commit reorders the checks so that the maximum size virtual memory check indeed comes before the maximum map count check. This makes the sense in the maximum map count check correct.	2018-07-27 10:40:16 -04:00
Jim Ferenczi	53ff06e621	Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (#32390 ) The main highlight is the removal of the reclaim_deletes_weight in the TieredMergePolicy. The es setting index.merge.policy.reclaim_deletes_weight is deprecated in this commit and the value is ignored. The new merge policy setting setDeletesPctAllowed should be added in a follow up.	2018-07-27 08:28:51 +02:00
Sandeep Kanabar	7ad16ffd84	Docs: Correcting a typo in tophits (#32359 )	2018-07-26 13:30:01 -04:00
Igor Motov	3b050194f9	Docs: Fix callouts in put license HL REST docs (#32363 )	2018-07-26 09:00:35 -07:00
mzbeck	63a0436764	Update update-settings.asciidoc (#31378 )	2018-07-26 17:16:53 +02:00
Jack Conradson	10bfedeb53	Painless: Fix documentation links to use existing refs (#32335 ) Uses ref and xpack-ref instead of custom ones in the Painless docs for long-term maintainability.	2018-07-25 10:59:25 -07:00
Lisa Cawley	8067250fbe	[DOCS] Adds recommendation for xpack.security.enabled (#32345 )	2018-07-25 09:51:40 -07:00
debadair	32eb4d41b6	[DOCS] Removed extraneous callout number.	2018-07-24 11:51:52 -07:00
Igor Motov	29c802f88e	Rest HL client: Add put license action (#32214 ) In the HL REST client we replace the License object with a string, because of complexity of this class. It is also not really needed on the client side since end-users are not interacting with the license besides passing it as a string to the server. Relates #29827	2018-07-24 11:19:07 -07:00
Christoph Büscher	c1cc0cef61	Add ERR to ranking evaluation documentation (#32314 ) This change adds a section about the Expected Reciprocal Rank metric (ERR) to the Ranking Evaluation documentation.	2018-07-24 19:58:34 +02:00
Daniel Mitterdorfer	73a38895fd	Add Restore Snapshot High Level REST API With this commit we add the restore snapshot API to the Java high level REST client. Relates #27205 Relates #32155	2018-07-24 16:17:09 +02:00
Russ Cam	e2b665c2e6	Consistent encoder names (#29492 ) This commit updates encoder names to be consistent within documentation and align with snake casing convention.	2018-07-24 09:21:43 +10:00
Zachary Tong	6ba144ae31	Add WeightedAvg metric aggregation (#31037 ) Adds a new single-value metrics aggregation that computes the weighted average of numeric values that are extracted from the aggregated documents. These values can be extracted from specific numeric fields in the documents. When calculating a regular average, each datapoint has an equal "weight"; it contributes equally to the final value. In contrast, weighted averages scale each datapoint differently. The amount that each datapoint contributes to the final value is extracted from the document, or provided by a script. As a formula, a weighted average is the `∑(value * weight) / ∑(weight)` A regular average can be thought of as a weighted average where every value has an implicit weight of `1`. Closes #15731	2018-07-23 18:33:15 -04:00
Christoph Büscher	fe6bb75eb4	Rename ranking evaluation `quality_level` to `metric_score` (#32168 ) The notion of "quality" is an overloaded term in the search ranking evaluation context. Its usually used to decribe certain levels of "good" vs. "bad" of a seach result with respect to the users information need. We currently report the result of the ranking evaluation as `quality_level` which is a bit missleading. This changes the response parameter name to `metric_score` which fits better.	2018-07-23 22:25:02 +02:00
debadair	5525cb1615	[DOCS] Clarified that you must remove X-Pack plugin when upgrading from pre-6.3. (#32016 )	2018-07-20 14:17:48 -07:00
Armin Braun	7aa8a0a927	INGEST: Extend KV Processor (#31789 ) (#32232 ) * INGEST: Extend KV Processor (#31789) Added more capabilities supported by LS to the KV processor: * Stripping of brackets and quotes from values (`include_brackets` in corresponding LS filter) * Adding key prefixes * Trimming specified chars from keys and values Refactored the way the filter is configured to avoid conditionals during execution. Refactored Tests a little to not have to add more redundant getters for new parameters. Relates #31786 * Add documentation	2018-07-20 22:32:50 +02:00
Nick Peihl	ac63408655	Add region ISO code to GeoIP Ingest plugin (#31669 )	2018-07-20 11:23:29 -07:00
Daniel Mitterdorfer	6afb661ca5	Remove indices stats timeout from monitoring docs With this commit we remove the documentation for the setting `xpack.monitoring.collection.indices.stats.timeout` which has already been removed in code. Closes #32133 Relates #32229	2018-07-20 12:55:20 +02:00
Christoph Büscher	5cbd9ad177	Rename ranking evaluation response section (#32166 ) Currently the ranking evaluation response contains a 'unknown_docs' section for each search use case in the evaluation set. It contains document ids for results in the search hits that currently don't have a quality rating. This change renames it to `unrated_docs`, which better reflects its purpose.	2018-07-20 11:43:46 +02:00
Luca Cavanna	00a6ad0e9e	Remove aliases resolution limitations when security is enabled (#31952 ) Resolving wildcards in aliases expression is challenging as we may end up with no aliases to replace the original expression with, but if we replace with an empty array that means _all which is quite the opposite. Now that we support and serialize the original requested aliases, whenever aliases are replaced we will be able to know what was initially requested. `MetaData#findAliases` can then be updated to not return anything in case it gets empty aliases, but the original aliases were not empty. That means that empty aliases are interpreted as _all only if they were originally requested that way. Relates to #31516	2018-07-20 09:23:32 +02:00
Julie Tibshirani	0f0068b91c	Ensure that field aliases cannot be used in multi-fields. (#32219 )	2018-07-20 00:18:54 -07:00
Mayya Sharipova	4c68dfe001	Handle missing values in painless (#32207 ) Throw an exception for doc['field'].value if this document is missing a value for the field. After deprecation changes have been backported to 6.x, make this a default behaviour in 7.0 Closes #29286	2018-07-19 17:41:06 -04:00
Vladimir Dolzhenko	7c0fc209bf	ECS Task IAM profile credentials ignored in repository-s3 plugin (#31864 ) ECS Task IAM profile credentials ignored in repository-s3 plugin (#31864) Closes #26913	2018-07-19 12:54:38 +02:00
David Turner	f6d7854f76	Remove indication of future multi-homing support (#32187 ) We do not support intra-cluster connections on multiple interfaces, but the documentation indicates that we will in future. In fact there is currently no plan to support this, so the forward-looking documentation is misleading. This commit - removes the misleading sentence - fixes that a transport profile affects outbound connections, not inbound ones - tidies up some nearby text	2018-07-19 11:33:46 +01:00
Alexander Reelsen	202894b832	Rest HL client: Add put watch action (#32026 ) Relates #29827 This implementation behaves like the current transport client, that you basically cannot configure a Watch POJO representation as an argument to the put watch API, but only a bytes reference. You can use the the `WatchSourceBuilder` from the `org.elasticsearch.plugin:x-pack-core` dependency to build watches. This commit also changes the license type to trial, so that watcher is available in high level rest client tests. /cc @hub-cap	2018-07-19 10:40:54 +02:00
Julie Tibshirani	15ff3da653	Add support for field aliases. (#32172 ) * Add basic support for field aliases in index mappings. (#31287) * Allow for aliases when fetching stored fields. (#31411) * Add tests around accessing field aliases in scripts. (#31417) * Add documentation around field aliases. (#31538) * Add validation for field alias mappings. (#31518) * Return both concrete fields and aliases in DocumentFieldMappers#getMapper. (#31671) * Make sure that field-level security is enforced when using field aliases. (#31807) * Add more comprehensive tests for field aliases in queries + aggregations. (#31565) * Remove the deprecated method DocumentFieldMappers#getFieldMapper. (#32148)	2018-07-18 09:33:09 -07:00
David Turner	380b45b965	Improve docs for search preferences (#32159 ) Today it is unclear what guarantees are offered by the search preference feature, and we claim a guarantee that is stronger than what we really offer: > A custom value will be used to guarantee that the same shards will be used > for the same custom value. This commit clarifies this documentation. Forward-port of #32098 to `master`.	2018-07-18 12:58:17 +01:00
Martijn van Groningen	1924f5d07c	Add more contexts to painless execute api (#30511 ) This change adds two contexts the execute scripts against: * SEARCH_SCRIPT: Allows to run scripts in a search script context. This context is used in `function_score` query's script function, script fields, script sorting and `terms_set` query. * FILTER_SCRIPT: Allows to run scripts in a filter script context. This context is used in the `script` query. In both contexts a index name needs to be specified and a sample document. The document is needed to create an in-memory index that the script can access via the `doc[...]` and other notations. The index name is needed because a mapping is needed to index the document. Examples: ``` POST /_scripts/painless/_execute { "script": { "source": "doc['field'].value.length()" }, "context" : { "search_script": { "document": { "field": "four" }, "index": "my-index" } } } ``` Returns: ``` { "result": 4 } ``` POST /_scripts/painless/_execute { "script": { "source": "doc['field'].value.length() <= params.max_length", "params": { "max_length": 4 } }, "context" : { "filter_script": { "document": { "field": "four" }, "index": "my-index" } } } Returns: ``` { "result": true } ``` Also changed PainlessExecuteAction.TransportAction to use TransportSingleShardAction instead of HandledAction, because now in case score or filter contexts are used the request needs to be redirected to a node that has an active IndexService for the index being referenced (a node with a shard copy for that index).	2018-07-18 12:42:07 +02:00
Christoph Büscher	36165265ce	Fix put mappings java API documentation (#31955 ) The current docs of the put-mapping Java API is currently broken. It its current form, it creates an index and uses the whole mapping definition given as a JSON string as the type name. Since we didn't check the index created in the IndicesDocumentationIT so far this went unnoticed. This change adds test to catch this error to the documentation test, changes the documentation so it works correctly now and adds an input validation to PutMappingRequest#buildFromSimplifiedDef() which was used internally to reject calls where no mapping definition is given. Closes #31906	2018-07-17 09:09:03 +02:00
Christoph Büscher	61486680a2	Add exclusion option to `keep_types` token filter (#32012 ) Currently the `keep_types` token filter includes all token types specified using its `types` parameter. Lucenes TypeTokenFilter also provides a second mode where instead of keeping the specified tokens (include) they are filtered out (exclude). This change exposes this option as a new `mode` parameter that can either take the values `include` (the default, if not specified) or `exclude`. Closes #29277	2018-07-17 09:04:41 +02:00
ahmedakef	5b65e2ccbf	DOCS: put LIMIT 10 to the SQL query (#32065 ) Provides a more precise equivalent SQL query for the aggregation example in the getting started guide.	2018-07-16 16:09:56 -04:00
debadair	ff8260c234	Tweaked Elasticsearch Service links for SEO	2018-07-16 12:58:08 -07:00
debadair	cccc3f7a64	Tweaked Elasticsearch Service links for SEO	2018-07-16 12:47:57 -07:00
DeDe Morton	ecd05d5be4	Use correct formatting for links (#29460 )	2018-07-16 21:11:24 +02:00
Jack Conradson	42ca520377	Clean Up Snapshot Create Rest API (#31779 ) Make SnapshotInfo and CreateSnapshotResponse parsers lenient for backwards compatibility. Remove extraneous fields from CreateSnapshotRequest toXContent.	2018-07-13 13:07:26 -07:00
Paul Sanwald	feb07559aa	fix typo	2018-07-13 14:59:11 -04:00
Colin Goodheart-Smithe	0edb096eb4	Adds a new auto-interval date histogram (#28993 ) * Adds a new auto-interval date histogram This change adds a new type of histogram aggregation called `auto_date_histogram` where you can specify the target number of buckets you require and it will find an appropriate interval for the returned buckets. The aggregation works by first collecting documents in buckets at second interval, when it has created more than the target number of buckets it merges these buckets into minute interval bucket and continues collecting until it reaches the target number of buckets again. It will keep merging buckets when it exceeds the target until either collection is finished or the highest interval (currently years) is reached. A similar process happens at reduce time. This aggregation intentionally does not support min_doc_count, offest and extended_bounds to keep the already complex logic from becoming more complex. The aggregation accepts sub-aggregations but will always operate in `breadth_first` mode deferring the computation of sub-aggregations until the final buckets from the shard are known. min_doc_count is effectively hard-coded to zero meaning that we will insert empty buckets where necessary. Closes #9572 * Adds documentation * Added sub aggregator test * Fixes failing docs test * Brings branch up to date with master changes * trying to get tests to pass again * Fixes multiBucketConsumer accounting * Collects more buckets than needed on shards This gives us more options at reduce time in terms of how we do the final merge of the buckeets to produce the final result * Revert "Collects more buckets than needed on shards" This reverts commit 993c782d117892af9a3c86a51921cdee630a3ac5. * Adds ability to merge within a rounding * Fixes nonn-timezone doc test failure * Fix time zone tests * iterates on tests * Adds test case and documentation changes Added some notes in the documentation about the intervals that can bbe returned. Also added a test case that utilises the merging of conseecutive buckets * Fixes performance bug The bug meant that getAppropriate rounding look a huge amount of time if the range of the data was large but also sparsely populated. In these situations the rounding would be very low so iterating through the rounding values from the min key to the max keey look a long time (~120 seconds in one test). The solution is to add a rough estimate first which chooses the rounding based just on the long values of the min and max keeys alone but selects the rounding one lower than the one it thinks is appropriate so the accurate method can choose the final rounding taking into account the fact that intervals are not always fixed length. Thee commit also adds more tests * Changes to only do complex reduction on final reduce * merge latest with master * correct tests and add a new test case for 10k buckets * refactor to perform bucket number check in innerBuild * correctly derive bucket setting, update tests to increase bucket threshold * fix checkstyle * address code review comments * add documentation for default buckets * fix typo	2018-07-13 13:08:35 -04:00
Ryan Ernst	2c3ea43f45	HLRC: Add xpack usage api (#31975 ) This commit adds the _xpack/usage api to the high level rest client. Currently in the transport api, the usage data is exposed in a limited fashion, at most giving one level of helper methods for the inner keys of data, but then exposing thos subobjects as maps of objects. Rather than making parsers for every set of usage data from each feature, this PR exposes the entire set of usage data as a map of maps.	2018-07-13 09:33:27 -07:00
Mayya Sharipova	80492cacfc	Add second level of field collapsing (#31808 ) * Put second level collapse under inner_hits Closes #24855	2018-07-13 11:40:03 -04:00
Clinton Gormley	bc1284eb28	Docs: Restyled cloud link in getting started	2018-07-13 15:48:14 +02:00
Clinton Gormley	9a928756e9	Docs: Change formatting of Cloud options	2018-07-13 15:40:38 +02:00
Alan Woodward	a01e26a39b	Correct spelling of AnalysisPlugin#requriesAnalysisSettings (#32025 ) Because this is a static method on a public API, and one that we encourage plugin authors to use, the method with the typo is deprecated in 6.x rather than just renamed.	2018-07-13 13:13:21 +01:00
Daniel Mitterdorfer	f174f72fee	Circuit-break based on real memory usage With this commit we introduce a new circuit-breaking strategy to the parent circuit breaker. Contrary to the current implementation which only accounts for memory reserved via child circuit breakers, the new strategy measures real heap memory usage at the time of reservation. This allows us to be much more aggressive with the circuit breaker limit so we bump it to 95% by default. The new strategy is turned on by default and can be controlled with the new cluster setting `indices.breaker.total.userealmemory`. Note that we turn it off for all integration tests with an internal test cluster because it leads to spurious test failures which are of no value (we cannot fully control heap memory usage in tests). All REST tests, however, will make use of the real memory circuit breaker. Relates #31767	2018-07-13 10:08:28 +02:00
Nik Everett	9822a6f911	Docs: Explain closing the high level client It looks like we weren't clear on when and why you should close the high level client and folks were closing it after every request which is not efficient. This explains why you should close the client and when so this shouldn't be as common. Closes #32001	2018-07-12 16:22:42 -04:00

1 2 3 4 5 ...

5382 Commits