OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	b4502dbf74	LLClient: Add setJsonEntity (#30447 ) Adds `Request#setJsonEntity(String)` which short circuits the process of sending a json string which is super common.	2018-05-09 18:33:03 -04:00
Yu	106bed90c7	Add `coordinating_only` node selector (#30313 ) Today we can execute cluster API actions on only master, data or ingest nodes using the `master:true`, `data:true` and `ingest:true` filters, but it is not so easy to select coordinating-only nodes (i.e. those nodes that are neither master nor data nor ingest nodes). This change fixes this by adding support for a `coordinating_only` filter such that `coordinating_only:true` adds all coordinating-only nodes to the set of selected nodes, and `coordinating_only:false` deletes them. Resolves #28831.	2018-05-09 12:14:07 +01:00
Ke Li	0c6789bc72	Use date format in `date_range` mapping before fallback to default (#29310 ) If the date format is not forced in query, use the format in mapping before fallback to the default format. Closes #29282	2018-05-09 09:41:44 +02:00
Alexander Reelsen	f00890ee38	Watcher: Increase HttpClient parallel sent requests (#30130 ) The HTTPClient used in watcher is based on the apache http client. The current client is using a lot of defaults - which are not always optimal. Two of those defaults are the maximum number of total connections and the maximum number of connections to a single route. If one of those limits is reached, the HTTPClient waits for a connection to be finished thus acting in a blocking fashion. In order to prevent this when many requests are being executed, we increase the limit of total connections as well as the connections per route (a route is basically an endpoint, which also contains proxy information, not containing an URL, just hosts). On top of that an additional option has been set to evict long running connections, which can potentially be reused after some time. As this requires an additional background thread, this required some changes to ensure that the httpclient is closed properly. Also the timeout for this can be configured.	2018-05-09 09:37:47 +02:00
Nik Everett	b062ce5634	Client: Deprecate many argument performRequest (#30315 ) Deprecate the many arguments versions of `performRequest` and `performRequestAsync` in favor of the `Request` object flavored variants introduced in #29623. We'll be dropping the many arguments variants in 7.0 because they make it difficult to add new features in a backwards compatible way and they create a ton of intellisense noise.	2018-05-08 14:38:55 -04:00
aditya-agrawal	27ddb4ffea	Avoid NPE in `more_like_this` when field has zero tokens (#30365 ) Fixes and edge case when using `more_like_this` where TermVectorsWriter could throw an NPE when a field produced zero tokens after analysis. This changes the implementation to use an empty list of tokens in this case. Closes #30148	2018-05-08 15:13:07 +02:00
Yannick Welsch	82b251adcf	Auto-expand replicas when adding or removing nodes (#30423 ) Auto-expands replicas in the same cluster state update (instead of a follow-up reroute) where nodes are added or removed. Closes #1873, fixing an issue where nodes drop their copy of auto-expanded data when coming up, only to sync it again later.	2018-05-07 22:26:31 +02:00
Igor Motov	44c6dcf5be	Docs: fix changelog merge	2018-05-07 14:25:03 -04:00
Igor Motov	6fb189ce47	Add stricter geohash parsing (#30376 ) Adds verification that geohashes are not empty and contain only valid characters. It fixes the issue when en empty geohash is treated as [-180, -90] and geohashes with non-geohash character are getting resolved into invalid coordinates. Closes #23579	2018-05-07 13:56:39 -04:00
Tanguy Leroux	1987d6261f	Do not fail snapshot when deleting a missing snapshotted file (#30332 ) When deleting or creating a snapshot for a given shard, elasticsearch usually starts by listing all the existing snapshotted files in the repository. Then it computes a diff and deletes the snapshotted files that are not needed anymore. During this deletion, an exception is thrown if the file to be deleted does not exist anymore. This behavior is challenging with cloud based repository implementations like S3 where a file that has been deleted can still appear in the bucket for few seconds/minutes (because the deletion can take some time to be fully replicated on S3). If the deleted file appears in the listing of files, then the following deletion will fail with a NoSuchFileException and the snapshot will be partially created/deleted. This pull request makes the deletion of these files a bit less strict, ie not failing if the file we want to delete does not exist anymore. It introduces a new BlobContainer.deleteIgnoringIfNotExists() method that can be used at some specific places where not failing when deleting a file is considered harmless. Closes #28322	2018-05-07 09:35:55 +02:00
Nhat Nguyen	eed8a3b585	Add put index template api to high level rest client (#30400 ) Relates #27205	2018-05-06 09:47:36 -04:00
Jason Tedor	10fcf30ce1	Move respect accept header on no handler to 6.3.1 This commit moves the changelog entry for the change to respect the accept header on no handler from the 7.0.0 section of the docs to 6.3.1.	2018-05-04 20:50:30 -04:00
Jason Tedor	beee5fe004	Respect accept header on no handler (#30383 ) Today when processing a request for a URL path for which we can not find a handler we send back a plain-text response. Yet, we have the accept header in our hand and can respect the accepted media type of the request. This commit addresses this.	2018-05-04 18:13:50 -04:00
Jim Ferenczi	891d3bd9c3	Expose the Lucene Korean analyzer module in a plugin (#30397 ) This change adds a new plugin called `analysis-nori` that exposes Korean text analysis in es using the new Lucene Korean analyzer module named (`nori`). The plugin adds: * a Korean analyzer: `nori` * a Korean tokenizer: `nori_tokenizer` * a part of speech stop filter: `nori_part_of_speech` * a filter that can replace Hanja characters with their Hangul transcription: `nori_readingform`	2018-05-04 20:46:13 +02:00
Zachary Tong	1c0d339904	[Rollup] Validate timezone in range queries (#30338 ) When validating the search request, we make sure any date_histogram aggregations have timezones that match the jobs. But we didn't do any such validation on range queries. While it wouldn't produce incorrect results, it would be confusing to the user as no documents would match the aggregation (because we add a filter clause on the timezone for the agg). Now the user gets an exception up front, and some helpful text about why the range query didnt match, and which timezones are acceptable	2018-05-04 10:45:16 -07:00
lcawley	137ce702a4	[DOCS] Added coming qualifiers in changelog	2018-05-03 19:38:19 -07:00
debadair	19624466e8	[DOCS] Commented out empty sections in the changelog to fix the doc build. (#30372 )	2018-05-03 16:31:32 -07:00
Jay Modi	aa0d7c73f8	Security: reduce garbage during index resolution (#30180 ) The IndexAndAliasesResolver resolves the indices and aliases for each request and also handles local and remote indices. The current implementation uses the ResolvedIndices class to hold the resolved indices and aliases. While evaluating the indices and aliases against the user's permissions, the final value for ResolvedIndices is constructed. Prior to this change, this was done by creating a ResolvedIndices for the first set of indices and for each additional addition, a new ResolvedIndices object is created and merged with the existing one. With a small number of indices and aliases this does not pose a large problem; however as the number of indices/aliases grows more list allocations and array copies are needed resulting in a large amount of garbage and severely impacted performance. This change introduces a builder for ResolvedIndices that appends to mutable lists until the final value has been constructed, which will ultimately reduce the amount of garbage generated by this code.	2018-05-03 12:48:23 -06:00
Dimitris Athanasiou	3b260dcfc1	[ML] Account for gaps in data counts after job is reopened (#30294 ) This commit fixes an issue with the data diagnostics were empty buckets are not reported even though they should. Once a job is reopened, the diagnostics do not get initialized from the current data counts (especially the latest record timestamp). The result is that if the data that is sent have a time gap compared to the previous ones, that gap is not accounted for in the empty bucket count. This commit fixes that by initializing the diagnostics with the current data counts. Closes #30080	2018-05-03 15:08:24 +01:00
Alexander Reelsen	2c38d12e23	Watcher: Make start/stop cycle more predictable and synchronous (#30118 ) The current implementation starts/stops watcher using an executor. This can result in our of order operations. This commit reduces those executor calls to an absolute minimum in order to be able to do state changes within the cluster state listener method, which runs in sequence. When a state change occurs that forces the watcher service to pause (like no watcher index, no master node, no local shards), the service is now in a paused state. Pausing is a super lightweight operation, which marks the ExecutionService as paused and waits for the currently executing watches to finish in the background via an executor. The same applies for stopping, the potentially long running operation is outsourced in to an executor, as waiting for executed watches is decoupled from the current state. The only other long running operation is starting, where watches need to be loaded. This is also done via an executor, but has an additional protection by checking the cluster state version it was started with. If another cluster state version was trying to load the watches, then this loading will not take effect. This PR also cleans up some unused states, like the a simple boolean in the HistoryStore/TriggeredWatchStore marking it as started or stopped, as this can now be caught in the execution service. Another advantage of this approach is the fact, that now only triggered watches are not getting executed, while watches that are run via the Execute Watch API will still be executed regardless if watcher is stopped or not. Lastly the TickerScheduleTriggerEngine thread now only starts on data nodes.	2018-05-03 09:47:12 +02:00
Zachary Tong	3c2d2a7d4a	Fix NPE when CumulativeSum agg encounters null/empty bucket (#29641 ) Fix NPE when CumulativeSum agg encounters null/empty bucket If the cusum agg encounters a null value, it's because the value is missing (like the first value from a derivative agg), the path is not valid, or the bucket in the path was empty. Previously cusum would just explode on the null, but this changes it so we only increment the sum if the value is non-null and finite. This is safe because even if the cusum encounters all null or empty buckets, the cumulative sum is still zero (like how the sum agg returns zero even if all the docs were missing values) I went ahead and tweaked AggregatorTestCase to allow testing pipelines, so that I could delete the IT test and reimplement it as AggTests. Closes #27544	2018-05-02 12:22:55 -07:00
Ryan Ernst	fb0aa562a5	Network: Remove http.enabled setting (#29601 ) This commit removes the http.enabled setting. While all real nodes (started with bin/elasticsearch) will always have an http binding, there are many tests that rely on the quickness of not actually needing to bind to 2 ports. For this case, the MockHttpTransport.TestPlugin provides a dummy http transport implementation which is used by default in ESIntegTestCase. closes #12792	2018-05-02 11:42:05 -07:00
Ryan Ernst	fba2f00a73	Packaging: Unmark systemd service file as a config file (#29004 ) Systemd overrides should happen through /etc/systemd/system, not directly editing the service file. This commit removes marking the service file as configuration for rpm and deb packages.	2018-05-02 09:48:49 -07:00
Ryan Ernst	62f2918abc	Added changelog entry for deb prerelease version change (#30184 ) This commit adds a changelog entry for the change in #29000.	2018-05-02 09:00:35 -07:00
Adrien Grand	7358946bda	Add a new `_ignored` meta field. (#29658 ) This adds a new `_ignored` meta field which indexes and stores fields that have been ignored at index time because of the `ignore_malformed` option. It makes malformed documents easier to identify by using `exists` or `term(s)` queries on the `_ignored` field. Closes #29494	2018-05-02 10:47:02 +02:00
Lisa Cawley	092dd6cb89	[DOCS] Removes X-Pack Elasticsearch release notes (#30272 )	2018-05-01 16:05:23 -07:00
Lisa Cawley	c5dc60718f	[DOCS] Fix 6.4-specific link in changelog (#30314 )	2018-05-01 13:20:19 -07:00
Nik Everett	0be443c5bb	REST Client: Add Request object flavored methods (#29623 ) Adds two new methods to `RestClient` that take a `Request` object. These methods will allows us to add more per-request customizable options without creating more and more and more overloads of the `performRequest` and `performRequestAsync` methods. These new methods look like: ``` Response performRequest(Request request) ``` and ``` void performRequestAsync(Request request, ResponseListener responseListener) ``` This change doesn't add any actual features but enables adding things like per request timeouts and per request node selectors. This change does rework the `HighLevelRestClient` and its tests to use these new `Request` objects and it does update the docs.	2018-05-01 14:31:23 -04:00
Lisa Cawley	5b5c98c96b	[DOCS] Adds changelog to Elasticsearch Reference (#30271 )	2018-05-01 10:34:26 -07:00
Jason Tedor	5de6f4ff7b	Adjust copy settings on resize BWC version This commit adjusts the BWC version for copy settings on resize operations after the behavior was backported to 6.x.	2018-05-01 08:49:16 -04:00
Jason Tedor	50535423ff	Allow copying source settings on resize operation (#30255 ) Today when an index is created from shrinking or splitting an existing index, the target index inherits almost none of the source index settings. This is surprising and a hassle for operators managing such indices. Given this is the default behavior, we can not simply change it. Instead, we start by introducing the ability to copy settings. This flag can be set on the REST API or on the transport layer and it has the behavior that it copies all settings from the source except non-copyable settings (a property of a setting introduced in this change). Additionally, settings on the request will always override. This change is the first step in our adventure: - this flag is added here in 7.0.0 and immediately deprecated - this flag will be backported to 6.4.0 and remain deprecated - then, we will remove the ability to set this flag to false in 7.0.0 - finally, in 8.0.0 we will remove this flag and the only behavior will be for settings to be copied	2018-05-01 08:48:19 -04:00
Paul Sanwald	e11070bcfa	Fix macros in changelog (#30269 ) remove comments for macros which caused macros not to work correctly	2018-04-30 14:09:32 -07:00
Jason Tedor	811f5b4efc	Do not ignore request analysis/similarity on resize (#30216 ) Today when a resize operation is performed, we copy the analysis, similarity, and sort settings from the source index. It is possible for the resize request to include additional index settings including analysis, similarity, and sort settings. We reject sort settings when validating the request. However, we silently ignore analysis and similarity settings on the request that are already set on the source index. Since it is possible to change the analysis and similarity settings on an existing index, this should be considered a bug and the sort of leniency that we abhor. This commit addresses this bug by allowing the request analysis/similarity settings to override the existing analysis/similarity settings on the target.	2018-04-30 07:31:36 -04:00
Julie Tibshirani	f5978d6d33	In the field capabilities API, remove support for providing fields in the request body. (#30185 )	2018-04-27 16:14:11 -07:00
Jason Tedor	4494565d8e	Bump changelog version to 6.4 (#30217 ) This commit bumps the changelog version to 6.4 as now that 6.3 is feature frozen there would be no additional entries in the changelog for 6.3.0.	2018-04-27 16:22:27 -04:00
Tanguy Leroux	63148dd9ba	Fail snapshot operations early on repository corruption (#30140 ) A NullPointerException is thrown when trying to create or delete a snapshot in a repository that has been written to by an older Elasticsearch after writing to it with a newer Elasticsearch version. This is because the way snapshots are formatted in the repository snapshots index file changed in #24477. This commit changes the parsing of the repository index file so that it now detects a corrupted index file and fails early the snapshot operation. closes #29052	2018-04-27 16:29:59 +02:00
Jason Tedor	2c3e71f116	Remove the suggest metric from stats APIs (#29635 ) This metric previously existed for backwards compatibility reasons although the suggest stats were folded into search stats. This metric was deprecated in 6.3.0 and this commit removes them for 7.0.0.	2018-04-24 19:03:48 -04:00
Jason Tedor	5d767e449a	Remove bulk fallback for write thread pool (#29609 ) The name of the bulk thread pool was renamed to "write" with "bulk" as a fallback name. This change was made in 6.x for BWC reasons yet in 7.0.0 we are removing this fallback. This commit removes this fallback for the write thread pool.	2018-04-19 16:59:58 -04:00
Jason Tedor	a28c0d2271	Remove extra spaces from changelog This commit removes to extra spaces at the end of lines in the changelog.	2018-04-19 15:10:19 -04:00
Paul Sanwald	3e7fccddaf	Add a CHANGELOG file for release notes. (#29450 ) * Add a CHANGELOG file for 7.x release notes. * update file to include 6.x * remove confusing comment and small edit to section title * moving CHANGELOG file under docs directory, as it pertains to release notes.	2018-04-18 07:42:05 -07:00

40 Commits