OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	e467e67fd4	Enhance license detection for various licenses (#31198 ) This commit enhances the license detection that we have for various licenses. Here we improve the detection for all licenses (especially the Apache 2.0 License), the BSD 2-clause license, the MIT (with attribution) license, and we add detection for the BSD 3-clause license. One way that we achieved this improvement is by changing how the license files are read so that rather than reading them as a multi-line string which ended up represented as "[line1, line2, line3, ...]" internally, we read the full bytes of the license text and replace all whitespace with a single space so the license text is now loaded as "line1 line2 line3". For the MIT license we add the actual license text and remove the "MIT" string as not all copies of the license clearly indicate that the text is the MIT license. We take a similar strategy for the BSD-2 and BSD-3 clause licenses. With this change, we reduce the number of "custom" licenses in the codebase from 31 to 2. The two remaining appear to be truly custom licenses, not carrying licenses identifiable by SPDX. A follow-up will address "unknown" licenses.	2018-06-08 08:55:10 -04:00
David Turner	8d4f09f7f2	[DOCS] Add note about long-lived idle connections (#30990 ) Clarify that we expect to have idle inter-node connections within the cluster, and that the network needs to be configured not to disrupt these.	2018-06-08 13:36:19 +01:00
Martijn van Groningen	07a57cc131	Move number of language analyzers to analysis-common module (#31143 ) The following analyzers were moved from server module to analysis-common module: `snowball`, `arabic`, `armenian`, `basque`, `bengali`, `brazilian`, `bulgarian`, `catalan`, `chinese`, `cjk`, `czech`, `danish`, `dutch`, `english`, `finnish`, `french`, `galician` and `german`. Relates to #23658	2018-06-08 08:58:46 +02:00
Simon Willnauer	435a825a53	Default max concurrent search req. numNodes * 5 (#31171 ) We moved to 1 shard by default which caused some issues in how many concurrent shard requests we allow by default. For instance searching a 5 shard index on a single node will now be executed serially per shard while we want these cases to have a good concurrency out of the box. This change moves to `numNodes * 5` which corresponds to the default we used to have in the previous version. Relates to #30783 Closes #30994	2018-06-08 08:33:01 +02:00
Hendrik Muhs	253b998681	flush job to ensure all results have been written (#31187 ) flush ml job to ensure all results have been written fixes #31173	2018-06-08 07:51:45 +02:00
Jack Conradson	d6a4c14e1b	Painless: Restructure/Clean Up of Spec Documentation (#31013 ) Full restructure of the spec into new sections for operators, statements, scripts, functions, lambdas, and regexes. Split of operators into 6 sections, a table, reference, array, numeric, boolean, and general. Clean up of all operators sections. Sporadic clean up else where.	2018-06-07 17:11:56 -07:00
Igor Motov	972dcbc0ad	Update ignore_unmapped serialization after backport Update the serialization version of ignore_unmapped flag after backport to 6.4 Relates #31153	2018-06-07 17:44:12 -04:00
Jason Tedor	d49c85d2e8	Add back dropped substitution on merge This was dropped accidentally during merge conflict resolution. This commit adds back the substitution for elasticsearch-cli.	2018-06-07 17:40:47 -04:00
Paul Sanwald	e82e5cc2e8	high level REST api: cancel task (#30745 ) * Initial commit of rest high level exposure of cancel task * fix javadocs * address some code review comments * update branch to use tasks namespace instead of cluster * High-level client: list tasks failure to not lose nodeId This commit reworks testing for `ListTasksResponse` so that random fields insertion can be tested and xcontent equivalence can be checked too. Proper exclusions need to be configured, and failures need to be tested separately. This helped finding a little problem, whenever there is a node failure returned, the nodeId was lost as it was never printed out as part of the exception toXContent. * added comment * merge from master * re-work CancelTasksResponseTests to separate XContent failure cases from non-failure cases * remove duplication of logic in parser creation * code review changes * refactor TasksClient to support RequestOptions * add tests for parent task id * address final PR review comments, mostly formatting and such	2018-06-07 14:02:23 -07:00
Jason Tedor	e481b860a1	Enable engine factory to be pluggable (#31183 ) This commit enables the engine factory to be pluggable based on index settings used when creating the index service for an index.	2018-06-07 17:01:06 -04:00
Jason Tedor	d8c0a39c15	Remove vestiges of animal sniffer (#31178 ) We no longer need animal sniffer because we use JDK functionality (introduced in JDK 9) to target older versions of the JDK for compilation. This functionality means that the JDK handles the problem of ensuring that we do not use JDK APIs from the version that we are compiling from that are not available in the version that we are compiling to. A previous commit removed this for the REST client (where we target JDK 7) but a few traces were left behind.	2018-06-07 17:00:22 -04:00
Jason Tedor	5296c11e4f	Rename elasticsearch-nio to nio (#31186 ) This commit renames :libs:elasticsearch-nio to :libs:nio.	2018-06-07 17:00:00 -04:00
Jason Tedor	94be9b471f	Rename elasticsearch-core to core (#31185 ) This commit renames :libs:elasticsearch-core to :libs:core.	2018-06-07 16:50:21 -04:00
Jason Tedor	b32cbc1baa	Move cli sub-project out of server to libs (#31184 ) This commit moves the cli sub-project out of server to libs where it makes more sense.	2018-06-07 16:35:34 -04:00
lcawl	5dc9e87bad	[DOCS] Fixes broken link in auditing settings	2018-06-07 10:49:22 -07:00
Nik Everett	dfcc939ef8	QA: Better seed nodes for rolling restart Use all running nodes as unicast seeds in the rolling restart tests to avoid a race between pinging and the tests. Without this if the tests are too fast then when a new node comes up and pings its single configured seed node that node might not have a ping from the other running node.	2018-06-07 13:30:37 -04:00
lcawl	1de38a2488	[DOCS] Moves ML content to stack-docs	2018-06-07 09:26:00 -07:00
Lisa Cawley	d0f35d204e	[DOCS] Clarifies recommendation for audit index output type (#31146 )	2018-06-07 08:55:14 -07:00
Tim Brooks	237f9b8930	Add nio-transport as option for http smoke tests (#31162 ) This is related to #27260 and #28898. This commit adds the transport-nio plugin as a random option when running the http smoke tests. As part of this PR, I identified an issue where cors support was not properly enabled causing these tests to fail when using transport-nio. This commit also fixes that issue.	2018-06-07 09:46:36 -06:00
Nik Everett	56207ea43d	QA: Set better node names on rolling restart tests These should help with debugging failures.	2018-06-07 11:25:41 -04:00
Igor Motov	7a9d9b0abf	Add support for ignore_unmapped to geo sort (#31153 ) Adds support for `ignore_unmapped` parameter in geo distance sorting, which is functionally equivalent to specifying an `unmapped_type` in the field sort. Closes #28152	2018-06-07 11:11:13 -04:00
Christoph Büscher	c352ff1615	Share common parser in some AcknowledgedResponses (#31169 ) Several AcknowledgedResponse implementations only parse the boolean acknowledged flag and then create an instance of their class using that flag. This can be simplified by adding this basic parser to the superclass, provide a common helper method and call the appropriate ctor in the fromXContent methods.	2018-06-07 13:52:10 +02:00
Jim Ferenczi	280a2f55d6	Fix random failure on SearchQueryIT#testTermExpansionExceptionOnSpanFailure This change moves an integration test that relies on setting the value of a static variable (boolean max clause count) to an unit test where we are sure that the same jvm is used to access the static variable.	2018-06-07 13:43:17 +02:00
David Turner	6ad7217656	Remove reference to multiple fields with one name (#31127 ) If there is only one type per index then each field's name is unique.	2018-06-07 12:38:57 +01:00
Tanguy Leroux	b5f05f676c	Remove BlobContainer.move() method (#31100 ) closes #30680	2018-06-07 10:48:31 +02:00
Rafał Bigaj	749d39061a	[Docs] Correct minor typos in templates.asciidoc (#31167 )	2018-06-07 10:44:57 +02:00
Adrien Grand	458bca11bc	Add a `feature_vector` field. (#31102 ) This field is similar to the `feature` field but is better suited to index sparse feature vectors. A use-case for this field could be to record topics associated with every documents alongside a metric that quantifies how well the topic is connected to this document, and then boost queries based on the topics that the logged user is interested in. Relates #27552	2018-06-07 10:05:37 +02:00
Nirmal Chidambaram	75a676c70b	Fail `span_multi` queries that exceeds boolean max clause limit (#30913 ) By default span_multi query will limit term expansions = boolean max clause. This will limit high heap usage in case of high cardinality term expansions. This applies only if top_terms_N is not used in inner multi query.	2018-06-07 09:34:39 +02:00
Jim Ferenczi	b30aa3137d	Reject long regex in query_string (#31136 ) This change applies the existing `index.max_regex_length` to regex queries produced by the `query_string` query. Relates #28344	2018-06-07 09:29:26 +02:00
Jason Tedor	8be1361579	Adjust indentation in CLI scripts This commit adjusts the indentation in the CLI scripts to give a clear visual indication that the line being indented is a continuation of the previous line.	2018-06-06 22:52:50 -04:00
Tim Vernum	bd3aabac97	[TEST] Make SSL restrictions update atomic (#31050 ) SSLTrustRestrictionsTests updates the restrictions YML file during the test run to change the set of restrictions. This update was small, but it wasn't atomic. If the yml file is reloaded while empty or invalid, then it causes all SSL certificates to be considered invalid (until it is reloaded again), which could break the sniffing/administrative client that runs underneath the tests.	2018-06-07 12:03:19 +10:00
Jason Tedor	01b5a46c24	Pass main class by environment variable on Windows (#31156 ) A previous refactoring of the CLI scripts migrated all of the CLI tools to shell to a common script, elasticsearch-cli. This approach is fine in Bash where it is easy to tear arguments apart but it doesn't work so well on Windows where quoting is insane. To avoid having to tear the arguments apart to separate the first argument to elasticsearch-cli from the remaining arguments, we instead choose a strategy where we can avoid tearing the arguments apart. To do this, we will instead pass the main class by an environment variable and then we can pass the arguments straight through. This will let us avoid awful quoting issues on Windows. This is the Windows side of that effort and the Bash side was in a previous commit.	2018-06-06 21:57:58 -04:00
Jason Tedor	95795c8935	Pass main class by environment variable (#31149 ) A previous refactoring of the CLI scripts migrated all of the CLI tools to shell to a common script, elasticsearch-cli. This approach is fine in Bash where it is easy to tear arguments apart but it doesn't work so well on Windows where quoting is insane. To avoid having to tear the arguments apart to separate the first argument to elasticsearch-cli from the remaining arguments, we instead choose a strategy where we can avoid tearing the arguments apart. To do this, we will instead pass the main class by an environment variable and then we can pass the arguments straight through. This will let us avoid awful quoting issues on Windows. This is the non-Windows side of that effort and the Windows side will be in a follow-up.	2018-06-06 21:56:52 -04:00
Lisa Cawley	7f0c2e89c2	[DOCS] Moves X-Pack setup to docs (#31145 )	2018-06-06 14:46:20 -07:00
Tim Brooks	4158387554	Cleanup nio http thread names (#31148 ) This is related to #28898. This commit adds the acceptor thread name to the method checking if this thread is a transport thread. Additionally, it modifies the nio http transport to use the same worker name as the netty4 http server transport.	2018-06-06 15:36:13 -06:00
Luca Cavanna	be4a101ea1	Add high-level client methods that accept RequestOptions (#31069 ) With #30490 we have introduced a new way to provide request options whenever sending a request using the high-level REST client. Before you could provide headers as the last argument varargs of each API method, now you can provide `RequestOptions` that in the future will allow to provide more options which can be specified per request. This commit deprecates all of the client methods that accept a `Header` varargs argument in favour of new methods that accept `RequestOptions` instead. For some API we don't even go through deprecation given that they were not released since they were added, hence in that case we can just move them to the new method.	2018-06-06 23:17:45 +02:00
Lisa Cawley	68827fc046	[DOCS] Enables testing for monitoring examples (#31119 )	2018-06-06 13:25:36 -07:00
Lisa Cawley	b4514d3cc1	[DOCS] Moves ML node info to docs (#31142 )	2018-06-06 12:39:24 -07:00
Tim Brooks	67e73b4df4	Combine accepting selector and socket selector (#31115 ) This is related to #27260. This commit combines the AcceptingSelector and SocketSelector classes into a single NioSelector. This change allows the same selector to handle both server and socket channels. This is valuable as we do not necessarily want a dedicated thread running for accepting channels. With this change, this commit removes the configuration for dedicated accepting selectors for the normal transport class. The accepting workload for new node connections is likely low, meaning that there is no need to dedicate a thread to this process.	2018-06-06 11:59:54 -06:00
Nik Everett	dc4bb62a78	QA: Remove mistaken timeout I pushed a test that `assertBusy`s for a whole hour accidentally. I was testing something and forgot to revert my local hack but caught it on backport. This removes it.	2018-06-06 13:51:54 -04:00
Lisa Cawley	45537c59e5	[DOCS] Moves X-Pack settings to docs folder (#31120 )	2018-06-06 10:05:32 -07:00
Nik Everett	7c59e7690e	QA: Switch xpack rolling upgrades to three nodes (#31112 ) This is much more realistic and can find more issues. This causes the "mixed cluster" tests to be run twice so I had to fix the tests to work in that case. In most cases I did as little as possible to get them working but in a few cases I went a little beyond that to make them easier for me to debug while getting them to work. My test changes: 1. Remove the "basic indexing" tests and replace them with a copy of the tests used in the OSS. We have no way of sharing code between these two projects so for now I copy. 2. Skip the a few tests in the "one third" upgraded scenario: * creating a scroll to be reused when the cluster is fully upgraded * creating some ml data to be used when the cluster is fully ugpraded 3. Drop many "assert yellow and that the cluster has two nodes" assertions. These assertions duplicate those made by the wait condition and they fail now that we have three nodes. 4. Switch many "assert green and that the cluster has two nodes" to 3 nodes. These assertions are unique from the wait condition and, while I imagine they aren't required in all cases, now is not the time to find that out. Thus, I made them work. 5. Rework the index audit trail test so it is more obvious that it is the same test expecting different numbers based on the shape of the cluster. The conditions for which number are expected are fairly complex because the index audit trail is shut down until the template for it is upgraded and the template is upgraded when a master node is elected that has the new version of the software. 6. Add some more information to debug the index audit trail test because it helped me figure out what was going on. I also dropped the `waitCondition` from the `rolling-upgrade-basic` tests because it wasn't needed. Closes #25336	2018-06-06 11:59:16 -04:00
Lisa Cawley	6fd4eb52b8	[DOCS] Moves commands to docs folder (#31114 )	2018-06-06 07:49:15 -07:00
Adrien Grand	e9fe371e41	Give the engine the whole index buffer size on init. (#31105 ) Currently the engine is initialized with a hardcoded 256MB of RAM. Elasticsearch may never use more than that for a given shard, `IndexingMemoryController` only has the power to flush segments to disk earlier in case multiple shards are actively indexing and use too much memory. While this amount of memory is enough for an index with few fields and larger RAM buffers are not expected to improve indexing speed, this might actually be little for an index that has many fields. Kudos to @bleskes for finding it out when looking into a user who was reporting a much slower indexing speed when upgrading from 2.x to 5.6 with an index that has about 20,000 fields.	2018-06-06 16:46:11 +02:00
Yannick Welsch	1dca00deb9	Remove extra checks from HdfsBlobContainer (#31126 ) This commit saves one network roundtrip when reading or deleting files from an HDFS repository.	2018-06-06 16:38:37 +02:00
Yannick Welsch	515a23360d	Do not check for S3 blob to exist before writing (#31128 ) In #19749 an extra check was added before writing each blob to ensure that we would not be overriding an existing blob. Due to S3's weak consistency model, this check was best effort. To make matters worse, however, this resulted in a HEAD request to be done before every PUT, in particular also when PUTTING a new object. The approach taken in #19749 worsened our consistency guarantees for follow-up snapshot actions, as it made it less likely for new files that had been written to be available for reads. This commit therefore removes this extra check. Due to the weak consistency model, this check was a best effort thing anyway, and there's currently no way to prevent accidental overrides on S3.	2018-06-06 16:38:06 +02:00
Jay Modi	8aa58887e2	Security: make native realm usage stats accurate (#30824 ) The native realm's usage stats were previously pulled from the cache, which only contains the number of users that had authenticated in the past 20 minutes. This commit changes this so that we pull the current value from the security index by executing a search request. In order to support this, the usage stats for realms is now asynchronous so that we do not block while waiting on the search to complete.	2018-06-06 08:18:56 -06:00
Luca Cavanna	f4a412fe21	Remove RestGetAllMappingsAction (#31129 ) We currently have a specific REST action to retrieve all indices and types mappings, which used internally the get index API. This doesn't seem to be required anymore though as the existing RestGetMappingAction could as well take the requests with no indices and types specified. This commit removes the RestGetAllMappingsAction in favour of using RestGetMappingAction also for requests that don't specify indices nor types.	2018-06-06 16:13:02 +02:00
Yannick Welsch	a9af5ca638	[TEST] Reenable UnicastZenPingTests.testSimplePings	2018-06-06 14:33:17 +02:00
David Kyle	3767bdc98d	[ML][DOCS] Add example of top N derivative aggregation (#31109 ) Add example of top N derivative aggregation to the ML datafeed docs	2018-06-06 13:21:16 +01:00

1 2 3 4 5 ...

39393 Commits All Branches Search

39393 Commits

All Branches