OpenSearch

Commit Graph

Author	SHA1	Message	Date
Josh Soref	d3e98278c3	Spelling: replace cachable with cacheable (#37047 )	2019-01-02 14:10:30 +01:00
Nhat Nguyen	7580d9d925	Make SourceToParse immutable (#36971 ) Today the routing of a SourceToParse is assigned in a separate step after the object is created. We can easily forget to set the routing. With this commit, the routing must be provided in the constructor of SourceToParse. Relates #36921	2018-12-24 14:06:50 -05:00
Jason Tedor	1f574bd17a	Package ingest-user-agent as a module (#36956 ) This commit moves ingest-user-agent from being a plugin to being a module that is packaged with Elasticsearch distributions.	2018-12-22 20:20:53 -05:00
Jason Tedor	e1717df0ac	Package ingest-geoip as a module (#36898 ) This commit moves ingest-geoip from being a plugin to being a module that is packaged with Elasticsearch distributions.	2018-12-22 07:21:49 -05:00
Jack Conradson	c13a7bc04a	[Painless] Add String Casting Tests (#36945 ) This adds additional standard casting tests for String as the original type. This also cleans up the error messages in the String to char cast method.	2018-12-21 13:42:07 -08:00
Julie Tibshirani	fba710469a	Refactor the REST actions to clarify what endpoints are deprecated. (#36869 )	2018-12-20 18:06:41 -08:00
Michael Basnight	a64fea10e2	Enable IPv6 URIs in reindex from remote (#36874 ) Reindex from remote was using a custom regex to dermine what URIs were valid. This commit removes the custom regex and uses the java.net.URI class instead, allowing IPv6 support without changing the existing validation around a URI in reindex from remote.	2018-12-20 13:48:35 -06:00
Jack Conradson	be573ab5e7	[Painless] Casting Tests for Object and Number (#36804 ) This adds more casting tests with the original type as Object and then Number. Covers the entire set of possible numeric cases for these two types.	2018-12-20 09:42:33 -08:00
Andrey Ershov	ca92d74e7e	[Zen2] Change unsafe bootstrap nodes count to nodes list in tests (#36559 ) This commit modifies ESSingleNodeTestCase and ESIntegTestCase and several concrete test classes to use node names when bootstrapping the cluster. Today ClusterBootstrapService.INITIAL_MASTER_NODE_COUNT_SETTING setting is used to bootstrap clusters in tests. Instead, we want to use ClusterBootrstapService.INITIAL_MASTER_NODES_SETTING and get rid of the former setting eventually. There were two main problems when refactoring InternalTestCluster: 1. Nodes are created one-by-one in buildNode method. And node.name is created in this method as well. It's not suitable for bootstrapping, because we need to have the names of all master eligible nodes in advance, before creating the node with bootstrapping configuration set. We address this issue by separating buildNode into two methods: getNodeSettings and buildNode. We first iterate over all nodes to get nodes settings, then change the setting for the bootstrapping node and then proceed with building the node. 2. If autoManageMinMasterNodes = false, there is no way for the test to set the list of bootstrapping nodes because node names are not known in advance. This problem is solved by adding updateNodesSettings method to NodeConfigurationSource and ESIntegTestCase (which could be overridden by concrete integration test class). Once we have the list of settings for all nodes, the integration test class is allowed to update it. In our case, we update the ClusterBootrstapService.INITIAL_MASTER_NODES_SETTING setting.	2018-12-20 15:20:33 +01:00
Alan Woodward	344917efab	Add script filter to intervals (#36776 ) This commit adds the ability to filter out intervals based on their start and end position, and internal gaps: ``` POST _search { "query": { "intervals" : { "my_text" : { "match" : { "query" : "hot porridge", "filter" : { "script" : { "source" : "interval.start > 10 && interval.end < 20 && interval.gaps == 0" } } } } } } } ```	2018-12-19 11:12:18 +00:00
Alpar Torok	e9ef5bdce8	Converting randomized testing to create a separate unitTest task instead of replacing the builtin test task (#36311 ) - Create a separate unitTest task instead of Gradle's built in - convert all configuration to use the new task - the built in task is now disabled	2018-12-19 08:25:20 +02:00
Jack Conradson	7de85f55e3	[Painless] Add tests for boxed return types (#36747 ) Adds tests for each primitive/boxed and def type to be implicitly cast to an appropriate boxed return type from a method.	2018-12-18 10:14:48 -08:00
Mayya Sharipova	f884b2b1cd	Deprecate types in index API (#36575 ) * Deprecate types in index API - deprecate type-based constructors of IndexRequest - update tests to use typeless IndexRequest constructors - no yaml tests as they have been already added in #35790 Relates to #35190	2018-12-18 08:53:49 -05:00
Alan Woodward	af57575838	Allow word_delimiter_graph_filter to not adjust internal offsets (#36699 ) This commit adds an adjust_offsets parameter to the word_delimiter_graph token filter, defaulting to true. Most of the time you'd want sub-tokens emitted by this filter to have offsets that are adjusted to their real position in the token stream; however, some token filters can change the length or starting position of a token (eg trim) without changing their offset attributes, and this can lead to word_delimiter_graph emitting illegal offsets. Setting adjust_offsets to false in these cases will allow indexing again. Fixes #34741, #33710	2018-12-18 13:20:51 +00:00
Christoph Büscher	2f5300e3a6	Deprecate types in get_source and exist_source (#36426 ) This change adds a new untyped endpoint `{index}/_source/{id}` for both the GET and the HEAD methods to get the source of a document or check for its existance. It also adds deprecation warnings to RestGetSourceAction that emit a warning when the old deprecated "type" parameter is still used. Also updating documentation and tests where appropriate. Relates to #35190	2018-12-18 00:57:42 +01:00
Jake Landis	384757deff	ingest: support default pipelines + bulk upserts (#36618 ) This commit adds support to enable bulk upserts to use an index's default pipeline. Bulk upsert, doc_as_upsert, and script_as_upsert are all supported. However, bulk script_as_upsert has slightly surprising behavior since the pipeline is executed _before_ the script is evaluated. This means that the pipeline only has access the data found in the upsert field of the script_as_upsert. The non-bulk script_as_upsert (existing behavior) runs the pipeline _after_ the script is executed. This commit does _not_ attempt to consolidate the bulk and non-bulk behavior for script_as_upsert. This commit also adds additional testing for the non-bulk behavior, which remains unchanged with this commit. fixes #36219	2018-12-17 16:25:11 -06:00
Jake Landis	7bf822bbbb	ingest: fix on_failure with Drop processor (#36686 ) This commit allows a document to be dropped when a Drop processor is used in the on_failure fork of the processor chain. Fixes #36151	2018-12-17 14:10:13 -06:00
Jack Conradson	a0e7e571e4	[Painless] Add boxed type to boxed type casts for method/return (#36571 ) This adds implicit boxed type to boxed types casts for non-def types to create asymmetric casting relative to the def type when calling methods or returning values. This means that a user calling a method taking an Integer can call it with a Byte, Short, etc. legally which matches the way def works. This creates consistency in the casting model that did not previously exist.	2018-12-17 10:50:19 -08:00
Boaz Leskes	e356b8cb95	Add doc's sequence number + primary term to GetResult and use it for updates (#36680 ) This commit adds the last sequence number and primary term of the last operation that have modified a document to `GetResult` and uses it to power the Update API. Relates #36148 Relates #10708	2018-12-17 15:22:13 +01:00
Tim Brooks	3065300434	Unify transport settings naming (#36623 ) This commit updates our transport settings for 7.0. It generally takes a few approaches. First, for normal transport settings, it usestransport. instead of transport.tcp. Second, it uses transport.tcp, http.tcp, or network.tcp for all settings that are proxies for OS level socket settings. Third, it marks the network.tcp.connect_timeout setting for removal. Network service level settings are only settings that apply to both the http and transport modules. There is no connect timeout in http. Fourth, it moves all the transport settings to a single class TransportSettings similar to the HttpTransportSettings class. This commit does not actually remove any settings. It just adds the new renamed settings and adds todos for settings that will be deprecated.	2018-12-14 14:41:04 -07:00
Michael Basnight	dae422fb2b	Update joda compat methods to use compat class (#36654 ) The existing joda compat methods isEquals isAfter and isBefore all took in a ZonedDateTime, but since all of the scripting is now using the new JodaCompatZonedDateTime, these are changed to take that in instead.	2018-12-14 15:38:51 -06:00
Alan Woodward	c7ac9ef826	Upgrade to lucene snapshot 774e9aefbc (#36637 ) Includes LUCENE-8607: improvement to MatchAllDocsQuery	2018-12-14 20:30:07 +00:00
Luca Cavanna	8f04536a35	Add copy constructor to SearchRequest (#36641 ) For cross cluster search alternate execution mode (see #32125), we will need to take a search request that spans across multiple clusters (based on index prefixes e.g. cluster1:index, cluster2:index etc.) and split it into multiple search requests to be sent to each cluster. A copy constructor added to `SearchRequest` would make that easy and well maintainable in the future. Something along the same lines already happens in `BulkByScrollParallelizationHelper`, but the corresponding code went outdated as some new fields were added to `SearchRequest` which were not added to the bulk by scroll code. A copy constructor helps making the task of copying a search request maintainable over time.	2018-12-14 18:30:29 +01:00
Jeff Hajewski	f1f3b28f5c	Delete deprecated getValues from ScriptDocValues (#36183 ) * Adds deprecation logging to ScriptDocValues#getValues. First commit addressing issue #22919. `ScriptDocValues#getValues` was added for backwards compatibility but no longer needed. Scripts using the syntax `doc['foo'].values` when `doc['foo']` is a list should be using `doc['foo']` instead. * Fixes two build errors in #34279 * Removes unused import in ScriptDocValuesDatesTest * Removes used of `.values` in example in diversified-sampler-aggregation.asciidoc * Removes use of .values from painless test. Part of #34279 * Updates tests to use `doc[foo]` syntax rather than `doc[foo].values`. * Removes use of `getValues()` and replaces use of `doc[foo].values` with `doc[foo]`. * Indentation fix. * Remove unnecessary list construction at previous `getValues()` callsite in ScriptDocValues.GeoPoints. * Update migration doc and add link to `getValue` in ScriptDocValues javadoc. * Fix compile * Fix javadoc issue * Removes ScriptDocValues#getValues usage from painless whitelist.	2018-12-14 07:56:47 -05:00
Mayya Sharipova	b5d532f9e3	Vector field (#33022 ) 1. Dense vector PUT dindex { "mappings": { "_doc": { "properties": { "my_vector": { "type": "dense_vector" }, "my_text" : { "type" : "keyword" } } } } } PUT dinex/_doc/1 { "my_text" : "text1", "my_vector" : [ 0.5, 10, 6 ] } 2. Sparse vector PUT sindex { "mappings": { "_doc": { "properties": { "my_vector": { "type": "sparse_vector" }, "my_text" : { "type" : "keyword" } } } } } PUT sindex/_doc/1 { "my_text" : "text1", "my_vector" : {"1": 0.5, "99": -0.5, "5": 1} }	2018-12-12 21:20:53 -05:00
Alan Woodward	9ac7359643	Update lucene to snapshot-7e4555a2fd (#36563 ) Includes the following: * Reversion of doc-values changes in LUCENE-8374; we are interested in seeing if this has an effect on benchmarks for node-stats and index-stats * More improvements to docvalues updates	2018-12-12 20:18:32 +00:00
Julie Tibshirani	33152f648f	Fix some inconsistencies in the types deprecation code. (#36517 ) * Make sure to test conversion for both typed and typeless HLRC requests. * Update a few more statements to deprecatedAndMaybeLog. * Make sure Rest*SearchTemplateActionTests extend RestActionTestCase.	2018-12-12 10:38:02 -08:00
Tim Brooks	e63d52af63	Move page size constants to PageCacheRecycler (#36524 ) `PageCacheRecycler` is the class that creates and holds pages of arrays for various uses. `BigArrays` is just one user of these pages. This commit moves the constants that define the page sizes for the recycler to be on the recycler class.	2018-12-12 07:00:50 -07:00
David Turner	aa43e0b2cc	[Zen2] Migrate no-master-block integration tests (#36502 ) This change follows up on #36478 by migrating the affected integration tests to use Zen2.	2018-12-12 12:52:34 +00:00
Nhat Nguyen	3fb5a12b30	Upgrade to Lucene-8.0.0-snapshot-61e448666d (#36518 ) Includes: - LUCENE-8602: Share TermsEnum if possible while applying DV updates	2018-12-12 06:47:40 +01:00
Tim Brooks	797f985067	Add version to handshake requests (#36171 ) Currently our handshake requests do not include a version. This is unfortunate as we cannot rely on the stream version since it is not the sending node's version. Instead it is the minimum compatibility version. The handshake request is currently empty and we do nothing with it. This should allow us to add data to the request without breaking backwards compatibility. This commit adds the version to the handshake request. Additionally, it allows "future data" to be added to the request. This allows nodes to craft a version compatible response. And will properly handle additional data in future handshake requests. The proper handling of "future data" is useful as this is the only request where we do not know the other node's version. Finally, it renames the TcpTransportHandshaker to TransportHandshaker.	2018-12-11 16:09:28 -07:00
Mayya Sharipova	2f18325384	Deprecate types in update_by_query and delete_by_query (#36365 ) Relates to #35190	2018-12-11 17:09:59 -05:00
Jack Conradson	8e988f6c06	[Painless] Add def to boxed type casts (#36506 ) This adds casts for the def type to all standard boxed types. Prior to this certain casts such as def [long/Long] -> Double would fail which does not follow the goals of the Painless casting model to remove the need for explicit boxing. This also creates symmetry with the casts for the newly created bridge methods being called at run-time.	2018-12-11 14:06:38 -08:00
Tim Brooks	790f8102e9	Modify `BigArrays` to take name of circuit breaker (#36461 ) This commit modifies BigArrays to take a circuit breaker name and the circuit breaking service. The default instance of BigArrays that is passed around everywhere always uses the request breaker. At the network level, we want to be using the inflight request breaker. So this change will allow that. Additionally, as this change moves away from a single instance of BigArrays, the class is modified to not be a Releasable anymore. Releasing big arrays was always dispatching to the PageCacheRecycler, so this change makes the PageCacheRecycler the class that needs to be managed and torn-down. Finally, this commit closes #31435 be making the serialization of transport messages use the inflight request breaker. With this change, we no longer push the global BigArrays instnace to the network level.	2018-12-11 11:55:41 -07:00
Jack Conradson	13b1f19772	[Painless] Add extensive tests for def to primitive casts (#36455 ) This adds tests for each possible cast of def to a primitive type both implicit and explicit. This also fixes a minor bug where we were only checking the type of a def to be Number in some explicit casts. This does not work because it allows possible unintended casts from BigInteger and BigDecimal to primitive types since they both extend Number but are not included as part of the Painless casting model.	2018-12-11 08:03:08 -08:00
Julie Tibshirani	87831051dc	Deprecate types in explain requests. (#35611 ) The following updates were made: - Add a new untyped endpoint `{index}/_explain/{id}`. - Add deprecation warnings to RestAction, plus tests in RestActionTests. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 19:45:13 -08:00
Ryan Ernst	a0da390df2	Scripting: Switch watcher to use joda bwc time objects (#35966 ) This commit converts the watcher execution context to use the joda compat java time objects. It also again removes the joda methods from the painless whitelist.	2018-12-10 17:29:25 -08:00
Nhat Nguyen	2a7edca59f	Upgrade to Lucene-8.0.0-snapshot-ef61b547b1 (#36450 ) Includes: - LUCENE-8598: Improve field updates packed values - LUCENE-8599: Use sparse bitset to store docs in SingleValueDocValuesFieldUpdates	2018-12-10 16:33:49 -05:00
Julie Tibshirani	b15d1aebcf	For msearch templates, make sure to use the right name for deprecation logging. (#36344 )	2018-12-07 14:50:47 -08:00
Julie Tibshirani	51e1d40dca	Small improvements related to types deprecation. (#36328 ) * Make sure to use deprecatedAndMaybeLog for types deprecation messages. * Introduce a common base class for Rest*Action tests.	2018-12-07 11:21:24 -08:00
Jack Conradson	2df4bd1f81	[Painless] Generate Bridge Methods (#36097 ) We use MethodHandles.asType to cast argument types into the appropriate parameter types for method calls when the target of the call is a def type at runtime. Currently, certain implicit casts using the def type are asymmetric. It is possible to cast Integer -> float as an argument to parameter, but not from int -> Float (boxed to primitive with upcasting is okay, but primitive to boxed with upcasting is not). This PR introduces a solution to the issue by generating bridge methods for all whitelisted methods that have at least a single boxed type as an argument. The bridge method will conduct appropriate casts and then call the original method. This adds a bit of overhead for correctness. It should not be used often as Painless avoids boxed types as much as possible. Note that a large portion of this change is adding methods to do the appropriate def to boxed type casts and a few mechanical changes as well. The most important method for review is generateBridgeMethod in PainlessLookupBuilder.	2018-12-07 09:32:27 -08:00
Nhat Nguyen	10feb75eb7	Upgrade to Lucene-8.0.0-snapshot-aaa64d70159 (#36335 ) Includes: LUCENE-8594: DV update are broken for updates on new field LUCENE-8590: Optimize DocValues update datastructures LUCENE-8593: Specialize single value numeric DV updates Relates #36286	2018-12-06 20:33:25 -05:00
Yannick Welsch	ee05ef1312	Merge branch 'zen2'	2018-12-06 08:31:46 +01:00
Jake Landis	190ac8e9bf	ingest: support default pipeline through an alias (#36231 ) This commit allows writes that go through an alias to use the default pipeline defined on the backing index. Fixes #35817	2018-12-05 16:25:50 -06:00
Yannick Welsch	a0ae1cc987	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 23:13:12 +01:00
Yannick Welsch	03d0ea91ef	Zen2: Rename tombstones to exclusions (#36226 ) Renames the withdrawal / tombstones APIs to voting configuration exclusions.	2018-12-05 23:12:28 +01:00
Andrey Ershov	5d6602120f	[Zen2] Hide not recovered state (#36224 ) This commit hides ClusterStates that have a STATE_NOT_RECOVERED_BLOCK from ClusterStateAppliers. This is needed, because some appliers, such as IngestService, rely on the fact, that cluster states with STATE_NOT_RECOVERED_BLOCK won't contain anything useful. Once the state is recovered it's fully available for the appliers. This commit also switches many of the remaining tests that require state persistence/recovery from Zen1 to Zen2.	2018-12-05 23:11:20 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Yannick Welsch	b20497560c	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 14:06:38 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00

1 2 3 4 5 ...

5014 Commits