OpenSearch

Commit Graph

Author	SHA1	Message	Date
Boaz Leskes	7beaaaef62	Discovery: publishing timeout to log at WARN and indicate pending nodes When the master publishes a new cluster state it waits (by default) for up to 30s for all nodes to respond. If not it continues to process other pending tasks. At the moment, this timeout is logged under DEBUG but it typically represent a serious issue with one or more of the nodes. We should log it in WARN and give the nodes that failed to respond in a timefly fashion Closes #9551	2015-02-04 16:39:01 +01:00
Adrien Grand	8b76cd76f9	Internal: Avoid unnecessary utf8 conversion when creating ScriptDocValues for a string field. This regression was introduced in #6908: the conversion from RandomAccessOrds to SortedBinaryDocValues goes through Strings while both impls actually work on BytesRef, so the SortedBinaryDocValues instance could directly return the BytesRefs returned by the RandomAccessOrds. Close #9306	2015-02-04 09:53:34 +01:00
javanna	74c7b5a197	Internal: add AliasesRequest interface to mark requests that manage aliases We currently have the IndicesRequest interface to mark indices related requests and be able to retrieve the indices they relate to in a generic way. This commit introduces a similar abstraction for requests that manage aliases, to be able to retrieve/replace the aliases they relate to. Also, IndicesAliasesRequest becomes a CompositeIndicesRequest, as it allows to perform multiple operations (e.g. add/remote multiple aliases). Each single operation (AliasActions) implements now the newly introduced AliasesRequest. AliasesRequest is also implemented by GetAliasesRequest, which allows to retrieve aliases information. Closes #9460	2015-02-04 07:59:33 +01:00
Boaz Leskes	896e8657ea	Discovery: check index uuid when merging incoming cluster state into local In big deployment ClusterState can be large. To make sure we keep reusing objects that were promoted to the Old Gen, ZenDiscovery has an optimization where it tries to reuse existing IndexMetaData object (containing among other things the mappings) from the current cluster state if they didn't change. The comparison currently uses the index name and the metadata version. This is however not enough and we should also check the index uuid. In extreme cases, where cluster state processing is slow and the index in question is deleted and recreated and these operations are batch processed together, we can use the wrong meta data if the version is also identical. This can happen if people create the index with all meta data predefined and no settings were changed. Closes #9489 Closes #9541	2015-02-03 21:36:05 +01:00
Adrien Grand	6cdde31e64	Search: Reuse Lucene's MultiCollector. We could reuse Lucene's MultiCollector instead of implementing our own. Close #9549	2015-02-03 18:12:15 +01:00
Adrien Grand	13b64cc362	Aggs: Make the nested aggregation call sub aggregators with doc IDs in order. Close #9547	2015-02-03 16:51:36 +01:00
javanna	ebb7ecb00e	[TEST] RestClient to use a non static pooling connection manager When closing an instance of RestClient, the connection manager gets shutdown, which makes it not usable anymore. If that is static, like it is now, no RestClient will work anymore from that moment on. Each instance of RestClient should have its own instance of connection manager	2015-02-03 16:46:54 +01:00
Adrien Grand	8540a863aa	Search: Avoid calling DocIdSets.toSafeBits. This method is heavy as it builds a bitset out of a DocIdSet in order to be able to provide random-access. Now that Lucene has removed out-of-order scoring true random-access is very rarely needed and we could instead return an Bits instance that wraps the iterator. Ideally, we would use the DISI API directly but I have to admit that the Bits API is more friendly. Close #9546	2015-02-03 16:16:19 +01:00
javanna	e5b174ff77	[TEST] Move SimpleNettyTransportTests to expected exception Replaced try catch with expected exception, since no additional check was done on the exception thrown.	2015-02-03 15:51:51 +01:00
javanna	338766fd4d	[TEST] Remove needless ClusterScope annotation from NettyTransportMultiPortTests NettyTransportMultiPortTests is not an integration test, it doesn't rely on the test cluster thus the ClusterScope annotation doesn't have any effect.	2015-02-03 15:51:44 +01:00
javanna	0e67dda15d	[TEST] Make sure that match assertion throws error if run against an object We had a REST test that relied on matching a json response against a regex. It worked but the match wasn't done against the actual json object, but its java map representation converted into a string by calling `toString`. Since all other clients test runners don't work in this case, as they try to match a json object against a regex, we should do the same and prevent it from working.	2015-02-03 10:18:18 +01:00
javanna	dfe67da013	[TEST] support stashed values within property names in our REST tests Closes #9533	2015-02-03 10:17:50 +01:00
Boaz Leskes	4342237acf	Test: reduce load in RecoveryWhileUnderLoadTests	2015-02-03 09:32:42 +01:00
Robert Muir	027730006b	core: add 'checksum' option for index.shard.check_on_startup The current "checkindex" on startup is very very expensive. This is like running one of the old school hard drive diagnostic checkers and usually not a good idea. But we can do a CRC32 verification of files. We don't even need to open an indexreader to do this, its much more lightweight. This option (as well as the existing true/false) are randomized in tests to find problems. Also fix bug where use of the current option would always leak an indexwriter lock. Closes #9183	2015-02-03 00:10:08 -05:00
Ryan Ernst	6079d88d43	Mappings: Remove type prefix support from field names in queries This is the first part of #8872.	2015-02-02 13:10:56 -08:00
Lee Hinman	0f405e9710	Merge branch 'pr/8795'	2015-02-02 11:49:45 -07:00
Michael McCandless	e29cf903c8	Core: upgrade to Lucene snapshot r1656366 * IndexWriter deadlock and DV update concurrency fix * BytesRef reuse bug with SortedSetDVTermsEnum * Int overflow skip data corruption bug * Compound file API cleanups * IndexWriter doesn't accept per-doc Analyzer anymore Closes #9524	2015-02-02 13:37:45 -05:00
Christoph Büscher	44193e7ba5	Aggregations: Add 'offset' option to histogram aggregation Histogram aggregation supports an 'offset' option to move bucket boundaries. In a histogram with buckets of size X these can be moved from 0, X, 2X, 3X,... by an offset value of Y to Y, X+Y, 2X+Y, 3X+Y... by using the 'offset' option. The previous 'pre_offset' and 'post_offset' options are removed in favour of the simplified 'offset' option. Closes #9417 Closes #9505	2015-02-02 18:23:01 +01:00
Lee Hinman	6c27f1242a	Update Groovy dependency to 2.4.0	2015-02-02 09:27:41 -07:00
Clinton Gormley	988f35a7da	REST: Add skip "stash_in_path" feature to nodes.info/20_transport test Required for clients that don't yet do stash lookups when resolving fieldnames like "nodes.$master.transport"	2015-02-02 17:11:04 +01:00
Clinton Gormley	e7ac5f296e	REST-spec: Allow stashed values to be used in property names as well Fix the nodes.info/20_transport test to use the master node, rather than to rely on applying a regex to the whole $body	2015-02-02 16:12:41 +01:00
Alexander Reelsen	a55476bf70	Tests: Ensure no use of potentially resolving internal ips	2015-02-02 09:45:42 +01:00
Boaz Leskes	79c8621a47	Test: add trace logging to testNodeFailuresAreProcessedOnce	2015-02-02 09:32:53 +01:00
Alexander Reelsen	59f8c0951a	Netty Transport: Add profiles to transport infos Until now, there was no possibility to expose infos about configured transport profiles. This commit adds the ability to expose those information in the TransportInfo class. The channel was well as the netty pipeline handler now also contain the profile they were configured for, as this information cannot be extracted elsewhere. In addition, each profile now can set its own publish host and port, which might be needed in case of portforwarding or using docker. Closes #9134	2015-02-02 08:17:55 +01:00
Martijn van Groningen	3ce05b6919	inner hits: Fix bug that resolves parent docs properly as inner hit when inner hit is defined on has_parent query.	2015-02-01 22:29:21 +01:00
Martijn van Groningen	d038f372d4	cleanup: Move catching of IOException higher op the stack to reduce the number of try-catch clauses.	2015-02-01 22:27:00 +01:00
Lee Hinman	25f944009c	Remove unneeded null checks from IndicesClusterStateService	2015-02-01 12:13:57 -07:00
Simon Willnauer	42bb5deca2	Revert "[ENGINE] Fail engine if Lucene commit fails" This reverts commit `dda7242848`.	2015-01-31 23:48:34 +01:00
Simon Willnauer	dda7242848	[ENGINE] Fail engine if Lucene commit fails This is similar to refresh, if we fail to commit the data we have to fail the engine since in-ram data is likely discarded. Yet, it's still in translog and might be recoverable when the node is restarted but we have to treat the engine as failed.	2015-01-31 16:45:38 +01:00
Lee Hinman	9557625ae7	Disallow method pointer expressions in Groovy scripting	2015-01-30 15:55:19 -07:00
Lee Hinman	9fe84062a1	Add `beforeIndexAddedToCluster` callback This callback is executed only once, on the master node during an index's creation. An exception thrown during this listener will cancel the index creation. This also adds checks in `IndicesClusterStateService` for the indexService being null as well as if the `indicesService.createIndex` throws an exception on data nodes after an index has already been created.	2015-01-30 15:25:58 -07:00
Adrien Grand	b2010f788d	[TESTS] IndicesQueryCacheTests: Ensure that shards are searchable before starting to query them.	2015-01-30 23:22:27 +01:00
Boaz Leskes	eabc3cde98	Recovery: update access time of ongoing recoveries #8720 introduced a timeout mechanism for ongoing recoveries, based on a last access time variable. In the many iterations on that PR the update of the access time was lost. This adds it back, including a test that should have been there in the first place. Closes #9506	2015-01-30 21:06:28 +01:00
Adrien Grand	00d54fabb2	Search: Remove query-cache serialization optimization. The query-cache has an optimization to not deserialize the bytes at the shard level. However this is a bit fragile since it assumes that serialized streams can be concatenanted (which is not the case with shared strings) and also does not update the QueryResult object that is held by the SearchContext. So you need to make sure to use the right one. With this change, the query cache just deserializes bytes into the QueryResult object from the context. Close #9500	2015-01-30 20:02:18 +01:00
Simon Willnauer	fb377d48bd	Remove dead code	2015-01-30 13:52:26 +01:00
Simon Willnauer	380fcd1d02	Reset MergePolicProvider settings only if the value actually changed Due to some unreleased refactorings we lost the persitence of a perviously set values in MergePolicyProvider. This commit adds this back and adds a simple unittest. Closes #8890	2015-01-30 13:24:08 +01:00
Ryan Ernst	1ebc95ee28	Tests: Add type-unrestricted version of field mapper getter to SearchContext. This fixes an NPE when using TestSearchContext in SignificanceHeuristicTests.	2015-01-29 13:42:07 -08:00
Michael McCandless	ecc8b702d3	also remove force option from logger.trace	2015-01-29 16:18:21 -05:00
Clinton Gormley	eea22d7731	Docs: Fixed asciidoc error in snapshots.asciidoc	2015-01-29 20:57:12 +01:00
J Charitopoulos	be8d8d658c	Docs: minor syntax Closes #9481	2015-01-29 20:27:20 +01:00
Glen Smith	3d5fbfb997	Docs: Update pattern-replace-charfilter.asciidoc Remove invalid trailing comma from json Closes #9477	2015-01-29 20:24:08 +01:00
Ryan Ernst	4e0e5e7328	Aggs: Remove limitation on field access within aggs to the types provided in the search Currently, doing a field lookup within a terms agg will restrict the fields available to those within the types passed into the search request. However, when doing sub aggs within a children agg, the fields available should not be restricted to those of the search. This change makes the field lookup use the index level mapper service.	2015-01-29 10:49:38 -08:00
David Pilato	878e46d7f9	[Docs] fix missing space	2015-01-29 19:17:41 +01:00
Simon Willnauer	c0fa60eb26	Remove HandlesStreamInput/Output The optimization we do in the HandlesStreamInput / Output adds a lot of complexity with a rather unknown benefit. It tries to compress commonly used strings and write ids instead. This should rather be done on a lower level if at all necessary for the small message we send over the network.	2015-01-29 17:43:32 +01:00
Simon Willnauer	1d77c3af82	Fix compilation	2015-01-29 17:41:53 +01:00
Simon Willnauer	03f1fcc85e	[ENGINE] Remove dirty flag and force boolean for refresh Today we have a dirty flag indicating that a refresh must be executed. We also allow users to bypass this by setting a force=true boolean on the refresh request / command. All these flags are unneeded since the SearcherManager has all the information to do the right thing if it's dirty or not.	2015-01-29 17:30:00 +01:00
Simon Willnauer	b275e917b7	[CACHE] Use a smaller expected size when serializing query results BytesStreamOutput allows to pass the expected size but by default uses BigArrays.PAGE_SIZE_IN_BYTES which is 16k. A common cached result ie. a date histogram with 3 buckets is ~100byte so 16k might be very wasteful since we don't shrink to the actual size once we are done serializing. By passing 512 as the expected size we will resize the byte array in the stream slowly until we hit the page size and don't waste too much memory for small query results.	2015-01-29 17:27:08 +01:00
Britta Weber	0a07ce8916	core: disable auto gen id optimization This pr removes the optimization for auto generated ids. Previously, when ids were auto generated by elasticsearch then there was no check to see if a document with same id already existed and instead the new document was only appended. However, due to lucene improvements this optimization does not add much value. In addition, under rare circumstances it might cause duplicate documents: When an indexing request is retried (due to connect lost, node closed etc), then a flag 'canHaveDuplicates' is set to true for the indexing request that is send a second time. This was to make sure that even when an indexing request for a document with autogenerated id comes in we do not have to update unless this flag is set and instead only append. However, it might happen that for a retry or for the replication the indexing request that has the canHaveDuplicates set to true (the retried request) arrives at the destination before the original request that does have it set false. In this case both request add a document and we have a duplicated a document. This commit adds a workaround: remove the optimization for auto generated ids and always update the document. The asumtion is that this will not slow down indexing more than 10 percent, see: http://benchmarks.elasticsearch.org/ closes #8788 closes #9468	2015-01-29 16:26:04 +01:00
Oliver	e412dab63a	Docs: Fix sample query Closes #9472	2015-01-29 15:56:24 +01:00
Simon Willnauer	15a766084d	[CACHE] Use correct number of bytes in query cache accounting today we use the length of the BytesReference which is misleading since the reference is paged such that the length != ramBytesUsed. This can lead to a way higher memory consuption than expected if query results are tiny since each query result requires at least 16kb. Yet, we should rethink this strategy for query results that are very small ie. less than 20% of the ramBytesUsed but this commit first tries to make the acocunting correct.	2015-01-29 10:59:36 +01:00

1 2 3 4 5 ...

10769 Commits All Branches Search

10769 Commits

All Branches