OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-24 17:09:48 +00:00

Author	SHA1	Message	Date
Alexander Reelsen	a55476bf70	Tests: Ensure no use of potentially resolving internal ips	2015-02-02 09:45:42 +01:00
Boaz Leskes	79c8621a47	Test: add trace logging to testNodeFailuresAreProcessedOnce	2015-02-02 09:32:53 +01:00
Alexander Reelsen	59f8c0951a	Netty Transport: Add profiles to transport infos Until now, there was no possibility to expose infos about configured transport profiles. This commit adds the ability to expose those information in the TransportInfo class. The channel was well as the netty pipeline handler now also contain the profile they were configured for, as this information cannot be extracted elsewhere. In addition, each profile now can set its own publish host and port, which might be needed in case of portforwarding or using docker. Closes #9134	2015-02-02 08:17:55 +01:00
Martijn van Groningen	3ce05b6919	inner hits: Fix bug that resolves parent docs properly as inner hit when inner hit is defined on has_parent query.	2015-02-01 22:29:21 +01:00
Martijn van Groningen	d038f372d4	cleanup: Move catching of IOException higher op the stack to reduce the number of try-catch clauses.	2015-02-01 22:27:00 +01:00
Lee Hinman	25f944009c	Remove unneeded null checks from IndicesClusterStateService	2015-02-01 12:13:57 -07:00
Simon Willnauer	42bb5deca2	Revert "[ENGINE] Fail engine if Lucene commit fails" This reverts commit dda72428484b4a5d79b6d9c97d56365851a79543.	2015-01-31 23:48:34 +01:00
Simon Willnauer	dda7242848	[ENGINE] Fail engine if Lucene commit fails This is similar to refresh, if we fail to commit the data we have to fail the engine since in-ram data is likely discarded. Yet, it's still in translog and might be recoverable when the node is restarted but we have to treat the engine as failed.	2015-01-31 16:45:38 +01:00
Lee Hinman	9557625ae7	Disallow method pointer expressions in Groovy scripting	2015-01-30 15:55:19 -07:00
Lee Hinman	9fe84062a1	Add `beforeIndexAddedToCluster` callback This callback is executed only once, on the master node during an index's creation. An exception thrown during this listener will cancel the index creation. This also adds checks in `IndicesClusterStateService` for the indexService being null as well as if the `indicesService.createIndex` throws an exception on data nodes after an index has already been created.	2015-01-30 15:25:58 -07:00
Adrien Grand	b2010f788d	[TESTS] IndicesQueryCacheTests: Ensure that shards are searchable before starting to query them.	2015-01-30 23:22:27 +01:00
Boaz Leskes	eabc3cde98	Recovery: update access time of ongoing recoveries #8720 introduced a timeout mechanism for ongoing recoveries, based on a last access time variable. In the many iterations on that PR the update of the access time was lost. This adds it back, including a test that should have been there in the first place. Closes #9506	2015-01-30 21:06:28 +01:00
Adrien Grand	00d54fabb2	Search: Remove query-cache serialization optimization. The query-cache has an optimization to not deserialize the bytes at the shard level. However this is a bit fragile since it assumes that serialized streams can be concatenanted (which is not the case with shared strings) and also does not update the QueryResult object that is held by the SearchContext. So you need to make sure to use the right one. With this change, the query cache just deserializes bytes into the QueryResult object from the context. Close #9500	2015-01-30 20:02:18 +01:00
Simon Willnauer	fb377d48bd	Remove dead code	2015-01-30 13:52:26 +01:00
Simon Willnauer	380fcd1d02	Reset MergePolicProvider settings only if the value actually changed Due to some unreleased refactorings we lost the persitence of a perviously set values in MergePolicyProvider. This commit adds this back and adds a simple unittest. Closes #8890	2015-01-30 13:24:08 +01:00
Ryan Ernst	1ebc95ee28	Tests: Add type-unrestricted version of field mapper getter to SearchContext. This fixes an NPE when using TestSearchContext in SignificanceHeuristicTests.	2015-01-29 13:42:07 -08:00
Michael McCandless	ecc8b702d3	also remove force option from logger.trace	2015-01-29 16:18:21 -05:00
Ryan Ernst	4e0e5e7328	Aggs: Remove limitation on field access within aggs to the types provided in the search Currently, doing a field lookup within a terms agg will restrict the fields available to those within the types passed into the search request. However, when doing sub aggs within a children agg, the fields available should not be restricted to those of the search. This change makes the field lookup use the index level mapper service.	2015-01-29 10:49:38 -08:00
Simon Willnauer	c0fa60eb26	Remove HandlesStreamInput/Output The optimization we do in the HandlesStreamInput / Output adds a lot of complexity with a rather unknown benefit. It tries to compress commonly used strings and write ids instead. This should rather be done on a lower level if at all necessary for the small message we send over the network.	2015-01-29 17:43:32 +01:00
Simon Willnauer	1d77c3af82	Fix compilation	2015-01-29 17:41:53 +01:00
Simon Willnauer	03f1fcc85e	[ENGINE] Remove dirty flag and force boolean for refresh Today we have a dirty flag indicating that a refresh must be executed. We also allow users to bypass this by setting a force=true boolean on the refresh request / command. All these flags are unneeded since the SearcherManager has all the information to do the right thing if it's dirty or not.	2015-01-29 17:30:00 +01:00
Simon Willnauer	b275e917b7	[CACHE] Use a smaller expected size when serializing query results BytesStreamOutput allows to pass the expected size but by default uses BigArrays.PAGE_SIZE_IN_BYTES which is 16k. A common cached result ie. a date histogram with 3 buckets is ~100byte so 16k might be very wasteful since we don't shrink to the actual size once we are done serializing. By passing 512 as the expected size we will resize the byte array in the stream slowly until we hit the page size and don't waste too much memory for small query results.	2015-01-29 17:27:08 +01:00
Britta Weber	0a07ce8916	core: disable auto gen id optimization This pr removes the optimization for auto generated ids. Previously, when ids were auto generated by elasticsearch then there was no check to see if a document with same id already existed and instead the new document was only appended. However, due to lucene improvements this optimization does not add much value. In addition, under rare circumstances it might cause duplicate documents: When an indexing request is retried (due to connect lost, node closed etc), then a flag 'canHaveDuplicates' is set to true for the indexing request that is send a second time. This was to make sure that even when an indexing request for a document with autogenerated id comes in we do not have to update unless this flag is set and instead only append. However, it might happen that for a retry or for the replication the indexing request that has the canHaveDuplicates set to true (the retried request) arrives at the destination before the original request that does have it set false. In this case both request add a document and we have a duplicated a document. This commit adds a workaround: remove the optimization for auto generated ids and always update the document. The asumtion is that this will not slow down indexing more than 10 percent, see: http://benchmarks.elasticsearch.org/ closes #8788 closes #9468	2015-01-29 16:26:04 +01:00
Simon Willnauer	15a766084d	[CACHE] Use correct number of bytes in query cache accounting today we use the length of the BytesReference which is misleading since the reference is paged such that the length != ramBytesUsed. This can lead to a way higher memory consuption than expected if query results are tiny since each query result requires at least 16kb. Yet, we should rethink this strategy for query results that are very small ie. less than 20% of the ramBytesUsed but this commit first tries to make the acocunting correct.	2015-01-29 10:59:36 +01:00
Simon Willnauer	4917121de2	Remove Unused code and remove unnecessary abstraction HashedBytesArray is not used anymore and Releable makes only sense on Paged implementation such that the marker interface is unneeded.	2015-01-29 09:51:14 +01:00
Lee Hinman	86e52c30a1	Make `script.groovy.sandbox.method_blacklist_patch` truly append-only Additionally, this setting can be specified in elasticsearch.yml if desired, to pre-populate the list of methods to be added to the default blacklist. When making a change to this setting dynamically, the entire blacklist is logged as well.	2015-01-28 17:09:27 -07:00
Ryan Ernst	afcedb94ed	Mappings: Remove `index_analyzer` setting to simplify analyzer logic The `analyzer` setting is now the base setting, and `search_analyzer` is simply an override of the search time analyzer. When setting `search_analyzer`, `analyzer` must be set. closes #9371	2015-01-28 13:43:15 -08:00
Lee Hinman	cc461a837f	Avoid NullPointerException if optional Groovy jar is removed	2015-01-28 13:49:50 -07:00
Lee Hinman	c610524392	Make groovy sandbox method blacklist dynamically additive Using the `script.groovy.sandbox.method_blacklist_patch` setting, the blacklist can be dynamically added to by specifying a comma-separated list of methods (for example, "toString,size" would add .toString and .size to the blacklist). When the `script.groovy.sandbox.method_blacklist_patch` setting is changed, the script cache is cleared to force new scripts to be recompiled. Additionally the on-disk cache is cleared so that scripts in the `config/scripts` directory are re-compiled as well. This also fixes an issue where script engines were injected more than once, which can cause multiple instances of the script engine per node.	2015-01-28 12:26:09 -07:00
Zachary Tong	a4eb1d5505	Aggregations: Add standard deviation bounds to extended_stats Extended_stats now displays the upper and lower bounds on standard deviations (e.g. avg +/- std). Default is to show 2 std above/below, but can be changed using the `sigma` parameter. Accepts non-negative doubles Closes #9356	2015-01-28 11:47:20 -05:00
gmarz	3e4fc2659d	Nodes Stats: Fix open file descriptors count on Windows Closes #1563	2015-01-28 10:30:02 -05:00
Nicholas Knize	9622f78fe6	Revert "[GEO] Update GeoPolygonFilter to handle ambiguous polygons" This reverts commit 06667c6aa898895acd624b8a71a6e00ff7ae32b8 which introduces an undesireable dependency on JTS.	2015-01-28 08:03:26 -06:00
Colin Goodheart-Smithe	29c24d75e7	Aggregations: Unify histogram implementations This change makes InternalHistogram the only InternalAggregation used by the Histogram Aggregator. There is still a separate Bucket implementation and Factory implementation. All buckets are created through the factory passed into the InternalHistogram meaning and the correct factory implementation is serialised as part of the aggregation to make sure the correct bucket types are always generate. This is needed by the Transformers (namely the derivative transformer) to allow it to generate buckets of the right type without having to know what the underlying bucket implementation is.	2015-01-28 10:45:28 +00:00
Boaz Leskes	1695f76f68	Test: testOldIndexes should disable merging It verifies some segments need to be upgraded, but if they are merged away, there are upgraded implicitly	2015-01-28 11:34:58 +01:00
Boaz Leskes	22a576d5ba	Recovery: flush immediately after a remote recovery finishes (unless there are ongoing ones) To properly replicate, we currently stop flushing during recovery so we can repay the translog once copying files are done. Once recovery is done, the translog will be flushed by a background thread that, by default, kicks in every 5s. In case of a recovery failure and a quick re-assignment of a new shard copy, we may fail to flush before starting a new recovery, causing it to deal with potentially even longer translog. This commit makes sure we flush immediately when the ongoing recovery count goes to 0. I also added a simple recovery benchmark. Closes #9439	2015-01-28 09:14:23 +01:00
Igor Motov	13ef7d73b9	Snapshot/Restore: better handling of index deletion during snapshot If an index is deleted during initial state of the snapshot operation, the entire snapshot can fail with NPE. This commit improves handling of this situation and allows snapshot to continue if partial snapshots are allowed. Closes #9024	2015-01-27 21:06:29 -05:00
Boaz Leskes	3512860956	Test: always use replicas in testClusterInfoServiceInformationClearOnError It assume the local node always has a shard	2015-01-28 00:23:03 +01:00
Nicholas Knize	06667c6aa8	[GEO] Update GeoPolygonFilter to handle ambiguous polygons PR #8672 addresses ambiguous polygons - those that either cross the dateline or span the map - by complying with the OGC standard right-hand rule. Since ```GeoPolygonFilter``` is self contained logic, the fix in #8672 did not address the issue for the ```GeoPolygonFilter```. This was identified in issue #5968 This fixes the ambiguous polygon issue in ```GeoPolygonFilter``` by moving the dateline crossing code from ```ShapeBuilder``` to ```GeoUtils``` and reusing the logic inside the ```pointInPolygon``` method. Unit tests are added to ensure support for coordinates specified in either standard lat/lon or great-circle coordinate systems. closes #5968 closes #9304	2015-01-27 15:45:05 -06:00
Boaz Leskes	9ac6d78308	Internal: ClusterInfoService should wipe local cache upon unknown exceptions The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes). This also adds settings for the timeout used when waiting for nodes. Closes #9449	2015-01-27 22:38:08 +01:00
simaov	1ca8404674	#9444 join lines Fixes #9445	2015-01-27 18:14:56 +00:00
simaov	f3e1a66133	#9444 throw StrictDynamicMappingException exception if dynamic is 'strict' and undeclared field value is NULL, test for this Fixes #9445	2015-01-27 18:14:56 +00:00
Lee Hinman	39c064ce8b	[TEST] remove AwaitsFix from DeleteByQuery test	2015-01-27 10:15:44 -07:00
Ryan Ernst	cff0ec3972	Mappings: Remove type level default analyzers closes #8874	2015-01-27 08:30:51 -08:00
Colin Goodheart-Smithe	6f894b1d2c	[TEST] Fix HistogramTests Fixed histogram tests for value scripts as it was picking the wrong buckets form the bucket list following the removal of the getBucketByKey method	2015-01-27 12:10:38 +00:00
Martijn van Groningen	7e6e9dbb96	Aggs: nested agg needs to reset root doc between segments. Closes #9437 Closes #9436	2015-01-27 12:53:47 +01:00
javanna	93bf737f34	Internal: fix shard state tranport action names When we renamed all of the transport actions in #7105, shard started and failed were flipped around by mistake. This commit fixes their naming. Closes #9440	2015-01-27 12:38:16 +01:00
Colin Goodheart-Smithe	285ef0f06d	Aggregations: Clean up response API for Aggregations This change makes the response API object for Histogram Aggregations the same for all types of Histogram, and does the same for all types of Ranges. The change removes getBucketByKey() from all aggregations except filters and terms. It also reduces the methods on the Bucket class to just getKey() and getKeyAsString(). The getKey() method returns Object and the actual Type is returns will be appropriate for the type of aggregation being run. e.g. date_histogram will return a DateTime for this method and Histogram will return a Number.	2015-01-27 10:53:44 +00:00
Sourav Mitra	78c52d559d	Minor hygiene, Removed Redundant inheritance Close #9427	2015-01-27 11:02:43 +01:00
Lee Hinman	0143d835d4	[TEST] Add `ensureGreen` to indices created in TopHitsTests	2015-01-26 18:45:04 -07:00
Lee Hinman	8fc58dc00a	[TEST] Add `ensureGreen` where needed in NestedTests	2015-01-26 18:26:04 -07:00

1 2 3 4 5 ...

5933 Commits