OpenSearch

Commit Graph

Author	SHA1	Message	Date
Boaz Leskes	4342237acf	Test: reduce load in RecoveryWhileUnderLoadTests	2015-02-03 09:32:42 +01:00
Robert Muir	027730006b	core: add 'checksum' option for index.shard.check_on_startup The current "checkindex" on startup is very very expensive. This is like running one of the old school hard drive diagnostic checkers and usually not a good idea. But we can do a CRC32 verification of files. We don't even need to open an indexreader to do this, its much more lightweight. This option (as well as the existing true/false) are randomized in tests to find problems. Also fix bug where use of the current option would always leak an indexwriter lock. Closes #9183	2015-02-03 00:10:08 -05:00
Ryan Ernst	6079d88d43	Mappings: Remove type prefix support from field names in queries This is the first part of #8872.	2015-02-02 13:10:56 -08:00
Lee Hinman	0f405e9710	Merge branch 'pr/8795'	2015-02-02 11:49:45 -07:00
Michael McCandless	e29cf903c8	Core: upgrade to Lucene snapshot r1656366 * IndexWriter deadlock and DV update concurrency fix * BytesRef reuse bug with SortedSetDVTermsEnum * Int overflow skip data corruption bug * Compound file API cleanups * IndexWriter doesn't accept per-doc Analyzer anymore Closes #9524	2015-02-02 13:37:45 -05:00
Christoph Büscher	44193e7ba5	Aggregations: Add 'offset' option to histogram aggregation Histogram aggregation supports an 'offset' option to move bucket boundaries. In a histogram with buckets of size X these can be moved from 0, X, 2X, 3X,... by an offset value of Y to Y, X+Y, 2X+Y, 3X+Y... by using the 'offset' option. The previous 'pre_offset' and 'post_offset' options are removed in favour of the simplified 'offset' option. Closes #9417 Closes #9505	2015-02-02 18:23:01 +01:00
Lee Hinman	6c27f1242a	Update Groovy dependency to 2.4.0	2015-02-02 09:27:41 -07:00
Clinton Gormley	988f35a7da	REST: Add skip "stash_in_path" feature to nodes.info/20_transport test Required for clients that don't yet do stash lookups when resolving fieldnames like "nodes.$master.transport"	2015-02-02 17:11:04 +01:00
Clinton Gormley	e7ac5f296e	REST-spec: Allow stashed values to be used in property names as well Fix the nodes.info/20_transport test to use the master node, rather than to rely on applying a regex to the whole $body	2015-02-02 16:12:41 +01:00
Alexander Reelsen	a55476bf70	Tests: Ensure no use of potentially resolving internal ips	2015-02-02 09:45:42 +01:00
Boaz Leskes	79c8621a47	Test: add trace logging to testNodeFailuresAreProcessedOnce	2015-02-02 09:32:53 +01:00
Alexander Reelsen	59f8c0951a	Netty Transport: Add profiles to transport infos Until now, there was no possibility to expose infos about configured transport profiles. This commit adds the ability to expose those information in the TransportInfo class. The channel was well as the netty pipeline handler now also contain the profile they were configured for, as this information cannot be extracted elsewhere. In addition, each profile now can set its own publish host and port, which might be needed in case of portforwarding or using docker. Closes #9134	2015-02-02 08:17:55 +01:00
Martijn van Groningen	3ce05b6919	inner hits: Fix bug that resolves parent docs properly as inner hit when inner hit is defined on has_parent query.	2015-02-01 22:29:21 +01:00
Martijn van Groningen	d038f372d4	cleanup: Move catching of IOException higher op the stack to reduce the number of try-catch clauses.	2015-02-01 22:27:00 +01:00
Lee Hinman	25f944009c	Remove unneeded null checks from IndicesClusterStateService	2015-02-01 12:13:57 -07:00
Simon Willnauer	42bb5deca2	Revert "[ENGINE] Fail engine if Lucene commit fails" This reverts commit `dda7242848`.	2015-01-31 23:48:34 +01:00
Simon Willnauer	dda7242848	[ENGINE] Fail engine if Lucene commit fails This is similar to refresh, if we fail to commit the data we have to fail the engine since in-ram data is likely discarded. Yet, it's still in translog and might be recoverable when the node is restarted but we have to treat the engine as failed.	2015-01-31 16:45:38 +01:00
Lee Hinman	9557625ae7	Disallow method pointer expressions in Groovy scripting	2015-01-30 15:55:19 -07:00
Lee Hinman	9fe84062a1	Add `beforeIndexAddedToCluster` callback This callback is executed only once, on the master node during an index's creation. An exception thrown during this listener will cancel the index creation. This also adds checks in `IndicesClusterStateService` for the indexService being null as well as if the `indicesService.createIndex` throws an exception on data nodes after an index has already been created.	2015-01-30 15:25:58 -07:00
Adrien Grand	b2010f788d	[TESTS] IndicesQueryCacheTests: Ensure that shards are searchable before starting to query them.	2015-01-30 23:22:27 +01:00
Boaz Leskes	eabc3cde98	Recovery: update access time of ongoing recoveries #8720 introduced a timeout mechanism for ongoing recoveries, based on a last access time variable. In the many iterations on that PR the update of the access time was lost. This adds it back, including a test that should have been there in the first place. Closes #9506	2015-01-30 21:06:28 +01:00
Adrien Grand	00d54fabb2	Search: Remove query-cache serialization optimization. The query-cache has an optimization to not deserialize the bytes at the shard level. However this is a bit fragile since it assumes that serialized streams can be concatenanted (which is not the case with shared strings) and also does not update the QueryResult object that is held by the SearchContext. So you need to make sure to use the right one. With this change, the query cache just deserializes bytes into the QueryResult object from the context. Close #9500	2015-01-30 20:02:18 +01:00
Simon Willnauer	fb377d48bd	Remove dead code	2015-01-30 13:52:26 +01:00
Simon Willnauer	380fcd1d02	Reset MergePolicProvider settings only if the value actually changed Due to some unreleased refactorings we lost the persitence of a perviously set values in MergePolicyProvider. This commit adds this back and adds a simple unittest. Closes #8890	2015-01-30 13:24:08 +01:00
Ryan Ernst	1ebc95ee28	Tests: Add type-unrestricted version of field mapper getter to SearchContext. This fixes an NPE when using TestSearchContext in SignificanceHeuristicTests.	2015-01-29 13:42:07 -08:00
Michael McCandless	ecc8b702d3	also remove force option from logger.trace	2015-01-29 16:18:21 -05:00
Clinton Gormley	eea22d7731	Docs: Fixed asciidoc error in snapshots.asciidoc	2015-01-29 20:57:12 +01:00
J Charitopoulos	be8d8d658c	Docs: minor syntax Closes #9481	2015-01-29 20:27:20 +01:00
Glen Smith	3d5fbfb997	Docs: Update pattern-replace-charfilter.asciidoc Remove invalid trailing comma from json Closes #9477	2015-01-29 20:24:08 +01:00
Ryan Ernst	4e0e5e7328	Aggs: Remove limitation on field access within aggs to the types provided in the search Currently, doing a field lookup within a terms agg will restrict the fields available to those within the types passed into the search request. However, when doing sub aggs within a children agg, the fields available should not be restricted to those of the search. This change makes the field lookup use the index level mapper service.	2015-01-29 10:49:38 -08:00
David Pilato	878e46d7f9	[Docs] fix missing space	2015-01-29 19:17:41 +01:00
Simon Willnauer	c0fa60eb26	Remove HandlesStreamInput/Output The optimization we do in the HandlesStreamInput / Output adds a lot of complexity with a rather unknown benefit. It tries to compress commonly used strings and write ids instead. This should rather be done on a lower level if at all necessary for the small message we send over the network.	2015-01-29 17:43:32 +01:00
Simon Willnauer	1d77c3af82	Fix compilation	2015-01-29 17:41:53 +01:00
Simon Willnauer	03f1fcc85e	[ENGINE] Remove dirty flag and force boolean for refresh Today we have a dirty flag indicating that a refresh must be executed. We also allow users to bypass this by setting a force=true boolean on the refresh request / command. All these flags are unneeded since the SearcherManager has all the information to do the right thing if it's dirty or not.	2015-01-29 17:30:00 +01:00
Simon Willnauer	b275e917b7	[CACHE] Use a smaller expected size when serializing query results BytesStreamOutput allows to pass the expected size but by default uses BigArrays.PAGE_SIZE_IN_BYTES which is 16k. A common cached result ie. a date histogram with 3 buckets is ~100byte so 16k might be very wasteful since we don't shrink to the actual size once we are done serializing. By passing 512 as the expected size we will resize the byte array in the stream slowly until we hit the page size and don't waste too much memory for small query results.	2015-01-29 17:27:08 +01:00
Britta Weber	0a07ce8916	core: disable auto gen id optimization This pr removes the optimization for auto generated ids. Previously, when ids were auto generated by elasticsearch then there was no check to see if a document with same id already existed and instead the new document was only appended. However, due to lucene improvements this optimization does not add much value. In addition, under rare circumstances it might cause duplicate documents: When an indexing request is retried (due to connect lost, node closed etc), then a flag 'canHaveDuplicates' is set to true for the indexing request that is send a second time. This was to make sure that even when an indexing request for a document with autogenerated id comes in we do not have to update unless this flag is set and instead only append. However, it might happen that for a retry or for the replication the indexing request that has the canHaveDuplicates set to true (the retried request) arrives at the destination before the original request that does have it set false. In this case both request add a document and we have a duplicated a document. This commit adds a workaround: remove the optimization for auto generated ids and always update the document. The asumtion is that this will not slow down indexing more than 10 percent, see: http://benchmarks.elasticsearch.org/ closes #8788 closes #9468	2015-01-29 16:26:04 +01:00
Oliver	e412dab63a	Docs: Fix sample query Closes #9472	2015-01-29 15:56:24 +01:00
Simon Willnauer	15a766084d	[CACHE] Use correct number of bytes in query cache accounting today we use the length of the BytesReference which is misleading since the reference is paged such that the length != ramBytesUsed. This can lead to a way higher memory consuption than expected if query results are tiny since each query result requires at least 16kb. Yet, we should rethink this strategy for query results that are very small ie. less than 20% of the ramBytesUsed but this commit first tries to make the acocunting correct.	2015-01-29 10:59:36 +01:00
Simon Willnauer	4917121de2	Remove Unused code and remove unnecessary abstraction HashedBytesArray is not used anymore and Releable makes only sense on Paged implementation such that the marker interface is unneeded.	2015-01-29 09:51:14 +01:00
Lee Hinman	86e52c30a1	Make `script.groovy.sandbox.method_blacklist_patch` truly append-only Additionally, this setting can be specified in elasticsearch.yml if desired, to pre-populate the list of methods to be added to the default blacklist. When making a change to this setting dynamically, the entire blacklist is logged as well.	2015-01-28 17:09:27 -07:00
Ryan Ernst	afcedb94ed	Mappings: Remove `index_analyzer` setting to simplify analyzer logic The `analyzer` setting is now the base setting, and `search_analyzer` is simply an override of the search time analyzer. When setting `search_analyzer`, `analyzer` must be set. closes #9371	2015-01-28 13:43:15 -08:00
Lee Hinman	cc461a837f	Avoid NullPointerException if optional Groovy jar is removed	2015-01-28 13:49:50 -07:00
Lee Hinman	c610524392	Make groovy sandbox method blacklist dynamically additive Using the `script.groovy.sandbox.method_blacklist_patch` setting, the blacklist can be dynamically added to by specifying a comma-separated list of methods (for example, "toString,size" would add .toString and .size to the blacklist). When the `script.groovy.sandbox.method_blacklist_patch` setting is changed, the script cache is cleared to force new scripts to be recompiled. Additionally the on-disk cache is cleared so that scripts in the `config/scripts` directory are re-compiled as well. This also fixes an issue where script engines were injected more than once, which can cause multiple instances of the script engine per node.	2015-01-28 12:26:09 -07:00
Zachary Tong	a4eb1d5505	Aggregations: Add standard deviation bounds to extended_stats Extended_stats now displays the upper and lower bounds on standard deviations (e.g. avg +/- std). Default is to show 2 std above/below, but can be changed using the `sigma` parameter. Accepts non-negative doubles Closes #9356	2015-01-28 11:47:20 -05:00
gmarz	3e4fc2659d	Nodes Stats: Fix open file descriptors count on Windows Closes #1563	2015-01-28 10:30:02 -05:00
J Charitopoulos	b359520849	Docs: Update snapshots.asciidoc minor syntax Closes #9457	2015-01-28 15:54:13 +01:00
Nicholas Knize	9622f78fe6	Revert "[GEO] Update GeoPolygonFilter to handle ambiguous polygons" This reverts commit `06667c6aa8` which introduces an undesireable dependency on JTS.	2015-01-28 08:03:26 -06:00
Clinton Gormley	8978aa5465	Docs: Improved the template query docs Added the `file` and `id` parameters. Closes #9458	2015-01-28 14:19:59 +01:00
Colin Goodheart-Smithe	29c24d75e7	Aggregations: Unify histogram implementations This change makes InternalHistogram the only InternalAggregation used by the Histogram Aggregator. There is still a separate Bucket implementation and Factory implementation. All buckets are created through the factory passed into the InternalHistogram meaning and the correct factory implementation is serialised as part of the aggregation to make sure the correct bucket types are always generate. This is needed by the Transformers (namely the derivative transformer) to allow it to generate buckets of the right type without having to know what the underlying bucket implementation is.	2015-01-28 10:45:28 +00:00
Boaz Leskes	1695f76f68	Test: testOldIndexes should disable merging It verifies some segments need to be upgraded, but if they are merged away, there are upgraded implicitly	2015-01-28 11:34:58 +01:00

1 2 3 4 5 ...

10707 Commits All Branches Search

10707 Commits

All Branches