OpenSearch

Commit Graph

Author	SHA1	Message	Date
Lee Hinman	5aa2d0cf61	Add support for the `_name` parameter to the simple_query_string query	2014-07-29 12:41:41 +02:00
Lee Hinman	36cf595367	[TESTS] spin in a loop checking request breaker, because multiple clusters could be running	2014-07-29 11:26:34 +02:00
javanna	91c4824a0f	Transport client: don't add listed nodes to connected nodes list in sniff mode This commit effectively reverts `e1aa91d` , as it is not needed anymore to add the original listed nodes. The cluster state local call made will in fact always return at least the local node (see #6811). There were a couple of downsides caused by putting the original listed nodes among the connected nodes: 1) in the following retries, they weren't seen as listed nodes anymore, thus the light connect wasn't used 2) among the connected nodes some were "bad" duplicates as they are already there and don't contain all needed info for each node. This was causing serialization problems for instance given that the node version was missing on the `DiscoveryNode` object. Closes #7067	2014-07-28 21:48:03 +02:00
javanna	fcf4d5a38d	Transport Client: fixed the node retry mechanism which could fail without trying all the connected nodes The RetryListener was notified twice for each single failure, which caused some additional retries, but more importantly was making the client reach the maximum number of retries (number of connected nodes) too quickly, meanwhile ongoing retries which could succeed were not completed yet. The TransportService used to throw ConnectTransportException due to throwConnectException set to true, and also notify the listener of any exception received from a separate thread through the request holder. Simplified exception handling by just removing the throwConnectException option from the TransportService, used only in the transport client. The transport client now relies solely on the request holder to notify of failures and eventually retry. Closes #6829	2014-07-28 20:34:46 +02:00
Simon Willnauer	eecbf8a559	Add [1.3.2] version constant	2014-07-28 17:22:18 +02:00
Lee Hinman	a93ee599d3	[TESTS] fix circuit breaker tests for remote clusters and bwc Adds additional version checks in NodeStats for older versions When using an external cluster (backwards compatibility tests), the act of checking the request breaker requires a network buffer, which increments the breaker. This change only checks the request breaker in InternalTestCluster and uses Guice to retrieve it instead of a (possible) network request. Also removed the now unused InternalCircuitBreakerService class	2014-07-28 17:18:24 +02:00
javanna	4e5ad568bb	Rest: fixed filters execution order to be from lowest to highest rather than the other way around Closes #7019	2014-07-28 16:54:42 +02:00
javanna	0e9594e02d	Internal: use AtomicInteger instead of volatile int for the current action filter position Also improved filter chain tests to not rely on execution time, and made filter chain tests look more similar to what happens in reality by removing multiple threads creation in testTooManyContinueProcessing (something we don't support anyway, makes little sense to test it). Closes #7021	2014-07-28 16:54:42 +02:00
David Pilato	264d59c3e2	Plugin Lucene version checker: use `Lucene.parseVersionLenient` With commit 07c632a2d4dbefe44e8f25dc4ded6cf143d60e41, we now have a new Lucene.parseVersionLenient(String, Version) method which tries to find an existing Lucene version based on the two first digits X.Y of X.Y.Z String.	2014-07-28 16:38:44 +02:00
Colin Goodheart-Smithe	162200f6ed	Aggregations: Stops direct subclassing of InternalNumericMetricsAggregation Must subclass either InternalNumericMetricsAggregation.SingleValue or InternalNumericMetricsAggregation.MultiValue	2014-07-28 14:13:23 +01:00
Itamar Syn-Hershko	dd0b42838d	[QUERY] Separate parsing impl from setter in SearchParseElement This commit makes it easier to reuse the inner highlighting, fetch and rescore parsing logic by plugins or other internal parts. Closes #3602	2014-07-28 14:53:04 +02:00
Simon Willnauer	d403e68f43	add missing import	2014-07-28 14:33:51 +02:00
Simon Willnauer	bf7f97d22f	[CORE] Support alpha/beta releases in version parsing too Pull Request #7055 fixed Version parsing for bugfix releases causing problems with minor version in segments files. Even though we never release anything with lucene in alpha / beta status this commit fixes lenient parsing for these cases. Relates to #7055	2014-07-28 14:04:39 +02:00
Simon Willnauer	d2493ea48a	[CORE] Support parsing lucene minor version strings We parse the version that is shipped with the Lucene segments in order to find the version of lucene that wrote a particular segment. Yet, some lucene version ie: * 4.3.1 (Elasticsearch 0.90.2) * 4.5.1 (Elasticsearch 0.90.7) * 3.6.1 (pre Elasticsearch 0.90.0) wrote illegal strings containing the minor version which causes IAE exceptions being thrown from lucenes parsing method. Closes #7055	2014-07-28 13:02:00 +02:00
Lee Hinman	07c9b5b08d	Change logging level for circuit breaking to warn	2014-07-28 12:10:13 +02:00
Lee Hinman	6abe4c951d	Add HierarchyCircuitBreakerService Adds a breaker for request BigArrays, which are used for parent/child queries as well as some aggregations. Certain operations like Netty HTTP responses and transport responses increment the breaker, but will not trip. This also changes the output of the nodes' stats endpoint to show the parent breaker as well as the fielddata and request breakers. There are a number of new settings for breakers now: `indices.breaker.total.limit`: starting limit for all memory-use breaker, defaults to 70% `indices.breaker.fielddata.limit`: starting limit for fielddata breaker, defaults to 60% `indices.breaker.fielddata.overhead`: overhead for fielddata breaker estimations, defaults to 1.03 (the fielddata breaker settings also use the backwards-compatible setting `indices.fielddata.breaker.limit` and `indices.fielddata.breaker.overhead`) `indices.breaker.request.limit`: starting limit for request breaker, defaults to 40% `indices.breaker.request.overhead`: request breaker estimation overhead, defaults to 1.0 The breaker service infrastructure is now generic and opens the path to adding additional circuit breakers in the future. Fixes #6129 Conflicts: src/main/java/org/elasticsearch/index/fielddata/IndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/IndexFieldDataService.java src/main/java/org/elasticsearch/index/fielddata/RamAccountingTermsEnum.java src/main/java/org/elasticsearch/index/fielddata/ordinals/GlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/ordinals/InternalGlobalOrdinalsBuilder.java src/main/java/org/elasticsearch/index/fielddata/plain/AbstractIndexOrdinalsFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/DisabledIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/IndexIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/NonEstimatingEstimator.java src/main/java/org/elasticsearch/index/fielddata/plain/PackedArrayIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/ParentChildIndexFieldData.java src/main/java/org/elasticsearch/index/fielddata/plain/SortedSetDVOrdinalsIndexFieldData.java src/main/java/org/elasticsearch/node/internal/InternalNode.java src/test/java/org/elasticsearch/index/aliases/IndexAliasesServiceTests.java src/test/java/org/elasticsearch/index/codec/CodecTests.java src/test/java/org/elasticsearch/index/fielddata/AbstractFieldDataTests.java src/test/java/org/elasticsearch/index/fielddata/IndexFieldDataServiceTests.java src/test/java/org/elasticsearch/index/mapper/MapperTestUtils.java src/test/java/org/elasticsearch/index/query/IndexQueryParserFilterCachingTests.java src/test/java/org/elasticsearch/index/query/SimpleIndexQueryParserTests.java src/test/java/org/elasticsearch/index/query/guice/IndexQueryParserModuleTests.java src/test/java/org/elasticsearch/index/search/FieldDataTermsFilterTests.java src/test/java/org/elasticsearch/index/search/child/ChildrenConstantScoreQueryTests.java src/test/java/org/elasticsearch/index/similarity/SimilarityTests.java	2014-07-28 11:27:33 +02:00
Martijn van Groningen	5631bbb02b	[TEST] All shards should be allocated before snapshotting.	2014-07-28 10:48:35 +02:00
Martijn van Groningen	86c0d693c3	[TEST] Ignore Lucene40 codec	2014-07-28 10:40:25 +02:00
Colin Goodheart-Smithe	f7b7f67522	Aggregations: fixed value count so it can be used in terms order Closes #7050	2014-07-28 09:19:01 +01:00
Martijn van Groningen	2e9ee5c937	The `nested` aggregator should also resolve and use the parentFilter of the closest `reverse_nested` aggregator. Closes #6994 Closes #7048	2014-07-28 10:07:57 +02:00
mikemccand	e42b73c6d4	Test: more verbosity for this test on failure	2014-07-26 04:42:26 -04:00
Adrien Grand	f682461b2f	Mappings: Enforce non-null settings. No that we are using the index created version to make index-time decisions, assuming that the version is the current version when settings are null is very error-prone. Instead we should ensure that settings are always non-null and contain the version when the index was created. Close #7032	2014-07-25 21:01:44 +02:00
David Pilato	11eced01da	Add multi_field support for Mapper externalValue (plugins) In context of mapper attachment and other mapper plugins, when dealing with multi fields, sub fields never get the `externalValue` although it was set. Here is a full script which reproduce the issue when used with mapper attachment plugin: ``` DELETE /test PUT /test { "mappings": { "test": { "properties": { "f": { "type": "attachment", "fields": { "f": { "analyzer": "english", "fields": { "no_stemming": { "type": "string", "store": "yes", "analyzer": "standard" } } } } } } } } } PUT /test/test/1 { "f": "VGhlIHF1aWNrIGJyb3duIGZveGVz" } GET /test/_search { "query": { "match": { "f": "quick" } } } GET /test/_search { "query": { "match": { "f.no_stemming": "quick" } } } GET /test/test/1?fields=f.no_stemming ``` Related to https://github.com/elasticsearch/elasticsearch-mapper-attachments/issues/57 Closes #5402.	2014-07-25 16:59:42 +02:00
Colin Goodheart-Smithe	655157c83a	Aggregations: Added an option to show the upper bound of the error for the terms aggregation. This is only applicable when the order is set to _count. The upper bound of the error in the doc count is calculated by summing the doc count of the last term on each shard which did not return the term. The implementation calculates the error by summing the doc count for the last term on each shard for which the term IS returned and then subtracts this value from the sum of the doc counts for the last term from ALL shards. Closes #6696	2014-07-25 14:24:24 +01:00
Alexander Reelsen	a1e335b1e9	CORS: Support regular expressions for origin to match against This commit adds regular expression support for the allow-origin header depending on the value of the request `Origin` header. The existing HttpRequestBuilder is also extended to support the OPTIONS HTTP method. Relates #5601 Closes #6891	2014-07-25 10:51:22 +02:00
Alexander Reelsen	35e562343f	Tests: Remove HttpClient to only use one Http client The HTTP client implementation used by the Elasticsearch REST tests is backed by apache http client instead of a self written helper class, that uses HttpUrlConnection. This commit removes the old simple HttpClient class and uses the more powerful and reliable one for all tests. It also fixes a minor bug, that when sending a 301 redirect, a Location header needs to be added as well, which was uncovered by the switching to the new client. Closes #7003	2014-07-25 10:26:52 +02:00
Adrien Grand	51fd2f513c	[TESTS] Fix NPE in FreqTermsEnumTests.	2014-07-25 09:12:01 +02:00
Martijn van Groningen	a0e5684d7b	[TEST] more logging	2014-07-25 01:16:32 +02:00
Adrien Grand	a3d8022dc5	Fielddata: Fix thread safety issue with field data on the `_index` field.	2014-07-24 19:04:22 +02:00
Lee Hinman	89e03910f4	Add a periodic cleanup thread for IndexFieldCache caches Fixes #7010	2014-07-24 17:23:52 +02:00
Martijn van Groningen	297a97cd23	Core: Use the provided cluster state instead of fetching a new cluster state from cluster service. Close #7013	2014-07-24 16:23:42 +02:00
Colin Goodheart-Smithe	5483c62de6	Geo: Fixes parse error with complex shapes The bug reproduces when the point under test for the placement of the hole of the polygon has an x coordinate which only intersects with the ends of edges in the main polygon. The previous code threw out these cases as not relevant but an intersect at 1.0 of the distance from the start to the end of an edge is just as valid as an intersect at any other point along the edge. The fix corrects this and adds a test. Closes #5773	2014-07-24 15:17:55 +01:00
Simon Willnauer	bd51d7a07f	Add `wait_if_ongoing` option to _flush requests This commit adds the ability to force blocking on the flush operaition to make sure all files have been written and synced to disk. Without this option a flush might be executing at the same time causing the current flush to fail and return before all files being synced. Closes #6996	2014-07-24 15:34:53 +02:00
Colin Goodheart-Smithe	127649d174	Aggregations: Added pre and post offset to histogram aggregation Added preOffset and postOffset parameters to the API for the histogram aggregation which work in the same way as in the date histogram Closes #6605	2014-07-24 14:32:33 +01:00
Adrien Grand	f5d1e0a37d	[TESTS] Ensure yellow in SimpleFacetsTests.testFilterFacetWithFacetFilterPostMode.	2014-07-24 15:21:20 +02:00
Shay Banon	eb37a5992b	remove use of recycled set in filters eviction closes #7012	2014-07-24 15:00:30 +02:00
javanna	d9ff42f88a	Internal: expose the indices names every action relates to if applicable Added two new interfaces: 1) IndicesRequest that allows to retrieve the indices the request relates to in a generic manner, together with the indices options that tell how they are going to get resolved and expanded 2) CompositeIndicesRequest for compound requests that hold multiple indices request like MultiSearchRequest, MultiGetRequest, MultiTermVectorsRequest, BulkRequest, BenchmarkRequest, PercolateRequest, MultiPercolateRequest and MoreLikeThisRequest Taken the chance to streamline the indices options and add them to every request where it makes sense (although they can't be changed from the outside), rather than leaving them implicit in the related TransportAction when indices get expanded (tipycally MetaData#concreteIndices or MetaData#concreteSingleIndex). Added IndicesOptions parameter to MetaData#concreteSingleIndex to make sure it is taken from the request, where the information belongs, instead of hardcoded within MetaData. The concreteSingleIndex method remains but it's just a utility method that returns a single index instead of an array and complains otherwise. Also made sure NPE is never thrown when setting indices(null) to IndicesAliasesRequest, similar to what SearchRequest does. Closes #6933	2014-07-24 14:42:40 +02:00
Adrien Grand	6f31b1135a	[Benchmark] Make TermsAggregationSearchBenchmark fairer to uninverted field data. The benchmark indexes 200 unique full-width longs. For uninverted field data we try to use the most memory-efficient storage, and in that case it would use two arrays: one for the doc->ordinals mapping and one for the ordinal->value mapping. Which is slower than what doc values do by storing directly the mapping from docs to values.	2014-07-24 14:35:47 +02:00
Colin Goodheart-Smithe	fdf2bb9371	Aggregations: Better JSON output scoping Before this change each aggregation had to output an object field with its name and write its JSON inside that object. This allowed for badly behaved aggregations which could write JSON content in the root of the 'aggs' object. this change move the writing of the aggregation name to a level above the aggregation itself, ensuring that aggregations can only write within there own scope in the JSON output. Closes #7004	2014-07-24 12:02:40 +01:00
Robert Muir	d8cd755445	Speed up string sort with custom missing value Today if the user supplies a custom missing value for a string sort, we do it in an extremely slow way, not using ordinals but dereferencing bytes for every document. Ordinals are only used if the missing value is _first or _last. Instead, use ordinals with custom missing values too. Closes #7005	2014-07-24 06:27:59 -04:00
Simon Willnauer	f130d60b72	[TEST] Don't randomize preference PRIMARY it might not try replicas depending on the clusterstate	2014-07-24 11:36:31 +02:00
Martijn van Groningen	73f7f426de	Made `_source` parsing in `top_hits` aggregation consistent with regular `_source` parsing in search api. Closes #6997	2014-07-24 11:23:59 +02:00
Adrien Grand	8cb4471cca	[TESTS] Add more assertions to SimpleFacetsTests.	2014-07-24 11:13:53 +02:00
Brian Murphy	ce864d4016	[REFACTOR] TransportActions Get rid of boilerplate code for handling transport actions. Make these transport actions extend HandledTransportAction where this code now lives.	2014-07-24 11:05:29 +01:00
javanna	3e30fa2089	Internal: streamline use of IndexClosedException when executing operation on closed indices Single index operations to use the newly added IndexClosedException introduced with #6475. This way we can also fail faster when we are trying to execute operations on closed indices and their use is not allowed (depending on indices options). Indices blocks are still checked but we can already throw error while resolving indices (MetaData#concreteIndices). Effectively this change also affects what we return when using one of the following apis: analyze, bulk, index, update, delete, explain, get, multi_get, mlt, term vector, multi_term vector. We now return `{"error":"IndexClosedException[[test] closed]","status":403}` instead of `{"error":"ClusterBlockException[blocked by: [FORBIDDEN/4/index closed];]","status":403}`. Closes #6988	2014-07-24 10:33:58 +02:00
Colin Goodheart-Smithe	dc9e9cb4cc	Aggregations: change to default shard_size in terms aggregation The default shard size in the terms aggregation now uses BucketUtils.suggestShardSideQueueSize() to set the shard size if the user does not specify it as a parameter. Closes #6857	2014-07-24 07:55:09 +01:00
Areek Zillur	5487c56c70	Search & Count: Add option to early terminate doc collection Allow users to control document collection termination, if a specified terminate_after number is set. Upon setting the newly added parameter, the response will include a boolean terminated_early flag, indicating if the document collection for any shard terminated early. closes #6876	2014-07-23 15:10:15 -04:00
Robert Muir	66825ac851	Change numeric data types to use SORTED_NUMERIC docvalues type instead of a custom encoding in BINARY. In low level benchmarks this is 2x to 5x faster: its also optimized for the common case where fields actually only contain at most one value for each document. Additionally SORTED_NUMERIC doesn't lose values if they appear more than once, so mathematical computations such as averages are correct. Closes #6967	2014-07-23 14:55:03 -04:00
Adrien Grand	ff2903d2c6	[TEST] Don't recycle in facets. The recycling happening in facets is done manually and arrays are sometimes not released. Aggregations do it in a less error-prone way by registering on to the SearchContext.	2014-07-23 20:20:16 +02:00
Adrien Grand	629f91ae57	Fielddata: goodbye comparators. This commit removes custom comparators in favor of the ones that are in Lucene. The major change is for nested documents: instead of having a comparator wrapper that deals with nested documents, this is done at the fielddata level by having a selector that returns the value to use for comparison. Sorting with custom missing string values might be slower since it is using TermValComparator since Lucene's TermOrdValComparator only supports sorting missing values first or last. But other than this particular case, this change will allow us to benefit from improvements on comparators from the Lucene side. Close #5980	2014-07-23 20:08:36 +02:00
Adrien Grand	76511158b5	Fielddata: Fix the ordinals impl for sparse fields. Caused by #6908	2014-07-23 17:39:43 +02:00
Simon Willnauer	b51bd3a645	Add version 1.2.4 and 1.3.1 to the version table	2014-07-23 16:26:48 +02:00
Simon Willnauer	be96f57c11	[TEST] Fix SimpleThreadPoolTests to exclude test infra threads	2014-07-23 14:44:08 +02:00
mikemccand	cc4d7c6272	Core: don't load bloom filters by default This change just changes the default for index.codec.bloom.load to false: with recent performance improvements to ID lookup, such as #6298, bloom filters don't give much of a performance gain anymore, and they can consume non-trivial RAM when there are many tiny documents. For now, we still index the bloom filters, so if a given app wants them back, it can just update the index.codec.bloom.load to true. Closes #6959	2014-07-23 05:58:41 -04:00
Lee Hinman	15ccd787a5	[TEST] Maven reproductions should always include 'clean' target	2014-07-23 11:47:58 +02:00
mikemccand	5ccd44519a	Test: make test less evil	2014-07-23 05:35:52 -04:00
mikemccand	55986907e8	Test: add more verbosity when this test fails	2014-07-23 05:23:15 -04:00
Lee Hinman	c38a9d73e7	[TEST] Add test for _score and doc[] access in Groovy scripts	2014-07-23 09:58:38 +02:00
Britta Weber	734e656a91	Make _all field accessible with GET GET only returned null even when stored if requested with GET like this: `curl -XGET "http://localhost:9200/test/test/1?fields=_all"` Instead, it should simply behave like a String field and return the concatenated fields as String. closes #6924	2014-07-23 09:16:35 +02:00
Adrien Grand	08f8731b6f	Core: Drop UnsafeUtils. This class potentially does unaligned memory access and does not bring much now that we switched to global ords for terms aggregations. Close #6962	2014-07-23 08:41:11 +02:00
Areek Zillur	f39d4e1f89	PhraseSuggester: Collate option should allow returning phrases with no matching docs A new option `prune` has been added to allow users to control phrase suggestion pruning when `collate` is set. If the new option is set, the phrase suggestion option will contain a boolean `collate_match` indicating whether the respective result had hits in collation. CLoses #6927	2014-07-22 17:17:15 -04:00
Simon Willnauer	0faffcf372	[TEST] Add simple sort assertions for bwc tests Today we only do count searches to ensure sane results are returned after upgrading etc. This change adds sorting to the picture asserting on simple numeric sorting that uses field data etc. after upgrading. Relates to #6967	2014-07-22 22:22:09 +02:00
Shay Banon	50ececbbcf	Unicast discovery: only disconnect from temporary connected nodes In unicast discovery, we try to reuse existing discovery nodes based on the node address they have. If we find an existing node based on its address, and for some reason its not connected, don't add it to the list of nodes to disconnect from, as that (full) connection is useful down the road closes #6966	2014-07-22 21:29:57 +02:00
Shay Banon	88f3afe4b5	Fix connect concurrency, can cause connection nodes to close Looking at the connect code, if 2 threads at the same time try and connect to a node, and both enter sequentially the connectLock code block, the second one would try and put the connection in the map, and close the replaced channels, which will cause the existing connection to close as well (since it removes the node from the connectedNodes map) To fix this, simply make sure we properly check the existence of the connection within the connectionLock block, so there won't be concurrent connections going on. While doing this, also went over all the mutation code that handles disconnections, and made sure they are properly done only within a connection lock. closes #6964	2014-07-22 19:48:47 +02:00
mikemccand	72b3d6ef75	Test: make sure randomizer doesn't swap in SerialMergeScheduler on us	2014-07-22 13:06:04 -04:00
mikemccand	1e92f0f4ff	Core: allow index.merge.scheduler.max_thread_count to be updated dynamically Lucene allows the max_thread_count to be updated, but this wasn't fully exposed in Elasticsearch. Closes #6925	2014-07-22 11:23:46 -04:00
Clinton Gormley	f14af3599a	Fixed typo in AbstractFieldMapper similariry -> similarity	2014-07-22 15:54:09 +02:00
Brian Murphy	7d9b012ca1	[FIX] Fix update parser to accept script_id	2014-07-22 14:22:57 +01:00
Adrien Grand	3c142e550d	Fielddata: Switch to Lucene DV APIs. This commits removes BytesValues/LongValues/DoubleValues/... and tries to use Lucene's APIs such as NumericDocValues or RandomAccessOrds instead whenever possible. The next step would be to take advantage of the fact that APIs are the same in Lucene and Elasticsearch in order to remove our custom comparators and use Lucene's. There are a few side-effects to this change: - GeoDistanceComparator has been removed, DoubleValuesComparator is used instead on top of dynamically computed values (was easier than migrating GeoDistanceComparator). - SortedNumericDocValues doesn't guarantee uniqueness so long/double terms aggregators have been updated to make sure a document cannot fall twice in the same bucket. - Sorting by maximum value of a field or running a `max` aggregation is potentially significantly faster thanks to the random-access API. Our aggs and p/c aggregations benchmarks don't report differences with this change on uninverted field data. However the fact that doc values don't need to be wrapped anymore seems to help a lot. For example TermsAggregationSearchBenchmark reports ~30% faster terms aggregations on doc values on string fields with this change, which are now only ~18% slower than uninverted field data although stored on disk. Close #6908	2014-07-22 15:16:24 +02:00
Brian Murphy	0de30e1798	[TEST] Fix indexed script test.	2014-07-22 13:58:07 +01:00
Nik Everett	79433d23e3	Update: Detect noop updates sent with doc_as_upsert This should help prevent spurious updates that just cause extra writing and cache invalidation for no real reason. Close #6822	2014-07-22 14:55:34 +02:00
Brian Murphy	ebcc1e0bf5	[FIX] Indexed scripts should always use script_id.	2014-07-22 13:49:31 +01:00
Simon Willnauer	55df94d91e	Revert "Fix InternalSearchHits serialization to be deterministic" This reverts commit `8db7b2b20b`.	2014-07-22 14:43:31 +02:00
Simon Willnauer	8db7b2b20b	Fix InternalSearchHits serialization to be deterministic The assertion on binary equality for streamable serialization sometimes fails due to the usage of identify hashmaps inside the InternalSearchHits serialization. This only happens if the number of shards the result set is composed of is very high. This commit makes the serialziation deterministic and removes the need to serialize the ordinal due to in-order serialization.	2014-07-22 14:14:31 +02:00
Brian Murphy	924f91588b	[TEST] Reduce the number of threads in StressUpdateDelete test.	2014-07-22 11:16:32 +01:00
Boaz Leskes	a344fe6590	IndicesStore fix logging format	2014-07-22 07:47:40 +03:00
Boaz Leskes	fee22fc0b2	[Discovery] fix MembershipAction.sendJoinRequestBlocking should now expect an empty response The change introduced in #6949 (do not serialize the cluster state) also means master now responds with an empty response rather then a JoinResponse. However, sendJoinRequestBlocking still expected a JoinRequest.	2014-07-22 00:43:01 +03:00
Boaz Leskes	690820dae3	[Discovery] remove unneeded cluster state serialization during cluster join process At the moment we serialize the cluster state in JoinResponse and ValidateJoinRequest. However this state is not used anywhere and can be removed to save on network overhead Closes #6949	2014-07-21 23:30:07 +03:00
Brian Murphy	c74552e006	[TEST] Stress update/delete Reduce top end of scaling parameters to avoid OOMs on nightly runs.	2014-07-21 21:16:41 +01:00
Shay Banon	8dca4b7e07	More lenient type parsing in histo/cardinality aggs closes #6948 closes #6893	2014-07-21 21:31:37 +02:00
Brian Murphy	b0ce090876	[TEST] Stress Update/Delete Create the test index with merging disabled so deletes don't get merged away this was breaking the version calculation.	2014-07-21 20:11:05 +01:00
Brian Murphy	6d641ea40d	[TEST] Remove TransportUpdateActionTest This test has been made obselete by the UpdateTests.	2014-07-21 17:55:51 +01:00
Brian Murphy	cbb4edf2ae	[TEST] Plugin Manager Undo removal of change.	2014-07-21 16:25:31 +01:00
Brian Murphy	8b169617a2	[TEST] Stress test for update and delete concurrency. This test deletes and updates using upserts documents over several threads in a tight loop. It counts the number of responses and verifies that the versions at the end are correct.	2014-07-21 16:04:21 +01:00
Simon Willnauer	e730c76be7	[RESTORE] Fail restore if snapshot is corrupted today if a snapshot is corrupted the restore operation never terminates. Yet, if the snapshot is corrupted there is no way to restore it anyway. If such a snapshot is restored today the only way to cancle it is to delete the entire index which might cause dataloss. This commit also fixes an issue in InternalEngine where a deadlock can occur if a corruption is detected during flush since the InternalEngine#snapshotIndex aqcuires a topLevel read lock which prevents closing the engine. Closes #6938	2014-07-21 16:15:53 +02:00
Simon Willnauer	d65a9d63a2	[Engine] `index.fail_on_corruption` is not updateable The `index.fail_on_corruption` was not updateable via the index settings API. This commit also fixed the setting prefix to be consistent with other setting on the engine. Yet, this feature is unreleased so this won't break anything. Closes #6941	2014-07-21 14:59:44 +02:00
Martijn van Groningen	f3114fe774	[TEST] top_hits aggs with breath_first collect mode only fails if scoring is used (score based sort or track_scores)	2014-07-21 11:21:50 +02:00
Martijn van Groningen	ba1042e7d1	Aggregations: Track scores should be applied properly for `top_hits` aggregation. Closes #6934	2014-07-21 10:03:42 +02:00
Martijn van Groningen	2d2088b259	[TEST] Temporarily ignore transport update tests.	2014-07-21 09:57:03 +02:00
Simon Willnauer	6e81cb5cd6	[TEST] Add SuppressSysoutChecks to DistributorDirectoryTest	2014-07-20 19:20:18 +02:00
Shay Banon	2c07588a1a	[TEST] an active shard might also be relocating	2014-07-20 10:44:24 +02:00
Alexander Reelsen	1816951b6b	Netty: Refactoring to make MessageChannelHandler extensible Small refactorings to make the MessageChannelHandler more extensible. Also allowed access to the different netty pipelines This is the fix after the first version had problems with the HTTP transport due to wrong reusing channel handlers, which is the reason why tests failed. Relates #6889 Closes #6915	2014-07-18 16:30:31 +02:00
javanna	bbf7e6be92	Plugins: Introduced pluggable filter chain to be able to filter transport actions execution It's now possible to inject action filters from plugins via `ActionModule#registerFilter` through the following code: ``` public void onModule(ActionModule actionModule) { actionModule.registerFilter(MyFilter.class); } ``` Also made `TransportAction#execute` methods final to enforce the execution of the filter chain. By default the chain is empty though. Note that the action filter chain is executed right after the request validation, as the filters might rely on a valid request to do their work. Closes #6921	2014-07-18 16:18:50 +02:00
Adrien Grand	97f4247880	[TESTS] Switch more tests to ElasticsearchSingleNodeTest. This avoids loading modules manually all the time.	2014-07-18 14:35:00 +02:00
Shay Banon	972fa10945	Better message for invalid internal transport message format closes ##6916	2014-07-18 13:49:38 +02:00
Simon Willnauer	a302f3a9dd	[TEST] Randomize Preference in search requests Closes #6863	2014-07-18 12:30:53 +02:00
Simon Willnauer	6b519633ab	[TEST] Randomize FS Translog implementation Closes #6658	2014-07-18 11:51:42 +02:00
Simon Willnauer	d79717c341	[CLIENT] Unknown node version should be a lower bound Today when we start a `TransportClient` we use the given transport addresses and create a `DiscoveryNode` from it without knowing the actual nodes version. We just use the `Version.CURRENT` which is an upper bound. Yet, the other node might be a version less than the currently running and serialisation of the nodes info might break. We should rather use a lower bound here which is the version of the first release with the same major version as `Version.CURRENT` since this is what we officially support. This commit moves to use the minimum major version or an RC / Snapshot if the current version is a snapshot. Closes #6894	2014-07-18 11:31:15 +02:00
Alex Ksikes	f22f3db30f	Term Vectors API: Computes term vectors on the fly if not stored in the index. Adds the ability to the Term Vector API to generate term vectors for some chosen fields, even though they haven't been explicitely stored in the index. Relates to #5184 Closes #6567	2014-07-17 23:29:05 +02:00
Simon Willnauer	6fc5acd760	Return null info if service is not started Closes #6906	2014-07-17 16:47:46 +02:00
Simon Willnauer	2bec59ec3e	[Bulk] Release semaphore again after close The bulk processor tries to acquire all leases for the semaphore to wait for all pending requests. Yet, we should release them afterwards again to ensure we don't ever deadlock if there is a bug in the processor. This commit also adds a testcase for this method	2014-07-17 16:24:43 +02:00
matt-preston	42b71a004a	[Bulk] Add blocking close method to BulkProcessor Blocks until all bulk requests have completed. Closes #4158 Closes #6314	2014-07-17 15:57:45 +02:00
uboness	cc8f7ddb9a	Added three frequency levels for resource watching It's now possible to register watchers along with a specified check frequency. There are three frequencies: low, medium, high. Each one is associated with a check interval that determines how frequent the watchers will check for changes and notify listeners if needed. By default, the intervals are 5s, 30s and 60s respectively, but they can also be customized in the settings. also: - Added the WatcherHandle construct by which one can stop it (remove it) and resume it (re add it). Also provices access to the watchers itself and the frequency by which it's checked - Change the default frequency to 30 seconds interval (used to be 60 seconds). The only watcher that is currently effected by this is the script watcher (now auto-loading scripts will auto-load every 30 seconds if changed)	2014-07-17 14:57:53 +02:00
Boaz Leskes	bdbe86dd2d	[Store] delete unallocated shards under a cluster state task This is to prevent a rare racing condition where the very same shard gets allocated to the node after our sanity check that the cluster state didn't check and the actual deletion of the files. Closes #6902	2014-07-17 14:51:07 +02:00
Alexander Reelsen	bb421d7ea3	Revert "Netty: Refactoring to make MessageChannelHandler extensible" This results in unstable tests, most likely due to Channels being mixed up by wrongly creating the pipelines. Needs investigation and a test. This reverts commit `db7f0d36af`.	2014-07-17 14:49:15 +02:00
Boaz Leskes	38d8e3ccc2	[Infra] remove indicesLifecycle.Listener from IndexingMemoryController The IndexingMemoryController determines the amount of indexing buffer size and translog buffer size each shard should have. It takes memory from inactive shards (indexing wise) and assigns it to other shards. To do so it needs to know about the addition and closing of shards. The current implementation hooks into the indicesService.indicesLifecycle() mechanism to receive call backs, such shard entered the POST_RECOVERY state. Those call backs are typically run on the thread that actually made the change. A mutex was used to synchronize those callbacks with IndexingMemoryController's background thread, which updates the internal engines memory usage on a regular interval. This introduced a dependency between those threads and the locks of the internal engines hosted on the node. In a very rare situation (two tests runs locally) this can cause recovery time outs where two nodes are recovering replicas from each other. This commit introduces a a lock free approach that updates the internal data structures during iterations in the background thread. Closes #6892	2014-07-17 14:31:38 +02:00
Adrien Grand	9714dd55c2	Refactoring: Rename FieldMapper.termsFilter to fieldDataTermsFilter. FieldMapper has two methods `Filter termsFilter(List values, @Nullable QueryParseContext)` which is supposed to work on the inverted index and `Filter termsFilter(QueryParseContext, List, QueryParseContext)` which is supposed to work on field data. Let's rename the second one to `fieldDataTermsFilter` and remove the unused `QueryParseContext`. Close #6888	2014-07-17 12:42:18 +02:00
Martijn van Groningen	9037365adf	[TEST] Use array* helper assert methods instead of verifying the array directly	2014-07-17 09:39:51 +02:00
David Pilato	26bac39e0e	bin/plugin removes itself If you call `bin/plugin --remove es-plugin` the plugin got removed but the file `bin/plugin` itself was also deleted. We now don't allow the following plugin names: * elasticsearch * plugin * elasticsearch.bat * plugin.bat * elasticsearch.in.sh * service.bat Closes #6745	2014-07-17 08:57:01 +02:00
Alexander Reelsen	db7f0d36af	Netty: Refactoring to make MessageChannelHandler extensible Small refactorings to make the MessageChannelHandler more extensible. Also allowed access to the different netty pipelines Closes #6889	2014-07-17 08:28:55 +02:00
Boaz Leskes	6fe1d9860e	[Scripts] GroovyScriptEngineService don't create ScoreAccess if lookup is null	2014-07-17 08:05:10 +02:00
Ryan Ernst	7139f44241	Scripts: Make ScoreAccessor utility available for other script engines. With the removal of setNextScore in #6864, script engines must use the Scorer to find the score of a document. The DocLookup is updated appropriately to do this, but most script engines require a Number to be bound for numeric variables. Groovy already had an encapsulation for this funtionality, and this moves it out to be shared with other script engines. closes #6898	2014-07-16 16:33:04 -07:00
Ryan Ernst	770447ce1a	Scripting: Remove setNextScore in SearchScript. While it would be nice to do this all the way up the chain (into score functions), this at least removes the weird dual setNextScore/setScorer for SearchScripts. closes #6864	2014-07-16 15:10:01 -07:00
Martijn van Groningen	ca7fa4f9ec	Query: When parent doc is found stop iterating over segments and continue with the next top matching child doc. The parent doc has been found and there is no need to check subsequent segments. Closes #6895 Closes #1843	2014-07-17 00:01:06 +02:00
Martijn van Groningen	473d1717be	Search: Added support for named filters/queries inside nested filters and queries. Closes #6293	2014-07-16 23:43:18 +02:00
Alexander Reelsen	70bad405c3	Serialization: Fix bwc issue by falling back to old threadpool serialization This fixes an issue introduced by the serialization changes in #6486 which are not needed at all. Node that the serialization itself is not broken but the TransportClient uses its own version on initial connect and getting the NodeInfos.	2014-07-16 17:58:32 +02:00
Simon Willnauer	90ea4610c8	[FIELDDATA] Use KeyedLock in IndexFieldDataService Today we synchronize when updating the IndexFieldDataService datastructures. This might unnecessarily block progress if multiple request need different fielddata instance for different fields. This commit also fixes clear calls to actually consistently clear the caches in the case of an exception. Closes #6855	2014-07-16 15:49:17 +02:00
Simon Willnauer	e30176cc69	Add read/write lock semantics to KeyedLock	2014-07-16 15:49:00 +02:00
Alexander Reelsen	b0c0ff8ac0	Threadpool Info: Allow to serialize negative thread pool sizes As a SizeValue is used for serializing the thread pool size, a negative number resulted in throwing an exception when deserializing (using -ea an assertionerror was thrown). This fixes a check for changing the serialization logic, so that negative numbers are read correctly, by adding an internal UNBOUNDED value. Closes #6325 Closes #5357	2014-07-16 15:25:53 +02:00
Martijn van Groningen	62da59f76f	[TEST] Verify if clear cache request went to all shards.	2014-07-16 13:48:39 +02:00
Colin Goodheart-Smithe	8260138e59	Aggregations: Fixed Histogram key_as_string bug The key as string field in the response for the histogram aggregation will now only show if format is specified on the request. Closes #6655	2014-07-16 11:36:48 +01:00
Brian Murphy	cbd2a97abd	[DOCS] : Indexed scripts/templates These are the docs for the indexed scripts/templates feature. Also moved the namespace for the REST endpoints. Closes #6851	2014-07-16 10:49:02 +01:00
Adrien Grand	7e44fe6017	[TESTS] Stabilize DisabledFieldDataFormatTests by setting number_of_replicas to 0.	2014-07-16 10:37:25 +02:00
Boaz Leskes	84ad29ddfa	[Recovery] don't start a gateway recovery if source node is not found Due to change introduced in #6825, we now start a local gateway recovery for replicas, if the source node can not be found. The recovery then fails because we never recover replicas from disk. Closes #6879	2014-07-16 10:01:01 +02:00
Boaz Leskes	d869163b66	[Infra] re-send failed shard messages when receiving a cluster state that still refers to them In rare cases we may fail to send a shard failure event to the master, or there is no known master when the shard has failed (ex. a couple of node leave the cluster canceling recoveries and causing a master to step down at the same time). When that happens and a cluster state arrives from the (new) master we should resend the shard failure in order for the master to remove the shard from this node. Closes #6881	2014-07-16 09:59:49 +02:00
Martijn van Groningen	f1c2cdb9c8	Store: Only send shard exists requests if shards exist locally on disk and are not allocated on that node according to the cluster state. Closes #6870	2014-07-16 08:23:32 +01:00
Ryan Ernst	35037e31b2	[AGGS] Pass current docid being processed to scripts. Scripts may internally cache based on docid (as expressions do). This change makes numeric aggregations using scripts pass the docid when it changes.	2014-07-15 14:40:12 -07:00
Shay Banon	323210729e	Releasable XContentBuilder make the builder releasable (auto closeable), and use it in shards state also make XContentParser releasable (AutoCloseable) and not closeable since it doesn't throw an IOException closes #6869	2014-07-16 04:28:03 +09:00
Shay Banon	9345194a65	Share numeric data analyzer instances between mappings use similar mechanism that shares numeric analyzers for long/double/... for dates as well. This has nice memory save properties with many date fields mapping case, as well as analysis saves (thread local resources) closes #6843	2014-07-16 04:24:52 +09:00
Ryan Ernst	e764c5f08a	Fix transient testScore failure by making DF consistent for query.	2014-07-15 11:18:13 -07:00
Brian Murphy	86dfad24fe	[FIX] normalize serialization of ScriptType in UpdateRequest	2014-07-15 18:14:48 +01:00
Nik Everett	da5fb34163	Mappings: Add transform to document before index. Closes #6566	2014-07-15 18:40:46 +02:00
Adrien Grand	56cfa8e47e	[TEST] Enforce that the node from ElasticsearchSingleNodeTest is alone in its cluster. In particular, this removes the ability to create new node and adds the ability to get a reference to the single node that the cluster is made of.	2014-07-15 17:26:52 +02:00
mikemccand	ba443a4819	Test: make test less evil so it doesn't take so long	2014-07-15 11:19:18 -04:00
Simon Willnauer	05d92b9b68	[TEST] Enforce provided settings in test base classes Enviorment variables might override the tests settings even if they are explicitly set. Other base classes like InternalTestCluster also specify `config.ignore_system_properties: true` to ensure `what we set is what we get`	2014-07-15 17:06:20 +02:00
Ryan Ernst	64ab22816c	Scripting: Add script engine for lucene expressions. These are javascript expressions, which can only access numeric fielddata, parameters, and _score. They can only be used for searches (not document updates). closes #6818	2014-07-15 07:49:01 -07:00
Simon Willnauer	1464bea00f	Added [1.4.0] version	2014-07-15 16:16:50 +02:00
Simon Willnauer	3900d61254	[TESTS] Add ElasticsearchSingleNodeTest. This test makes it easy to create a lightweight node (no http, indices stored in RAM, ...) whose main purpose is to get an instance of the Guice injector for unit tests. This should help not have to update lots of unit tests when we add a new Guice dependency.	2014-07-15 15:42:58 +02:00
Patrick Peschlow	9742d08f53	[UPDATE] Handle document missing exception on retry correctly. Throwables thrown on update retries are now caught and handled via the provided callback. This commit also contains an integration test demonstrating the bug and validating the fix. Closes #6355 Closes #6724	2014-07-15 14:52:29 +02:00
javanna	3c54eb9b8f	Internal: make transport action name available in TransportAction base class Each transport action is associated with at least an action name, which is the action name that gets serialized together with the request and identifies what to do with the request itself. Also, the action name is the name of the registered transport handler that handles incoming request for the transport action. This commit makes the action name available in a generic manner in the TransportAction base class, so that it can be used when needed by subclasses, or in the base class for instance for action filtering. Closes #6860	2014-07-15 14:35:35 +02:00
mikemccand	4194ab31c8	Core: don't close/reopen IndexWriter when changing RAM buffer size Today we close/reopen IW when we change the RAM buffer but that's costly because it means the next NRT reader is a full reopen. The RAM buffer size setting is a live one in IndexWriter, even if there are no buffered docs in RAM when you call it. Separately it would be nice if Lucene let you manage a "reader pool" that could outlive individual IW instances ... Closes #6856	2014-07-15 08:32:30 -04:00
Simon Willnauer	557b634f4a	[SEARCH] Wrap filter only once in ApplyAcceptedDocsFilter We potentially wrap the given filter multiple times when iterating the subreaders. We only need to do this once. Closes #6873	2014-07-15 12:44:35 +02:00
Simon Willnauer	7de9d3d2cb	[TEST] Fix test to use keyword analyzer since it relies on exact terms without processing	2014-07-15 12:26:51 +02:00
Simon Willnauer	82cc227da3	Added missing version based serialization	2014-07-15 12:13:45 +02:00
Brian Murphy	73d93e5fd8	Indexed Scripts/Templates: cleanup This commit cleans up some code around the indexed script/templates feature. Remove dead code in ScriptService. Remove setXScript methods for UpdateRequestBuilder and use setScript(script,type) instead	2014-07-15 10:49:39 +01:00
Simon Willnauer	2bac455cd0	Add version check before reading script type in UpdateRequest	2014-07-15 10:16:08 +02:00
Simon Willnauer	70bd24ed14	[TEST] use higher timeout to wait for balanced cluster CorruptFileTest sometimes hits conditions where lots of rebalancing happens. In such a case the default timeout is just not enough - this timeout just makes sure that the cluster has enough time to balance itself.	2014-07-15 09:56:41 +02:00
Areek Zillur	0bd774423d	[TEST] fix random preference string tests to accomodate for the new more strict preference type	2014-07-14 20:15:00 -04:00
Areek Zillur	d0d1b98d23	Stats: Expose IndexWriter and VersionMap RAM usage to ShardStats and _cat endpoint This commit adds the RAM usage of IndexWriter and VersionMap Closes #6483	2014-07-14 19:46:12 -04:00
Areek Zillur	4fb79fe787	[TEST] fix random preference string test to accomodate for the new more strict preference type	2014-07-14 19:01:04 -04:00

1 2 3 4 5 ...

4692 Commits