OpenSearch

Commit Graph

Author	SHA1	Message	Date
Clinton Gormley	75cc7077c7	Update plugins.asciidoc Added entity resolution plugin for duplication detection Related to #9131	2015-01-05 12:53:37 +01:00
Britta Weber	9454593d6a	[TEST] mute ExceptionRetryTests	2015-01-03 18:40:19 +01:00
David Pilato	a50d82c44b	[Doc] Use byte[] as example instead of String Closes #8973.	2015-01-02 16:20:32 +01:00
Britta Weber	f45e6ae3f9	[index] Prevent duplication of documents when retry indexing after fail If bulk index request fails due to a disconnect, unavailable shard etc, the request is retried once before actually failing. However, even in case of failure the documents might already be indexed. For autogenerated ids the request must not add the documents again and therfore canHaveDuplicates must be set to true. closes #8788	2015-01-02 15:44:47 +01:00
Nicholas Knize	b21024b5f9	[GEO] Throw helpful exception for Polygons with holes outside the shell A recent situation occured where a MultiPolygon coordinate array was accidentally defined as a single polygon with multiple holes. Since the intent was a MultiPolygon, the holes of the unintended Polygon fell outside the outer shell. This exposed a bug in the contains logic inside BasePolygonBuilder. An ArrayIndexOutOfBoundsException was being thrown instead of a more useful ElasticsearchParseException( "hole is not within polygon" ). This pull request fixes the bug and adds additional unit tests for verifying proper MultiPolygon type parsing. closes #9071	2015-01-02 08:14:07 -06:00
Simon Willnauer	93dddcdfd9	[TEST] Wait for green if index is closed and reopened if we reopen an index and the majority of the replicas where not created the reopen will fail sicne on master this runs with local gatway all the time.	2015-01-02 14:58:18 +01:00
Simon Willnauer	3e37c89932	[INTERNAL] Remove OperationRouting abstraction This commit removes the unneeded OperationRouting interface and flattens the package structure inside cluster.routing	2015-01-02 12:17:35 +01:00
Simon Willnauer	b936c2f845	[RECOVERY] Allow cancel waiting for mapping changes This commit interrupts the wait for mapping change if the index shard gateway is waiting for the master on a mapping update.	2015-01-02 11:02:45 +01:00
Simon Willnauer	54ce210c8e	[RECOVERY] Release store lock before blocking on mapping updates This can lead to sporadic shard creating timeouts if the same shard is created, closed and created again on the same node. The reason for this is that we holding on to the store reference while blocking on the mapping update that will prevent the shard lock from being released. Holding the lock is unnecessary in this case and can simply be removed.	2015-01-02 11:02:19 +01:00
Mikhail Korobov	707025fb7a	[Docs] fix curl examples in Nodes Stats docs Closes #9118	2014-12-31 14:01:37 +01:00
Adrien Grand	56974bf867	[TEST] Fix GroovyScriptTests failures.	2014-12-31 10:02:45 +01:00
Ryan Ernst	6304f68715	Scripting: Make _score in groovy scripts comparable closes #8828 closes #9094	2014-12-30 16:38:44 -08:00
Clinton Gormley	f83909f7ae	Docs: The regexp query defaults to the `ALL` flag, and removed the `AUTOMATON` flag which is not used in Elasticsearch. Closes #6180	2014-12-30 19:53:31 +01:00
Nicholas Knize	0e24f34b0c	[GEO] GIS envelope validation ShapeBuilder expected coordinates for Envelope types in strict Top-Left, Bottom-Right order. Given that GeoJSON does not enforce coordinate order (as seen in #8672) clients could specify envelope bounds in any order and be compliant with the GeoJSON spec but not the ES ShapeBuilder logic. This change loosens the ShapeBuilder requirements on envelope coordinate order, reordering where necessary. closes #2544 closes #9067 closes #9079 closes #9080	2014-12-30 11:54:07 -06:00
Lee Hinman	31652a8b3d	Fix TransportNodesListShardStoreMetaData for custom data paths Cleans up the testReusePeerRecovery test as well The actual fix is in TransportNodesListShardStoreMetaData.java, which needs to use `nodeEnv.shardDataPaths` instead of `nodeEnv.shardPaths`. Due to the difficulty in tracking this down, I've added a lot of additional logging. This also fixes a logging issue in GatewayAllocator	2014-12-30 17:50:38 +01:00
Clinton Gormley	904f20a41b	Update setup.asciidoc Add a note about using the same JVM version on all nodes and clients	2014-12-30 17:40:51 +01:00
Simon Willnauer	bc65afba8a	[TEST] Wait for threads to finish / start before asserting	2014-12-30 15:45:28 +01:00
Adrien Grand	3af3def30b	Remove some dead code.	2014-12-30 14:30:40 +01:00
Lee Hinman	a4e2230ebd	Add index.data_path setting This allows specifying the path an index will be at. `index.data_path` is specified in the settings when creating an index, and can not be dynamically changed. An example request would look like: POST /myindex { "settings": { "number_of_shards": 2, "data_path": "/tmp/myindex" } } And would put data in /tmp/myindex/0/index/0 and /tmp/myindex/0/index/1 Since this can be used to write data to arbitrary locations on disk, it requires enabling the `node.enable_custom_paths` setting in elasticsearch.yml on all nodes. Relates to #8976	2014-12-29 14:40:50 +01:00
dtpeacock	582d5e8d3c	Doc has store "false" not store "true" Came from `3465e69e83` due to changing "yes" to "false". Closes #9075	2014-12-29 11:59:22 +01:00
tlrx	2ccfde76f1	Native: Kernel32Library throws NoClassDefFound if JNA is missing Introduced by #8993	2014-12-28 21:05:02 +01:00
Martijn van Groningen	d8054ec299	inner_hits: Added another more compact syntax for inner hits. Closes #8770	2014-12-24 17:41:35 +01:00
Adrien Grand	cc71f7730a	[TESTS] Make sure to wait for all shards to be allocated before running the test.	2014-12-24 11:18:40 +01:00
Martijn van Groningen	a345e98575	Core: `ignore_unavailable` shouldn't ignore closed indices if a single index is specified in a search or broadcast request. Closes #9047 Closes #7153	2014-12-24 10:46:03 +01:00
Adrien Grand	7678ab5264	Parent/child: Fix concurrency issues of the _parent field data. `_parent` field data mistakenly shared some stateful data-structures across threads. Close #8396	2014-12-24 09:34:40 +01:00
Adrien Grand	67eba23b2d	Core: Terms filter lookup caching should cache values, not filters. The terms filter lookup mechanism today caches filters. Because of this, the cache values depend on two things: the values that can be found in the lookup index AND the mapping of the local index, since changing the mapping can change the way that the filter is parsed. We should make the cache depend solely on the content of the lookup index. For instance the issue I was seeing was due to the following scenario: - create index1 with _id indexed - run terms filter with lookup, the parsed filter looks like `_id: 1 OR _id: 2` - remove index1 - create index1 with _id not indexed - run terms filter without lookup, the parsed filter is `_uid: type#1 OR _uid: type#2` (the _id field mapper knows how to use the _uid field when _id is not indexed) - run terms filter with lookup, the filter is fetched from the cache: `_id: 1 OR _id: 2` but does not match anything since `_id` is not indexed. Close #9027	2014-12-24 09:33:21 +01:00
Adrien Grand	24591b3c70	Search: parse terms filters on a single term as a term filter. Running a terms filter on a single term is equivalent to loading a postings list into a bit set and then returning the bit set instead of reading the postings list on the fly. Close #9014	2014-12-24 09:33:21 +01:00
Janmejay Singh	01bb02a0a4	ignore intellij project/workspace files closes #9044	2014-12-23 12:00:11 -08:00
Ryan Ernst	39b3613420	Fix date histogram docs grammar.	2014-12-23 10:19:55 -08:00
Nicholas Knize	6d872843bd	[GEO] Removing unnecessary orientation enumerators PR #8978 included 4 unnecessary enumeration values ('cw', 'clockwise', 'ccw', 'counterclockwise'). Since the ShapeBuilder.parse method handles these as strings and maps them to LEFT and RIGHT enumerators, respectively, their enumeration counterpart is unnecessary. This minor change adds 4 static convenience variables (COUNTER_CLOCKWISE, CLOCKWISE, CCW, CW) for purposes of the API and removes the unnecessary values from the Orientation Enum. closes #9035	2014-12-22 22:00:40 -06:00
Nicholas Knize	77a7ef28b3	[GEO] Add optional left/right parameter to GeoJSON This feature adds an optional orientation parameter to the GeoJSON document and geo_shape mapping enabling users to explicitly define how they want Elasticsearch to interpret vertex ordering. The default uses the right-hand rule (counterclockwise for outer ring, clockwise for inner ring) complying with OGC Simple Feature Access standards. The parameter can be explicitly specified for an entire index using the geo_shape mapping by adding "orientation":{"left"\|"right"\|"cw"\|"ccw"\|"clockwise"\|"counterclockwise"} and/or overridden on each insert by adding the same parameter to the GeoJSON document. closes #8764	2014-12-22 12:09:45 -06:00
Adrien Grand	fb6c3b7c29	[Docs] Improve documentation of the new caching policy for filters.	2014-12-22 17:14:47 +01:00
Colin Goodheart-Smithe	391b5f3f5e	Aggregations: Adds methods to get to/from as Strings for Range Aggs Adds getToAsString and getFromAsString to Range interface and implements them for all range aggregations Closes #9003	2014-12-22 09:56:25 +00:00
Tomas Varaneckas	f8897a40af	Mappings: Include currentFieldName into ObjectMapper errors Without currentFieldName error is very generic and non informative Close #9020	2014-12-22 10:11:25 +01:00
Nik Everett	a95d75e074	Mappings: Reencode transformed result with same xcontent When I originally wrote the transform feature I didn't think that the XContentType of the reencoded source mattered. It actually matters because payloads for the completion suggester are stored and returned exactly as encoded by this XContentType. This revision changes the transform feature from always reencoding with smile to always reencoding with the provided XContentType to support the completion suggester. Closes #8959	2014-12-22 10:11:25 +01:00
tlrx	a4133ec4a3	Shutdown: Add support for Ctrl-Close event on Windows platforms to gracefully shutdown node This commit adds the support for the Ctrl-Close event on Windows using native system calls. This way, it is possible to catch the Ctrl-Close event sent by a 'taskill /pid' command (or when the user closes the console window where elasticsearch.bat was started) and gracefully close the node. Before this commit, the node was simply killed on taskkill/window closing.	2014-12-22 09:36:29 +01:00
David Pilato	90f2f1da84	Plugins: NPE when plugins dir is inaccessible Steps to reproduce: 1. Download fresh es. 2. `sudo mkdir plugins && sudo chmod 0700 plugins` 3. Start elasticsearch ``` elasticsearch-1.4.1 λ ./bin/elasticsearch [2014-12-09 12:18:59,025][INFO ][node ] [Piotr Rasputin] version[1.4.1], pid[16338], build[89d3241/2014-11-26T15:49:29Z] [2014-12-09 12:18:59,025][INFO ][node ] [Piotr Rasputin] initializing ... {1.4.1}: Initialization Failed ... - NullPointerException[null] ``` Closes #8837.	2014-12-21 11:59:54 +01:00
Boaz Leskes	defecb3f80	Test: added some logging to NodeEnvironmentTests.testDeleteSafe	2014-12-20 00:27:37 +01:00
Boaz Leskes	4d699bd76c	Internal: remove IndexCloseListener & Store.OnCloseListener Closes #9009	2014-12-19 21:11:46 +01:00
Boaz Leskes	c077683248	Test: ZenFaultDetectionTests.testNodesFaultDetectionConnectOnDisconnect should account for initial ping There was a race condition in the test in the case where the nodes fault detection would manage to send and initial ping, followed by 2 attempts before the target service was disconnected.	2014-12-19 13:12:39 +01:00
Boaz Leskes	cb0d462aa0	Test: fix racing condition in IndicesRequestTests a request could be captured after action array was cleared.	2014-12-19 11:25:12 +01:00
Boaz Leskes	635ae29bf1	Recovery: cleaner interrupt handling during cancellation RecoveryTarget initiates the recovery by sending a start recovery request to the source node and then waits for the recovery to complete. During recovery cancellation, we interrupt the thread so it will wake up and clean the recovery. Depending on timing, this can leave an unneeded interrupted thread status causing future IO commands to fail unneeded. RecoverySource already had a handy utility called CancellableThreads. This extracts it to a top level class, and uses it in RecoveryTarget as well. Closes #9000	2014-12-19 10:39:21 +01:00
Guillaume Hiron	8738583de6	FunctionScore: Fix 'avg' score mode to correctly implement weighted mean. closes #8992 closes #9004	2014-12-18 16:36:39 -08:00
Boaz Leskes	e6a190ec58	Test: AutoFilterCachingPolicy.HISTORY_SIZE should be large enough to accommodate other param	2014-12-18 21:00:47 +01:00
Adrien Grand	55d8bfd691	[TEST] Fix IndexStatsTests failures.	2014-12-18 19:33:05 +01:00
Adrien Grand	ce11e0ee6d	Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449	2014-12-18 15:51:36 +01:00
Boaz Leskes	b9db5b178c	Internal: PlainTransportFuture should not set currentThread().interrupt() We use PlainTransportFuture as a future for our transport calls. If someone blocks on it and it is interrupted, we throw an ElasticsearchIllegalStateException. We should not set Thread.currentThread().interrupt(); in this case because we already communicate the interrupt through an exception. Closes #9001	2014-12-18 11:57:12 +01:00
javanna	d17db85794	[TEST] upgrade randomized runner to 2.1.11 2.1.11 contains the fix for this issue: https://github.com/carrotsearch/randomizedtesting/issues/179 Closes #8930	2014-12-18 10:40:05 +01:00
Adrien Grand	6d253aba08	Upgrade to lucene-5.0.0-snapshot-1646179.	2014-12-18 09:51:20 +01:00
Boaz Leskes	ee7ed387d4	Test: use less shards in SimpleQueryTests	2014-12-18 09:02:51 +01:00

1 2 3 4 5 ...

10451 Commits All Branches Search

10451 Commits

All Branches