OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-10 15:05:33 +00:00

Author	SHA1	Message	Date
Ali Beyad	58d6b9dcd1	This commit first reads the repository data and only upgrades if it determines the read data is in the legacy format. It writes the upgraded version if it is not a read-only repository and caches the repository data if it is a read-only repository.	2016-07-28 22:09:01 -04:00
Ali Beyad	299b8a7a52	Removes unnecessary blobExists() check before reading a blob in the Azure and Google cloud blob containers, as the APIs for both return a 404 in the case of a missing object, which we already handle through a NoSuchFileFoundException.	2016-07-23 23:24:56 -04:00
Ali Beyad	a6f5e0b0fe	Remove IndexMeta and addresses code review comments	2016-07-23 23:24:56 -04:00
Ali Beyad	d9ec959dfc	Index folder names now use a UUID (not the index UUID but one specific to snapshot/restore) and the index to UUID mapping is stored in the repository index file.	2016-07-22 13:59:13 -04:00
Ali Beyad	a0a4d67eae	All snapshot metadata files use UUID for the blob ID	2016-07-22 13:52:13 -04:00
Ali Beyad	630218a16f	Change the BlobContainer interface to throw a NoSuchFileFoundException for reads and deletes if the blob does not exist.	2016-07-22 13:49:25 -04:00
Ali Beyad	abaf8443e5	More robust handling of snapshot deletions Makes deleting snapshots more robust by first deleting the snapshot from the index generational file, then handling individual deletion file errors with log messages instead of failing the entire operation.	2016-07-22 13:49:25 -04:00
gfyoung	6a9f488b17	Caught exceptions during compromised snapshot deletion	2016-07-22 13:48:45 -04:00
gfyoung	95a118d9c6	Changed Files.deleteIfExists to Files.delete in FsBlobContainer	2016-07-22 13:48:45 -04:00
gfyoung	dfcdadb59f	Added HdfsBlobStoreContainer tests Added BlobContainer tests for HDFS storage and caught a bug at the same time in which deleteBlob was not raising an IOException when the blobName did not exist.	2016-07-22 13:48:45 -04:00
gfyoung	b02a6da8fd	Properly raise IOException for Azure, Fs, Hdfs, and S3	2016-07-22 13:48:45 -04:00
gfyoung	0620a3d6c2	Raised IOException on deleteBlob Closes gh-18530.	2016-07-22 13:48:45 -04:00
Jason Tedor	c27237be9f	Revert "Allow to listen on virtual interfaces" This reverts commit 4cb8b620c37decb69f31c92d08de765db1ec828e.	2016-07-22 13:30:05 -04:00
Michael Nitschinger	4cb8b620c3	Allow to listen on virtual interfaces Previously when trying to listen on virtual interfaces during bootstrap the application would stop working - the interface couldn't be found by the NetworkUtils class. The NetworkUtils utilize the underlying JDK NetworkInterface class which, when asked to lookup by name only takes physical interfaces into account, failing at virtual (or subinterfaces) ones (returning null). Note that when interating over all interfaces, both physical and virtual ones are taken into account. This changeset asks for all known interfaces, iterates over them and matches on the given name as part of the loop, allowing it to catch both physical and virtual interfaces. As a result, elasticsearch can now also serve on virtual interfaces. A test case has been added which at least makes sure that all iterable interfaces can be found by their respective name. (It's not easily possible in a unit test to "fake" virtual interfaces). Relates #19537	2016-07-22 12:33:21 -04:00
Ali Beyad	2b9cfff90f	Fixes CORS handling so that it uses the defaults Fixes CORS handling so that it uses the defaults for http.cors.allow-methods and http.cors.allow-headers if none are specified in the config. Closes #19520	2016-07-22 12:25:28 -04:00
Boaz Leskes	bd574d92ae	Verify lower level transport exceptions don't bubble up on disconnects (#19518 ) #19096 introduced a generic TCPTransport base class so we can have multiple TCP based transport implementation. These implementations can vary in how they respond internally to situations where we concurrently send, receive and handle disconnects and can have different exceptions. However, disconnects are important events for the rest of the code base and should be distinguished from other errors (for example, it signals TransportMasterAction that it needs to retry and wait for the a (new) master to come back). Therefore, we should make sure that all the implementations do the proper translation from their internal exceptions into ConnectTransportException which is used externally. Similarly we should make sure that the transport implementation properly recognize errors that were caused by a disconnect as such and deal with them correctly. This was, for example, the source of a build failure at https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/1080 , where a concurrency issue cause SocketException to bubble out of MockTcpTransport. This PR adds a tests which concurrently simulates connects, disconnects, sending and receiving and makes sure the above holds. It also fixes anything (not much!) that was found it.	2016-07-22 14:35:47 +02:00
Tal Levy	19e7b1c737	fix: no other processors should be executed after on_failure is called in a compound processor (#19545 )	2016-07-21 14:27:04 -07:00
Ali Beyad	9765b4a6ff	Fixes the ActiveShardsObserverIT tests that have a very short index (#19540 ) creation timeout so they process the index creation cluster state update before the test finishes and attempts to cleanup. Otherwise, the index creation cluster state update could be processed after the test finishes and cleans up, thereby leaking an index in the cluster state that could cause issues for other tests that wouldn't expect the index to exist. Closes #19530	2016-07-21 11:47:21 -04:00
Yannick Welsch	d4771b993f	Use executor's describeTasks method to log task information in cluster service (#19531 ) This fixes the log output in some places of ClusterService where the executor's describeTasks wasn't used to log task information.	2016-07-21 14:32:37 +02:00
Simon Willnauer	302c7a521a	Fix analyzer alias processing (#19506 ) In the lack of tests the analyzer.alias feature was pretty much not working at all on current master. Issues like #19163 showed some serious problems for users using this feature upgrading to an alpha version. This change fixes the processing order and allows aliases to be set for existing analyzers like `default`. This change also ensures that if `default` is aliased the correct analyzer is used for `default_search` etc. Closes #19163	2016-07-21 09:32:47 +02:00
Jun Ohtani	cebad703fe	Analyze: Specify anonymous char_filters/tokenizer/token_filters in the analyze API Add parser for anonymous char_filters/tokenizer/token_filters Using Settings in AnalyzeRequest for anonymous definition Add breaking changes document Closed #8878	2016-07-21 11:06:36 +09:00
Tal Levy	f7cd86ef6d	rethrow script compilation exceptions into ingest configuration exceptions (#19318 ) * rethrow script compilation exceptions into ingest configuration exceptions * update readProcessor to rethrow any exception as an ElasticsearchException	2016-07-20 10:37:56 -07:00
Nik Everett	3a82c613e4	Migrate query registration from push to pull Remove `ParseField` constants used for names where there are no deprecated names and just use the `String` version of the registration method instead. This is step 2 in cleaning up the plugin interface for extending search time actions. Aggregations are next. This is breaking for plugins because those that register a new query should now implement `SearchPlugin` rather than `onModule(SearchModule)`.	2016-07-20 12:33:51 -04:00
Yannick Welsch	2cf94d2d8a	Fix race in testCreateIndexWaitsForAllActiveShards When index creation is not acknowledged (due to a very low request timeout) it is possible that the index is still created. If a subsequent index-exists request completes before the cluster state of the index creation has been fully applied, it might miss the newly created index.	2016-07-20 18:28:28 +02:00
Nik Everett	fc4b439635	Remove AggregationStreams and friends * Remove outdated aggregation registration method * Remove AggregationStreams * Adds StreamInput#readNamedWriteableList and StreamOutput#writeNamedWriteableList convenience methods. We strive to make the reading and writing from the streams terse so they are easier to scan visually. * Remove PipelineAggregatorStreams * Remove stream info from InternalAggreation.Type * Remove InternalAggregation#type * Remove Streamable from PipelineAggregator * Remove Streamable from MultiBucketsAggregation.Bucket	2016-07-20 09:46:04 -04:00
Daniel Mitterdorfer	a4f09d2b81	Restore parameter name auto_generate_phrase_queries (#19514 ) During query refactoring the query string query parameter 'auto_generate_phrase_queries' was accidentally renamed to 'auto_generated_phrase_queries'. With this commit we restore the old name. Closes #19512	2016-07-20 13:13:57 +02:00
Martijn van Groningen	9b1a477120	Fix ClusterInfo serialization	2016-07-20 09:16:27 +02:00
Ryan Ernst	0f2d7a84a8	Add tests for disabling positions and copy the check to text fields	2016-07-19 19:07:56 -07:00
Ryan Ernst	c85cb37cc4	Mappings: Fix not_analyzed string fields to error when position_increment_gap is set Currently if a string field is not_analyzed, but a position_increment_gap is set, it will lookup the default analyzer and set it, along with the position_increment_gap, before the code which handles setting the keyword analyzer for not_analyzed fields has a chance to run. This change adds a parsing check and test for that case.	2016-07-19 17:54:13 -07:00
Jason Tedor	128f0276d9	Fix Javadocs for ThreadPool#schedule This commit fixes an issue with an @throws tag on ThreadPool#schedule not containing a description.	2016-07-19 18:35:30 -04:00
Jason Tedor	770186f6cf	Catch the right rejected execution exception ThreadPool#schedule can throw a rejected execution exception. Yet, the rejected execution exception that it throws comes from the EsAbortPolicy which throws an EsRejectedExecutionException. This exception does not inherit from RejectedExecutionException so instead we must catch the former instead of the latter.	2016-07-19 16:45:12 -04:00
Jason Tedor	720b53b018	Handle rejected execution exception on reschedule A self-rescheduling runnable can hit a rejected execution exception but this exception goes uncaught. Instead, this exception should be caught and passed to the onRejected handler. Not catching handling this rejected execution exception can lead to test failures. Namely, a race condition can arise between the shutting down of the thread pool and cancelling of the rescheduling of the task. If another reschedule fires right as the thread pool is being terminated, the rescheduled task will be rejected leading to an uncaught exception which will cause a test failure. This commit addresses these issues. Relates #19505	2016-07-19 15:35:51 -04:00
Nik Everett	9e2221cae5	Migrate remaining aggregations to NamedWriteable After this we'll be able to remove AggregationStreams and PipelineAggregatorStreams.	2016-07-19 14:43:29 -04:00
jaymode	11389638f9	Require executor name when calling scheduleWithFixedDelay The ThreadPool#scheduleWithFixedDelay method does not make it clear that all scheduled runnable instances will be run on the scheduler thread. This becomes problematic if the actions being performed include blocking operations since there is a single thread and tasks may not get executed due to a blocking task. This change includes a few different aspects around trying to prevent this situation. The first is that the scheduleWithFixedDelay method now requires the name of the executor that should be used to execute the runnable. All existing calls were updated to use Names.SAME to preserve the existing behavior. The second aspect is the removal of using ScheduledThreadPoolExecutor#scheduleWithFixedDelay in favor of a custom runnable, ReschedulingRunnable. This runnable encapsulates the logic to deal with rescheduling a runnable with a fixed delay and mimics the behavior of executing using a ScheduledThreadPoolExecutor and provides a ScheduledFuture implementation that also mimics that of the typed returned by a ScheduledThreadPoolExecutor. Finally, an assertion was added to BaseFuture to detect blocking calls that are being made on the scheduler thread.	2016-07-19 12:47:47 -04:00
Adrien Grand	0854b03f13	Elasticsearch should reject dynamic templates with unknown `match_mapping_type`. #17285 When looking at the logstash template, I noticed that it has definitions for dynamic temilates with `match_mapping_type` equal to `byte` for instance. However elasticsearch never tries to find templates that match the byte type (only long or double as far as numbers are concerned). This commit changes template parsing in order to ignore bad values of `match_mapping_type` (given how the logstash template is popular, this would break many upgrades otherwise). Then I hope to fail the parsing on bad values in 6.0.	2016-07-19 15:38:00 +02:00
Nik Everett	a2a7ea1f17	Make ExtendedBounds immutable We used to mutate it as part of building the aggregation. That caused assertVersionSerializable to fail because it assumes that requests aren't mutated after they are sent. Closes #19481	2016-07-19 08:48:14 -04:00
Yannick Welsch	c4fe8e7bf2	Fix replica-primary inconsistencies when indexing during primary relocation with ongoing replica recoveries (#19287 ) Primary relocation violates two invariants that ensure proper interaction between document replication and peer recoveries, ultimately leading to documents not being properly replicated. Invariant 1: Document writes must be replicated based on the routing table of a cluster state that includes all shards which have ongoing or finished recoveries. This is ensured by the fact that do not start a recovery that is not reflected by the cluster state available on the primary node and we always sample a fresh cluster state before starting to replicate write operations. Invariant 2: Every operation that is not part of the snapshot taken for phase 2, must be succesfully indexed on the target replica (pending shard level errors which will cause the target shard to be failed). To ensure this, we start replicating to the target shard as soon as the recovery start and open it's engine before we take the snapshot. All operations that are indexed after the snapshot was taken are guaranteed to arrive to the shard when it's ready to index them. Note that this also means that the replication doesn't fail a shard if it's not yet ready to recieve operations - it's a normal part of a recovering shard. With primary relocations, the two invariants can be possibly violated. Let's consider a primary relocating while there is another replica shard recovering from the primary shard. Invariant 1 can be violated if the target of the primary relocation is so lagging on cluster state processing that it doesn't even know about the new initializing replica. This is very rare in practice as replica recoveries take time to copy all the index files but it is a theoretical gap that surfaces in testing scenarios. Invariant 2 can be violated even if the target primary knows about the initializing replica. This can happen if the target primary replicates an operation to the intializing shard and that operation arrives to the initializing shard before it opens it's engine but arrives to the primary source after it has taken the snapshot of the translog. Those operations will be currently missed on the new initializing replica. The fix to reestablish invariant 1 is to ensure that the primary relocation target has a cluster state with all replica recoveries that were successfully started on primary relocation source. The fix to reestablish invariant 2 is to check after opening engine on the replica if the primary has been relocated in the meanwhile and fail the recovery. Closes #19248	2016-07-19 14:07:58 +02:00
Simon Willnauer	f79fb4ada7	Create RecoveryTarget once we reset the source RecoveryTarget increments a reference on the store once it's created. If we fail to return the instance from the reset method we leak a reference causing shard locks to not be released. This change creates the reference in the return statement to ensure no references are leaked	2016-07-19 12:27:11 +02:00
Martijn van Groningen	52b1b3e31f	allocation explain: Also serialize `includeDiskInfo` field.	2016-07-19 11:54:43 +02:00
Yannick Welsch	79ab6d19af	Fix NPE when initializing replica shard has no unassignedInfo (#19491 ) An initializing replica shard might not have an UnassignedInfo object, for example when it is a relocation target. The method allocatedPostIndexCreate does not account for this situation.	2016-07-19 11:30:57 +02:00
Simon Willnauer	5b07f81fcf	Move `reset recovery` into RecoveriesCollection (#19466 ) Today when we reset a recovery because of the source not being ready or the shard is getting removed on the source (for whatever reason) we wipe all temp files and reset the recovery without respecting any reference counting or locking etc. all streams are closed and files are wiped. Yet, this is problematic since we assert that some files are on disk etc. when we finish writing a file. These assertions don't hold anymore if we concurrently wipe the tmp files. This change moves the logic out of RecoveryTarget into RecoveriesCollection which basically clones the RecoveryTarget on reset instead which allows in-flight operations to finish gracefully. This means we now have a single path for cleanups in RecoveryTarget and can safely use assertions in the class since files won't be removed unless the recovery is either canceled, failed or finished. Closes #19473	2016-07-19 10:23:02 +02:00
Adrien Grand	37e20c6f34	Automatically created indices should honor `index.mapper.dynamic`. #19478 Today they don't because the create index request that is implicitly created adds an empty mapping for the type of the document. So to Elasticsearch it looks like this type was explicitly created and `index.mapper.dynamic` is not checked. Closes #17592	2016-07-19 09:02:31 +02:00
Nik Everett	7861548786	Migrate serial_diff aggregation to NamedWriteable This is the last migration before AggregationStreams and PipelineAggregatorStreams can be removed to remove redundant code.	2016-07-18 13:00:06 -04:00
Adrien Grand	3bb6a4dea6	Try to prevent classloading deadlock. Closes #19316	2016-07-18 17:45:17 +02:00
Colin Goodheart-Smithe	e3d3f6b1f1	#19472 Enable option to use request cache for size > 0 Enable option to use request cache for size > 0	2016-07-18 16:28:07 +01:00
Yannick Welsch	4bec7ad58f	Do not throw AssertionError for expected exceptions in SearchWhileRelocatingIT (#19476 ) The test would previously catch Throwable and then decide if it was a critical exception or not. As the catch block was changed from Throwable to Exception this made the test fail for non-critical exceptions. This commit changes the test so that exceptions are only thrown when they're unexpected.	2016-07-18 16:45:07 +02:00
Martijn van Groningen	82e7f1fc43	parent/child: Make sure that no `_parent#null` gets introduces as default _parent mapping. Instead it should just be `_parent` field. Also added more tests regarding the join doc values field being added. Closes #19389	2016-07-18 16:38:13 +02:00
Nik Everett	16812cc032	Migrate moving_avg pipeline aggregation to NamedWriteable This is the first pipeline aggregation that doesn't have its own bucket type that needs serializing. It uses InternalHistogram instead. So that required reworking the new-style `registerAggregation` method to not require bucket readers. So I built `PipelineAggregationSpec` to mirror `AggregationSpec`. It allows registering any number of bucket readers or result readers.	2016-07-18 10:14:09 -04:00
Simon Willnauer	8394544548	Add a dedicated client/transport project for transport-client (#19435 ) The `client/transport` project adds a new jar build project that pulls in all dependencies and configures all required modules. Preinstalled modules are: * transport-netty * lang-mustache * reindex * percolator The `TransportClient` classes are still in core while `TransportClient.Builder` has only a protected construcutor such that users are redirected to use the new `TransportClientBuilder` from the new jar. Closes #19412	2016-07-18 15:42:24 +02:00
Colin Goodheart-Smithe	b717ad8eb6	Enable option to use request cache for size > 0 Previously if the size of the search request was greater than zero we would not cache the request in the request cache. This change retains the default behaviour of not caching requests with size > 0 but also allows the `request_cache=true` query parameter to enable the cache for requests with size > 0	2016-07-18 13:33:59 +01:00

1 2 3 4 5 ...

5773 Commits