OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-25 01:19:02 +00:00

Author	SHA1	Message	Date
Shay Banon	646ab22e3a	only process an internal refresh mapping once we only need to process a refresh on an index/type tuple once even if there are several pending, optimize to not do it several times	2013-07-24 19:12:21 +02:00
Simon Willnauer	08697b2ba8	Fix test-bug that produced ordinals with gaps	2013-07-24 18:04:54 +02:00
Shay Banon	6da558f81f	extra space...	2013-07-24 17:51:18 +02:00
Shay Banon	9f57f26be4	make base classes tests abstract (for intellij)	2013-07-24 17:28:51 +02:00
Shay Banon	a3c3d4073b	use host address instead of host name in tests to not do host lookup	2013-07-24 17:22:18 +02:00
Simon Willnauer	ed473e272d	Cut over to JUnit & Randomized Runner from TestNG For better integration with the Lucene Test Framework and the availabilty of RandomizedRunner / Randommized Testing this commit moves over from TestNG to JUnit. This commit also moves relevant places over to RandomzedRunner for reproduceability and removes copied classes from the Lucene Test Framework.	2013-07-24 16:59:36 +02:00
Martijn van Groningen	5de9877405	We should wait until the first blocking task is actual being executed. The pending tasks can be returned out of order.	2013-07-24 15:59:01 +02:00
Shay Banon	d67b8f1310	don't xcontent locale if its ROOT for date since its the default	2013-07-24 12:47:22 +02:00
Boaz Leskes	497032e6e7	FuzzyLikeThisFieldQueryBuilder now defaults failOnUnsupportedField to true, to be consistent with REST API. Closes #3374	2013-07-24 09:54:23 +02:00
Shay Banon	b85bcde640	better logging on failure to create index if the index already exists, trace log it, since it might happen as a result of multiple index requests at the same time creating the index, all other ones, should be debug and not warn in the same spirit of other APIs	2013-07-24 01:58:31 +02:00
Shay Banon	bb6df34671	bulk refresh and update mapping cluster events try and bulk as much as possible refresh and update mapping events, so they will all be processed at a single go, resulting in less cluster change events, and also reduce the load of multiple changes to the same index also, change the prio for those to HIGH, since we want URGENT ones (like create index, delete index) to execute	2013-07-24 01:58:11 +02:00
Shay Banon	6a5d2bf767	remove cached stream output start to build the infra for simpler migration to netty 4, starting with removing the cache stream output handling as it will come built in	2013-07-23 15:56:48 +02:00
Martijn van Groningen	b52243cdc2	Added cluster pending tasks api. Closes #3368	2013-07-23 15:33:29 +02:00
Simon Willnauer	69a7f8d71d	Removed XPatternCaptureGroupTokenFilter	2013-07-23 13:55:20 +02:00
Simon Willnauer	2e9851138e	Upgrade to Lucene 4.4	2013-07-23 13:55:15 +02:00
Shay Banon	92a7030558	Add optimize thread pool (size 1) dedicated to perform explicit optimize API Have a dedicated thread pool for explicit optimize calls (shard level optimize operations). By default, the size should be 1 to work the same with how things work currently allowing for only 1 shard level optimize on a node. The change allows to see the optimize thread pool stats now, and potentially increase the thread pool size for beefy machines. closes #3366	2013-07-23 11:05:15 +02:00
Shay Banon	f9ce791578	make sure we update last failure	2013-07-23 10:54:42 +02:00
Martijn van Groningen	aa5c15a1d7	Wait a bit longer for ttl to have occurred	2013-07-23 00:29:11 +02:00
Martijn van Groningen	18141b8da0	Fix SimplePercolatorTests#testPercolateStatistics	2013-07-23 00:10:27 +02:00
Shay Banon	4930b93c26	move master actions to accept a lister, improve cluster service state execution - master actions many times end up being executed on the cluster service, so there is no need to block them on the management thread pool to wait for a response, this remove the load on the management thread pool, and also simplifies the code implementing it - cluster service state update exception handling was improved to include a callback when a failure happens during state update execution, this makes sure that we catch all relevant exceptions and invoke the callback, as well as simplify the code of cluster state update tasks, as they can throw failures from within the execute method and then handle them properly	2013-07-23 00:08:45 +02:00
Adrien Grand	e943cc81a5	Add FastVectorHighlighter support for more complex queries. FastVectorHighlighter fails at highlighting some complex queries such as multi phrase queries which have two terms at the same position. This can be easily triggered by running a `match_phrase` query with an analyzer which outputs synonyms such as SynonymFilter or WordDelimiterFilter. Close #3357	2013-07-22 19:43:22 +02:00
Shay Banon	6b21414520	should be 500...	2013-07-22 17:53:44 +02:00
Shay Banon	c766f6bd97	Cluster State Update APIs (master node) to respect master_timeout better also respect the timeout when trying to obtain the md lock relates #3365	2013-07-22 17:53:18 +02:00
Shay Banon	235a68c3bd	Cluster State Update APIs (master node) to respect master_timeout better Currently, the master node might be processing too many cluster state events, and then be blocked on waiting for its respective even to be processed. We can use the new cluster state update timeout support to use the master_timeout value and respect it. closes #3365	2013-07-22 16:58:00 +02:00
Luca Cavanna	0b33394476	Added alias action validation Moved alias action validation to IndicesAliasesRequest, so that Java api and RestIndexPutAliasAction can benefit from it too. Added check in MetaDataIndexAliasesService too. Closes #3363	2013-07-22 15:27:20 +02:00
Martijn van Groningen	2f7d1189b1	Also serialize the routing and preference	2013-07-22 15:16:29 +02:00
Martijn van Groningen	d310b94904	Fix ConcurrentPercolatorTests replaces CountDownLatches array by a Semaphore. The writes to the CountDownLatches array wasn't visible by all threads when the countdown latch array slots were re-initialized.	2013-07-22 11:32:55 +02:00
Shay Banon	4da7086df8	catch failures and notify the listener in aliases action handling	2013-07-22 11:19:23 +02:00
Shay Banon	7a9350c9a1	Transport: Add a dedicated ping channel Today, we have a low/med/high channel groups in our transport layer. High is used to publish cluster state and send ping requests. Sometimes, the overhead of publishing large cluster states can interfere with ping requests. Introduce a new, dedicated ping channel (with size 1) to have a channel that only handles ping requests. closes #3362	2013-07-22 10:29:57 +02:00
Shay Banon	e2961c0c7a	add ability to associate a timeout with a priority executor enhancement that can be used later to timeout tasks that are pending, also added the ability to get the pending task list from the executor	2013-07-22 09:16:56 +02:00
Martijn van Groningen	e8ff7de6b8	Test to some extent the time spent on percolating.	2013-07-22 09:02:27 +02:00
Shay Banon	bd52d61d5d	reroute post shard started should have HIGH prio as well	2013-07-21 15:21:22 +02:00
Shay Banon	f2614b22de	Zen Discovery Cluster Events to have Priority.URGENT Master node cluster state events resulting in zen discovery (node gets added, removed, for example) should be processed with priority URGENT as its always better to process them as fast as possible, and not let other events get in the way. closes #3361	2013-07-21 14:57:26 +02:00
Shay Banon	6a25395c97	fix percolate stats tests failures	2013-07-21 14:25:26 +02:00
Adrien Grand	09362f47e9	Be consistent with the default value of `acceptable_overhead_ratio`.	2013-07-19 19:37:36 +02:00
Martijn van Groningen	32a96aea71	Added percolate statistics to indices and node stats. Relates to #3173	2013-07-19 19:14:49 +02:00
Shay Banon	eb75a815db	pattern replace with empty "" setting fails we should default the replacement to "", since in the settings, if its set to "", we remove the setting	2013-07-19 19:13:15 +02:00
skymeyer	08e35e4dbe	Use systemd configuration file for applying limits correctly In case systemd is used, ulimit is not called (as it would be in the initscript) and has to be configured in the systemd configuration. For more information about the parameters LimitNOFILE and LimitMEMLOCK see http://www.freedesktop.org/software/systemd/man/systemd.exec.html	2013-07-19 18:10:00 +02:00
Martijn van Groningen	c346ed3d2d	Fix ConcurrentPercolatorTests that failed occasionally when running with other tests, by not using the shared test cluster.	2013-07-19 16:54:14 +02:00
Shay Banon	b12acbcf9e	introduce read/writeSharedString while streaming currently, we treat all strings as shared (either by full equality or identity equality), while almost all times we know if they should be serialized as shared or not. Add an explicitly write/readSharedString, and use it where applicable, and all other write/readString will not treat them as shared relates to #3322	2013-07-19 16:17:22 +02:00
Shay Banon	74a7c46b0e	top level filter not resulting in an actual filter is ignored when parsing a filter, we use null to indicate that this filter should not match anything, the top level filter doesn't take it into account fixes #3356	2013-07-19 13:12:23 +02:00
Adrien Grand	fe4c2a9d02	Work around the fact that AssertionError(String message, Throwable cause) is a Java 1.7-only API.	2013-07-19 09:47:55 +02:00
Adrien Grand	12d9268db2	Make field data able to support more than 2B ordinals per segment. Although segments are limited to 2B documents, there is not limit on the number of unique values that a segment may store. This commit replaces 'int' with 'long' every time a number is used to represent an ordinal and modifies the data-structures used to store ordinals so that they can actually support more than 2B ordinals per segment. This commit also improves memory usage of the multi-ordinals data-structures and the transient memory usage which is required to build them (OrdinalsBuilder) by using Lucene's PackedInts data-structures. In the end, loading the ordinals mapping from disk may be a little slower, field-data-based features such as faceting may be slightly slower or faster depending on whether being nicer to the CPU caches balances the overhead of the additional abstraction or not, and memory usage should be better in all cases, especially when the size of the ordinals mapping is not negligible compared to the size of the values (numeric data for example). Close #3189	2013-07-19 09:10:08 +02:00
Martijn van Groningen	4d05c9cfd5	Optimize `has_child` query & filter execution with two short circuit mechanisms: * If all parent ids have been emitted as hit, abort the query / filter execution. * If the a relative small number of parent ids have been collected in the first phase then limit the number of second phase parent id lookups by putting a short circuit filter before parent document evaluation or omit the it in the case of the filter. This is contrable via the `short_circuit_cutoff` option which is exposed in the `has_child` query & filter. All parent / child queries and filters (expect `top_children` query) abort execution if no parent ids have been collected in the first phase. Closes #3190	2013-07-18 17:41:23 +02:00
Martijn van Groningen	c222ce28fc	Redesigned the percolator engine to execute in a distribute manner. With this design the percolate queries will be stored in a special `_percolator` type with its own mapping in the same index where the actual data is or in a different index (dedicated percolation index, which might require different sharding behavior compared to the index that holds actual data and being search on). This approach allows percolate requests to scale to the number of primary shards an index has been configured with and effectively distributes the percolate execution. This commit doesn't add new percolate features other than scaling. The response remains similar, with exception that a header similar to the search api has been added to the percolate response. Closes #3173	2013-07-18 16:52:42 +02:00
Adrien Grand	f38103a232	Eclipse: organize imports on save. It can happen that Eclipse fails at correctly adding a new import entry to an existing list of imports since we don't use its default rules. This commit forces Eclipse to organize imports on save.	2013-07-18 14:49:35 +02:00
Florian Schilling	1e5e8d83b1	Changed GeoPoint parsing in serveral parsers using Geopoint.parse() Closes #3351	2013-07-18 12:49:12 +02:00
Robin Hughes	45a756c203	Analysis: update ThaiAnalyzerProvider to use custom stopwords setting	2013-07-18 11:18:37 +02:00
Luca Cavanna	c28452ee67	added missing link to the elasticsearch.org website	2013-07-17 19:07:22 +02:00
Adrien Grand	ffcc710e4e	Add the ability to ignore or fail on numeric fields when executing more-like-this or fuzzy-like-this queries. More-like-this and fuzzy-like-this queries expect analyzers which are able to generate character terms (CharTermAttribute), so unfortunately this doesn't work with analyzers which generate binary-only terms (BinaryTermAttribute, the default CharTermAttribute impl being a special BinaryTermAttribute) such as our analyzers for numeric fields (byte, short, integer, long, float, double but also date and ip). To work around this issue, this commits adds a fail_on_unsupported_field parameter to the more-like-this and fuzzy-like-this parsers. When this parameter is false, numeric fields will just be ignored and when it is true, an error will be returned, saying that these queries don't support numeric fields. By default, this setting is true but the mlt API sets it to true in order not to fail on documents which contain numeric fields. Close #3252	2013-07-16 18:37:34 +02:00

1 2 3 4 5 ...

5082 Commits