OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-24 17:09:48 +00:00

Author	SHA1	Message	Date
Martijn van Groningen	145efbf6ea	Return missing (404) is a scroll_id is cleared that no longer exists. Closes #5730	2014-05-12 09:43:56 +02:00
Adrien Grand	51de01bae5	[TESTS] Tentative fix of BigArrays byte-accounting checks.	2014-05-12 09:25:49 +02:00
cccabot	58ebcf1252	Fixed typos in FieldSortBuilder	2014-05-10 02:57:51 +02:00
Andrew Selden	48879752a2	[TEST] Fix for benchmark tests - Fix bug where repeatedly calling computeSummaryStatistics() could accumulate some values incorrectly - Fix check for number of responsive nodes on list is <= number of candidate benchmark nodes - Add public getters for summary statistics - Add javadoc for new getters - Add javadoc comments about API use - Improve abort and status tests by calling awaitBusy() to wait for jobs to be completely submitted before testing them	2014-05-09 16:01:57 -07:00
mikemccand	5e40a4b95a	don't call isFinite from XAnalyzingSuggester; re-enable test on Java 8	2014-05-09 18:24:13 -04:00
javanna	6678da8c28	[TEST] randomly added node.bench=true to client node in test cluster and re-enabled REST benchmark tests based on number of bench nodes available In our REST tests we already have support for features and skip sections that allow to skip tests if a feature is not supported. We can then add a skip section based on the benchmark feature to the benchmark tests and execute them only when they are supported, knowing that they need at least a node with node.bench settings within the cluster. We can check that this requirement is met by calling the nodes info api. This way we can dynamically decide whether to execute those tests or not and we don't need to have a node.bench around all the time. In fact, given that the REST tests use the GLOBAL cluster, we want to be able to randomize settings as much as possible and run tests against default settings as well. Also, this mechanism can be easily supported by the external cluster implementation that is used during the release process. Introduced ability to disable benchmark nodes which is needed by BenchmarkNegativeTest.	2014-05-09 23:36:00 +02:00
Alex Ksikes	d8bb7c157a	[TEST] Removed the restriction on the number of bool clauses that must match. The test failed because 'percent_terms_to_match' defaults to 0.3, which results in requiring that some terms only found in the queried document must match, when all the documents are on the same shard.	2014-05-09 19:14:32 +02:00
Lee Hinman	e7e4ef859a	Add /_cat/fielddata to display fielddata usage Closes #4593	2014-05-09 13:18:02 +02:00
Alex Ksikes	dae48d9fe8	Added the ability to include the queried document for More Like This API. By default More Like This API excludes the queried document from the response. However, when debugging or when comparing scores across different queries, it could be useful to have the best possible matched hit. So this option lets users explicitly specify the desired behavior. Closes #6067	2014-05-09 12:59:39 +02:00
mikemccand	aa31c71776	mute this test until we fix isFinite	2014-05-09 05:24:22 -04:00
Martijn van Groningen	67fe88c63c	[TEST] Enforce that only one shard per node is allocated. The prevents during node shutdown, that a second shard is assigned the another node.	2014-05-09 10:43:08 +02:00
Martijn van Groningen	d7c05e5924	Temporarily disabling benchmark tests. Relates #6094	2014-05-08 13:18:12 +02:00
Martijn van Groningen	d5b95e3e8a	A number of changes to fix reduce failures if shard failures have occurred: * The shardTopDocs array should get created with the size equal to the total number of shard level requests and not the total number of requests that have a shard level result. * Make sure no null TopDocs entires are passed down to TopDocs#merge * Added dedicated scroll tests that tests scrolling on an index that has missing shards due to node failure. * Made sure that the sort fields in SimpleNestedTests exists by adding the fields in the mapping during index creation. Closes #6022	2014-05-08 10:17:00 +02:00
Martijn van Groningen	0efeeff49a	The percolator needs to deleted percolator documents into account when running in near realtime mode. This bug only occurs in non-realtime mode when query, filter, facet or aggs is used. Closes #5843 Closes #5840	2014-05-08 09:52:27 +02:00
Andrew Selden	c00120b818	Fix for benchmark test - Fix bug where repeatedly calling computeSummaryStatistics() could accumulate some values incorrectly. - Fix check for number of responsive nodes on list is <= number of candidate benchmark nodes. - Add public getters for summary statistics - Add javadoc for new getters - Add javadoc comments about API use	2014-05-07 18:42:39 -07:00
mikemccand	82aad78ff2	it's safe to use OneMerge.getTotalBytesSize (fixed in LUCENE-4775)	2014-05-07 17:25:06 -04:00
Andrew Selden	f23274523a	Integration tests for benchmark API. - Randomized integration tests for the benchmark API. - Negative tests for cases where the cluster cannot run benchmarks. - Return 404 on missing benchmark name. - Allow to specify 'types' as an array in the JSON syntax when describing a benchmark competition. - Don't record slowest for single-request competitions. Closes #6003, #5906, #5903, #5904	2014-05-07 14:14:54 -07:00
uboness	fc52db1209	Changed the respnose structure of the percentiles aggregation where now all the percentiles are placed under a `values` object (or `values` array in case the `keyed` flag is set to `false` Closes #5870	2014-05-07 18:35:24 +02:00
Shay Banon	743dc19acb	Node version sometimes empty in _cat/nodes closes #5480	2014-05-07 18:08:11 +02:00
Britta Weber	7944369fd1	Add `shard_min_doc_count` parameter for significant terms similar to `shard_size` Significant terms internally maintain a priority queue per shard with a size potentially lower than the number of terms. This queue uses the score as criterion to determine if a bucket is kept or not. If many terms with low subsetDF score very high but the `min_doc_count` is set high, this might result in no terms being returned because the pq is filled with low frequent terms which are all sorted out in the end. This can be avoided by increasing the `shard_size` parameter to a higher value. However, it is not immediately clear to which value this parameter must be set because we can not know how many terms with low frequency are scored higher that the high frequent terms that we are actually interested in. On the other hand, if there is no routing of docs to shards involved, we can maybe assume that the documents of classes and also the terms therein are distributed evenly across shards. In that case it might be easier to not add documents to the pq that have subsetDF <= `shard_min_doc_count` which can be set to something like `min_doc_count`/number of shards because we would assume that even when summing up the subsetDF across shards `min_doc_count` will not be reached. closes #5998 closes #6041	2014-05-07 18:02:56 +02:00
javanna	f554178fc7	Renamed IndicesOptions#strict and IndicesOptions#lenient to make it clearer what they actually return, reused methods and introduced new one Relates to #6059, where two new constants were introduced in IndicesOptions. There were already two constants there though, one of which we could have reused. This commit tries to unify them.	2014-05-07 17:40:57 +02:00
Alexander Reelsen	0c0f717aba	Removed Index Status API The functionality of the index status API has been replaced by the recovery API. Relates #4854	2014-05-07 16:57:19 +02:00
Adrien Grand	c49276cda7	Add a dedicated field data type for the _index field mapper. This makes aggregations work on the _index field, and also allows to remove the special facet aggregator for the _index field. Close #5848	2014-05-07 14:06:13 +02:00
Adrien Grand	c4f127fb6f	Limit the number of bytes that can be allocated to process requests. This should prevent costly requests from killing the whole cluster. Close #6050	2014-05-07 12:55:48 +02:00
Adrien Grand	8cd7811955	Lower initial sizing of sub aggregations. We currently compute initial sizings based on the cardinality of our fields. This can be highly exagerated for sub aggregations, for example if there is a parent terms aggregation that is executed over a field that has a very long tail: most buckets will only collect a couple of documents. Close #5994	2014-05-06 17:23:34 +02:00
Adrien Grand	c306d8c5f5	Don't assume fixed earth diameter in the geo-distance bounding box optimization. We switched to Lucene's SloppyMath way of computing an approximate value of the eath diameter given a latitude in order to compute distances, yet the bounding box optimization of the geo distance filter still assumed a constant earth diameter, equal to the average. Close #6008	2014-05-06 16:20:31 +02:00
Shay Banon	44fd962a9f	Improve 404 on missing scroll id This relates to #6040, the fix is twofold, first, not handling missing context specifically in the search code, but behave the same as we do in non scroll search, where if all the shards failed, raise an exception. The second is to apply this logic in both scroll cases.	2014-05-06 15:55:42 +02:00
Shay Banon	66296de38d	Remove unused dump infra Way back when, when ES started, there was an idea for a dump infrastructure, but it ended up supporting its serviceability aspects through APIs, remove the unused code	2014-05-06 14:02:24 +02:00
javanna	a8b6f81525	Made it mandatory to specify IndicesOptions when calling MetaData#concreteIndices Removed MetaData#concreteIndices variations that didn't require an IndicesOptions argument. Every caller should specify how indices should be resolved to concrete indices based on the indices options argument. Closes #6059	2014-05-06 12:45:16 +02:00
Adrien Grand	90b547cf2c	Remove RootMapper.validate and validate the routing key up-front. RootMapper.validate was only used by the routing field mapper, which makes buggy assumptions about how fields are indexed. For example, it assumes that the index representation of a field is the same as its external representation. Close #5844	2014-05-06 11:55:31 +02:00
Adrien Grand	589360c8b1	[TESTS] Don't randomize mappings in SimpleValidateQueryTests. This test relies on the fact that the _id field is not indexed.	2014-05-06 11:46:31 +02:00
Adrien Grand	17a32fca03	[TEST] Random dynamic templates. This change randomly indexes the _id field and randomizes field data formats and loading. Close #5834	2014-05-06 11:07:43 +02:00
Alexander Reelsen	d356881664	[REST] Missing scroll id now returns 404 A bad/non-existing scroll ID used to return a 200, however a 404 might be more useful. Also, this PR returns the right Exception (SearchContextMissingException) in the Java API. Additionally: Added StatusToXContent interface and RestStatusToXContentListener listener, so the appropriate RestStatus can be returned Closes #5729	2014-05-05 17:37:26 +02:00
Shay Banon	fad5e2d0e1	Remove operation threading from broadcast actions Similar to search removal, the operation threading options are not really ued, and the default should always be used. This also considerably simplifies the code. A side affect is that we can now remove the ShardIterator#firstOrNull method, which can cause for sneaky bugs to occur. closes #6044	2014-05-05 17:09:36 +02:00
Alexander Reelsen	799bb2491c	Analyze API: Default analyzer accidentally removed stopwords The analyze API used the standard analyzer from lucene and therefore removed stopwords instead of using the elasticsearch default analyzer. Closes #5974	2014-05-05 15:55:33 +02:00
Alexander Reelsen	d4fcf23057	Cluster State API: Remove index template filtering The possibility of filtering for index templates in the cluster state API had been introduced before there was a dedicated index templates API. This commit removes this support from the cluster state API, as it was not really clean, requiring you to specify the metadata and the index templates. Closes #4954	2014-05-05 14:54:14 +02:00
Shay Banon	7ce8306bc5	Remove search operation threading option Search operation threading is an option that is not really used, and current non default implementations are flawed. Handling it also creates quite the complexity in the search handling codebase... This is a breaking change, but one that is actually a good one, since I haven't seen/heard anybody use it, and if its used, its problematic... closes #6042	2014-05-05 11:39:16 +02:00
Benjamin Devèze	cea2d21c50	Fix bug in PropertyPlaceholder and add unit tests. Close #6034	2014-05-05 10:21:18 +02:00
Adrien Grand	727e6172e3	Restore read/write visibility is PlainShardsIterator. Change #5561 introduced a potential bug in that iterations that are performed on a thread are might not be visible to other threads due to the removal of the `volatile` keyword. Close #6039	2014-05-05 10:05:44 +02:00
Shay Banon	342a32fb16	Search might not return on thread pool rejection When a thread pool rejects the execution on the local node, the search might not return. This happens due to the fact that we move to the next shard only within the execution on the thread pool in the start method. If it fails to submit the task to the thread pool, it will go through the fail shard logic, but without "counting" the current shard itself. When this happens, the relevant shard will then execute more times than intended, causing the total opes counter to skew, and for example, if on another shard the search is successful, the total ops will be incremented beyond the expectedTotalOps, causing the check on == as the exit condition to never happen. The fix here makes sure that the shard iterator properly progresses even in the case of rejections, and also includes improvement to when cleaning a context is sent in case of failures (which were exposed by the test). Though the change fixes the problem, we should work on simplifying the code path considerably, the first suggestion as a followup is to remove the support for operation threading (also in broadcast), and move the local optimization execution to SearchService, this will simplify the code in different search action considerably, and will allow to remove the problematic #firstOrNull method on the shard iterator. The second suggestion is to move the optimization of local execution to the TransportService, so all actions will not have to explicitly do the mentioned optimization. fixes #4887	2014-05-05 09:24:53 +02:00
javanna	e96e634d10	[TEST] fixed _cat/thread_pool REST tests with local transport, in case the transport port is not available and gets returned as '-' Re-enabled REST tests suite Closes #6033	2014-05-04 22:10:03 +02:00
mikemccand	6bc3a744a1	Fix StackOverflowException for long suggestion strings Changed getFiniteStrings to use an iterative implementation instead of recursive, so we don't use a Java stack-frame per character for each suggestion at build & query time.	2014-05-04 13:35:05 -04:00
Shay Banon	c9f1792c81	Change default filter cache to 10% and circuit breaker to 60% The defaults we have today in our data intensive memory structures don't properly add up to properly protected from potential OOM. The circuit breaker, today at 80%, aims at protecting from extensive field data loading. The default threshold today is too permissive and can still cause OOMs. The filter cache today is at 20%, and its too high when adding it to other limits we have, reduce it to 10%, which is still a big enough portion of the heap, yet provides improved safety measure. closes #5990	2014-05-04 15:38:16 +02:00
Adrien Grand	01eb01cb70	[TEST] Disable REST tests until #6033 is fixed.	2014-05-04 11:58:30 +02:00
Boaz Leskes	694bf287d6	Do not start a recovery process if the primary shard is currently allocated on a node which is not part of the cluster state If a source node disconnect during recover, the target node will respond by canceling the recovery. Typically the master will respond by removing the disconnected node from the cluster state, promoting another shard to become primary. This is sent it to all nodes and the target node will start recovering from the new primary. However, if the drop of a node caused the node count to go bellow min_master_node, the master will step down and will not promote shard immediately. When a new master is elected we may publish a new cluster state (who's point is to notify of a new master) which is not yet updated. This caused the node to start a recovery to a non existent node. Before we aborted the recovery without cleaning up the shard, causing subsequent correct cluster states to be ignored. We should not start the recovery process but wait for another cluster state to come in. Closes #6024	2014-05-02 23:30:24 +02:00
Alex Ksikes	b55d8ed2e3	Fix behavior on default boost factor for More Like This. A boost terms factor of 1.0 is not the same as no boosting of terms. The desired behavior is to deactivate boosting by default. If the user specifies any value other than 0, then boosting is activated. Closes #6021	2014-05-02 16:59:09 +02:00
Holger Hoffstätte	f5c9bf6f0f	Update JNA to latest version Updating to this version allows to configure a special JNA directory, in case the /tmp directory is mounted with the noexec option, as JNA extracts some data and tries to execute parts of it. Also updated documentation to clarify mlockall and memory settings as well as pointing to the new jna.tmpdir system property. Closes #5493	2014-05-02 11:52:57 +02:00
Britta Weber	2e44040388	function_score parser throws exception if both functions:[] and single function given In addition, add a special warning if the misplaced function is a "boost_factor" function to avoid confusion of "boost" and "boost_function". closes #5995	2014-05-02 10:53:33 +02:00
Shay Banon	a557ee8daf	Support empty properties array in mappings closes #5887	2014-05-01 12:18:39 -04:00
Boaz Leskes	42a112f50b	debug log of receiving a cluster state from another master could be erroneously logged Added trace logging to MinimumMasterNodesTests.multipleNodesShutdownNonMasterNodes	2014-05-01 13:15:08 +02:00

... 3 4 5 6 7 ...

4196 Commits