OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-27 15:39:49 +00:00

Author	SHA1	Message	Date
Martijn van Groningen	45699bae5a	Make sure preference isn't null	2013-08-27 11:05:12 +02:00
Adrien Grand	db46946d16	Configurable sort order for missing string values. This commit allows for configuring the sort order of missing values in BytesRef comparators (used for strings) with the following options: - _first: missing values will appear in the first positions, - _last: missing values will appear in the last positions (default), - <any value>: documents with missing sort value will use the given value when sorting. Since the default is _last, sorting by string value will have a different behavior than in previous versions of elasticsearch which used to insert missing value in the first positions when sorting in ascending order. Implementation notes: - Nested sorting is supported through the implementation of NestedWrappableComparator, - BytesRefValComparator was mostly broken since no field data implementation was using it, it is now tested through NoOrdinalsStringFieldDataTests, - Specialized BytesRefOrdValComparators have been removed now that the ordinals rely on packed arrays instead of raw arrays, - Field data tests hierarchy has been changed so that the numeric tests don't inherit from the string tests anymore, - When _first or _last is used, internally the comparators are told to use null or BytesRefFieldComparatorSource.MAX_TERM to replace missing values (depending on the sort order), - BytesRefValComparator just replaces missing values with the provided value and uses them for comparisons, - BytesRefOrdValComparator multiplies ordinals by 4 so that it can find ordinals for the missing value and the bottom value which are directly comparable with the segment ordinals. For example, if the segment values and ordinals are (a,1) and (b,2), they will be stored internally as (a,4) and (b,8) and if the missing value is 'ab', it will be assigned 6 as an ordinal, since it is between 'a' and 'b'. Then if the bottom value is 'abc', it will be assigned 7 as an ordinal since if it between 'ab' and 'b'. Closes #896	2013-08-27 10:46:21 +02:00
Martijn van Groningen	2c939847b4	Simplified percolate reduce logic and the percolator recovery test	2013-08-27 10:45:09 +02:00
Martijn van Groningen	28e1744e79	No test is simple	2013-08-26 18:01:12 +02:00
Martijn van Groningen	29626dd201	Rename multi percolate item actions: count_percolate -> count percolate_existing_doc -> percolate count_percolate_existing_doc -> count If header contains `id` field, then it will automatically be percolation an existing document.	2013-08-26 18:00:36 +02:00
Martijn van Groningen	3ca0239668	Added highlighter to percolate api. The highlighter in the percolate api highlights snippets in the document being percolated. If highlighting is enabled then foreach matching query, highlight snippets will be generated. All highlight options that are supported via the search api are also supported in the percolate api, since the percolate api embeds the same highlighter infrastructure as the search api. The `size` option is a required option if highlighting is specified in the percolate api, other than that the `highlight`request part can just be placed in the percolate api request body. Closes #3574	2013-08-26 16:37:07 +02:00
Martijn van Groningen	df3922a22a	Small cleanup	2013-08-26 15:42:19 +02:00
Martijn van Groningen	35dcdb0b5a	Only use the client variable and don't use client from client() in percolator test	2013-08-26 15:37:18 +02:00
Shay Banon	8b295b53d0	Improve refresh logic when replica move to started closes #3573	2013-08-26 15:15:01 +02:00
Shay Banon	b329943632	improve search while create test - improve the test to be more re-creatable - have tests for various number of replica counts, to check if failures are caused by searching on replicas that might not have been refreshed yet - improve test to test explicit index creation, and index creation caused by index operation - have an initial search go to _primary, to check if failure fails when searching on replica because it missed a refresh	2013-08-26 14:29:40 +02:00
Martijn van Groningen	9f2e615ed9	Added extra logging	2013-08-26 09:48:04 +02:00
Boaz Leskes	a96ecea653	Multi term vector request -------------------------- This feature allows to retrieve [term vectors](https://github.com/elasticsearch/elasticsearch/issues/3114) for a list of documents. The json request has exactly the same [format](https://github.com/elasticsearch/elasticsearch/issues/3484) as the ```_termvectors``` endpoint It use it, call ``` curl -XGET 'http://localhost:9200/index/type/_mtermvectors' -d '{ "fields": [ "field1", "field2", ... ], "ids": [ "docId1", "docId2", ... ], "offsets": false\|true, "payloads": false\|true, "positions": false\|true, "term_statistics": false\|true, "field_statistics": false\|true }' ``` The return format is an array, each entry of which conatins the term vector response for one document: ``` { "docs": [ { "_index": "index", "_type": "type", "_id": "docId1", "_version": 1, "exists": true, "term_vectors": { ... } }, { "_index": "index", "_type": "type", "_id": "docId2", "_version": 1, "exists": true, "term_vectors": { ... } } ] } ``` Note that, like term vectors, the mult term vectors request will silenty skip over documents that have no term vectors stored in the index and will simply return an empty response in this case. Closes #3536	2013-08-26 09:25:21 +02:00
Boaz Leskes	18c71b16b5	Refactor term vector api This is necessary to allow adding a mult term vector request	2013-08-26 09:25:21 +02:00
Martijn van Groningen	a09f217b45	Don't reduce twice if failure occurs.	2013-08-25 21:50:04 +02:00
Simon Willnauer	020e68f2a0	Make sure all shards are refreshed during recovery test	2013-08-24 09:08:39 +02:00
Shay Banon	f3a35ccc90	upgrade to joda 2.3	2013-08-24 00:07:57 +02:00
Shay Banon	e892ec2c37	use our abort policy in the scheduler thread pool	2013-08-23 22:03:14 +02:00
Shay Banon	7791f2612d	remove unused variable	2013-08-23 21:33:14 +02:00
Simon Willnauer	c688ed6c9f	Don't assert a second time if awaitBusy returned successfully	2013-08-23 20:57:53 +02:00
Shay Banon	86c95ab2ab	Setting index/bulk thread pools with queue_size can cause replica shard failures closes #3526	2013-08-23 19:24:47 +02:00
Simon Willnauer	19cce0b329	Use awaiBusy rather than a tight loop in IndicesStoreTests	2013-08-23 18:04:53 +02:00
Simon Willnauer	a943135ef6	Improve assertion and busy waiting for RecoveryWhileUnterLoadTests	2013-08-23 18:04:53 +02:00
Simon Willnauer	6c24a0af3e	Wait for consistent view on both clients in ClusterServiceTests	2013-08-23 18:04:53 +02:00
Simon Willnauer	6d49170509	Introduce base test classes to share thread scope annotations Currently we run unittests with clusters running in the background that can potentiallly spawn threads causeing the thread leak control to fire off in tests that don't use the test cluster. This commit introduces some base classes for that purpose shadowning lucene test framework classes adding the approriate ThreadScope.	2013-08-23 18:04:53 +02:00
Simon Willnauer	71ebb14b58	Add ESAbortPolicy to cached pools All ES ThreadPools / Executors should use the ESAbortPolicy or at least one that throws the ESRejectedExecutionException.	2013-08-23 18:04:53 +02:00
Simon Willnauer	7c76819040	Randomize numeric types in MLT test and apply mapping ahead of time.	2013-08-23 18:04:52 +02:00
Britta Weber	8b9396b6da	add additional method for setting combine mode with CombineFunction parameter	2013-08-23 16:04:00 +02:00
Britta Weber	8d6dc5908e	add builders for nicer java api This commit and the previous 10 commits close issue 3533. Closes #3533	2013-08-23 16:03:23 +02:00
Britta Weber	5258940d9e	format code	2013-08-23 14:44:34 +02:00
Britta Weber	1b085b069b	rename reference -> origin	2013-08-23 14:44:34 +02:00
Britta Weber	9e7ad7249f	rename scale_weight -> decay	2013-08-23 14:44:34 +02:00
Britta Weber	41b4a14933	Add offset to decay function score Docs within the offset will be scored with 1.0, decay only starts after offset is reached.	2013-08-23 14:44:22 +02:00
Britta Weber	c0288a62e6	rename 'total' to 'sum', both enum and for query	2013-08-23 14:44:01 +02:00
Britta Weber	6035134047	add more combine functions and rename PLAIN to REPLACE	2013-08-23 14:44:01 +02:00
Britta Weber	db100aa2de	make GeoPoint parsable in lat/lon json format	2013-08-23 14:43:54 +02:00
Britta Weber	f125ac122c	format code	2013-08-23 13:55:14 +02:00
Britta Weber	41c59c6b49	make mult default boost mode always multiply query score to function score. For script score functions, this means that boost_mode has to be set to `plain` if 'function_score' should behave like 'custom_score'	2013-08-23 13:55:14 +02:00
Britta Weber	634f1036a0	add boost_mode to rest interface allow user to set combine functions explicitely via boost_mode variable.	2013-08-23 13:55:14 +02:00
Britta Weber	b007af1f46	Fix inconsistent usage of ScriptScoreFunction in FiltersFunctionScoreQuery This commit fixes inconsistencies in `function_score` and `filters_function_score` using scripts, see issue #3464 The method 'ScoreFunction.factor(docId)' is removed completely, since the name suggests that this method actually computes a factor which was not the case. Multiplying the computed score is now handled by 'FiltersFunctionScoreQuery' and 'FunctionScoreQuery' and not implicitely performed in 'ScoreFunction.factor(docId, subQueryScore)' as was the case for 'BoostScoreFunction' and 'DecayScoreFunctions'. This commit also fixes the explain function for FiltersFunctionScoreQuery. Here, the influence of the maxBoost was never printed. Furthermore, the queryBoost was printed as beeing multiplied to the filter score. Closes #3464	2013-08-23 13:55:14 +02:00
Alexander Reelsen	2b03bc83a4	Dont write pidfile twice on startup There is no need to write the pidfile in the bin/elasticsearchshell script as this happens already in the java code. Also cleaning up the bin/elasticsearch shell script a bit (no need to return an error code when exec is called, as this forks and exits the shell script immediately). Closes #3529 Closes #1745	2013-08-23 13:20:29 +02:00
Shay Banon	1ac00a13fb	cleanup removable of reject_policy	2013-08-23 12:15:47 +02:00
Louis Gueye	048a02eebc	Debian init script: Add debian default java location Adding /usr/lib/jvm/default-java to JAVA_HOME candidates to check Closes #3500	2013-08-23 11:27:12 +02:00
Martijn van Groningen	e173f9a369	Added test for verifying the id cache size if the clear cache api is invoked.	2013-08-23 10:04:13 +02:00
Dan Everton	c76e589bc5	Call onRemoval of shard IdCache during clear. This looks like a copy/paste issue where onCached was being called rather than onRemoval. This should fix the ID cache stats not being correct after a call to /_cache/clear?id_cache=true	2013-08-23 09:55:51 +02:00
uboness	4b3a883111	random_score function - Added the index name and shard id to the randomization, and improved the PRNG itself Closes #3559	2013-08-23 02:48:43 +02:00
Boaz Leskes	109e2944f2	ClusterUpdateSettingsAction will hang if no changes were made Closes #3560	2013-08-22 20:59:31 +02:00
Simon Willnauer	fc3133d087	Prevent NPE if all docs for the field are pruned during merge During segment merges FieldConsumers for all fields from the source segments are created. Yet, if all documents that have a value in a certain field are deleted / pruned during the merge the FieldConsumer will not be called at all causing the internally used FST Builder to return `null` from it's `build()` method. This means we can't store it and run potentially into a NPE. This commit adds handling for this corner case both during merge and during suggest phase since we also don't have a Lookup instance for this segments field. Closes #3555	2013-08-22 11:59:25 +02:00
Martijn van Groningen	7fda12316a	Properly reduce in onFailure	2013-08-21 23:57:15 +02:00
David Pilato	352d2aaf18	Plugin Manager can not download _site plugins from github Sounds like github changes a bit download url for master zip file. From `https://github.com/username/reponame/zipball/master` to `https://github.com/username/reponame/archive/master.zip`. We need to update plugin manager to reflect that change. In the meantime, we invite users having this issue to use: ```sh bin/plugin -install reponame -url https://github.com/username/reponame/archive/master.zip ``` For example: ```sh bin/plugin -install paramedic -url https://github.com/karmi/elasticsearch-paramedic/archive/master.zip ``` Closes #3551	2013-08-21 17:44:08 +02:00
David Pilato	8668479b92	Plugin Manager can not download _site plugins from github Sounds like github changes a bit download url for master zip file. From `https://github.com/username/reponame/zipball/master` to `https://codeload.github.com/username/reponame/zip/master`. We need to update plugin manager to reflect that change. In the meantime, we invite users having this issue to use: ```sh bin/plugin -install reponame -url https://codeload.github.com/username/reponame/zip/master ``` For example: ```sh bin/plugin -install paramedic -url https://codeload.github.com/karmi/elasticsearch-paramedic/zip/master ``` Closes #3551	2013-08-21 16:08:00 +02:00

... 5 6 7 8 9 ...

5672 Commits