5379 Commits

Author SHA1 Message Date
Britta Weber
1ab037d4d0 Fix searching while shard is being relocated
Shard relocation caused queries to fail in the fetch phase either with a
`AlreadyClosedException` or a `SearchContextMissingException`.
This was caused by the `SearchService` releasing the `SearchContext`s via the
superfluous call of `releaseContextsForShard()` when the
2013-08-27 18:30:00 +02:00
Simon Willnauer
e7ff8ea509 Added unit tests or zone awareness 2013-08-27 18:05:42 +02:00
Luca Cavanna
d5b2c8e82f Fixed extraction of site plugins downloaded from github, so that we skip the top-level folder and we place the files directly under the _site folder
Closes #3551
2013-08-27 17:03:55 +02:00
Boaz Leskes
fa067e058a MultiTermVectorsAction was wrongly registered under mget 2013-08-27 13:46:33 +02:00
Simon Willnauer
39040b5f17 Add more tests for Zone Awareness 2013-08-27 12:25:35 +02:00
Simon Willnauer
6b8dd0b08f Added log info on test finished 2013-08-27 12:03:49 +02:00
Luca Cavanna
6e19ca8080 Fixed order of parameters when calling byteSizeField and timeValueField methods (introduced with #3432 - support for human readable flag) 2013-08-27 11:48:20 +02:00
Martijn van Groningen
45699bae5a Make sure preference isn't null 2013-08-27 11:05:12 +02:00
Adrien Grand
db46946d16 Configurable sort order for missing string values.
This commit allows for configuring the sort order of missing values in BytesRef
comparators (used for strings) with the following options:
 - _first: missing values will appear in the first positions,
 - _last: missing values will appear in the last positions (default),
 - <any value>: documents with missing sort value will use the given value when
   sorting.

Since the default is _last, sorting by string value will have a different
behavior than in previous versions of elasticsearch which used to insert missing
value in the first positions when sorting in ascending order.

Implementation notes:
 - Nested sorting is supported through the implementation of
   NestedWrappableComparator,
 - BytesRefValComparator was mostly broken since no field data implementation
   was using it, it is now tested through NoOrdinalsStringFieldDataTests,
 - Specialized BytesRefOrdValComparators have been removed now that the ordinals
   rely on packed arrays instead of raw arrays,
 - Field data tests hierarchy has been changed so that the numeric tests don't
   inherit from the string tests anymore,
 - When _first or _last is used, internally the comparators are told to use
   null or BytesRefFieldComparatorSource.MAX_TERM to replace missing values
   (depending on the sort order),
 - BytesRefValComparator just replaces missing values with the provided value
   and uses them for comparisons,
 - BytesRefOrdValComparator multiplies ordinals by 4 so that it can find
   ordinals for the missing value and the bottom value which are directly
   comparable with the segment ordinals. For example, if the segment values and
   ordinals are (a,1) and (b,2), they will be stored internally as (a,4) and
   (b,8) and if the missing value is 'ab', it will be assigned 6 as an ordinal,
   since it is between 'a' and 'b'. Then if the bottom value is 'abc', it will
   be assigned 7 as an ordinal since if it between 'ab' and 'b'.

Closes #896
2013-08-27 10:46:21 +02:00
Martijn van Groningen
2c939847b4 Simplified percolate reduce logic and the percolator recovery test 2013-08-27 10:45:09 +02:00
Martijn van Groningen
28e1744e79 No test is simple 2013-08-26 18:01:12 +02:00
Martijn van Groningen
29626dd201 Rename multi percolate item actions:
count_percolate -> count
percolate_existing_doc -> percolate
count_percolate_existing_doc -> count

If header contains `id` field, then it will automatically be percolation  an existing document.
2013-08-26 18:00:36 +02:00
Martijn van Groningen
3ca0239668 Added highlighter to percolate api.
The highlighter in the percolate api highlights snippets in the document being percolated. If highlighting is enabled then foreach matching query, highlight snippets will be generated.
All highlight options that are supported via the search api are also supported in the percolate api, since the percolate api embeds the same highlighter infrastructure as the search api.
The `size` option is a required option if highlighting is specified in the percolate api, other than that the `highlight`request part can just be placed in the percolate api request body.

Closes #3574
2013-08-26 16:37:07 +02:00
Martijn van Groningen
df3922a22a Small cleanup 2013-08-26 15:42:19 +02:00
Martijn van Groningen
35dcdb0b5a Only use the client variable and don't use client from client() in percolator test 2013-08-26 15:37:18 +02:00
Shay Banon
8b295b53d0 Improve refresh logic when replica move to started
closes #3573
2013-08-26 15:15:01 +02:00
Shay Banon
b329943632 improve search while create test
- improve the test to be more re-creatable
- have tests for various number of replica counts, to check if failures are caused by searching on replicas that might not have been refreshed yet
- improve test to test explicit index creation, and index creation caused by index operation
- have an initial search go to _primary, to check if failure fails when searching on replica because it missed a refresh
2013-08-26 14:29:40 +02:00
Martijn van Groningen
9f2e615ed9 Added extra logging 2013-08-26 09:48:04 +02:00
Boaz Leskes
a96ecea653 Multi term vector request
--------------------------

This feature allows to retrieve [term vectors](https://github.com/elasticsearch/elasticsearch/issues/3114) for a list of documents. The json request has exactly the same [format](https://github.com/elasticsearch/elasticsearch/issues/3484) as the ```_termvectors``` endpoint

It use it, call

```
curl -XGET 'http://localhost:9200/index/type/_mtermvectors' -d '{
    "fields": [
        "field1",
        "field2",
        ...
    ],
    "ids": [
        "docId1",
        "docId2",
        ...
    ],
    "offsets": false|true,
    "payloads": false|true,
    "positions": false|true,
    "term_statistics": false|true,
    "field_statistics": false|true
}'

```

The return format is an array, each entry of which conatins the term vector response for one document:

```
{
   "docs": [
      {
         "_index": "index",
         "_type": "type",
         "_id": "docId1",
         "_version": 1,
         "exists": true,
         "term_vectors": {
         	...
         }
      },
      {
         "_index": "index",
         "_type": "type",
         "_id": "docId2",
         "_version": 1,
         "exists": true,
         "term_vectors": {
         ...
         }
      }
   ]
}
```

Note that, like term vectors, the mult term vectors request will silenty skip over documents that have no term vectors stored in the index and will simply return an empty response in this case.

Closes #3536
2013-08-26 09:25:21 +02:00
Boaz Leskes
18c71b16b5 Refactor term vector api
This is necessary to allow adding a mult term vector request
2013-08-26 09:25:21 +02:00
Martijn van Groningen
a09f217b45 Don't reduce twice if failure occurs. 2013-08-25 21:50:04 +02:00
Simon Willnauer
020e68f2a0 Make sure all shards are refreshed during recovery test 2013-08-24 09:08:39 +02:00
Shay Banon
f3a35ccc90 upgrade to joda 2.3 2013-08-24 00:07:57 +02:00
Shay Banon
e892ec2c37 use our abort policy in the scheduler thread pool 2013-08-23 22:03:14 +02:00
Shay Banon
7791f2612d remove unused variable 2013-08-23 21:33:14 +02:00
Simon Willnauer
c688ed6c9f Don't assert a second time if awaitBusy returned successfully 2013-08-23 20:57:53 +02:00
Shay Banon
86c95ab2ab Setting index/bulk thread pools with queue_size can cause replica shard failures
closes #3526
2013-08-23 19:24:47 +02:00
Simon Willnauer
19cce0b329 Use awaiBusy rather than a tight loop in IndicesStoreTests 2013-08-23 18:04:53 +02:00
Simon Willnauer
a943135ef6 Improve assertion and busy waiting for RecoveryWhileUnterLoadTests 2013-08-23 18:04:53 +02:00
Simon Willnauer
6c24a0af3e Wait for consistent view on both clients in ClusterServiceTests 2013-08-23 18:04:53 +02:00
Simon Willnauer
6d49170509 Introduce base test classes to share thread scope annotations
Currently we run unittests with clusters running in the background
that can potentiallly spawn threads causeing the thread leak control
to fire off in tests that don't use the test cluster. This commit
introduces some base classes for that purpose shadowning lucene test
framework classes adding the approriate ThreadScope.
2013-08-23 18:04:53 +02:00
Simon Willnauer
71ebb14b58 Add ESAbortPolicy to cached pools
All ES ThreadPools / Executors should use the ESAbortPolicy or at least
one that throws the ESRejectedExecutionException.
2013-08-23 18:04:53 +02:00
Simon Willnauer
7c76819040 Randomize numeric types in MLT test and apply mapping ahead of time. 2013-08-23 18:04:52 +02:00
Britta Weber
8b9396b6da add additional method for setting combine mode with CombineFunction parameter 2013-08-23 16:04:00 +02:00
Britta Weber
8d6dc5908e add builders for nicer java api
This commit and the previous 10 commits close issue 3533.

Closes #3533
2013-08-23 16:03:23 +02:00
Britta Weber
5258940d9e format code 2013-08-23 14:44:34 +02:00
Britta Weber
1b085b069b rename reference -> origin 2013-08-23 14:44:34 +02:00
Britta Weber
9e7ad7249f rename scale_weight -> decay 2013-08-23 14:44:34 +02:00
Britta Weber
41b4a14933 Add offset to decay function score
Docs within the offset will be scored with 1.0, decay only starts after
offset is reached.
2013-08-23 14:44:22 +02:00
Britta Weber
c0288a62e6 rename 'total' to 'sum', both enum and for query 2013-08-23 14:44:01 +02:00
Britta Weber
6035134047 add more combine functions and rename PLAIN to REPLACE 2013-08-23 14:44:01 +02:00
Britta Weber
db100aa2de make GeoPoint parsable in lat/lon json format 2013-08-23 14:43:54 +02:00
Britta Weber
f125ac122c format code 2013-08-23 13:55:14 +02:00
Britta Weber
41c59c6b49 make mult default boost mode
always multiply query score to function score. For script score
functions, this means that boost_mode has to be set to `plain` if
'function_score' should behave like 'custom_score'
2013-08-23 13:55:14 +02:00
Britta Weber
634f1036a0 add boost_mode to rest interface
allow user to set combine functions explicitely via boost_mode variable.
2013-08-23 13:55:14 +02:00
Britta Weber
b007af1f46 Fix inconsistent usage of ScriptScoreFunction in FiltersFunctionScoreQuery
This commit fixes inconsistencies in `function_score` and `filters_function_score`
using scripts, see issue #3464

The method 'ScoreFunction.factor(docId)' is removed completely, since the name
suggests that this method actually computes a factor which was not the case.
Multiplying the computed score is now handled by 'FiltersFunctionScoreQuery'
and 'FunctionScoreQuery' and not implicitely performed in
'ScoreFunction.factor(docId, subQueryScore)' as was the case for 'BoostScoreFunction'
and 'DecayScoreFunctions'.

This commit also fixes the explain function for FiltersFunctionScoreQuery. Here,
the influence of the maxBoost was never printed. Furthermore, the queryBoost was
printed as beeing multiplied to the filter score.

Closes #3464
2013-08-23 13:55:14 +02:00
Alexander Reelsen
2b03bc83a4 Dont write pidfile twice on startup
There is no need to write the pidfile in the bin/elasticsearchshell script
as this happens already in the java code.

Also cleaning up the bin/elasticsearch shell script a bit (no need to return
an error code when exec is called, as this forks and exits the shell script
immediately).

Closes #3529
Closes #1745
2013-08-23 13:20:29 +02:00
Shay Banon
1ac00a13fb cleanup removable of reject_policy 2013-08-23 12:15:47 +02:00
Louis Gueye
048a02eebc Debian init script: Add debian default java location
Adding /usr/lib/jvm/default-java to JAVA_HOME candidates to check

Closes #3500
2013-08-23 11:27:12 +02:00
Martijn van Groningen
e173f9a369 Added test for verifying the id cache size if the clear cache api is invoked. 2013-08-23 10:04:13 +02:00