5258 Commits

Author SHA1 Message Date
Martijn van Groningen
15fcb17a81 During parent uid loading seek to next parent type when child type is encountered.
Relates to 
2013-05-14 16:22:05 +02:00
Simon Willnauer
6d5805c901 Use Recovery Throtteling by default.
To prevent to extensive resource use during recovery we use
recovery throtteling by default to prevent unexpected peak load
on clusters. The default is set to 20 MB/sec.

Closes 
2013-05-14 15:10:03 +02:00
Simon Willnauer
6624949501 Use Merge Throtteling by default on node level.
Merge Throtteling is one of the most recommended settings and crucial in the
RealTime indexing case. We should set the default to a reasonable setting
that allows folks to index in a production index and don't see large merge
peaks by default. The default is set to 20 MB/sec on the node level.

Closes 
2013-05-14 15:10:03 +02:00
Simon Willnauer
09fb2264d0 Raise search threadpool default size.
The default size used to be 2x availableProcessors which seemed to
be a to lowish value in practice. 3x appeared to be a sweetspot for
most application. The default is now 3 x availableProcessors

Closes 
2013-05-14 15:10:03 +02:00
uboness
d06a15ec3e Support for term facets on unmapped fields
Added support for unmapped & partially mapped fields (partially mapped fields may occur when searching across multiple indices where the faceted field is mapped on some and unmapped on others). If a shard doesn't have mappings for a field, the matching documents count on that shard will be added to the missing count for that facet.
2013-05-14 13:53:41 +02:00
Martijn van Groningen
906f278896 Make sure only relevant documents are evaluated in the second round lookup phase.
Both has_parent and has_child filters are internally executed in two rounds. In the second round all documents are evaluated whilst only specific documents need to be checked. In the has_child case only documents belonging to a specific parent type need to be checked and in the has_parent case only child documents need to be checked.

Closes 
2013-05-14 11:02:03 +02:00
Shay Banon
ae6c1b345f Allow to disable allocation on the index level
Similar to the global cluster wide disable allocation flags, allow to set those on a specific index by updating its settings. The keys are the same as the cluster one, except they start with an index, for example: index.routing.allocation.disable_allocation set to true.
closes 
2013-05-14 10:25:23 +02:00
Simon Willnauer
7b437e801a Added test for LimitTokenCountFilterFactory 2013-05-14 09:58:43 +02:00
Brusic
183ac1e04c Expose LimitTokenCountFilter as a TokenFilter
Closes 
2013-05-14 09:58:42 +02:00
Martijn van Groningen
669cf90d0c Not load the ids of child documents into memory.
Closes 
2013-05-14 09:46:43 +02:00
Alexander Reelsen
31b4b7ea58 Renaming span_multi_term query to span_multi
... due to discussing this on  in order to have a more concise name
2013-05-13 12:32:57 +02:00
Simon Willnauer
cffe333fe3 Ensure tests pass if store dir is a soft-link 2013-05-13 12:08:41 +02:00
Simon Willnauer
a3a2ca0ad3 Reduce branches in TopChildrenQuery
The branches used in the score method can be moved into the
scorer call and be essentially a constant operation rather than
a linear operation depending on the number of parent docs.
2013-05-13 12:08:41 +02:00
Alexander Reelsen
52654179e7 Fix for RPM postinstall on old OpenSUSE distributions
Older OpenSUSE distributions do not ship with systemd and therefore are
using chkconfig, but do not have their scripts placed at /etc/init.d/
This patch is more defensive and adds additional checks in the postinstall
script to prevent aborted post install scripts, which makes the RPM
uninstallable.
2013-05-13 11:48:04 +02:00
Martijn van Groningen
3c58176d29 Also support sum as score_mode option for the nested query.
Relates to 
2013-05-13 10:38:20 +02:00
Martijn van Groningen
6eaad25621 Made all the queries support score_mode parameter name in addition to the existing parameter name for score mode.
Closes 
2013-05-13 10:30:01 +02:00
Martijn van Groningen
bacf969dd3 Improved the stability of hl tests by adding waiting for at least yellow status.
In some test cases this was missing.
2013-05-13 10:18:36 +02:00
Shay Banon
21d749a6aa resolved empty setting values should be removed
when resolving empty settings values, their value should be removed, for example, when using ${env.ENV_VAR}, and ENV_VAR is not set, then the setting should be removed
2013-05-12 05:18:03 +02:00
Shay Banon
2ab72da7d6 update to joda 2.2 2013-05-11 23:37:56 +02:00
Shay Banon
e09e3eb73b upgrade to mvel 2.1.5 2013-05-11 23:17:00 +02:00
Shay Banon
ee636c2330 use throwable in transport layer
catch throwable when processing messages in the transport layer, to report back failures even under errors
2013-05-11 21:30:16 +02:00
Shay Banon
342e9cf18e test no longer needed... 2013-05-11 01:22:23 +02:00
Shay Banon
6e26efcd87 not active shards should translate to 503 not 500 2013-05-10 18:09:42 +02:00
Alexander Reelsen
21fcc482eb Allow to set headers in HTTP response
This commit allows to set custom headers in HTTP responses (like
setting the WWW-Authenticate header for basic auth) by adding
RestRequest.addHeader() method.

Closes 
Closes 

To get the history right: This is based on PR 
2013-05-10 17:58:46 +02:00
Shay Banon
da5dff9ee4 remove concrete bytes for field data
no really need for it, specifically with the fact that we don't need to deepCopy on makeSafe for the (default) paged bytes
2013-05-10 17:42:34 +02:00
Shay Banon
455b5da52f No need for deepCopy on makeSafe for pages field data
Since its a reference to a buffer in the PagedBytes, we don't need to deep copy it on makeSafe, just shallow copy it
2013-05-10 17:25:39 +02:00
Martijn van Groningen
2be23d2427 Added test that checks if a validation error is thrown when both doc and script provided in a update request.
Closes 
2013-05-10 16:43:20 +02:00
Alexander Kahn
47971ac808 Reject update request that has both script and doc 2013-05-10 16:31:54 +02:00
Martijn van Groningen
9ddd675a02 Added support for the update operation in the bulk api.
Update requests can now be put in the bulk api. All update request options are supported.

Example usage:
```
curl -XPOST 'localhost:9200/_bulk' --date-binary @bulk.json
```

Contents of bulk.json that contains two update request items:
```
{ "update" : {"_id" : "1", "_type" : "type1", "_index" : "index1", "_retry_on_conflict" : 3} }
{ "doc" : {"field" : "value"} }
{ "update" : { "_id" : "0", "_type" : "type1", "_index" : "index1", "_retry_on_conflict" : 3} }
{ "script" : "counter += param1", "lang" : "js", "params" : {"param1" : 1}, "upsert" : {"counter" : 1}}
```
The `doc`, `upsert` and all script related options are part of the payload. The `retry_on_conflict` option is part of the header.

Closes 
2013-05-10 16:03:24 +02:00
Shay Banon
c5e177dc56 lazy compute the hash and actually use it... 2013-05-10 11:51:55 +02:00
Igor Motov
4d66575abe Make GetField behavior more consitent for multivalued fields.
Before this change, the GetField#getValue() method was returning a list of values of a multivalued fields if the field values were obtained from source or if the field was stored and real-time get was used. If the field was stored but non-realtime get was used, GetField#getValue() was returning only the first element and the GetField#getValues() was returning a list of elements. This change makes behavior consistent. GetField#getValue() now always returns only the first value of the field and GetField#getValues() returns the entire list.
2013-05-09 12:45:49 -04:00
Shay Banon
522ca7e889 upgrade to jackson 2.2.1 2013-05-09 18:00:23 +02:00
Igor Motov
d69dd321fc Improve test stability 2013-05-09 10:21:21 -04:00
Shay Banon
8a2e5bbe68 Reroute Allocate to force primary allocation when enabled
Typically, the main reason a reroute allocation command with allow_primary is enabled, is to force create an empty new shard because a shard (and its replicas) were lost. This can't be done today because the shard expects to have a valid index where its allocated, we need to clear its post allocation flag to make sure it is allowed to create a fresh index.
2013-05-09 00:47:57 +02:00
Igor Motov
15c8510e65 Fix DfsSearchResult method names in AggregatedDfs 2013-05-08 18:21:47 -04:00
Simon Willnauer
436e23b8d4 Use simplified asserts and better naming
addOne / subOne are likely easier to understand without reading the docs.
If I would read my emails this would have made it in the last commit.
2013-05-08 22:06:01 +02:00
Simon Willnauer
1ef8761b70 Handle optional term and field statistics gracefully
Lucene provides a set of statistics that depend on the codec / postingsformat
as well as on the index options used when the field is created / indexed.
If a certain stats value is not available lucene return `-1` instead of the
correct value. We need to ensure that those values are encoded correctly if
we try to write vLongs as well as when we aggregate those values.

Closes 
2013-05-08 21:34:48 +02:00
Igor Motov
dbaf39c792 Add more informative toString method to StoreDirectory 2013-05-08 12:05:40 -04:00
Simon Willnauer
a89230945f Add NGramTokenizer and NGramTokenFilter to broken chains
NgramTokenizer and NGramTokenFilter are broken with a version < 4.2
We should still support these filters but should prevent the StringIOOB
exceptions. Adding these fitlers for the FragmentBuilderHelper will
allow seamless highlighting on fields indexed with those tokenizers or
tokenfilters
2013-05-08 17:57:20 +02:00
chilling
b7cd8a64cd Merge pull request from chilling/issue2986_scores
Fixed parsing of track_scores in RestSearchAction
2013-05-08 03:54:16 -07:00
Florian Schilling
19fab7cd0e Fixed parsing of track_scores in RestSearchAction
Closes 
2013-05-08 12:45:32 +02:00
Simon Willnauer
c1e8d4787a Don't use smart query wrapping for span term query
Lucenes span queries are a different family than 'ordinary' queries
in lucene. Spans only work with other spans such that smart query
wrapping doesn't work with span queries at all ie. we can't wrap
in filtered query.

Closes 
2013-05-07 22:52:57 +02:00
Simon Willnauer
992a40cbd8 Add field_masking_span to IndexQueryModule
The query parser for `field_masking_span` has never been added / bound to
the IndexQueryModule.

Closes 
2013-05-07 21:50:47 +02:00
Igor Motov
32abf6b890 Fix error getting array fields
Fixes 
2013-05-07 13:30:29 -04:00
Simon Willnauer
130f0f6afd Remove Java 7 only API
We still run on Java 6 as minimum requirement. Integer.compare(int,int)
was added in Java 7. This caused compile errors on CI.
2013-05-07 18:40:56 +02:00
uboness
74317fec8b Fixed custom hunspell dictionary directory
Properly loading dictionaries from based on the "indices.analysis.hunspell.dictionary.location" setting if one exists
2013-05-07 17:21:37 +02:00
Simon Willnauer
e1b66b34ea Don't fail hard if broken analysis is used.
Today an analysis chain with broken tokenfilters or tokenizers like
WordDelimiterFilter might produce somewhat broken term vectors that cause
`StringIndexOutOfBoundsExceptions` if FastVectorHighlighter is used
since the positions / offsets contract is violated and offsets of highlight
tokens are not increasing but decreasing even if their positions are increasing.

Yet, if we detect such a situation we can resort the tokens which might cause
somewhat odd highlights but doesn't fail hard with a StringIndexOOBException.

Closes 
2013-05-07 16:29:00 +02:00
uboness
14ae2fb765 Changed the priority of delete-index action to URGENT
All index meta data API's have urgent priority when it comes to cluster state updates. We'd like to remove indices asap to avoid things like unnecessary shards relocations
2013-05-07 05:43:08 +02:00
Simon Willnauer
758a4fcdc0 Enable Geo-Shape Relations Within and Disjoint 2013-05-06 18:03:16 +02:00
Simon Willnauer
2219925485 Upgrade to Lucene 4.3.0
This Lucene Release introduced a new API on DocIdSetIterator that requires each
implementation to return a `cost` upperbound as a function of the iterated documents.
This API allows for several optimizations during query execution especially in
Conjunction and Disjunction Queries with min_should_match set.

Closes 
2013-05-06 18:03:16 +02:00