Commit Graph

5128 Commits

Author SHA1 Message Date
Simon Willnauer 31f0aca65d Integrate forbiddenAPI checks into Maven build.
This commit integrates the forbiddenAPI checks that checks
Java byte code against a list of "forbidden" API signatures.
The commit also contains the fixes of the current source code
that didn't pass the default API checks.

See https://code.google.com/p/forbidden-apis/ for details.

Closes #3059
2013-05-19 23:25:44 +02:00
Simon Willnauer c4db582f26 Allow Date Fields to have a locale for date parsing
Currently if somebody uses a date format that is locale dependend
date fields can only parse a single format depending on the nodes
host locale. This can cause lots of problems since nodes might have
different locales. ie. "E, d MMM yyyy HH:mm:ss Z" where you have
"Wed, 06 Dec 2000 02:55:00 -0800" for en_EN while
"Mi, 06 Dez 2000 02:55:00 -0800" for de_DE.

Closes #3047
2013-05-19 23:25:44 +02:00
Shay Banon e580507fbe wait for yellow after the index is created
also, remove starting one node, it not useful for the test, and slows down the execution
2013-05-17 18:21:51 +02:00
Simon Willnauer 17681d7104 Grow array buffer in ScriptDocValues if needed
The buffer in ScriptDocValues for Strings was never called causeing
NPE in scripts if a document has > 10 distinct values in a field.

Closes #3051
2013-05-17 15:16:59 +02:00
Alexander Reelsen bd857d6d2e Ensuring test isolation when jvm plugins are loaded
Instead of specifying 'path.plugins' configuration option, 'plugin.types'
is used to load plugins in integration tests. This makes sure the JVM
plugins are not loaded in all following tests from then.

Also removed the now unneeded es-plugin.properties files from JVM test
plugins.
2013-05-17 13:21:26 +02:00
Alexander Reelsen 2485c4890c Packaging improvements & bugfixes
* RPM: Use the ES_USER variable to set the user (same name as in the debian package
  now), while retaining backwards compatibility to existing /etc/sysconfig/elasticsearch
* RPM: Bugfix: Remove the user when uninstalling the package
* RPM: Set an existing homedir when adding the user (allows one to run cronjobs as this user)
* DEB & RPM: Unify Required-Start/Required-Stop fields in initscripts
2013-05-17 11:14:44 +02:00
Alexander Reelsen 2e07af63ba Allowing pluggable highlighter implementations.
Currently elasticsearch ships with the plain and the fast-vector highlighter.
In order to support arbitrary highlighters via plugins, you only need to
implement a Highlighter interface and register your implementation in your
plugin at the HighlightModule.

In addition you can also add arbitrary options via the 'options' field in
the highlight request, which can be parsed in the highlighter implementation.

In order to find out how to write add your own analyzer, check out the tests
classes (CustomHighlighterSearchTests and CustomHighlighter).

Closes #2828
2013-05-17 09:07:13 +02:00
Martijn van Groningen db421742f7 Added support for nested sorting for script sorting and geo sorting.
Closes #3044
2013-05-16 18:45:00 +02:00
Martijn van Groningen 42d5bdd337 If matching root doc's inner objects don't match the `nested_filter` then the `missing` value should be used to sort the root doc.
Closes #3020
2013-05-16 10:12:02 +02:00
Shay Banon 2779967279 fix package name... 2013-05-15 17:06:42 +02:00
Martijn van Groningen bc0c7f8f28 Added simple id loading test.
Relates to #3028
2013-05-15 16:10:22 +02:00
Simon Willnauer 8235b89e9c Don't apply min frequency smoothing if suggest type is 'always'
Using an automatically detected 'min_doc_freq' if suggest type is set to
'always' is counter intuitive. If we suggest always ignore the frequency and
set threshold frequency to 0 to allow all possible candidates to be drawn if
they are within the given bounds.

Closes #3037
2013-05-15 15:17:49 +02:00
Martijn van Groningen 48cb06c9cf Keep backwards compatible with 0.90.0 on the transport layer.
Relates to #3039
2013-05-15 13:28:55 +02:00
Martijn van Groningen 585cbf6886 Routing value not serialized on transport layer.
Closes #3039
2013-05-15 13:09:13 +02:00
Clinton Gormley db805cf5a9 Corrected English in a shard error message 2013-05-15 12:41:49 +02:00
Clinton Gormley 4d09e7562a Corrected a typo and improved the English in a master-discovery error 2013-05-15 12:39:31 +02:00
Shay Banon f92eed8591 clean thread locals without needing a wrapper
clean thread locals smartly by identifying "our" classes, and removing them, so there is no need to wrap it in our our clenable values
2013-05-15 12:13:13 +02:00
Shay Banon 4d357660ca reuse version key in an actual operation
no need to compute the hash several times
2013-05-15 00:27:48 +02:00
Shay Banon 1fb78c53b8 remove unused class 2013-05-14 20:21:37 +02:00
Shay Banon 1c7d2442c8 use bytes instead of String as key in versionMap
no need to create a String every time we put or get a value from the version map
2013-05-14 20:18:54 +02:00
Martijn van Groningen 15fcb17a81 During parent uid loading seek to next parent type when child type is encountered.
Relates to #3028
2013-05-14 16:22:05 +02:00
Simon Willnauer 6d5805c901 Use Recovery Throtteling by default.
To prevent to extensive resource use during recovery we use
recovery throtteling by default to prevent unexpected peak load
on clusters. The default is set to 20 MB/sec.

Closes #3035
2013-05-14 15:10:03 +02:00
Simon Willnauer 6624949501 Use Merge Throtteling by default on node level.
Merge Throtteling is one of the most recommended settings and crucial in the
RealTime indexing case. We should set the default to a reasonable setting
that allows folks to index in a production index and don't see large merge
peaks by default. The default is set to 20 MB/sec on the node level.

Closes #3033
2013-05-14 15:10:03 +02:00
Simon Willnauer 09fb2264d0 Raise search threadpool default size.
The default size used to be 2x availableProcessors which seemed to
be a to lowish value in practice. 3x appeared to be a sweetspot for
most application. The default is now 3 x availableProcessors

Closes #3023
2013-05-14 15:10:03 +02:00
uboness d06a15ec3e Support for term facets on unmapped fields
Added support for unmapped & partially mapped fields (partially mapped fields may occur when searching across multiple indices where the faceted field is mapped on some and unmapped on others). If a shard doesn't have mappings for a field, the matching documents count on that shard will be added to the missing count for that facet.
2013-05-14 13:53:41 +02:00
Martijn van Groningen 906f278896 Make sure only relevant documents are evaluated in the second round lookup phase.
Both has_parent and has_child filters are internally executed in two rounds. In the second round all documents are evaluated whilst only specific documents need to be checked. In the has_child case only documents belonging to a specific parent type need to be checked and in the has_parent case only child documents need to be checked.

Closes #3034
2013-05-14 11:02:03 +02:00
Shay Banon ae6c1b345f Allow to disable allocation on the index level
Similar to the global cluster wide disable allocation flags, allow to set those on a specific index by updating its settings. The keys are the same as the cluster one, except they start with an index, for example: index.routing.allocation.disable_allocation set to true.
closes #3031
2013-05-14 10:25:23 +02:00
Simon Willnauer 7b437e801a Added test for LimitTokenCountFilterFactory 2013-05-14 09:58:43 +02:00
Brusic 183ac1e04c Expose LimitTokenCountFilter as a TokenFilter
Closes #3013
2013-05-14 09:58:42 +02:00
Martijn van Groningen 669cf90d0c Not load the ids of child documents into memory.
Closes #3028
2013-05-14 09:46:43 +02:00
Alexander Reelsen 31b4b7ea58 Renaming span_multi_term query to span_multi
... due to discussing this on #2610 in order to have a more concise name
2013-05-13 12:32:57 +02:00
Simon Willnauer cffe333fe3 Ensure tests pass if store dir is a soft-link 2013-05-13 12:08:41 +02:00
Simon Willnauer a3a2ca0ad3 Reduce branches in TopChildrenQuery
The branches used in the score method can be moved into the
scorer call and be essentially a constant operation rather than
a linear operation depending on the number of parent docs.
2013-05-13 12:08:41 +02:00
Alexander Reelsen 52654179e7 Fix for RPM postinstall on old OpenSUSE distributions
Older OpenSUSE distributions do not ship with systemd and therefore are
using chkconfig, but do not have their scripts placed at /etc/init.d/
This patch is more defensive and adds additional checks in the postinstall
script to prevent aborted post install scripts, which makes the RPM
uninstallable.
2013-05-13 11:48:04 +02:00
Martijn van Groningen 3c58176d29 Also support `sum` as `score_mode` option for the nested query.
Relates to #3026
2013-05-13 10:38:20 +02:00
Martijn van Groningen 6eaad25621 Made all the queries support `score_mode` parameter name in addition to the existing parameter name for score mode.
Closes #3026
2013-05-13 10:30:01 +02:00
Martijn van Groningen bacf969dd3 Improved the stability of hl tests by adding waiting for at least yellow status.
In some test cases this was missing.
2013-05-13 10:18:36 +02:00
Shay Banon 21d749a6aa resolved empty setting values should be removed
when resolving empty settings values, their value should be removed, for example, when using ${env.ENV_VAR}, and ENV_VAR is not set, then the setting should be removed
2013-05-12 05:18:03 +02:00
Shay Banon 2ab72da7d6 update to joda 2.2 2013-05-11 23:37:56 +02:00
Shay Banon e09e3eb73b upgrade to mvel 2.1.5 2013-05-11 23:17:00 +02:00
Shay Banon ee636c2330 use throwable in transport layer
catch throwable when processing messages in the transport layer, to report back failures even under errors
2013-05-11 21:30:16 +02:00
Shay Banon 342e9cf18e test no longer needed... 2013-05-11 01:22:23 +02:00
Shay Banon 6e26efcd87 not active shards should translate to 503 not 500 2013-05-10 18:09:42 +02:00
Alexander Reelsen 21fcc482eb Allow to set headers in HTTP response
This commit allows to set custom headers in HTTP responses (like
setting the WWW-Authenticate header for basic auth) by adding
RestRequest.addHeader() method.

Closes #2936
Closes #2540

To get the history right: This is based on PR #2723
2013-05-10 17:58:46 +02:00
Shay Banon da5dff9ee4 remove concrete bytes for field data
no really need for it, specifically with the fact that we don't need to deepCopy on makeSafe for the (default) paged bytes
2013-05-10 17:42:34 +02:00
Shay Banon 455b5da52f No need for deepCopy on makeSafe for pages field data
Since its a reference to a buffer in the PagedBytes, we don't need to deep copy it on makeSafe, just shallow copy it
2013-05-10 17:25:39 +02:00
Martijn van Groningen 2be23d2427 Added test that checks if a validation error is thrown when both doc and script provided in a update request.
Closes #2967
2013-05-10 16:43:20 +02:00
Alexander Kahn 47971ac808 Reject update request that has both script and doc 2013-05-10 16:31:54 +02:00
Martijn van Groningen 9ddd675a02 Added support for the update operation in the bulk api.
Update requests can now be put in the bulk api. All update request options are supported.

Example usage:
```
curl -XPOST 'localhost:9200/_bulk' --date-binary @bulk.json
```

Contents of bulk.json that contains two update request items:
```
{ "update" : {"_id" : "1", "_type" : "type1", "_index" : "index1", "_retry_on_conflict" : 3} }
{ "doc" : {"field" : "value"} }
{ "update" : { "_id" : "0", "_type" : "type1", "_index" : "index1", "_retry_on_conflict" : 3} }
{ "script" : "counter += param1", "lang" : "js", "params" : {"param1" : 1}, "upsert" : {"counter" : 1}}
```
The `doc`, `upsert` and all script related options are part of the payload. The `retry_on_conflict` option is part of the header.

Closes #2982
2013-05-10 16:03:24 +02:00
Shay Banon c5e177dc56 lazy compute the hash and actually use it... 2013-05-10 11:51:55 +02:00