Simon Willnauer
30f9f278c3
Added UNICODE_CHARACTER_CLASS support to Regex flags. This flag is only supported in Java7 and is ignored if set on a java 6 JVM
...
Closes #2895
2013-04-16 10:06:53 +02:00
uboness
eb21526552
Added missing support for lat, lats, lon, lons for doc notation in scripts
2013-04-13 13:58:30 -07:00
uboness
20e6df9f34
Optimization in fielddata cache where ordinals are used instead of flat arrays when number of unique values is low
2013-04-13 12:42:53 -07:00
Igor Motov
e7b49d8936
Add more dynamic settings validation
2013-04-12 20:55:45 -04:00
Shay Banon
d385e1b356
Clear Cache API: Streamline option names
...
closes #2890
2013-04-12 15:58:24 -07:00
Shay Banon
a2d72697eb
Expose field level field data statistics
...
closes #2889
2013-04-12 15:51:08 -07:00
David Pilato
3b7a195f6f
Add toString() for FilterBuilders
...
Closes #2887 .
2013-04-12 22:27:51 +02:00
Martijn van Groningen
bf21466291
CacheTests test fix.
2013-04-12 19:14:38 +02:00
Martijn van Groningen
80dbca0809
Field data: Try to load short values as byte values and load int values as short or byte values to reduce the size they take in memory.
2013-04-12 19:11:18 +02:00
Shay Banon
5fbd4a12a0
fix memory computation for int field data
2013-04-12 08:38:52 -07:00
Martijn van Groningen
5c90e5f940
If no options are specified with the clear cache api then all caches should be cleared.
...
Closes #2886
2013-04-12 15:24:50 +02:00
Igor Motov
00c035f88c
Make sure that settings are propagated to all nodes
2013-04-11 10:59:14 -04:00
Martijn van Groningen
2dfcc3c740
Test that size is actually computed.
...
Relates to #2882
2013-04-11 10:22:48 +02:00
Simon Willnauer
9a2d27a035
rename prefix_length to prefix_len for consistency
...
Closes #2883
2013-04-10 17:39:32 +02:00
Martijn van Groningen
4fd8c2c6d2
Ordinals were omitted from fielddata cache size calculation if field has more than one term.
...
Closes #2882
2013-04-10 14:50:07 +02:00
Martijn van Groningen
637eeacb20
Better error description if field(s) (statistical facet) and value_field (term_stats facet) are not a numeric field
2013-04-10 11:11:52 +02:00
Martijn van Groningen
6a3c53ef44
Should prevent OOM
2013-04-10 10:00:51 +02:00
Martijn van Groningen
b8b28041e5
Fix for extended facets test.
2013-04-10 00:47:00 +02:00
Shay Banon
80fc55a01d
upgrade to netty 3.6.5
2013-04-09 09:44:49 -07:00
Igor Motov
b0e44a2b40
Fix term counters in script field terms facet
...
Fixes #2878
2013-04-09 12:42:35 -04:00
Simon Willnauer
ae74a8dbb7
Configure FieldData using a hash not a string
...
Closes #2876
2013-04-09 15:53:05 +02:00
Simon Willnauer
374bbbfa7b
# FieldData Filter
...
FieldData is an in-memory representation of the term dictionary in an uninverted form. Under certain circumstances this FieldData representation can grow very large on high-cardinality fields like tokenized full-text. Depending on the use-case filtering the terms that are hold in the FieldData representation can heavily improve execution performance and application stability.
FieldData Filters can be applied on a per-segment basis. During FieldData loading the terms enumeration is passed through a filter predicate that either accepts or rejects a term.
## Frequency Filter
The Frequency Filter acts as a high / low pass filter based on the document frequencies of a certain term within the segment that is loaded into field data. It allows to reject terms that are very high or low frequent based on absolute frequencies or percentages relative to the number of documents in the segment or more precise the number of document that have at least one value in the field that is loaded in the current segment.
Here is an example mapping
Here is an example mapping:
```json
{
"tweet" : {
"properties" : {
"locale" : {
"type" : "string",
"fielddata" : "format=paged_bytes;filter.frequency.min=0.001;filter.frequency.max=0.1",
"index" : "analyzed",
}
}
}
}
```
### Paramters
* `filter.frequency.min` - the minimum document frequency (inclusive) in order to be loaded in to memory. Either a percentage if < `1.0` or an absolute value. `0` if omitted.
* `filter.frequency.max` - the maximum document frequency (inclusive) in order to be loaded in to memory. Either a percentage if < `1.0` or an absolute value. `0` if omitted.
* `filter.frequency.min_segment_size` - the minimum number of documents in a segment in order for the filter to be applied. Small segments might be omitted with this setting.
## Regular Expression Filter
The regular expression filter applies a regular expression to each term during loading and only loads terms into memory that match the given regular expression.
Here is an example mapping:
```json
{
"tweet" : {
"properties" : {
"locale" : {
"type" : "string",
"fielddata" : "format=paged_bytes;filter.regex=^en_.*",
"index" : "analyzed",
}
}
}
}
```
Closes #2874
2013-04-09 11:34:48 +02:00
Igor Motov
acc0950957
Get template should return warmers
...
Fixes #2868
2013-04-08 19:12:20 -04:00
Simon Willnauer
a10c80e20f
ensure that modificatons to the enum order trigger test failures since we rely on the ordinal
2013-04-08 23:29:56 +02:00
Simon Willnauer
7e77ddb88f
use enum to represent flags and fail if flags are not respected
2013-04-08 22:56:11 +02:00
Igor Motov
2a588dc1f1
Fix IndexMissingException in get template request
...
Fixes #2873
2013-04-08 16:25:09 -04:00
Shay Banon
3120457bfe
move to 0.90.0.RC3 snap
2013-04-08 05:48:29 -07:00
Shay Banon
3a8cba4d50
release 0.90.0.RC2
2013-04-08 05:46:26 -07:00
Shay Banon
5fa66cd592
Node Stats: Allow to explicitly get specific indices level node stats element
...
closes #2871
2013-04-07 20:22:48 -07:00
Shay Banon
15d7ae5983
FieldData Stats: Add field data stats to indices stats API
...
closes #2870
2013-04-07 18:30:24 -07:00
Martijn van Groningen
86c1714bf3
Also test the `fields` option.
2013-04-07 21:52:19 +02:00
Simon Willnauer
7ad03ed789
Use IndexOption.DOCS_ONLY for boolean fields
...
Closes #2866
2013-04-06 22:41:22 +02:00
Shay Banon
9f6c8c88f3
improve on shard level filter/id cache stats
...
use just the removal listener and back to the IndexReader#coreCacheKey as the actual field as part of the cache key
2013-04-06 00:02:42 +02:00
Shay Banon
815917fbf8
confusing code..., but we can't release the searcher in a get result case
...
we need that searcher later on..., need to think of how to simplify that..., added a comment for now
2013-04-05 23:27:03 +02:00
Simon Willnauer
36ffd6d582
release searcher in finally block rather than relying on an exception that is thrown
2013-04-05 22:45:52 +02:00
Shay Banon
84670212a6
Filter / Id Cache Stats: Add to Indices Stats API, revise node stats API
...
closes #2862
2013-04-05 20:02:32 +02:00
Simon Willnauer
5e7ad9832c
Added more evil tests for different field data implementations
2013-04-05 18:12:50 +02:00
Martijn van Groningen
224faffead
Added an extended test for terms facet with a decent number of documents / field values and randomly tests various options. Also fixed an issue where `regex` and `excludes` were ignored when `all_terms` was used.
2013-04-05 17:38:46 +02:00
David Pilato
4b1ec037f8
Fix test for #2668 .
2013-04-05 15:00:28 +02:00
Martijn van Groningen
9b5c74d43e
Made sure `all_terms` works consistently. In some cases the `all_terms` option was ignored: * Faceting on number based fields. * The `execution_type` was set to `map`. * In the case the `fields` option was used.
...
Closes #2861
2013-04-05 14:27:19 +02:00
Shay Banon
831ea789aa
rename getByOrd to getValueByOrd (to match BytesValues.WithOrdinals)
...
also make it public so it can be used when iterating over ords
2013-04-05 13:56:33 +02:00
Shay Banon
bcc14cde9f
make numeric namings consistent with bytes ones
...
also add the ability to get the ordinals from DoubleValues.WithOrdinals and LongValues.WithOrdinals
2013-04-05 13:33:56 +02:00
David Pilato
36b92be212
List of existing plugins with Node Info API
...
We want to display information about loaded plugins in Node Info API using plugin option:
```sh
curl http://localhost:9200/_nodes?plugin=true
```
For example, on a 4 nodes cluster, it could provide the following output:
```javascript
{
"ok" : true,
"cluster_name" : "test-cluster-MacBook-Air-de-David.local",
"nodes" : {
"lodYfbFTRnmwE6rjWGGyQQ" : {
"name" : "node1",
"transport_address" : "inet[/172.18.58.139:9300]",
"hostname" : "MacBook-Air-de-David.local",
"version" : "0.90.0.Beta2-SNAPSHOT",
"http_address" : "inet[/172.18.58.139:9200]",
"plugins" : [ ]
},
"hJLXmY_NTrCytiIMbX4_1g" : {
"name" : "node4",
"transport_address" : "inet[/172.18.58.139:9303]",
"hostname" : "MacBook-Air-de-David.local",
"version" : "0.90.0.Beta2-SNAPSHOT",
"http_address" : "inet[/172.18.58.139:9203]",
"plugins" : [ {
"name" : "test-plugin",
"description" : "test-plugin description",
"site" : true,
"jvm" : false
}, {
"name" : "test-no-version-plugin",
"description" : "test-no-version-plugin description",
"site" : true,
"jvm" : false
}, {
"name" : "dummy",
"description" : "No description found for dummy.",
"url" : "/_plugin/dummy/",
"site" : false,
"jvm" : true
} ]
},
"bnoySsBfTrSzbDRZ0BFHvg" : {
"name" : "node2",
"transport_address" : "inet[/172.18.58.139:9301]",
"hostname" : "MacBook-Air-de-David.local",
"version" : "0.90.0.Beta2-SNAPSHOT",
"http_address" : "inet[/172.18.58.139:9201]",
"plugins" : [ {
"name" : "dummy",
"description" : "This is a description for a dummy test site plugin.",
"url" : "/_plugin/dummy/",
"site" : false,
"jvm" : true
} ]
},
"0Vwil01LSfK9YgRrMce3Ug" : {
"name" : "node3",
"transport_address" : "inet[/172.18.58.139:9302]",
"hostname" : "MacBook-Air-de-David.local",
"version" : "0.90.0.Beta2-SNAPSHOT",
"http_address" : "inet[/172.18.58.139:9202]",
"plugins" : [ {
"name" : "test-plugin",
"description" : "test-plugin description",
"site" : true,
"jvm" : false
} ]
}
}
}
```
Information are cached for 10 seconds by default. Modify `plugins.info_refresh_interval` property if needed.
Setting `plugins.info_refresh_interval` to `-1` will cause infinite caching.
Setting `plugins.info_refresh_interval` to `0` will disable caching.
Closes #2668 .
2013-04-05 11:36:56 +02:00
Simon Willnauer
f3e6fe094a
beef up term facet tests
2013-04-05 11:05:24 +02:00
Simon Willnauer
9fbe075aec
Added test that compares concurrent facet execution results with a serial execution result
2013-04-05 10:36:53 +02:00
Shay Banon
5af6343697
allow to disable the optimization of removal of ords on single value numerics/geo field data
...
field data settings in the mappings can have ordinals=always option
2013-04-05 00:44:07 +02:00
Shay Banon
54f685674b
Thread Pool: Update default settings (move from default cached to fixed)
...
closes #2858
2013-04-04 23:24:49 +02:00
Simon Willnauer
f1dd867c4f
Catch Throwable when listener is called rather then Exception to prevent possible hangs if fatal exceptions or errors are thrown
2013-04-04 22:58:38 +02:00
Shay Banon
a206aa4548
Settings / Config: Allow to explicitly specify external environment variable syntax, in which case its optional
...
fixes #2855
2013-04-04 16:30:24 +02:00
Simon Willnauer
d758401add
Cleanup ScriptDocValues. This commit adds a getValues method to all ScriptDocValues for easy access
...
in scripts via doc['field'].values / value.
2013-04-04 16:07:54 +02:00