1026 Commits

Author SHA1 Message Date
Charles Allen
2ad134638d Merge pull request #2589 from b-slim/fix_real_time
Make realtime kafka firehose skip corrupt message
2016-03-04 12:14:23 -08:00
Slim Bouguerra
623e89aa54 skip corrupt message 2016-03-04 08:30:40 -06:00
jisookim
177b575d41 fix default number of connections on broker config documentation 2016-03-03 13:50:48 -08:00
Björn Zettergren
2462c82c0e New defaults for maxRowsInMemory rowFlushBoundary
To bring consistency to docs and source this commit changes the default
values for maxRowsInMemory and rowFlushBoundary to 75000 after
discussion in PR https://github.com/druid-io/druid/pull/2457.

The previous default was 500000 and it's lower now on the grounds that
it's better for a default to be somewhat less efficient, and work,
than to reach for the stars and possibly result in
"OutOfMemoryError: java heap space" errors.
2016-03-01 13:50:28 +01:00
Fangjin Yang
8f97b1e40c Merge pull request #2554 from guobingkun/improve_include_extensions
improve doc on including druid and hadoop extensions
2016-02-26 20:05:57 -08:00
Charles Allen
c6803c4364 Allow specifying peon javaOpts as an array 2016-02-26 13:24:35 -08:00
Bingkun Guo
18f9e05f0f improve doc on including druid and hadoop extensions 2016-02-26 13:53:08 -06:00
Gian Merlino
eb13d7afe3 Merge pull request #2521 from himanshug/fix_2497
RTR has multiple threads for assignment of pending tasks now
2016-02-26 08:14:15 -08:00
Nishant
9f8faabddb Merge pull request #2469 from pdeva/patch-10
correct service names
2016-02-26 21:15:58 +05:30
Himanshu Gupta
bc156effe7 RTR has multiple threads for assignment of pending tasks now. 2016-02-26 09:27:03 -06:00
Charles Allen
1fe277ee29 Merge pull request #2367 from se7entyse7en/feature-rackspace-cloud-files-static-firehose
Adds support to use Rackspace's cloudfiles as static firehose
2016-02-25 17:31:06 -08:00
jon-wei
fd3782522c Rename 'replaceMissingValues...' parameters in RegexExtractionFn 2016-02-24 13:12:56 -08:00
Nishant
6c9e1a28ad Merge pull request #2519 from gianm/unparseable-handling
Better handling of ParseExceptions.
2016-02-24 04:46:29 +05:30
Fangjin Yang
93540c0631 Merge pull request #2503 from gianm/jetty-qos
Add druid.indexer.server.maxChatRequests for QoS; deprecate separate ports.
2016-02-23 10:35:53 -08:00
Fangjin Yang
4f300cfe49 Merge pull request #2526 from druid-io/b-slim-patch-1
fix docs about sketches
2016-02-23 10:23:53 -08:00
Gian Merlino
3534483433 Better handling of ParseExceptions.
Two changes:
- Allow IncrementalIndex to suppress ParseExceptions on "aggregate".
- Add "reportParseExceptions" option to realtime tuning configs. By default this is "false".

Behavior of the counters should now be:

- processed: Number of rows indexed, including rows where some fields could be parsed and some could not.
- thrownAway: Number of rows thrown away due to rejection policy.
- unparseable: Number of rows thrown away due to being completely unparseable (no fields salvageable at all).

If "reportParseExceptions" is true then "unparseable" will always be zero (because a parse error would
cause an exception to be thrown). In addition, "processed" will only include fully parseable rows
(because even partial parse failures will cause exceptions to be thrown).

Fixes #2510.
2016-02-23 10:11:43 -08:00
Slim
86c4900347 fix thetaSketch post aggregator doc 2016-02-23 10:43:54 -06:00
Himanshu Gupta
c7cb5bff14 fix thetaSketchSetOp doc 2016-02-23 09:17:49 -06:00
Fangjin Yang
0c984f9e32 Merge pull request #2109 from himanshug/segments_in_delta_ingestion
idempotent batch delta ingestion
2016-02-22 14:00:45 -08:00
Himanshu Gupta
f7679dd5a9 updating thetaSketchSetOp post agg documentation to reflect the possibility of nesting 2016-02-22 09:38:58 -06:00
Himanshu Gupta
21b0b8a07d new coordinator endpoint to get list of used segment given a dataSource and list of intervals 2016-02-21 23:17:58 -06:00
Himanshu Gupta
09ffcae4ae give user the option to specify the segments for dataSource inputSpec 2016-02-21 23:15:31 -06:00
Gian Merlino
23c993c9e7 Add druid.indexer.server.maxChatRequests for QoS; deprecate separate ports.
- Add druid.indexer.server.maxChatRequests, which sets up a QoSFilter on the main Jetty server.
- Deprecate druid.indexer.runner.separateIngestionEndpoint
- Deprecate druid.indexer.server.chathandler.*
2016-02-19 13:36:09 -08:00
Bingkun Guo
9e4c908922 generate tarball by mvn package 2016-02-18 16:42:41 -06:00
Fangjin Yang
083f019a48 Merge pull request #2465 from druid-io/more-doc-fix
more doc fixes
2016-02-17 11:00:38 -08:00
Fangjin Yang
661c3aac4f Merge pull request #2483 from druid-io/b-slim-patch-1
clarification about monitoring period
2016-02-17 09:46:25 -08:00
fjy
7da6594bfe more doc fixes 2016-02-17 09:43:47 -08:00
Gian Merlino
3a996216bd Multivalued dimensions can be compressed since 0.8.0. 2016-02-17 08:33:21 -08:00
Slim
e9f1c94822 Update metrics.md 2016-02-17 09:27:15 -06:00
Slim
ebbb1aa74e Update metrics.md 2016-02-17 09:05:16 -06:00
pdeva
dd81b5ebe4 correct service names
use a `/` instead of `:` cause thats how the service names are declared in the respective config files of coordinator and overlord
2016-02-13 15:26:19 -08:00
Gian Merlino
e0c049c0b0 Make startup properties logging optional.
Off by default, but enabled in the example config files. See also #2452.
2016-02-12 14:12:16 -08:00
Jonathan Wei
d63eec65a1 Merge pull request #2208 from navis/metadataquery-minmax
Support min/max values for metadata query
2016-02-11 17:28:07 -08:00
navis.ryu
dd2375477a Support min/max values for metadata query (#2208) 2016-02-12 09:35:58 +09:00
navis.ryu
4d63196535 Support dimension spec for select query 2016-02-12 08:54:28 +09:00
Slim
368988d187 Merge pull request #2291 from druid-io/lookupManager
Promoting LookupExtractor state and LookupExtractorFactory to be a first class druid state object.
2016-02-11 16:07:27 -06:00
Fangjin Yang
6b45a9e4a7 Merge pull request #2449 from b-slim/fix_docs
fix docs about search query limit
2016-02-11 13:01:41 -08:00
Slim Bouguerra
438a4a9970 fix docs about search query limit 2016-02-11 13:20:59 -06:00
Himanshu
5779b32742 Merge pull request #2439 from metamx/fix2435
Make QuotableWhiteSpaceSplitter able to take JSON
2016-02-11 13:14:43 -06:00
Slim Bouguerra
4e119b7a24 Adding lookup ref manager and lookup dimension spec impl 2016-02-11 12:11:51 -06:00
Fangjin Yang
1430bc2c88 Merge pull request #2276 from harshjain2/feature-2021
Fix for issue 2021.
2016-02-10 17:04:45 -08:00
Fangjin Yang
f204dfbebe Merge pull request #2413 from pdeva/patch-9
added note about including extension lib
2016-02-10 17:01:27 -08:00
Charles Allen
3a6452c6d4 Make QuotableWhiteSpaceSplitter able to take json
* Fixes #2435
2016-02-10 16:42:14 -08:00
Fangjin Yang
69a6bdcf03 Merge pull request #2427 from gianm/rolling-doc
Freshen up rolling update docs
2016-02-09 15:32:27 -08:00
Gian Merlino
95d5526e7c Freshen up rolling update docs
1. Clarify what "Indexing Service / Realtime" means
2. Add info about restore-based middle manager rolling restarts
3. Add info about what happens in middle manager updates
4. More consistent capitalization and spelling of node types
2016-02-09 13:57:04 -08:00
Himanshu
f6eebf5884 Merge pull request #2422 from rasahner/docMinorFixes
some minor doc changes
2016-02-09 10:03:22 -06:00
Robin
1d57e3267d some minor doc changes 2016-02-09 08:20:53 -06:00
pdeva
b75862da7e make 0.9 compatible 2016-02-08 17:25:34 -08:00
David Lim
57bf61029b Merge pull request #2416 from druid-io/fix-docs
fix docs
2016-02-08 15:09:20 -07:00
fjy
6fc5bcb1ef fix docs 2016-02-08 13:40:53 -08:00