druid

Commit Graph

Author	SHA1	Message	Date
Michael Schnupp	33b4eb624d	fix freeSpacePercent in segmentCache.locations (#5765 ) * fix freeSpacePercent in segmentCache.locations * the check should probably test the other way around * documentation should put the option in the right place * examples have a superfluous backslash * add test to verify correct behavior * switch to Path and test with jimfs Path allows to use different filesystems. Jimfs provides an actual (in memory) filesystem. This also allows more complex test scenarios. The behavior should be unchanged by this commit. * Revert "switch to Path and test with jimfs" This reverts commit `8b9a418d65`.	2018-05-24 11:15:30 +09:00
Atul Mohan	1b9611a60e	Local indexing from RDBMS (#5441 ) * Local indexing from RDBMS * Fix content * Remove pom changes * Remove extraneous space * Add tests and update documentation * Fix comments * Fix docs * Fix build related issue * Handle invalid strings * Make target database independent of metadata storage * Add firehose connector * Fix accessibility * Add docs * Remove unused def * Remove lazy instantiation of jsoniterator * Move unused changes * Move unused changes * Fix build * Make Sqlfirehose method private	2018-05-22 12:33:01 +09:00
Dylan Wylie	c537ea56f6	Validate dataschema datasource (#5785 ) * Validate dataschema has a datasource * Fix tests * Use Guava Strings.isNullOrEmpty * Inverse nullempty check, whoops	2018-05-18 16:29:06 -07:00
Gian Merlino	f2cc6ce4d5	VersionedIntervalTimeline: Optimize construction with heavily populated holders. (#5777 ) * VersionedIntervalTimeline: Optimize construction with heavily populated holders. Each time a segment is "add"ed to a timeline, "isComplete" is called on the holder that it is added to. "isComplete" is an O(segments per chunk) operation, meaning that adding N segments to a chunk is an O(N^2) operation. This blows up badly if we have thousands of segments per chunk. The patch defers the "isComplete" check until after all segments have been inserted. * Fix imports.	2018-05-16 09:16:59 -07:00
Jihoon Son	9dca5ec76b	Simple cleanup for ThreadPoolTaskRunner and SetAndVerifyContextQueryRunner / Add ThreadPoolTaskRunnerTest (#5557 ) * Simple fix for ThreadPoolTaskRunner * fix build * address comments * update javadoc * fix build * fix test * add dependency	2018-05-15 22:53:11 +05:30
Surekha	2f8904e25f	Check against the real default of maxBytes(1/6 max mem) in AppenderatorImpl's add (#5758 ) * The check for maxBytesInMemory should be >= 0 instead of > 0 * if the default value is 0, the actual check could be skipped * fix the message for persistReasons * Address PR comments * if maxBytes set -1, make is Long.MAX_VAL, so we do not need to check if it's 0 or -1 * set the maxBytesTuningconfig in AppenderatorImpl constructor to avoid duplicate code * fix the failing test cases * Address PR comments	2018-05-09 13:41:51 -07:00
Jihoon Son	c7a59394e0	Consider waiting and pending compaction tasks as well as running tasks in DruidCoordinatorSegmentCompactor (#5704 ) * Consider waiting and pending compaction tasks as well as running tasks in DruidCoordinatorSegmentCompactor * fix build * fix logging	2018-05-08 19:03:54 -07:00
Kirill Kozlov	67d0b0ee42	Add taskType dimension to task metrics (#5664 )	2018-05-07 09:42:26 -07:00
Fokko Driesprong	a95ec92296	Move to the org.lz4 dependency (#5746 ) The net.jpountz.lz4 moved to org.lz4	2018-05-07 08:16:45 -07:00
Slim Bouguerra	8aa8d9fa5b	Kerberos Spnego Authentication Router Issue (#5706 ) * Adding decoration method to proxy servlet Change-Id: I872f9282fb60bfa20524271535980a36a87b9621 * moving the proxy request decoration to authenticators Change-Id: I7f94b9ff5ecf08e8abf7169b58bc410f33148448 * added docs Change-Id: I901543e52f0faf4666bfea6256a7c05593b1ae70 * use the authentication result to decorate request Change-Id: I052650de9cd02b4faefdbcdaf2332dd3b2966af5 * adding authenticated by name Change-Id: I074d2933460165feeddb19352eac9bd0f96f42ca * ensure that authenticator is not null Change-Id: Idb58e308f90db88224a06f3759114872165b24f5 * fix types and minor bug Change-Id: I6801d49a05d5d8324406fc0280286954eb66db10 * fix typo Change-Id: I390b12af74f44d760d0812a519125fbf0df4e97b * use actual type names Change-Id: I62c3ee763363781e52809ec912aafd50b8486b8e * set authenitcatedBy to null for AutheticationResults created by Escalator. Change-Id: I4a675c372f59ebd8a8d19c61b85a1e4bf227a8ba	2018-05-05 20:33:51 -07:00
kaijianding	c12c16385e	support throw duplcate row during realtime ingestion in RealtimePlumber (#5693 )	2018-05-04 10:12:25 -07:00
Stuart McLean	c2b5e5ec95	Default caffeine cache size (#5738 ) * add default caffeine cache size based on runtime Xmx or max 1GB * update docs for caffeine cache * fix formatting * test caffeine size should never be less than 0 * set caffeine max default size to 1G not 1M * fix caffeine cache tests	2018-05-04 09:29:11 -07:00
Surekha	13c616ba24	'maxBytesInMemory' tuningConfig introduced for ingestion tasks (#5583 ) * This commit introduces a new tuning config called 'maxBytesInMemory' for ingestion tasks Currently a config called 'maxRowsInMemory' is present which affects how much memory gets used for indexing.If this value is not optimal for your JVM heap size, it could lead to OutOfMemoryError sometimes. A lower value will lead to frequent persists which might be bad for query performance and a higher value will limit number of persists but require more jvm heap space and could lead to OOM. 'maxBytesInMemory' is an attempt to solve this problem. It limits the total number of bytes kept in memory before persisting. * The default value is 1/3(Runtime.maxMemory()) * To maintain the current behaviour set 'maxBytesInMemory' to -1 * If both 'maxRowsInMemory' and 'maxBytesInMemory' are present, both of them will be respected i.e. the first one to go above threshold will trigger persist * Fix check style and remove a comment * Add overlord unsecured paths to coordinator when using combined service (#5579) * Add overlord unsecured paths to coordinator when using combined service * PR comment * More error reporting and stats for ingestion tasks (#5418) * Add more indexing task status and error reporting * PR comments, add support in AppenderatorDriverRealtimeIndexTask * Use TaskReport instead of metrics/context * Fix tests * Use TaskReport uploads * Refactor fire department metrics retrieval * Refactor input row serde in hadoop task * Refactor hadoop task loader names * Truncate error message in TaskStatus, add errorMsg to task report * PR comments * Allow getDomain to return disjointed intervals (#5570) * Allow getDomain to return disjointed intervals * Indentation issues * Adding feature thetaSketchConstant to do some set operation in PostAgg (#5551) * Adding feature thetaSketchConstant to do some set operation in PostAggregator * Updated review comments for PR #5551 - Adding thetaSketchConstant * Fixed CI build issue * Updated review comments 2 for PR #5551 - Adding thetaSketchConstant * Fix taskDuration docs for KafkaIndexingService (#5572) * With incremental handoff the changed line is no longer true. * Add doc for automatic pendingSegments (#5565) * Add missing doc for automatic pendingSegments * address comments * Fix indexTask to respect forceExtendableShardSpecs (#5509) * Fix indexTask to respect forceExtendableShardSpecs * add comments * Deprecate spark2 profile in pom.xml (#5581) Deprecated due to https://github.com/druid-io/druid/pull/5382 * CompressionUtils: Add support for decompressing xz, bz2, zip. (#5586) Also switch various firehoses to the new method. Fixes #5585. * This commit introduces a new tuning config called 'maxBytesInMemory' for ingestion tasks Currently a config called 'maxRowsInMemory' is present which affects how much memory gets used for indexing.If this value is not optimal for your JVM heap size, it could lead to OutOfMemoryError sometimes. A lower value will lead to frequent persists which might be bad for query performance and a higher value will limit number of persists but require more jvm heap space and could lead to OOM. 'maxBytesInMemory' is an attempt to solve this problem. It limits the total number of bytes kept in memory before persisting. * The default value is 1/3(Runtime.maxMemory()) * To maintain the current behaviour set 'maxBytesInMemory' to -1 * If both 'maxRowsInMemory' and 'maxBytesInMemory' are present, both of them will be respected i.e. the first one to go above threshold will trigger persist * Address code review comments * Fix the coding style according to druid conventions * Add more javadocs * Rename some variables/methods * Other minor issues * Address more code review comments * Some refactoring to put defaults in IndexTaskUtils * Added check for maxBytesInMemory in AppenderatorImpl * Decrement bytes in abandonSegment * Test unit test for multiple sinks in single appenderator * Fix some merge conflicts after rebase * Fix some style checks * Merge conflicts * Fix failing tests Add back check for 0 maxBytesInMemory in OnHeapIncrementalIndex * Address PR comments * Put defaults for maxRows and maxBytes in TuningConfig * Change/add javadocs * Refactoring and renaming some variables/methods * Fix TeamCity inspection warnings * Added maxBytesInMemory config to HadoopTuningConfig * Updated the docs and examples * Added maxBytesInMemory config in docs * Removed references to maxRowsInMemory under tuningConfig in examples * Set maxBytesInMemory to 0 until used Set the maxBytesInMemory to 0 if user does not set it as part of tuningConfing and set to part of max jvm memory when ingestion task starts * Update toString in KafkaSupervisorTuningConfig * Use correct maxBytesInMemory value in AppenderatorImpl * Update DEFAULT_MAX_BYTES_IN_MEMORY to 1/6 max jvm memory Experimenting with various defaults, 1/3 jvm memory causes OOM * Update docs to correct maxBytesInMemory default value * Minor to rename and add comment * Add more details in docs * Address new PR comments * Address PR comments * Fix spelling typo	2018-05-03 16:25:58 -07:00
Gian Merlino	df01998213	SegmentLoadDropHandler: Fix deadlock when segments have errors loading on startup. (#5735 ) The "lock" object was used to synchronize start/stop as well as synchronize removals from segmentsToDelete (when a segment is done dropping). This could cause a deadlock if a segment-load throws an exception during loadLocalCache. loadLocalCache is run by start() while it holds the lock, but then it spawns loading threads, and those threads will try to acquire the "segmentsToDelete" lock if they want to drop a corrupt segments. I don't see any reason for these two locks to be the same lock, so I split them.	2018-05-03 09:59:01 -07:00
Jihoon Son	2c8296f94d	Fix Appenderator.push() to commit the metadata of all segments (#5730 ) * Remove persist from Appenderator * fix javadoc	2018-05-02 13:17:54 -07:00
Jihoon Son	d4311b4a5a	Support enablePathStyleAccess, disableChunkedEncoding, and forceGlobalBucketAccessEnabled for aws client (#5702 ) * Support enablePathStyleAccess and disableChunkedEncoding for aws client * add an option for forceGlobalBucketAccessEnabled * add missing doc	2018-05-02 10:45:38 -07:00
David Lim	8ec2d2fe18	Use unique segment paths for Kafka indexing (#5692 ) * support unique segment file paths * forbiddenapis * code review changes * code review changes * code review changes * checkstyle fix	2018-04-29 21:59:48 -07:00
Roman Leventov	9be000758d	Refactor index merging, replace Rowboats with RowIterators and RowPointers (#5335 ) * Refactor index merging, replace Rowboats with RowIterators and RowPointers * Add javadocs * Fix a bug in QueryableIndexIndexableAdapter * Fixes * Remove unused declarations * Remove unused GenericColumn.isNull() method * Fix test * Address comments * Rearrange some code in MergingRowIterator for more clarity * Self-review * Fix style * Improve docs * Fix docs * Rename IndexMergerV9.writeDimValueAndSetupDimConversion to setUpDimConversion() * Update Javadocs * Minor fixes * Doc fixes, more code comments, cleanup of RowCombiningTimeAndDimsIterator * Fix doc link	2018-04-27 17:34:32 -07:00
David Lim	55b003e5e8	Fix loadstatus?full double counting expected segments (#5667 ) * fix loadstatus?full double counting expected segments * remove possible flakiness from Thread.sleep() in test	2018-04-24 01:11:16 +05:30
Roman Leventov	a3a9ada843	Add GenericWhitespace checkstyle check (#5668 )	2018-04-24 01:09:14 +05:30
Jihoon Son	ca3f833426	Fix coordinator's dataSource api with full parameter (#5662 ) * Fix coordinator's dataSource api with full parameter * address comment * Add a constructor for json serde and fix result order * Change to immutableSortedMap * Revert immutableSortedMap to treeMap	2018-04-19 17:41:53 -07:00
Kirill Kozlov	a7ba2bf275	Detailed error message when unable to create temp dir (#5648 )	2018-04-17 15:12:46 -07:00
Jonathan Wei	d0b66a6af5	Fix HTTP OPTIONS request auth handling (#5638 ) * Fix HTTP OPTIONS request auth handling * PR comment * More PR comments * Fix * PR comment	2018-04-16 18:09:56 -07:00
Jonathan Wei	882b172318	Revert "Fix HTTP OPTIONS request auth handling (#5615 )" (#5637 ) This reverts commit `df51a7bcb7`.	2018-04-12 16:43:54 -07:00
Jonathan Wei	e91add6843	Fix coordinator loadStatus performance (#5632 ) * Optimize coordinator loadStatus * Add comment * Fix teamcity * Checkstyle * More checkstyle * Checkstyle	2018-04-12 15:07:52 -07:00
Jonathan Wei	df51a7bcb7	Fix HTTP OPTIONS request auth handling (#5615 ) * Fix HTTP OPTIONS request auth handling * Flip configuration boolean	2018-04-12 14:02:20 -07:00
Gian Merlino	d0400a0688	SegmentWithState: Add toString method. (#5635 ) The class appears in log messages, and the default toString method isn't very informative.	2018-04-12 14:01:09 -05:00
palanieppan-m	dbea5cb9b7	Load rules should honor partial overlap (#5595 ) Load rules should load segments that partially overlap with rule window, instead of loading only segments that fully overlap.	2018-04-12 09:46:00 -07:00
Atul Mohan	19f359957f	Add getters for AlertEvent (#5522 ) * Add getters for AlertEvent * Move PublicApi and ExtensionPoint to java-util * Fix publicapi annotation usage * Add publicapi annotations to ServiceMetricEvent and RequestLogEvent	2018-04-12 23:38:20 +07:00
Nishant Bangarwa	e6efd75a3d	Add config to allow setting up custom unsecured paths for druid nodes. (#5614 ) * Add config to allow setting up custom unsecured paths for druid nodes. * return all resources for Unsecured paths * review comment - Add test * fix tests * fix test	2018-04-11 17:10:07 -07:00
Clint Wylie	ea4f8544fb	revert lambda conversion to fix occasional jvm error (#5591 )	2018-04-06 14:18:55 -07:00
Gian Merlino	5ab17668c0	CompressionUtils: Add support for decompressing xz, bz2, zip. (#5586 ) Also switch various firehoses to the new method. Fixes #5585.	2018-04-06 08:06:45 -07:00
Niketh Sabbineni	270fd1ea15	Allow getDomain to return disjointed intervals (#5570 ) * Allow getDomain to return disjointed intervals * Indentation issues	2018-04-05 22:12:30 -07:00
Jonathan Wei	969342cd28	More error reporting and stats for ingestion tasks (#5418 ) * Add more indexing task status and error reporting * PR comments, add support in AppenderatorDriverRealtimeIndexTask * Use TaskReport instead of metrics/context * Fix tests * Use TaskReport uploads * Refactor fire department metrics retrieval * Refactor input row serde in hadoop task * Refactor hadoop task loader names * Truncate error message in TaskStatus, add errorMsg to task report * PR comments	2018-04-05 21:38:57 -07:00
Niketh Sabbineni	f0a94f5035	Remove unused config (#5564 ) * Remove unused config * Fix failing tests	2018-04-03 13:23:46 -07:00
Clint Wylie	f31dba6c5b	Coordinator drop segment selection through cost balancer (#5529 ) * drop selection through cost balancer * use collections.emptyIterator * add test to ensure does not drop from server with larger loading queue with cost balancer * javadocs and comments to clear things up * random drop for completeness	2018-04-03 11:22:51 -07:00
Clint Wylie	a81ae99021	add 'stopped' check and handling to HttpLoadQueuePeon load and drop segment methods (#5555 ) * add stopped check and handling to HttpLoadQueuePeon load and drop segment methods * fix unrelated timeout :( * revert unintended change * PR feedback: change logging * fix dumb	2018-04-03 11:21:52 -07:00
Clint Wylie	6feac204e3	Coordinator primary segment assignment fix (#5532 ) * fix issue where assign primary assigns segments to all historical servers in cluster * fix test * add test to ensure primary assignment will not assign to another server while loading is in progress	2018-04-02 09:40:20 -07:00
Jihoon Son	05547e29b2	Fix SQLMetadataSegmentManager to allow succesive start and stop (#5554 ) * Fix SQLMetadataSegmentManager to allow succesive start and stop * address comment * add synchronization	2018-03-30 12:43:19 -07:00
Clint Wylie	30fc4d3ba0	Coordinator balancer move then drop fix (#5528 ) * #5521 part 1 * formatting * oops * less magic tests	2018-03-29 10:30:12 -07:00
Kirill Kozlov	8878a7ff94	Replace guava Charsets with native java StandardCharsets (#5545 )	2018-03-28 21:00:08 -07:00
Atul Mohan	ec17a44e09	Add result level caching to Brokers (#5028 ) * Add result level caching to Brokers * Minor doc changes * Simplify sequences * Move etag execution * Modify cacheLimit criteria * Fix incorrect etag computation * Fix docs * Add separate query runner for result level caching * Update docs * Add post aggregated results to result level cache * Fix indents * Check byte size for exceeding cache limit * Fix indents * Fix indents * Add flag for result caching * Remove logs * Make cache object generation synchronous * Avoid saving intermediate cache results to list * Fix changes that handle etag based response * Release bytestream after use * Address PR comments * Discard resultcache stream after use * Fix docs * Address comments * Add comment about fluent workflow issue	2018-03-23 19:11:52 -07:00
Charles Allen	ef21ce5a64	Add graceful shutdown timeout for Jetty (#5429 ) * Add graceful shutdown timeout * Handle interruptedException * Incorporate code review comments * Address code review comments * Poll for activeConnections to be zero * Use statistics handler to get active requests * Use native jetty shutdown gracefully * Move log line back to where it was * Add unannounce wait time * Make the default retain prior behavior * Update docs with new config defaults * Make duration handling on jetty shutdown more consistent * StatisticsHandler is a wrapper * Move jetty lifecycle error logging to error	2018-03-23 09:38:17 -07:00
Jihoon Son	1ad898bde2	Use the official aws-sdk instead of jet3t (#5382 ) * Use the official aws-sdk instead of jet3t * fix compile and serde tests * address comments and fix test * add http version string * remove redundant dependencies, fix potential NPE, and fix test * resolve TODOs * fix build * downgrade jackson version to 2.6.7 * fix test * resolve the last TODO * support proxy and endpoint configurations * fix build * remove debugging log * downgrade hadoop version to 2.8.3 * fix tests * remove unused log * fix it test * revert KerberosAuthenticator change * change hadoop-aws scope to provided in hdfs-storage * address comments * address comments	2018-03-21 15:36:54 -07:00
Charles Allen	58f110f7f8	Future-proof some Guava usage (#5414 ) * Future-proof some Guava usage * Use a java-util EmptyIterator instead of Guava's * Change some of the guava future handling to do manual async transforms. Guava changes transform into transformAsync by deprecating transform in ONLY Guava 19. Then its gone in 20 * Use `Collections.emptyIterator()` * Pretty formatting * Make listenable future transforms a thing in default druid * Format fix * Add forbidden guava apis * Make the ListenableFutrues.transformAsync have comments * Undo intellij bad pattern matching in comments * Futrues --> Futures * Add empty iterators forbidding * Fix extra `A` * Correct method signature * Address review comments * Finish Gian review comments * Proper syntax from https://github.com/policeman-tools/forbidden-apis/wiki/SignaturesSyntax	2018-03-20 08:59:33 -07:00
Jonathan Wei	b22455b924	Fix supervisor tombstone auth handling (#5504 )	2018-03-19 12:55:47 -07:00
Roman Leventov	693e3575f9	Remove unused code and exception declarations (#5461 ) * Remove unused code and exception declarations * Address comments * Remove redundant Exception declarations * Make FirehoseFactoryV2.connect() to throw IOException again	2018-03-16 22:11:12 +01:00
Jonathan Wei	30e6bdedf3	Authorize supervisor history instead of current active supervisors for supervisor history API (#5501 )	2018-03-16 12:29:17 -07:00
Gian Merlino	a08efe4683	Fix round robining in router. (#5500 ) * Fix round robining in router. Say that ten times fast. For query endpoints, AsyncQueryForwardingServlet called hostFinder.getDefaultServer() to set a default server, followed by hostFinder.getServer(inputQuery) to override it with query-specific routing. Since hostFinder is round-robin, this skips a server. When there are only two servers, one server is _always_ skipped and the router sends all queries to the same broker. * Adjust spacing.	2018-03-15 18:45:59 -07:00
Gian Merlino	fdd55538e1	SQL: Remove unused escalator, authConfig from various classes. (#5483 ) DruidPlanner.plan is responsible for checking authorization, so these objects weren't needed in as many places as they were injected.	2018-03-14 13:28:51 -07:00
Jihoon Son	9b2a25bd84	Refactor supervisorReport to be type-safe (#5479 ) * refactor supervisorReport * use primitives	2018-03-13 09:28:44 -07:00
Himanshu	e968811583	HttpServerInventoryView: fixed startup wait time and more informative logging (#5336 )	2018-03-12 22:13:51 -07:00
Roman Leventov	6b158abe3f	Enforce optimal IndexedInts iteration (#5456 ) * Enforce optimal IndexedInts iteration * Fix remaining suboptimal usages	2018-03-09 09:42:40 -08:00
Alexander Korablev	6a3a5350b8	Make memcached protocol and locator configurable. (#5438 ) * Make memcached protocol and locator configurable. * Style fix. * Style fix. * Style fix.	2018-02-28 17:16:43 -08:00
Niketh Sabbineni	ac5034e241	Improve cache cost to handle heterogenous historicals (#5416 )	2018-02-23 13:17:31 -08:00
Jonathan Wei	e9977ce4ef	Automatically adjust com.metamx.metrics Monitor class references (#5412 ) * Automatically adjust com.metamx.metrics monitor class references * Log warning for old class names	2018-02-22 12:03:07 -08:00
vvc11	305ecc2a78	adding a properties endpoint in status resource (#5276 ) * adding a properties endpoint in status resource * checkstyle fixes * more checkstyle corrections * correcting the resource filter for properties endpoint * adding feature of hiding sensitive properties * checkstyle changes * review changes for adding default hidden properties and using jackson for arrays value * making review changes	2018-02-18 12:51:02 -08:00
David Lim	20a3164180	Support for router forwarding requests to active coordinator/overlord (#5369 ) * allow router to forward requests to coordinator and overlord * fix forbidden API * more forbidden api fixes * code review changes	2018-02-15 14:38:58 -08:00
Jihoon Son	cd929000ca	Change early publishing to early pushing in indexTask & refactor AppenderatorDriver (#5297 ) * Fix early publishing to early pushing in batch indexing & refactor appenderatorDriver * fix compile * rename and add more javadocs * Fix conflicts * address comments * revert await executors * fix test	2018-02-14 12:48:33 -08:00
Jihoon Son	0105cdbc19	Fix Json Serde (#5370 )	2018-02-08 13:13:52 -08:00
Roman Leventov	e64ffb10c2	Standartize on using Integer.BYTES instead of Ints.BYTES from Guava, same for other primitives (#5366 )	2018-02-07 13:24:30 -08:00
Gian Merlino	971d45ab3f	Use a separate snapshot file per lookup tier. (#5358 ) Prevents conflicts if two processes on the same machine use the same lookup snapshot directory but are in different tiers.	2018-02-07 11:28:53 -08:00
Jihoon Son	2099b43e5f	Add a new config object for compactConfig (#5264 ) * add a new config object for compactConfig * fix test * address comments * Update doc	2018-02-06 12:13:52 -08:00
Gian Merlino	c21ff6e81c	Properly set "identity" in query metrics. (#5330 ) * Properly set "identity" in query metrics. This patch adds an "identity" field to QueryPlus and sets it in QueryLifecycle when the query starts executing. This is important because it allows it to be used for future QueryMetrics created by that QueryPlus object. We also add "identity" to the request-level QueryMetrics object created in emitLogsAndMetrics. * Remove unused method.	2018-02-06 10:53:00 -08:00
Kevin Conaway	93fdbcb364	Change RealtimeIndexTask to use AppenderatorDriver (#5261 ) * Change RealtimeIndexTask to use AppenderatorDriver instead of RealtimePlumber. Related to #4774 * Remove unused throwableDuringPublishing * Fix usage of forbidden API * Update realtime index IT to account for not skipping older data any more * Separate out waiting on publish futures and handoff futures to avoid a race condition where the handoff timeout expires before the segment is published * #5261 Add separate AppenderatorDriverRealtimeIndexTask and revert changes to RealtimeIndexTask * #5261 Add separate AppenderatorDriverRealtimeIndexTask and revert changes to RealtimeIndexTask * #5261 Readability improvements in AppenderatorDriverRealtimeIndexTask. Combine publish and handoff futures in to single future * #5261 Add separate tuningConfig for RealtimeAppenderatorIndexTask. Revert changes to RealtimeTuningConfig * #5261 Change JSON type to realtime_appenderator to keep the same naming pattern as RealtimeIndexTask	2018-02-06 10:21:31 -08:00
Gian Merlino	8c738c7076	Fix races in LookupSnapshotTaker, CoordinatorPollingBasicAuthenticatorCacheManager (#5344 ) * Fix races in LookupSnapshotTaker, CoordinatorPollingBasicAuthenticatorCacheManager. Both were susceptible to the following conditions: 1. Two JVMs on the same machine (perhaps two peons) could conflict by one reading while the other was writing, or by writing to the file at the same time. 2. One JVM could partially write a file, then crash, leaving a truncated file. * Use StringUtils.format	2018-02-06 09:44:06 -08:00
Slim	37c09ce3f8	Use both Joad Ids and Java IDs as Timezone to string readers (#5349 ) * Use both Joad Ids and Java IDs as Timezone to string readers Change-Id: Ieb5c18559879f3f3a0104912ce2f0a354ad0aac3 * move the function to DateTimes and add org.joda.time.DateTimeZone#forID as part of forbidden api Change-Id: Iff97fa044758019ed0c231587d10e31a9cc18da0 * exclude class and remove other usage Change-Id: Ib458c2caaa1865535767e1009fbf017a92c8f615 * remove it from test classes Change-Id: I9b576324f6c7e17a74bd8b13879232c9a8cd40b4 * remove unused Change-Id: If1c5b70c26c2b7c83c20434cb72b2060653f5052	2018-02-06 16:34:11 +05:30
Gian Merlino	9a62b02cb7	Extensions: Option to load classes from extension jars first. (#5321 ) The behavior is configurable through druid.extensions.useExtensionClassloaderFirst. It is useful when extensions want to load a dependency different from one provided by Druid, for example a different version of geoip or protobuf.	2018-02-06 16:14:03 +05:30
Jonathan Wei	c9e7c0a817	Remove Escalator jetty http client escalation method (#5322 )	2018-02-02 12:43:02 -06:00
Gian Merlino	7e02408510	Update versions to 0.13.0-SNAPSHOT. (#5323 )	2018-02-02 12:06:38 -06:00
Gian Merlino	10b8540f80	CliCoordinator: LoadQueueTaskMaster should use an escalated http client. (#5329 ) Also remove Guice annotations from LoadQueueTaskMaster, since it is provided by CliCoordinator, so Guice does not need to know how to build one directly.	2018-02-02 10:44:32 -06:00
Himanshu	4cd47de62f	add LookupExtractorFactory.destroy() method (#5287 ) * add LookupExtractorFactory.destroy() method * fix LookupReferencesManagerTest	2018-02-01 22:56:09 -08:00
Gian Merlino	ed47a1e1a9	Lookups: Inherit "injective" from registered lookups, improve docs. (#5316 ) Code changes: - In the lookup-based extractionFns, inherit injective property from the lookup itself if not specified. Doc changes: - Add a "Query execution" section to the lookups doc explaining how injective lookups and their optimizations work. - Remove scary warnings against using registeredLookup extractionFns. They are necessary and important since they work with filters and function cascades -- two things that the dimension specs do not do. They deserve to be first class citizens. - Move the "registeredLookup" fn above the "lookup" fn. It's probably more commonly used, so the docs read better this way.	2018-02-01 18:30:19 -08:00
Jihoon Son	3a69b0e513	Handle nullable taskTypes for rolling upgrade (#5309 )	2018-01-30 13:32:54 -08:00
David Lim	be66d4b822	clean up intermediate_pushes directory for LocalDataSegmentPusher (#5306 )	2018-01-30 12:33:06 -06:00
Jonathan Wei	f6749f1229	Allow separate truststore conf for HttpEmitter (#5298 ) * Fix HttpEmitter TLS support, allow separate truststore conf * PR comment, fix tests	2018-01-26 10:46:06 -06:00
Jonathan Wei	80419752b5	Add metamx emitter, http clients, and metrics packages to druid java-util (#5289 ) * Add metamx java-util emitter, http clients, and metrics packages to druid java-util * Remove metamx java-util from pom.xml files * Checkstyle fixes * Import fix * TeamCity inspection fixes * Use slf4j, move some version defs to master pom.xml * Use parent jvm-attach-api and maven-surefire-plugin versions * Add ] to log msg, suppress inspection	2018-01-24 22:10:36 +01:00
Nishant Bangarwa	aca200fddb	Fix rewrite of queryPath for encoded joda intervals as query param on druid router (#5274 ) * Fix rewrite of queryPath for encoded joda intervals as query param on druid router * fix checkstyle * fix comment	2018-01-24 02:20:07 +05:30
Roman Leventov	61e6878afd	Check Javadoc reference integrity (#5279 )	2018-01-22 13:51:28 -08:00
Roman Leventov	a346bbc6f3	Enforce spacing around foreach colon with Checkstyle (#5271 )	2018-01-22 11:48:51 -08:00
Roman Leventov	f99c27e9e0	Fix bugs in ImmutableRTree; Merge bytebuffer-collections module into druid-processing (#5275 ) * Fix bugs in ImmutableRTree; optimize ImmmutableRTreeObjectStrategy.writeTo(); Merge bytebuffer-collections module into druid-processing * Remove unused declaration * Fix another bug	2018-01-23 00:49:59 +05:30
Roman Leventov	87c744ac1d	Add MethodParamPad, OneStatementPerLine and EmptyStatement Checkstyle checks (#5272 )	2018-01-18 11:29:23 -08:00
Akash Dwivedi	d6932c1621	java-util version update + Add UnusedConnectionTimeout config. (#5239 ) * java-util version update + Add UnusedConnectionTimeout config. * warn if unusedConnectionTime >= readTimeout. * Doc update + addressed comment. * Use compareTo to compare duration. * remove unused variable. * addressed comments and default for unusedConnectionTimeout.	2018-01-17 15:54:18 -06:00
Parag Jain	b6b12db8b4	do not include the index in toString (#5268 )	2018-01-17 20:03:53 +01:00
Jihoon Son	241efafbb2	Automatic compaction by coordinators (#5102 ) * Automatic compaction by coordinator * add links * skip compaction for very recent segments if they are small * fix finding search interval * fix finding search interval * fix TimelineHolder iteration * add test for newestSegmentFirstPolicy * add CompactionSegmentIterator * add numTargetCompactionSegments * add missing config * fix skipping huge shards * fix handling large number of segments per shard * fix test failure * change recursive call to loop * fix logging * fix build * fix test failure * address comments * change dataSources type * check running pendingTasks at each run * fix test * address comments * fix build * fix test * address comments * address comments * add doc for segment size optimization * address comment	2018-01-13 13:52:37 +09:00
Roman Leventov	8877ce38d6	Enforce modifier order with Checkstyle (#5246 )	2018-01-11 09:50:42 +01:00
Jihoon Son	5d0619f5ce	Support retrying for PrefetchableTextFilesFirehoseFactory when prefetch is disabled (#5162 ) * Add RetryingInputStream * unnecessary exception * fix PrefetchableTextFilesFirehoseFactoryTest * Fix retrying on connection reset * fix start offset * fix checkstyle * fix check connection reset * address comments * fix compile * address comments * address comments	2018-01-10 17:37:19 +01:00
Parag Jain	83c6c48bed	Fix state check bug in Kafka Index Task (#5204 ) * fix state check for replacement task * fix comments * rebase with master	2018-01-08 18:01:36 -08:00
Himanshu	a46d34daa2	HTTP based task/worker management. (#5104 ) * just renaming of SegmentChangeRequestHistory etc * additional change history refactoring changes * WorkerTaskManager a replica of WorkerTaskMonitor * HttpServerInventoryView refactoring to extract sync code and robustification * Introducing HttpRemoteTaskRunner * Additional Worker side updates	2018-01-04 19:19:35 -08:00
Roman Leventov	579f9fbedf	Add IndexedInts.debugToString() and AbstractIndex.toString(); Add Sequence.toList() and limit() (#5175 ) * Add IndexedInts.debugToString() and AbstractIndex.toString() * Fix AppenderatorTest	2018-01-04 09:56:47 +09:00
David Lim	a7967ade4d	Support replaceExisting parameter for segments pushers (#5187 ) * support replaceExisting parameter for segments pushers * code review changes * code review changes	2018-01-03 16:13:21 -08:00
Nishant Bangarwa	59af4d3b14	Fix broken KafkaEmitterConfig parsing (#5201 ) * Fix broken KafkaEmitterConfig parsing This was a regression introduced in https://github.com/druid-io/druid/pull/4722 KafkaEmitterConfig property names have dot(.) in the name of properties and JsonConfigurator behavior was changed to not support that. Added a test and fixed parsing of properties that have dot(.) in property names * Fix test failure	2018-01-03 12:08:40 -08:00
Charles Allen	0f773aff80	Fix lookup logging on node start (#5206 ) * Add better logging messages in lookups startup on query nodes * Make sure list is mutable * Move list to be with other `final` variables	2018-01-03 13:13:55 -06:00
Himanshu	0f5c7d1aec	Add freeSpacePercent config in segment location to enforce free space while storing segments (#5137 ) * Add freeSpacePercent config in segment location config to enforce free space while storing segments * address review comments * address review comments: more doc on freeSpacePercent and use Double for freeSpacePercent	2017-12-21 15:31:09 +03:00
Himanshu	f57496ed8b	FilteredHttpServerInventoryViewProvider to start with always false predicate for each segment discovered (#5123 ) * FilteredHttpServerInventoryViewProvider to start with always false predicate for each segment discovered * update HttpServerInventoryViewTest to ensure that predicates are honored * add docs for HttpServerInventoryView.defaultFilter * change to javadoc style comment	2017-12-20 18:56:00 -08:00
Nishant Bangarwa	494e0b79ed	Allow configuring header size for druid requests (#5174 ) * Allow configuring header size for druid requests * fix configuration name in doc. * add more info to docs. * Add info to kerberos doc.	2017-12-20 18:51:40 -08:00
Jihoon Son	9199d61389	Automatic pendingSegments cleanup (#5149 ) * PendingSegments cleanup * fix build * address comments * address comments * fix potential npe * address comments * fix build * fix test * fix test	2017-12-20 14:46:34 -08:00
Roman Leventov	5787d04fad	Bump Druid version to 0.12.0 (#5138 )	2017-12-15 07:37:01 -08:00
Jonathan Wei	f48c9d7be1	Basic auth extension (#5099 ) * Basic auth extension * Add auth configuration integration test * Fix missing authorizerName property * PR comments * Fix missing @JsonProperty annotation * PR comments * more PR comments	2017-12-14 10:36:04 -08:00
Roman Leventov	64848c7ebf	DataSegment memory optimizations (#5094 ) * Deduplicate DataSegments contents (loadSpec's keys, dimensions and metrics lists as a whole) more aggressively; use ArrayMap instead of default LinkedHashMap for DataSegment.loadSpec, because they have only 3 entries on average; prune DataSegment.loadSpec on brokers * Fix DataSegmentTest * Refinements * Try to fix * Fix the second DataSegmentTest * Nullability * Fix tests * Fix tests, unify to use TestHelper.getJsonMapper() * Revert TestUtil as ServerTestHelper, fix tests * Add newline * Fix indexing tests * Fix s3 tests * Try to fix tests, remove lazy caching of ObjectMapper in TestHelper, rename TestHelper.getJsonMapper() to makeJsonMapper() * Fix HDFS tests * Fix HdfsDataSegmentPusherTest * Capitalize constant names	2017-12-12 11:41:40 -08:00

1 2 3 4 5 ...

3187 Commits