druid

Commit Graph

Author	SHA1	Message	Date
Surekha	2f8904e25f	Check against the real default of maxBytes(1/6 max mem) in AppenderatorImpl's add (#5758 ) * The check for maxBytesInMemory should be >= 0 instead of > 0 * if the default value is 0, the actual check could be skipped * fix the message for persistReasons * Address PR comments * if maxBytes set -1, make is Long.MAX_VAL, so we do not need to check if it's 0 or -1 * set the maxBytesTuningconfig in AppenderatorImpl constructor to avoid duplicate code * fix the failing test cases * Address PR comments	2018-05-09 13:41:51 -07:00
Jihoon Son	c7a59394e0	Consider waiting and pending compaction tasks as well as running tasks in DruidCoordinatorSegmentCompactor (#5704 ) * Consider waiting and pending compaction tasks as well as running tasks in DruidCoordinatorSegmentCompactor * fix build * fix logging	2018-05-08 19:03:54 -07:00
Kirill Kozlov	67d0b0ee42	Add taskType dimension to task metrics (#5664 )	2018-05-07 09:42:26 -07:00
Fokko Driesprong	a95ec92296	Move to the org.lz4 dependency (#5746 ) The net.jpountz.lz4 moved to org.lz4	2018-05-07 08:16:45 -07:00
Slim Bouguerra	8aa8d9fa5b	Kerberos Spnego Authentication Router Issue (#5706 ) * Adding decoration method to proxy servlet Change-Id: I872f9282fb60bfa20524271535980a36a87b9621 * moving the proxy request decoration to authenticators Change-Id: I7f94b9ff5ecf08e8abf7169b58bc410f33148448 * added docs Change-Id: I901543e52f0faf4666bfea6256a7c05593b1ae70 * use the authentication result to decorate request Change-Id: I052650de9cd02b4faefdbcdaf2332dd3b2966af5 * adding authenticated by name Change-Id: I074d2933460165feeddb19352eac9bd0f96f42ca * ensure that authenticator is not null Change-Id: Idb58e308f90db88224a06f3759114872165b24f5 * fix types and minor bug Change-Id: I6801d49a05d5d8324406fc0280286954eb66db10 * fix typo Change-Id: I390b12af74f44d760d0812a519125fbf0df4e97b * use actual type names Change-Id: I62c3ee763363781e52809ec912aafd50b8486b8e * set authenitcatedBy to null for AutheticationResults created by Escalator. Change-Id: I4a675c372f59ebd8a8d19c61b85a1e4bf227a8ba	2018-05-05 20:33:51 -07:00
kaijianding	c12c16385e	support throw duplcate row during realtime ingestion in RealtimePlumber (#5693 )	2018-05-04 10:12:25 -07:00
Stuart McLean	c2b5e5ec95	Default caffeine cache size (#5738 ) * add default caffeine cache size based on runtime Xmx or max 1GB * update docs for caffeine cache * fix formatting * test caffeine size should never be less than 0 * set caffeine max default size to 1G not 1M * fix caffeine cache tests	2018-05-04 09:29:11 -07:00
Surekha	13c616ba24	'maxBytesInMemory' tuningConfig introduced for ingestion tasks (#5583 ) * This commit introduces a new tuning config called 'maxBytesInMemory' for ingestion tasks Currently a config called 'maxRowsInMemory' is present which affects how much memory gets used for indexing.If this value is not optimal for your JVM heap size, it could lead to OutOfMemoryError sometimes. A lower value will lead to frequent persists which might be bad for query performance and a higher value will limit number of persists but require more jvm heap space and could lead to OOM. 'maxBytesInMemory' is an attempt to solve this problem. It limits the total number of bytes kept in memory before persisting. * The default value is 1/3(Runtime.maxMemory()) * To maintain the current behaviour set 'maxBytesInMemory' to -1 * If both 'maxRowsInMemory' and 'maxBytesInMemory' are present, both of them will be respected i.e. the first one to go above threshold will trigger persist * Fix check style and remove a comment * Add overlord unsecured paths to coordinator when using combined service (#5579) * Add overlord unsecured paths to coordinator when using combined service * PR comment * More error reporting and stats for ingestion tasks (#5418) * Add more indexing task status and error reporting * PR comments, add support in AppenderatorDriverRealtimeIndexTask * Use TaskReport instead of metrics/context * Fix tests * Use TaskReport uploads * Refactor fire department metrics retrieval * Refactor input row serde in hadoop task * Refactor hadoop task loader names * Truncate error message in TaskStatus, add errorMsg to task report * PR comments * Allow getDomain to return disjointed intervals (#5570) * Allow getDomain to return disjointed intervals * Indentation issues * Adding feature thetaSketchConstant to do some set operation in PostAgg (#5551) * Adding feature thetaSketchConstant to do some set operation in PostAggregator * Updated review comments for PR #5551 - Adding thetaSketchConstant * Fixed CI build issue * Updated review comments 2 for PR #5551 - Adding thetaSketchConstant * Fix taskDuration docs for KafkaIndexingService (#5572) * With incremental handoff the changed line is no longer true. * Add doc for automatic pendingSegments (#5565) * Add missing doc for automatic pendingSegments * address comments * Fix indexTask to respect forceExtendableShardSpecs (#5509) * Fix indexTask to respect forceExtendableShardSpecs * add comments * Deprecate spark2 profile in pom.xml (#5581) Deprecated due to https://github.com/druid-io/druid/pull/5382 * CompressionUtils: Add support for decompressing xz, bz2, zip. (#5586) Also switch various firehoses to the new method. Fixes #5585. * This commit introduces a new tuning config called 'maxBytesInMemory' for ingestion tasks Currently a config called 'maxRowsInMemory' is present which affects how much memory gets used for indexing.If this value is not optimal for your JVM heap size, it could lead to OutOfMemoryError sometimes. A lower value will lead to frequent persists which might be bad for query performance and a higher value will limit number of persists but require more jvm heap space and could lead to OOM. 'maxBytesInMemory' is an attempt to solve this problem. It limits the total number of bytes kept in memory before persisting. * The default value is 1/3(Runtime.maxMemory()) * To maintain the current behaviour set 'maxBytesInMemory' to -1 * If both 'maxRowsInMemory' and 'maxBytesInMemory' are present, both of them will be respected i.e. the first one to go above threshold will trigger persist * Address code review comments * Fix the coding style according to druid conventions * Add more javadocs * Rename some variables/methods * Other minor issues * Address more code review comments * Some refactoring to put defaults in IndexTaskUtils * Added check for maxBytesInMemory in AppenderatorImpl * Decrement bytes in abandonSegment * Test unit test for multiple sinks in single appenderator * Fix some merge conflicts after rebase * Fix some style checks * Merge conflicts * Fix failing tests Add back check for 0 maxBytesInMemory in OnHeapIncrementalIndex * Address PR comments * Put defaults for maxRows and maxBytes in TuningConfig * Change/add javadocs * Refactoring and renaming some variables/methods * Fix TeamCity inspection warnings * Added maxBytesInMemory config to HadoopTuningConfig * Updated the docs and examples * Added maxBytesInMemory config in docs * Removed references to maxRowsInMemory under tuningConfig in examples * Set maxBytesInMemory to 0 until used Set the maxBytesInMemory to 0 if user does not set it as part of tuningConfing and set to part of max jvm memory when ingestion task starts * Update toString in KafkaSupervisorTuningConfig * Use correct maxBytesInMemory value in AppenderatorImpl * Update DEFAULT_MAX_BYTES_IN_MEMORY to 1/6 max jvm memory Experimenting with various defaults, 1/3 jvm memory causes OOM * Update docs to correct maxBytesInMemory default value * Minor to rename and add comment * Add more details in docs * Address new PR comments * Address PR comments * Fix spelling typo	2018-05-03 16:25:58 -07:00
Gian Merlino	df01998213	SegmentLoadDropHandler: Fix deadlock when segments have errors loading on startup. (#5735 ) The "lock" object was used to synchronize start/stop as well as synchronize removals from segmentsToDelete (when a segment is done dropping). This could cause a deadlock if a segment-load throws an exception during loadLocalCache. loadLocalCache is run by start() while it holds the lock, but then it spawns loading threads, and those threads will try to acquire the "segmentsToDelete" lock if they want to drop a corrupt segments. I don't see any reason for these two locks to be the same lock, so I split them.	2018-05-03 09:59:01 -07:00
Jihoon Son	2c8296f94d	Fix Appenderator.push() to commit the metadata of all segments (#5730 ) * Remove persist from Appenderator * fix javadoc	2018-05-02 13:17:54 -07:00
Jihoon Son	d4311b4a5a	Support enablePathStyleAccess, disableChunkedEncoding, and forceGlobalBucketAccessEnabled for aws client (#5702 ) * Support enablePathStyleAccess and disableChunkedEncoding for aws client * add an option for forceGlobalBucketAccessEnabled * add missing doc	2018-05-02 10:45:38 -07:00
David Lim	8ec2d2fe18	Use unique segment paths for Kafka indexing (#5692 ) * support unique segment file paths * forbiddenapis * code review changes * code review changes * code review changes * checkstyle fix	2018-04-29 21:59:48 -07:00
Roman Leventov	9be000758d	Refactor index merging, replace Rowboats with RowIterators and RowPointers (#5335 ) * Refactor index merging, replace Rowboats with RowIterators and RowPointers * Add javadocs * Fix a bug in QueryableIndexIndexableAdapter * Fixes * Remove unused declarations * Remove unused GenericColumn.isNull() method * Fix test * Address comments * Rearrange some code in MergingRowIterator for more clarity * Self-review * Fix style * Improve docs * Fix docs * Rename IndexMergerV9.writeDimValueAndSetupDimConversion to setUpDimConversion() * Update Javadocs * Minor fixes * Doc fixes, more code comments, cleanup of RowCombiningTimeAndDimsIterator * Fix doc link	2018-04-27 17:34:32 -07:00
David Lim	55b003e5e8	Fix loadstatus?full double counting expected segments (#5667 ) * fix loadstatus?full double counting expected segments * remove possible flakiness from Thread.sleep() in test	2018-04-24 01:11:16 +05:30
Roman Leventov	a3a9ada843	Add GenericWhitespace checkstyle check (#5668 )	2018-04-24 01:09:14 +05:30
Jihoon Son	ca3f833426	Fix coordinator's dataSource api with full parameter (#5662 ) * Fix coordinator's dataSource api with full parameter * address comment * Add a constructor for json serde and fix result order * Change to immutableSortedMap * Revert immutableSortedMap to treeMap	2018-04-19 17:41:53 -07:00
Kirill Kozlov	a7ba2bf275	Detailed error message when unable to create temp dir (#5648 )	2018-04-17 15:12:46 -07:00
Jonathan Wei	d0b66a6af5	Fix HTTP OPTIONS request auth handling (#5638 ) * Fix HTTP OPTIONS request auth handling * PR comment * More PR comments * Fix * PR comment	2018-04-16 18:09:56 -07:00
Jonathan Wei	882b172318	Revert "Fix HTTP OPTIONS request auth handling (#5615 )" (#5637 ) This reverts commit `df51a7bcb7`.	2018-04-12 16:43:54 -07:00
Jonathan Wei	e91add6843	Fix coordinator loadStatus performance (#5632 ) * Optimize coordinator loadStatus * Add comment * Fix teamcity * Checkstyle * More checkstyle * Checkstyle	2018-04-12 15:07:52 -07:00
Jonathan Wei	df51a7bcb7	Fix HTTP OPTIONS request auth handling (#5615 ) * Fix HTTP OPTIONS request auth handling * Flip configuration boolean	2018-04-12 14:02:20 -07:00
Gian Merlino	d0400a0688	SegmentWithState: Add toString method. (#5635 ) The class appears in log messages, and the default toString method isn't very informative.	2018-04-12 14:01:09 -05:00
palanieppan-m	dbea5cb9b7	Load rules should honor partial overlap (#5595 ) Load rules should load segments that partially overlap with rule window, instead of loading only segments that fully overlap.	2018-04-12 09:46:00 -07:00
Atul Mohan	19f359957f	Add getters for AlertEvent (#5522 ) * Add getters for AlertEvent * Move PublicApi and ExtensionPoint to java-util * Fix publicapi annotation usage * Add publicapi annotations to ServiceMetricEvent and RequestLogEvent	2018-04-12 23:38:20 +07:00
Nishant Bangarwa	e6efd75a3d	Add config to allow setting up custom unsecured paths for druid nodes. (#5614 ) * Add config to allow setting up custom unsecured paths for druid nodes. * return all resources for Unsecured paths * review comment - Add test * fix tests * fix test	2018-04-11 17:10:07 -07:00
Clint Wylie	ea4f8544fb	revert lambda conversion to fix occasional jvm error (#5591 )	2018-04-06 14:18:55 -07:00
Gian Merlino	5ab17668c0	CompressionUtils: Add support for decompressing xz, bz2, zip. (#5586 ) Also switch various firehoses to the new method. Fixes #5585.	2018-04-06 08:06:45 -07:00
Niketh Sabbineni	270fd1ea15	Allow getDomain to return disjointed intervals (#5570 ) * Allow getDomain to return disjointed intervals * Indentation issues	2018-04-05 22:12:30 -07:00
Jonathan Wei	969342cd28	More error reporting and stats for ingestion tasks (#5418 ) * Add more indexing task status and error reporting * PR comments, add support in AppenderatorDriverRealtimeIndexTask * Use TaskReport instead of metrics/context * Fix tests * Use TaskReport uploads * Refactor fire department metrics retrieval * Refactor input row serde in hadoop task * Refactor hadoop task loader names * Truncate error message in TaskStatus, add errorMsg to task report * PR comments	2018-04-05 21:38:57 -07:00
Niketh Sabbineni	f0a94f5035	Remove unused config (#5564 ) * Remove unused config * Fix failing tests	2018-04-03 13:23:46 -07:00
Clint Wylie	f31dba6c5b	Coordinator drop segment selection through cost balancer (#5529 ) * drop selection through cost balancer * use collections.emptyIterator * add test to ensure does not drop from server with larger loading queue with cost balancer * javadocs and comments to clear things up * random drop for completeness	2018-04-03 11:22:51 -07:00
Clint Wylie	a81ae99021	add 'stopped' check and handling to HttpLoadQueuePeon load and drop segment methods (#5555 ) * add stopped check and handling to HttpLoadQueuePeon load and drop segment methods * fix unrelated timeout :( * revert unintended change * PR feedback: change logging * fix dumb	2018-04-03 11:21:52 -07:00
Clint Wylie	6feac204e3	Coordinator primary segment assignment fix (#5532 ) * fix issue where assign primary assigns segments to all historical servers in cluster * fix test * add test to ensure primary assignment will not assign to another server while loading is in progress	2018-04-02 09:40:20 -07:00
Jihoon Son	05547e29b2	Fix SQLMetadataSegmentManager to allow succesive start and stop (#5554 ) * Fix SQLMetadataSegmentManager to allow succesive start and stop * address comment * add synchronization	2018-03-30 12:43:19 -07:00
Clint Wylie	30fc4d3ba0	Coordinator balancer move then drop fix (#5528 ) * #5521 part 1 * formatting * oops * less magic tests	2018-03-29 10:30:12 -07:00
Kirill Kozlov	8878a7ff94	Replace guava Charsets with native java StandardCharsets (#5545 )	2018-03-28 21:00:08 -07:00
Atul Mohan	ec17a44e09	Add result level caching to Brokers (#5028 ) * Add result level caching to Brokers * Minor doc changes * Simplify sequences * Move etag execution * Modify cacheLimit criteria * Fix incorrect etag computation * Fix docs * Add separate query runner for result level caching * Update docs * Add post aggregated results to result level cache * Fix indents * Check byte size for exceeding cache limit * Fix indents * Fix indents * Add flag for result caching * Remove logs * Make cache object generation synchronous * Avoid saving intermediate cache results to list * Fix changes that handle etag based response * Release bytestream after use * Address PR comments * Discard resultcache stream after use * Fix docs * Address comments * Add comment about fluent workflow issue	2018-03-23 19:11:52 -07:00
Charles Allen	ef21ce5a64	Add graceful shutdown timeout for Jetty (#5429 ) * Add graceful shutdown timeout * Handle interruptedException * Incorporate code review comments * Address code review comments * Poll for activeConnections to be zero * Use statistics handler to get active requests * Use native jetty shutdown gracefully * Move log line back to where it was * Add unannounce wait time * Make the default retain prior behavior * Update docs with new config defaults * Make duration handling on jetty shutdown more consistent * StatisticsHandler is a wrapper * Move jetty lifecycle error logging to error	2018-03-23 09:38:17 -07:00
Jihoon Son	1ad898bde2	Use the official aws-sdk instead of jet3t (#5382 ) * Use the official aws-sdk instead of jet3t * fix compile and serde tests * address comments and fix test * add http version string * remove redundant dependencies, fix potential NPE, and fix test * resolve TODOs * fix build * downgrade jackson version to 2.6.7 * fix test * resolve the last TODO * support proxy and endpoint configurations * fix build * remove debugging log * downgrade hadoop version to 2.8.3 * fix tests * remove unused log * fix it test * revert KerberosAuthenticator change * change hadoop-aws scope to provided in hdfs-storage * address comments * address comments	2018-03-21 15:36:54 -07:00
Charles Allen	58f110f7f8	Future-proof some Guava usage (#5414 ) * Future-proof some Guava usage * Use a java-util EmptyIterator instead of Guava's * Change some of the guava future handling to do manual async transforms. Guava changes transform into transformAsync by deprecating transform in ONLY Guava 19. Then its gone in 20 * Use `Collections.emptyIterator()` * Pretty formatting * Make listenable future transforms a thing in default druid * Format fix * Add forbidden guava apis * Make the ListenableFutrues.transformAsync have comments * Undo intellij bad pattern matching in comments * Futrues --> Futures * Add empty iterators forbidding * Fix extra `A` * Correct method signature * Address review comments * Finish Gian review comments * Proper syntax from https://github.com/policeman-tools/forbidden-apis/wiki/SignaturesSyntax	2018-03-20 08:59:33 -07:00
Jonathan Wei	b22455b924	Fix supervisor tombstone auth handling (#5504 )	2018-03-19 12:55:47 -07:00
Roman Leventov	693e3575f9	Remove unused code and exception declarations (#5461 ) * Remove unused code and exception declarations * Address comments * Remove redundant Exception declarations * Make FirehoseFactoryV2.connect() to throw IOException again	2018-03-16 22:11:12 +01:00
Jonathan Wei	30e6bdedf3	Authorize supervisor history instead of current active supervisors for supervisor history API (#5501 )	2018-03-16 12:29:17 -07:00
Gian Merlino	a08efe4683	Fix round robining in router. (#5500 ) * Fix round robining in router. Say that ten times fast. For query endpoints, AsyncQueryForwardingServlet called hostFinder.getDefaultServer() to set a default server, followed by hostFinder.getServer(inputQuery) to override it with query-specific routing. Since hostFinder is round-robin, this skips a server. When there are only two servers, one server is _always_ skipped and the router sends all queries to the same broker. * Adjust spacing.	2018-03-15 18:45:59 -07:00
Gian Merlino	fdd55538e1	SQL: Remove unused escalator, authConfig from various classes. (#5483 ) DruidPlanner.plan is responsible for checking authorization, so these objects weren't needed in as many places as they were injected.	2018-03-14 13:28:51 -07:00
Jihoon Son	9b2a25bd84	Refactor supervisorReport to be type-safe (#5479 ) * refactor supervisorReport * use primitives	2018-03-13 09:28:44 -07:00
Himanshu	e968811583	HttpServerInventoryView: fixed startup wait time and more informative logging (#5336 )	2018-03-12 22:13:51 -07:00
Roman Leventov	6b158abe3f	Enforce optimal IndexedInts iteration (#5456 ) * Enforce optimal IndexedInts iteration * Fix remaining suboptimal usages	2018-03-09 09:42:40 -08:00
Alexander Korablev	6a3a5350b8	Make memcached protocol and locator configurable. (#5438 ) * Make memcached protocol and locator configurable. * Style fix. * Style fix. * Style fix.	2018-02-28 17:16:43 -08:00
Niketh Sabbineni	ac5034e241	Improve cache cost to handle heterogenous historicals (#5416 )	2018-02-23 13:17:31 -08:00

1 2 3 4 5 ...

3132 Commits