druid

Commit Graph

Author	SHA1	Message	Date
Jonathan Wei	0b4f771062	Exclude hadoop-lzo from thrift-extensions build (#7151 )	2019-02-27 19:57:53 -08:00
Jonathan Wei	3d247498ef	Update tutorials for 0.14.0-incubating (#7157 )	2019-02-27 19:50:31 -08:00
Jihoon Son	cacdc83cad	Improve error message for integer overflow in compaction task (#7131 ) * improve error message for integer overflow in compaction task * fix build	2019-02-28 11:07:37 +08:00
Jihoon Son	6b232d8195	Improve compaction tutorial to demonstrate compaction with keepSegmentGranularity = true (#7079 ) * Improve compaction tutorial to demonstrate compaction with keepSegmentGranularity = true * typo * add a warning	2019-02-27 16:02:51 -08:00
Clint Wylie	9fa649b3bd	segment metadata fallback analysis if no bitmaps (#7116 ) * segment metadata fallback analysis if no bitmaps * remove accidental line * remove nonsense size estimation * less ternary * fix it * do the thing	2019-02-26 11:27:41 -08:00
Vadim Ogievetsky	b8f762037a	Downgrade blueprintjs version in the web console to one with a vanilla Apache 2.0 license (#7139 ) * revert bp * fix tests * move @types/hjson to dev dep * removed all the package upgrades	2019-02-25 20:54:56 -08:00
Mirko Jotic	f6a8e030cc	Select query failing if miliseconds used as time for indexing (#6937 ) * [#1332] Fix - select failing if milis used for idx. * Formating correction. * Address comment: throw original exception. * Using constant values in tests - Try converting to Integer and then multiply by 1000L to achieve milis. - If not successful try converting to Long or rethrow original exception. * DateTime#of has to support "2011-01-01T00:00:00" - in addition to seconds and milisecs, this method currently supports even a date string. * Handle only milisec timestamps and ISO8601 strings	2019-02-25 14:36:01 -08:00
Jihoon Son	9a066558a4	Fix exception when the scheme is missing in endpointUrl for S3 (#7129 ) * Fix exception when the scheme is missing in endpointUrl for S3 * add null check	2019-02-25 11:10:35 -08:00
Himanshu Pandey	8b803cbc22	Added checkstyle for "Methods starting with Capital Letters" (#7118 ) * Added checkstyle for "Methods starting with Capital Letters" and changed the method names violating this. * Un-abbreviate the method names in the calcite tests * Fixed checkstyle errors * Changed asserts position in the code	2019-02-23 20:10:31 -08:00
David Glasser	1c2753ab90	ParallelIndexSubTask: support ingestSegment in delegating factories (#7089 ) IndexTask had special-cased code to properly send a TaskToolbox to a IngestSegmentFirehoseFactory that's nested inside a CombiningFirehoseFactory, but ParallelIndexSubTask didn't. This change refactors IngestSegmentFirehoseFactory so that it doesn't need a TaskToolbox; it instead gets a CoordinatorClient and a SegmentLoaderFactory directly injected into it. This also refactors SegmentLoaderFactory so it doesn't depend on an injectable SegmentLoaderConfig, since its only method always replaces the preconfigured SegmentLoaderConfig anyway. This makes it possible to use SegmentLoaderFactory without setting druid.segmentCaches.locations to some dummy value. Another goal of this PR is to make it possible for IngestSegmentFirehoseFactory to list data segments outside of connect() --- specifically, to make it a FiniteFirehoseFactory which can query the coordinator in order to calculate its splits. See #7048. This also adds missing datasource name URL-encoding to an API used by CoordinatorBasedSegmentHandoffNotifier.	2019-02-23 17:02:56 -08:00
Jonathan Wei	417b9f2fe1	Add bug report and feature request GitHub issue templates (#7105 ) * Add bug report GitHub template * PR comments * Add feature request template * Tweak * Add [REQUEST] title * Remove request title, add note	2019-02-21 08:44:41 -08:00
Jihoon Son	4e2b085201	Remove DataSegmentFinder, InsertSegmentToDb, and descriptor.json file in deep storage (#6911 ) * Remove DataSegmentFinder, InsertSegmentToDb, and descriptor.json file * delete descriptor.file when killing segments * fix test * Add doc for ha * improve warning	2019-02-20 15:10:29 -08:00
Mingming Qiu	dd34691004	Coordinator await initialization before finishing startup (#6847 ) * Curator server inventory await initialization * address comments * print exception object in log * remove throws ISE * cachingCost awaitInitialization default to false	2019-02-20 11:56:23 -08:00
David Glasser	a81b1b8c9c	index_parallel: support !appendToExisting with no explicit intervals (#7046 ) * index_parallel: support !appendToExisting with no explicit intervals This enables ParallelIndexSupervisorTask to dynamically request locks at runtime if it is run without explicit intervals in the granularity spec and with appendToExisting set to false. Previously, it behaved as if appendToExisting was set to true, which was undocumented and inconsistent with IndexTask and Hadoop indexing. Also, when ParallelIndexSupervisorTask allocates segments in the explicit interval case, fail if its locks on the interval have been revoked. Also make a few other additions/clarifications to native ingestion docs. Fixes #6989. * Review feedback. PR description on GitHub updated to match. * Make native batch ingestion partitions start at 0 * Fix to previous commit * Unit test. Verified to fail without the other commits on this branch. * Another round of review * Slightly scarier warning	2019-02-20 10:54:26 -08:00
Furkan KAMACI	9a521526c7	Since notify might not wake up the right thread, notifyAll should be used instead. (#6931 ) * Since notify might not wake up the right thread, notifyAll should be used instead. * Comment is added about why notifyAll() is not used.	2019-02-20 09:02:58 -08:00
Justin Borromeo	871b9d2f4c	[Benchmarking] Call blackhole#consume() on collections instead of iterating through each element (#7002 ) * Replaced iteration with blackhole#consume(the collection) * Added javadoc on Sequence#toList()	2019-02-20 08:48:06 -08:00
Dylan Wylie	554b0142c3	Autoclose old PRs using stale bot. (#7031 ) * Autoclose old PRs using stale bot. * add apache license * Excempt bug label	2019-02-19 14:26:54 -08:00
Fangyuan Deng	7d1e8f353e	bugfix: when building materialized-view, if taskCount>1, may cause concurrentModificationException (#6690 ) * bugfix: when building materialized-view, if taskCount >1, may cause ConcurrentModificationException * remove entry after iteration instead of using ConcurrentMap, and add unit test * small change * modify unit test for coverage * remove unused method	2019-02-19 13:10:55 -08:00
Jonathan Wei	258485a2fb	Exclude github issue templates from license check (#7070 ) * Exclude github issue templates from license check * Adjust capitalization	2019-02-19 12:38:52 -08:00
Surekha	2b04e6d0bc	add note on consistency of results for sys.segments queries (#7034 ) * add doc * change docs * PR comments * few more changes	2019-02-19 10:52:37 -08:00
Clint Wylie	cadb6c5280	Missing Overlord and MiddleManager api docs (#7042 ) * document middle manager api * re-arrange * correction * document more missing overlord api calls, minor re-arrange of some code i was referencing * fix it * this will fix it * fixup * link to other docs	2019-02-19 10:52:05 -08:00
Surekha	80a2ef7be4	Support kafka transactional topics (#5404 ) (#6496 ) * Support kafka transactional topics * update kafka to version 2.0.0 * Remove the skipOffsetGaps option since it's not used anymore * Adjust kafka consumer to use transactional semantics * Update tests * Remove unused import from test * Fix compilation * Invoke transaction api to fix a unit test * temporary modification of travis.yml for debugging * another attempt to get travis tasklogs * update kafka to 2.0.1 at all places * Remove druid-kafka-eight dependency from integration-tests, remove the kafka firehose test and deprecate kafka-eight classes * Add deprecated in docs for kafka-eight and kafka-simple extensions * Remove skipOffsetGaps and code changes for transaction support * Fix indentation * remove skipOffsetGaps from kinesis * Add transaction api to KafkaRecordSupplierTest * Fix indent * Fix test * update kafka version to 2.1.0	2019-02-18 11:50:08 -08:00
Jonathan Wei	61272d6daa	Update handlebars dep to patch vulnerability (#7083 )	2019-02-18 18:06:47 +08:00
Justin Borromeo	c7eeeabf45	2528 Replace Incremental Index Global Flags with Getters (#7043 ) * Eliminated reportParseExceptions and deserializeComplexMetrics * Removed more global flags * Cleanup * Addressed Surekha's recommendations	2019-02-15 13:36:46 -08:00
scrawfor	0fa9000849	Add Postgresql SqlFirehose (#6813 ) * Add Postgresql SqlFirehose * Fix Code Style. * Fix style. * Fix Import Order. * Add Line Break before package.	2019-02-14 22:52:03 -08:00
awelsh93	ee91e27fe7	Update api-reference.md doc (#7065 ) - moving description of coordinator isLeader endpoint	2019-02-14 14:38:09 +00:00
Jonathan Wei	1f29940811	Fix momentsketch build issues (#7074 ) * Fix momentsketch build issues * Remove unused section in pom * Fix test * Remove unused method * Checkstyle	2019-02-13 21:32:43 -08:00
Edward Gan	90c1a54b86	Moments Sketch custom aggregator (#6581 ) * Moments Sketch Integration with Druid * updates, add documentation, fix warnings * nits * disallowed base64 * update to druid 0.14	2019-02-13 14:03:47 -08:00
Jonathan Wei	673396ae74	Add proposal template (#7062 ) * Add proposal template Adds a proposal template based on the discussion in https://lists.apache.org/thread.html/bb9c5e1f8ce9b3148a5c26f95059f9b6629fae3bf8c617121d671395@%3Cdev.druid.apache.org%3E * Add license	2019-02-13 13:43:31 -08:00
Jihoon Son	970308463d	Add doc for Hadoop-based ingestion vs Native batch ingestion (#7044 ) * Add doc for Hadoop-based ingestion vs Native batch ingestion * add links * add links	2019-02-13 11:23:08 -08:00
Jihoon Son	1701fbcad3	Improve error message for revoked locks (#7035 ) * Improve error message for revoked locks * fix test * fix test * fix test * fix toString	2019-02-13 11:22:48 -08:00
Jihoon Son	b1c4a5de0d	Fix and improve doc for partitioning of local index (#7064 )	2019-02-13 11:20:52 -08:00
Mingming Qiu	d0abf5c20a	fix kafka index task doesn't resume when recieve duplicate request (#6990 ) * fix kafka index task doesn't resume when recieve duplicate request * add unit test	2019-02-12 13:24:28 -08:00
Jonathan Wei	8ba11591b6	Add router conf to assembly.xml (#7051 )	2019-02-12 10:33:18 +08:00
Surekha	02ef14f262	Fix num_rows in sys.segments (#6888 ) * Fix the bug with num_rows in sys.segments * Fix segmentMetadataInfo update in DruidSchema * Add numRows to SegmentMetadataHolder builder's constructor, so it's not overwritten * Rename SegSegmentSignature to setSegmentMetadataHolder and fix it so nested map is appended instead of recreated * Replace Map<String, Set<String>> segmentServerMap with Set<String> for num_replica * Remove unnecessary code and update test * Add unit test for num_rows * PR comments * change access modifier to default package level * minor changes to comments * PR comments	2019-02-11 16:21:19 -08:00
Ankit Kothari	16a4a50e91	[Issue #6967 ] NoClassDefFoundError when using druid-hdfs-storage (#7015 ) * Fix: 1. hadoop-common dependency for druid-hdfs and druid-kerberos extensions Refactoring: 2. Hadoop config call in the inner static class to avoid class path conflicts for stopGracefully kill * Fix: 1. hadoop-common test dependency * Fix: 1. Avoid issue of kill command once the job is actually completed	2019-02-08 18:26:37 -08:00
Jihoon Son	d42de574d6	Add an api to get all lookup specs (#7025 ) * Add an api to get all lookup specs * add doc	2019-02-08 11:05:59 -08:00
Jihoon Son	c9f21bc782	Fix filterSegments for TimeBoundary and DataSourceMetadata queries (#7023 ) * Fix filterSegments for TimeBoundary and DataSourceMetadata queries * add javadoc * fix build	2019-02-08 10:03:02 -08:00
Don Bowman	b3dcbe70ad	Add docker container for druid (#6896 ) * Add docker container for druid This container is an 'omnibus' (since there is such a high overlap with the various services). It includes all contrib extension as well as the msql connector. It is intended to be run as `docker run NAME SERVICE` (e.g. docker run druid:latest broker) * Add Apache license header * Resolve issues from Pull Request review * Add comments at top of script per PR comments * Revert BUILDKIT. Not available everywhere. * Don't set hostname, allow default (IP) Some environments (e.g. Kubernetes Deployments) don't resolve hostname to IP. * Switch to amd64 glibc-based busybox from 32-bit uclibc * Override service-specific configuration * Replace MAINTAINER w/ LABEL * Add mysql connector to application classpath This works around issue #3770 https://github.com/apache/incubator-druid/issues/3770 * Add docker-compose and sample environment Signed-off-by: Don Bowman <don@agilicus.com>	2019-02-08 12:12:28 +00:00
Jonathan Wei	fafbc4a80e	Set version to 0.15.0-incubating-SNAPSHOT (#7014 )	2019-02-07 14:02:52 -08:00
Furkan KAMACI	3097562adf	Improper getter value is fixed. (#6930 ) * Improper getter value is fixed. * Test class is added.	2019-02-07 11:51:07 -08:00
Jihoon Son	8e3a58f723	Improve druid.storage.sse.kms.keyId and druid.s3.protocol (#7012 ) * Improve druid.storage.sse.kms.keyId and druid.s3.protocol * fix article	2019-02-06 15:00:51 -08:00
Justin Borromeo	6723243ed2	Create Scan Benchmark (#6986 ) * Moved Scan Builder to Druids class and started on Scan Benchmark setup * Need to form queries * It runs. * Remove todos * Change number of benchmark iterations * Changed benchmark params * More param changes * Made Jon's changes and removed TODOs * Broke some long lines into two lines * Decrease segment size for less memory usage * Committing a param change to kick teamcity	2019-02-06 14:45:01 -08:00
Furkan KAMACI	58f9507ccf	Improper equals override is fixed to prevent NullPointerException (#6938 ) * Improper equals override is fixed to prevent NullPointerException * Fixed curly brace indentation. * Test method is added for equals method of TaskLockPosse class.	2019-02-06 14:08:50 -08:00
anantmf	315ccb76b8	Fix for getSingleObjectSummary, replacing keyCount with objectSummaries().size (#7000 ) * Instead of using keyCount, changing it to check the size of objectSummaries. For issue: https://github.com/apache/incubator-druid/issues/6980 https://github.com/apache/incubator-druid/issues/6980#issuecomment-460006580 * Changing another usage of keyCount with size of objectSummaries. * Adding some comments to explain why using keyCount is not working as expected.	2019-02-05 15:45:44 -08:00
Surekha	ef451d3603	Add null checks in DruidSchema (#6830 ) * Add null checks in DruidSchema * Add unit tests * Add VisibleForTesting annotation * PR comments * unused import	2019-02-05 13:42:20 -08:00
Jihoon Son	75c70c2ccc	Add doc for S3 permissions settings (#7011 ) * Add doc for S3 permissions settings * add a comment about additional settings	2019-02-05 11:52:09 -08:00
Jonathan Wei	8bc5eaa908	Set version to 0.14.0-incubating-SNAPSHOT (#7003 )	2019-02-04 19:36:20 -08:00
Egor Riashin	97b6407983	maintenance mode for Historical (#6349 ) * maintenance mode for Historical forbidden api fix, config deserialization fix logging fix, unit tests * addressed comments * addressed comments * a style fix * addressed comments * a unit-test fix due to recent code-refactoring * docs & refactoring * addressed comments * addressed a LoadRule drop flaw * post merge cleaning up	2019-02-04 18:11:00 -08:00
David Glasser	7e48593b57	ParallelIndexSupervisorTask: don't warn about a default value (#6987 ) Native batch indexing doesn't yet support the maxParseExceptions, maxSavedParseExceptions, and logParseExceptions tuning config options, so ParallelIndexSupervisorTask logs if these are set. But the default value for maxParseExceptions is Integer.MAX_VALUE, which means that you'll get the maxParseExceptions flavor of this warning even if you don't configure maxParseExceptions. This PR changes all three warnings to occur if you change the settings from the default; this mostly affects the maxParseExceptions warning.	2019-02-04 12:00:26 -08:00

... 7 8 9 10 11 ...

9453 Commits All Branches Search

9453 Commits

All Branches