druid

Commit Graph

Author	SHA1	Message	Date
Jihoon Son	f456d2fcf8	Resource leak in DruidSegmentReader (#9476 ) * Close the Yielder in DruidSegmentReader * forbidden api	2020-03-09 10:05:25 -07:00
Clint Wylie	a677664811	allow optimization of single multi-value column input expr with repeated identifier (#9425 ) * allow optimization of single multi-value column input expr with repeated identifier * add test	2020-03-06 12:53:32 -08:00
Julian Jaffe	eda03630d0	Add OnHeapMemorySegmentWriteOutMediumFactory (#9454 ) * Add OnHeapMemorySegmentWriteOutMediumFactory Add a factory for OnHeapMemorySegmentWriteOutMedium to support direct writing via Spark. * Register OnHeapMemorySegmentWriteOutMediumFactory. Register OnHeapMemorySegmentWriteOutMediumFactory with SegmentWriteOutMediumFactory. * Remove unnecessary throws The base `makeSegmentWriteOutMedium` throws an IOException, but the particular implementation of OnHeapMemorySegmentWriteOutMediumFactory does not throw a checked exception. * Update SegmentWriteOutMedium docs to include onHeapMemory Update the SegmentWriteOutMedium section of the indexing docs to include a description of the new OnHeapSegmentMediumWriteOut option.	2020-03-05 22:34:08 -08:00
Jihoon Son	64afc05080	Open the licenses.yaml with an explicit encoding (#9462 )	2020-03-05 17:13:44 -08:00
Clint Wylie	32cd47bc8e	Fix home view styling (#9444 )	2020-03-04 19:39:36 -08:00
Jihoon Son	3016057178	Make Transform an ExtensionPoint (#9319 ) * Make Transform an ExtensionPoint * Add transform to the list of documented extensions * Add example transform implementation	2020-03-04 12:13:14 -08:00
Chi Cao Minh	4ed83f6af6	Fix superbatch merge last partition boundaries (#9448 ) * Fix superbatch merge last partition boundaries A bug in the computation for the last parallel merge partition could cause an IndexOutOfBoundsException or precondition failure due to an empty partition. * Improve comments and tests	2020-03-04 10:35:21 -08:00
Jihoon Son	9466ac7c9b	Skip empty files for local, hdfs, and cloud input sources (#9450 ) * Skip empty files for local, hdfs, and cloud input sources * split hint spec doc * doc for skipping empty files * fix typo; adjust tests * unnecessary fluent iterable * address comments * fix test * use the right lists * fix test * fix test	2020-03-03 20:51:06 -08:00
mcbrewster	99095c4ac5	Add Azure ingestion flow to web console (#9437 ) * add support for azure * change bucket to container * add azure to input menu * remove static-azure	2020-03-03 11:06:00 -08:00
Gian Merlino	1fd865b7c1	BufferArrayGrouper: Fix potential overflow in requiredBufferCapacity. (#9435 ) * BufferArrayGrouper: Fix potential overflow in requiredBufferCapacity. If cardinality was high, the computation could overflow an int. There were tests for this, but the tests were wrong. * Nicer.	2020-02-28 14:27:52 -08:00
Gian Merlino	81d8be6e39	CacheStrategy: Improve Javadocs. (#9280 ) * CacheStrategy: Improve Javadocs. * Update processing/src/main/java/org/apache/druid/query/CacheStrategy.java Co-Authored-By: Suneet Saldanha <44787917+suneet-s@users.noreply.github.com> Co-authored-by: Suneet Saldanha <44787917+suneet-s@users.noreply.github.com>	2020-02-28 11:30:58 -08:00
Vadim Ogievetsky	c294e0b7c6	Web console: Column counter (#9334 ) * Column counter * more general test	2020-02-27 12:04:27 -08:00
Gian Merlino	ef3d24e886	Add javadocs for enableFilterPushDown. (#9423 )	2020-02-26 22:07:33 -08:00
Gian Merlino	ae617bf5dd	Clarify InputSource.isSplittable usage. (#9424 ) Also removes TimedShutoffInputSource, which had a bug in isSplittable (it improperly returned true, even though it didn't implement SplittableInputSource). This bug had no user-visible impact, since the code wasn't used.	2020-02-26 22:05:46 -08:00
Chi Cao Minh	5d05b40e6d	Remove druid incubating references (#9405 )	2020-02-26 21:47:58 -08:00
Lijia Liu	063811710e	#8690 use utc interval when create pedding segments (#9142 ) Co-authored-by: Gian Merlino <gianmerlino@gmail.com>	2020-02-26 13:20:59 -08:00
Jihoon Son	b924161086	Add main method to VersionedIntervalTimelineBenchmark (#9404 )	2020-02-26 12:01:02 -08:00
Aditya	e506fc9fdf	fix cursor position after function autocomplete (#9396 ) Closes #9395	2020-02-26 09:41:24 -08:00
Gian Merlino	c9faf3e148	Add SQL GROUPING SETS support. (#9122 ) * Add SQL GROUPING SETS support. Built on top of the subtotalsSpec feature in the groupBy query. This also involves two changes to subtotalsSpec: - Alter behavior so limitSpec is applied after subtotalsSpec, rather than applied to each grouping set. This is more in line with SQL standard behavior. I think it is okay to make this change, since the old behavior was not documented, so users should hopefully not be depending on it. - Fix a bug where virtual columns were included in the subtotal queries, but they should not have been. Also fixes two bugs in query equality checking: - BaseQuery: Use getDuration() instead of "duration" in equals and hashCode, since the latter is lazily initialized and might be null in one query but not the other. - GroupByQuery: Include subtotalsSpec in equals and hashCode. * Fix bugs. * Fix tests. * PR updates. * Grouping class hygiene.	2020-02-26 08:52:39 -08:00
Maytas Monsereenusorn	92fb83726b	Add support for optional aws credentials for s3 for ingestion (#9375 ) * Add support for optional cloud (aws, gcs, etc.) credentials for s3 for ingestion * Add support for optional cloud (aws, gcs, etc.) credentials for s3 for ingestion * Add support for optional cloud (aws, gcs, etc.) credentials for s3 for ingestion * fix build failure * fix failing build * fix failing build * Code cleanup * fix failing test * Removed CloudConfigProperties and make specific class for each cloudInputSource * Removed CloudConfigProperties and make specific class for each cloudInputSource * pass s3ConfigProperties for split * lazy init s3client * update docs * fix docs check * address comments * add ServerSideEncryptingAmazonS3.Builder * fix failing checkstyle * fix typo * wrap the ServerSideEncryptingAmazonS3.Builder in a provider * added java docs for S3InputSource constructor * added java docs for S3InputSource constructor * remove wrap the ServerSideEncryptingAmazonS3.Builder in a provider	2020-02-25 20:59:53 -08:00
Jonathan Wei	5ce9c81b68	Add join prefix duplicate/shadowing check (#9384 ) * Add join prefix duplicate/shadowing check * Fix format string * PR comments * PR comment * Optimize loop PR comment	2020-02-25 18:17:23 -08:00
zachjsh	d771b42ed1	Move Azure extension into Core (#9394 ) * Move Azure extension into Core Moving the azure extension into Core. * * Fix build failure * * Add The MIT License (MIT) to list of compatible licenses * * Address review comments * * change reference to contrib azure to core azure * * Fix spelling mistakes.	2020-02-25 17:49:16 -08:00
Francesco Nidito	14accb50ad	Improves on the fix for 8918 (#9387 ) * Improves on the fix for 8918 * factorize constants for ITRetryUtil.retryUntil call * increasing retries and sleep in HttpUtil to cope with 401s in testing * adding retries in EventReceiverFirehoseTestClient * adding missing space	2020-02-25 15:50:27 -08:00
als-sdin	f619903403	Updated the configuration documentation on coordinator kill tasks to clarify whether they delete only unused segments. (#9400 )	2020-02-25 13:15:55 -08:00
Chi Cao Minh	7fc99ee206	Add common optional dependencies for extensions (#9399 ) * Add common optional dependencies for extensions Include hadoop-aws and postgres JDBC connector jar to improve out-of-the-box experience for extensions. The mysql JDBC connector jar is not bundled as it is GPL. * Update docs * Fix typo	2020-02-25 00:04:00 -08:00
Jihoon Son	3bc7ae782c	Create splits of multiple files for parallel indexing (#9360 ) * Create splits of multiple files for parallel indexing * fix wrong import and npe in test * use the single file split in tests * rename * import order * Remove specific local input source * Update docs/ingestion/native-batch.md Co-Authored-By: sthetland <steve.hetland@imply.io> * Update docs/ingestion/native-batch.md Co-Authored-By: sthetland <steve.hetland@imply.io> * doc and error msg * fix build * fix a test and address comments Co-authored-by: sthetland <steve.hetland@imply.io>	2020-02-24 17:34:39 -08:00
Fokko Driesprong	806dfe6de6	Bump Apache Avro to 1.9.2 (#9381 ) * Bump Apache Avro 1.9.2 Bugfixes that where discovered in other projects * Update missing license	2020-02-24 10:04:22 +01:00
Clint Wylie	6d8dd5ec10	string -> expression -> string -> expression (#9367 ) * add Expr.stringify which produces parseable expression strings, parser support for null values in arrays, and parser support for empty numeric arrays * oops, macros are expressions too * style * spotbugs * qualified type arrays * review stuffs * simplify grammar * more permissive array parsing * reuse expr joiner * fix it	2020-02-21 15:43:02 -08:00
Srinivas Reddy	05258dca37	Improved the readability and fixed few java warnings (#9163 ) * Improved the readability and fixed few java warnings * Fix the checkstyle Co-authored-by: Gian Merlino <gianmerlino@gmail.com>	2020-02-22 07:30:11 +09:00
zachjsh	f707064bed	Add Azure config options for segment prefix and max listing length (#9356 ) * Add Azure config options for segment prefix and max listing length Added configuration options to allow the user to specify the prefix within the segment container to store the segment files. Also added a configuration option to allow the user to specify the maximum number of input files to stream for each iteration. * * Fix test failures * * Address review comments * * add dependency explicitly to pom * * update docs * * Address review comments * * Address review comments	2020-02-21 14:12:03 -08:00
Jihoon Son	141d8dd875	Enable druid.coordinator.kill.pendingSegments.on by default (#9385 ) * Enable druid.coordinator.kill.pendingSegments.on by default * checkstyle	2020-02-21 13:13:49 -08:00
Chi Cao Minh	3f848e6a7c	Suppress CVE-2020-8840 for htrace-core-4.0.1 (#9379 ) CVE-2020-8840 was updated on 19 Feb 2020, which now gets flagged by the security vulnerability scan. Since the CVE is for jackson-databind, via htrace-core-4.0.1, it can be added to the existing list of security vulnerability suppressions for that dependency.	2020-02-21 11:05:00 -08:00
Björn Zettergren	30c24df4d3	Add config option for namespacePrefix (#9372 ) * Add config option for namespacePrefix opentsdb emitter sends metric names to opentsdb verbatim as what druid names them, for example "query.count", this doesn't fit well with a central opentsdb server which might have namespaced metrics, for example "druid.query.count". This adds support for adding an optional prefix. The prefix also gets a trailing dot (.), after it, so the metric name becomes <namespacePrefix>.<metricname> configureable as "druid.emitter.opentsdb.namespacePrefix", as documented. Co-authored-by: Martin Gerholm <martin.gerholm@deltaprojects.com> Signed-off-by: Martin Gerholm <martin.gerholm@deltaprojects.com> Signed-off-by: Björn Zettergren <bjorn.zettergren@deltaprojects.com> * Spelling for PR #9372 Added "namespacePrefix" to .spelling exceptions, it's a variable name used in documentation for opentsdb-emitter. * fixing tests for PR #9372 changed naming of variables to be more descriptive added test of prefix being an empty string: "". added a conditional to buildNamespacePrefix to check for empty string being fed if EventConverter called without OpentsdbEmitterConfig instance. * fixing checkstyle errors for PR #9372 used == to compare literal string, should be equals() * cleaned up and updated PR #9372 Created a buildMetric function as suggested by clintropolis, and removed redundant tests for empty strings as they're only used when calling EventConverter directly without going through OpentsdbEmitterConfig. * consistent naming of tests PR #9372 Changed names of tests in files to match better with what it was actually testing changed check for Strings.isNullOrEmpty to just check for `null`, as empty string valued `namespacePrefix` is handled in OpentsdbEmitterConfig. Co-authored-by: Martin Gerholm <inspector-martin@users.noreply.github.com>	2020-02-20 14:01:41 -08:00
Jonathan Wei	cab08f941d	Fix join filter push down post-join virtual column handling (#9373 ) * Fix join filter push down post-join virtual column handling * Remove unused adapter param, update javadocs * Fix TC * Update processing/src/main/java/org/apache/druid/segment/join/filter/JoinFilterAnalyzer.java Co-Authored-By: Suneet Saldanha <44787917+suneet-s@users.noreply.github.com> * Address PR comments Co-authored-by: Suneet Saldanha <44787917+suneet-s@users.noreply.github.com>	2020-02-19 15:51:05 -08:00
Chi Cao Minh	26eeba636a	Make java version check work on all shells (#9376 ) * Make java version check work on all shells Previously, "perl verify-java" would fail on shells like zsh, which would cause the quickstart scripts (e.g., bin/start-micro-quickstart) to fail unless the DRUID_SKIP_JAVA_SKIP environment variable is set. * Support dash (ubuntu)	2020-02-19 13:44:00 -08:00
Clint Wylie	b408a6d774	sql support for dynamic parameters (#6974 ) * sql support for dynamic parameters * fixup * javadocs * fixup from merge * formatting * fixes * fix it * doc fix * remove druid fallback self-join parameterized test * unused imports * ignore test for now * fix imports * fixup * fix merge * merge fixup * fix test that cannot vectorize * fixup and more better * dependency thingo * fix docs * tweaks * fix docs * spelling * unused imports after merge * review stuffs * add comment * add ignore text * review stuffs	2020-02-19 13:09:20 -08:00
Chi Cao Minh	e7eb45e648	Run IntelliJ inspections on Travis (#9179 ) * Run IntelliJ inspections on Travis Running IntelliJ inspections currently takes about 90 minutes, but they can be run in about 30 minutes on Travis. * Restore assert statements	2020-02-19 11:34:19 +03:00
Jonathan Wei	73a0181e34	Fix handling for columns that appear multiple times in join conditions (#9362 ) * Fix handling for columns that appear multiple times in join conditions * Remove unneeded comment * Fix test	2020-02-17 10:54:04 -08:00
Lucas Capistrant	5befd40638	Issue 4909 popped up again. I applied PR 5451 liberally to all new Calcite test classes introduced in PR 9279 to fix (#9324 )	2020-02-16 22:29:43 -08:00
Jihoon Son	3bb9e7e53a	Inject things instead of subclassing everything for parallel task testing (#9353 ) * Inject things instead of subclassing everything for parallel task testing * javadoc * fix compilation * fix wrong merge * Address comments	2020-02-16 13:00:12 -08:00
Clint Wylie	b1be88d79c	fix Expressions.toQueryGranularity to be more correct, improve javadocs of Expr.getIdentifierIfIdentifier and Expr.getBindingIfIdentifier (#9363 )	2020-02-16 08:36:40 -08:00
sthetland	6d52edddab	Remove references to Docker Machine (#9366 ) * Remove references to Docker Machine Removing a broken link to an obsolete repo. While at it, removing references to Docker Machine, which was obsolete as of Docker v1.12 (avail. 2016). This version introduced Docker as native MacOS and Windows apps. * Update README.md Wording nit.	2020-02-15 03:08:43 -08:00
Atul Mohan	043abd5529	Fix compatibility issues with SqlFirehose (#9365 ) * Make SqlFirehose compatible with FiniteFirehose * Fix build	2020-02-14 17:45:12 -08:00
Clint Wylie	2e54755a03	add docker tutorial, friendlier docker-compose.yml, experimental java 11 dockerfile (#9262 ) * add docker tutorial, experimental java 11 dockerfile * fix typo * spelling * doc adjustments	2020-02-13 21:24:45 -08:00
Chi Cao Minh	4221573317	Update web-console dependencies (#9361 ) Fixes security vulnerabilities as flagged by the "security vulnerability" travis cron job.	2020-02-13 21:02:34 -08:00
Jihoon Son	bcf8f91e46	Add unit tests for CoordinatorRuleManager (#9318 )	2020-02-13 19:29:57 -08:00
Adam Peck	e9aebd994a	Fix for building in Eclipse & VS Code. (#7481 ) Fixes #6866 Reverse dependencies from /main/ to /test/ Add generated-test-sources to source path for Eclipse	2020-02-13 14:58:32 -08:00
Jonathan Wei	48a0681f7e	Fix basic auth polling to skip retries when cachedSerializedGroupMappingMap returns 404 (#9354 )	2020-02-12 16:52:03 -08:00
Maytas Monsereenusorn	31528bcdaf	Integration tests for JDK 11 (#9249 ) * Integration tests for JDK 11 * fix vm option * fix superviosrd * fix pom * add integration tests for java 11 * add logs * update docs * Update dockerfile to ack AdoptOpenJdk for Java 11 install commands	2020-02-12 16:36:31 -08:00
Suneet Saldanha	b1f38131af	Fix timestamp extract fn to match postgreSQL (#9337 ) * Fix timestamp extract fn to match postgres Update the timestamp extract function so that it matches the PostgreSQL docs. Examples from the PostgreSQL docs were added as tests for DECADE, CENTURY and MILLENIUM extraction. There were bugs in CENTURY and MILLENIUM that were spotted because of intelliJ inspections - 'Integer division in floating point context' * Update CalciteQueryTest * remove useless round * mark integer division as an error	2020-02-12 15:39:19 -08:00

1 2 3 4 5 ...

10161 Commits All Branches Search

10161 Commits

All Branches