druid

Commit Graph

Author	SHA1	Message	Date
Clint Wylie	4d3987c1dd	lifecycle stage refactor to ensure proper start and stop ordering of servers and announcements (#7234 ) * lifecycle stage refactor to ensure proper ordering of servers and announcements * move DerivativeDataSourceManager to Lifecycle.Stage.NORMAL	2019-03-12 07:09:03 -07:00
Jihoon Son	e240fba247	Fix logs in SegmentLoaderLocalCacheManager (#7229 )	2019-03-11 21:16:03 -07:00
Justin Borromeo	0df8ee3f79	Fix ITRealtimeIndexTaskTest Flakiness (#7232 ) * Testing sleep change before posting events * Added comment	2019-03-11 20:25:24 -07:00
Jonathan Wei	5e7cbe39fa	Exclude non-source files from src assembly (#7235 ) * Exclude node_modules from src assembly * Remove git.version exclusion * Include binary LICENSE/NOTICE in source assembly	2019-03-11 19:23:10 -07:00
Ferris Tseng	c503ba9779	Write null byte when indexing numeric dimensions with Hadoop (#7020 ) * write null byte in hadoop indexing for numeric dimensions * Add test case to check output serializing null numeric dimensions * Remove extra line * Add @Nullable annotations	2019-03-11 21:02:03 -04:00
Gian Merlino	9178793ab5	Further improve caching documentation. (#7236 ) Follow-up to #7223 that fixes a doc bug (a result-level cache property was misspelled), changes the recommended "small cluster" threshold from 20 to 5 servers, and clarifies behavior of the various caching options.	2019-03-11 17:57:00 -07:00
Pierre-Emile Ferron	a88fbcd5db	Improve caching doc (#7223 ) - Set correct default values for query context result cache parameters - Add details about broker cache impact on local historical merging	2019-03-11 20:06:28 -04:00
Gian Merlino	dcfca03718	More accurate RealtimeMetricsMonitor messages. (#7230 ) The old messages did not reflect the full range of reasons why messages could be thrown away.	2019-03-11 19:50:32 -04:00
Venkatraman P	3118160387	Adding a tutorial in doc for using Kerberized Hadoop as deep storage. (#6863 ) * Adding a tutorial in doc for using Kerberized Hadoop as deep storage. * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md Fixed - to ~ in Apache License section. * Update tutorial-kerberos-hadoop.md * Update tutorial-kerberos-hadoop.md	2019-03-11 11:39:15 -07:00
Clint Wylie	d7ba19d477	sql, filters, and virtual columns (#6902 ) * refactor sql planning to re-use expression virtual columns when possible when constructing a DruidQuery, allowing virtual columns to be defined in filter expressions, and making resulting native druid queries more concise. also minor refactor of built-in sql aggregators to maximize code re-use * fix it * fix it in the right place * fixup for base64 stuff * fixup tests * fix merge conflict on import order * fixup * fix imports * fix tests * review comments * refactor * re-arrange * better javadoc * fixup merge * fixup tests * fix accidental changes	2019-03-11 11:37:58 -07:00
Jonathan Wei	e1d8c17746	Add commit ID milestone helper script (#7100 ) * Add commit ID milestone helper script * Filter on merged/closed in API call	2019-03-11 11:36:07 -07:00
Gian Merlino	4290e5ae7a	Cache selectors in QueryableIndexColumnSelectorFactory. (#7216 ) For selectors with internal caches (like SingleScanTimeDimensionSelector, SingleLongInputCachingExpressionColumnValueSelector, etc) we can get a perf boost and memory usage decrease by sharing selectors.	2019-03-11 11:33:01 -07:00
Samarth Jain	8804bd0dc1	Remove unnecessary check for contains() in LoadRule (#7073 ) See https://github.com/apache/incubator-druid/issues/7072	2019-03-11 13:52:46 -03:00
Jonathan Wei	94463b5778	Add missing redirects and fix broken links (#7213 ) * Add missing redirects * Fix zookeeper redirect * Fix broken links	2019-03-09 15:16:23 -08:00
Clint Wylie	5cc171419c	move jetty module to Lifecycle.Stage.LAST to allow graceful shutdown to work with lookups and stuff, put http-clint on lifecycle modules lifecycle (#7215 )	2019-03-09 15:14:09 -08:00
Jihoon Son	9bebf113ba	Fix race in historical when loading segments in parallel (#7203 ) * Fix race in historical when loading segments in parallel * revert unnecessary change * remove synchronized * add reference counting locking * fix build * fix comment	2019-03-08 17:54:05 -08:00
jorbay-au	62f0de9b89	Remove outdated instruction for rule updates (#7205 )	2019-03-08 16:42:08 -08:00
Surekha	6991735f73	Fix and add sys IT tests to travis script (#7208 ) * Add sys IT tests to travis script * minor fixes * Modify the test queries * modify query	2019-03-08 16:40:59 -08:00
Clint Wylie	a44df6522c	rename maintenance mode to decommission (#7154 ) * rename maintenance mode to decommission * review changes * missed one * fix straggler, add doc about decommissioning stalling if no active servers * fix missed typo, docs * refine docs * doc changes, replace generals * add explicit comment to mention suppressed stats for balanceTier * rename decommissioningVelocity to decommissioningMaxSegmentsToMovePercent and update docs * fix precondition check * decommissioningMaxPercentOfMaxSegmentsToMove * fix test * fix test * fixes	2019-03-08 16:33:51 -08:00
David Glasser	de55905a5f	integration-tests: make ITParallelIndexTest still work in parallel (#7211 ) * integration-tests: make ITParallelIndexTest still work in parallel Follow-up to #7181, which made the default behavior for index_parallel tasks non-parallel. * Validate that parallel index subtasks were run	2019-03-08 16:17:52 -08:00
Charles Allen	3ed250787d	Densify swapped hll buffer (#6865 ) * Densify swapped hll buffer * Make test loop limit pre-increment * Reformat * Fix test comments	2019-03-06 14:50:04 -08:00
Jihoon Son	e48a9c138e	Reduce default max # of subTasks to 1 for native parallel task (#7181 ) * Reduce # of max subTasks to 2 * fix typo and add more doc * add more doc and link * change default and add warning * fix doc * add test * fix it test	2019-03-05 22:06:36 -08:00
Jonathan Wei	9183e32876	Add more approximate algorithm docs (#7195 )	2019-03-05 16:44:02 -08:00
Roman Leventov	37cbad79b1	Adjust issue templates (#7188 ) * Adjust issue templates * typo * bug -> problem	2019-03-05 16:06:40 -08:00
Xue Yu	65118277a3	support sin cos etc trigonometric function in sql (#7182 ) * support triangle function in sql * feedback address	2019-03-04 19:18:22 -08:00
Jonathan Wei	5486c2abf8	Update LICENSE and NOTICE files (#7026 ) * Update LICENSE and NOTICE files * Update react-table version	2019-03-04 18:45:22 -08:00
Clint Wylie	3398d3982f	fix intellij UnusedInspectionsScope.xml (#7158 )	2019-03-04 14:56:41 -08:00
Roman Leventov	10c9f6d708	Fix and document concurrency of EventReceiverFirehose and TimedShutoffFirehose; Refine concurrency specification of Firehose (#7038 ) #### `EventReceiverFirehoseFactory` Fixed several concurrency bugs in `EventReceiverFirehoseFactory`: - Race condition over putting an entry into `producerSequences` in `checkProducerSequence()`. - `Stopwatch` used to measure time across threads, but it's a non-thread-safe class. - Use `System.nanoTime()` instead of `System.currentTimeMillis()` because the latter are [not suitable](https://stackoverflow.com/a/351571/648955) for measuring time intervals. - `close()` was not synchronized by could be called from multiple threads concurrently. Removed unnecessary `readLock` (protecting `hasMore()` and `nextRow()` which are always called from a single thread). Removed unnecessary `volatile` modifiers. Documented threading model and concurrent control flow of `EventReceiverFirehose` instances. Important: please read the updated Javadoc for `EventReceiverFirehose.addAll()`. It allows events from different requests (batches) to be interleaved in the buffer. Is this OK? #### `TimedShutoffFirehoseFactory` - Fixed a race condition that was possible because `close()` that was not properly synchronized. Documented threading model and concurrent control flow of `TimedShutoffFirehose` instances. #### `Firehose` Refined concurrency contract of `Firehose` based on `EventReceiverFirehose` implementation. Importantly, now it states that `close()` doesn't affect `hasMore()` and `nextRow()` and could be called concurrently with them. In other words, specified that `close()` is for "row supply" side rather than "row consume" side. However, I didn't check that other `Firehose` implementatations adhere to this contract. <hr> This issue is the result of reviewing `EventReceiverFirehose` and `TimedShutoffFirehose` using [this checklist](https://medium.com/@leventov/code-review-checklist-java-concurrency-49398c326154).	2019-03-04 18:50:03 -03:00
David Glasser	7bf1ee4dc0	ITIndexerTest: validate new data source after reindex (#7171 ) Previously, the test validated that the data source that we ingested from still had the same query responses that it did before the second ingestion. This is less useful than validating queries against the newly created data source. The new queries file differs from the old one in that its maxTime is earlier due to the interval selected by the reindex, and in that it does not query for the dropped metric "count".	2019-03-04 11:05:40 -08:00
Clint Wylie	050728b115	add license checker to web-console (#7028 ) * add license checker to web-console to ensure npm dependencies are apache license compatible * add generate licenses file * update check to remove excludes due to blueprintjs downgrade	2019-03-02 12:22:54 -08:00
Jihoon Son	ded03d9d4c	Improve doc for auto compaction (#7117 ) * Improve doc for auto compaction * fix doc * address comments	2019-03-02 12:21:50 -08:00
Gian Merlino	fa218f5160	Fix two SeekableStream serde issues. (#7176 ) * Fix two SeekableStream serde issues. 1) Fix backwards-compatibility serde for SeekableStreamPartitions. It is needed for split 0.13 / 0.14 clusters to work properly during a rolling update. 2) Abstract classes don't need JsonCreator constructors; remove them. * Comment fixes.	2019-03-01 22:27:08 -08:00
Jihoon Son	06c8229c08	Kill all running tasks when the supervisor task is killed (#7041 ) * Kill all running tasks when the supervisor task is killed * add some docs and simplify * address comment	2019-03-01 11:28:03 -08:00
Jihoon Son	45f12de9ad	Fix supported file formats for Hadoop vs Native batch doc (#7069 ) * Fix supported file formats * address comment	2019-02-28 19:44:45 -08:00
Jonathan Wei	32c418fdd8	Reword 'node' to 'process' (#7172 )	2019-02-28 18:10:39 -08:00
Vadim Ogievetsky	66e8d35ddf	downgrade react-table (#7170 )	2019-02-28 18:09:38 -08:00
Jihoon Son	9a62157a06	Make MapBasedRow immutable (#7130 ) * Make MapBasedRow immutable * add null check	2019-02-28 16:07:14 -08:00
Jonathan Wei	a0afd7931d	Add web consoles doc page (#7123 ) * Add web consoles doc page * PR comments * Remove 'unified' * PR comments * Fix TOC * PR comments * More revisions * GUI -> UI * Update router docs * Reword router doc	2019-02-28 14:02:39 -08:00
Furkan KAMACI	e432965c13	* Overview and Login precedure are added. (#7135 ) * Typos are fixed.	2019-02-27 20:51:59 -08:00
Justin Borromeo	c7082ba36e	Added friendlier dsql error message for 405 (which occurs when druid.sql.enabled=false) (#7112 ) * Added friendlier error message for dsql 405 * no extra char * Changed error message * fixed weird spacing	2019-02-27 20:40:30 -08:00
Jonathan Wei	0b4f771062	Exclude hadoop-lzo from thrift-extensions build (#7151 )	2019-02-27 19:57:53 -08:00
Jonathan Wei	3d247498ef	Update tutorials for 0.14.0-incubating (#7157 )	2019-02-27 19:50:31 -08:00
Jihoon Son	cacdc83cad	Improve error message for integer overflow in compaction task (#7131 ) * improve error message for integer overflow in compaction task * fix build	2019-02-28 11:07:37 +08:00
Jihoon Son	6b232d8195	Improve compaction tutorial to demonstrate compaction with keepSegmentGranularity = true (#7079 ) * Improve compaction tutorial to demonstrate compaction with keepSegmentGranularity = true * typo * add a warning	2019-02-27 16:02:51 -08:00
Clint Wylie	9fa649b3bd	segment metadata fallback analysis if no bitmaps (#7116 ) * segment metadata fallback analysis if no bitmaps * remove accidental line * remove nonsense size estimation * less ternary * fix it * do the thing	2019-02-26 11:27:41 -08:00
Vadim Ogievetsky	b8f762037a	Downgrade blueprintjs version in the web console to one with a vanilla Apache 2.0 license (#7139 ) * revert bp * fix tests * move @types/hjson to dev dep * removed all the package upgrades	2019-02-25 20:54:56 -08:00
Mirko Jotic	f6a8e030cc	Select query failing if miliseconds used as time for indexing (#6937 ) * [#1332] Fix - select failing if milis used for idx. * Formating correction. * Address comment: throw original exception. * Using constant values in tests - Try converting to Integer and then multiply by 1000L to achieve milis. - If not successful try converting to Long or rethrow original exception. * DateTime#of has to support "2011-01-01T00:00:00" - in addition to seconds and milisecs, this method currently supports even a date string. * Handle only milisec timestamps and ISO8601 strings	2019-02-25 14:36:01 -08:00
Jihoon Son	9a066558a4	Fix exception when the scheme is missing in endpointUrl for S3 (#7129 ) * Fix exception when the scheme is missing in endpointUrl for S3 * add null check	2019-02-25 11:10:35 -08:00
Himanshu Pandey	8b803cbc22	Added checkstyle for "Methods starting with Capital Letters" (#7118 ) * Added checkstyle for "Methods starting with Capital Letters" and changed the method names violating this. * Un-abbreviate the method names in the calcite tests * Fixed checkstyle errors * Changed asserts position in the code	2019-02-23 20:10:31 -08:00
David Glasser	1c2753ab90	ParallelIndexSubTask: support ingestSegment in delegating factories (#7089 ) IndexTask had special-cased code to properly send a TaskToolbox to a IngestSegmentFirehoseFactory that's nested inside a CombiningFirehoseFactory, but ParallelIndexSubTask didn't. This change refactors IngestSegmentFirehoseFactory so that it doesn't need a TaskToolbox; it instead gets a CoordinatorClient and a SegmentLoaderFactory directly injected into it. This also refactors SegmentLoaderFactory so it doesn't depend on an injectable SegmentLoaderConfig, since its only method always replaces the preconfigured SegmentLoaderConfig anyway. This makes it possible to use SegmentLoaderFactory without setting druid.segmentCaches.locations to some dummy value. Another goal of this PR is to make it possible for IngestSegmentFirehoseFactory to list data segments outside of connect() --- specifically, to make it a FiniteFirehoseFactory which can query the coordinator in order to calculate its splits. See #7048. This also adds missing datasource name URL-encoding to an API used by CoordinatorBasedSegmentHandoffNotifier.	2019-02-23 17:02:56 -08:00

1 2 3 4 5 ...

9093 Commits All Branches Search

9093 Commits

All Branches