druid

Commit Graph

Author	SHA1	Message	Date
Vadim Ogievetsky	3b62d7929c	Web console: Data loader should allow for multiline JSON messages in kafka (#13709 ) * stricter * data loader should allow for mulit-line json * add await * kinesis also	2023-01-25 21:23:18 -08:00
Vadim Ogievetsky	4ee4d99b8d	better error reporting (#13636 )	2023-01-05 20:00:33 -08:00
Vadim Ogievetsky	fdc8aa2833	better show totals when grouping (#13631 )	2023-01-05 18:06:32 -08:00
Vadim Ogievetsky	8773d619a2	Web console: tidy up stage UI (#13615 ) * show the right info * sort indicator * nicer marker * move error icon	2022-12-22 12:52:16 -08:00
Vadim Ogievetsky	e23abc710a	Web console: default max workers to cluster capacity and simplify live reports (#13577 ) * step * better capacity * start with capacity * more compressed stats display * better rule editor * fix SQL data loader also * update snapshots * new line * better formatting	2022-12-16 15:13:32 -08:00
Vadim Ogievetsky	639decdf2e	fix preview droping out of MSQ mode (#13586 )	2022-12-16 15:13:07 -08:00
Rishabh Singh	4ebdfe226d	Druid automated quickstart (#13365 ) * Druid automated quickstart * remove conf/druid/single-server/quickstart/_common/historical/jvm.config * Minor changes in python script * Add lower bound memory for some services * Additional runtime properties for services * Update supervise script to accept command arguments, corresponding changes in druid-quickstart.py * File end newline * Limit the ability to start multiple instances of a service, documentation changes * simplify script arguments * restore changes in medium profile * run-druid refactor * compute and pass middle manager runtime properties to run-druid supervise script changes to process java opts array use argparse, leave free memory, logging * Remove extra quotes from mm task javaopts array * Update logic to compute minimum memory * simplify run-druid * remove debug options from run-druid * resolve the config_path provided * comment out service specific runtime properties which are computed in the code * simplify run-druid * clean up docs, naming changes * Throw ValueError exception on illegal state * update docs * rename args, compute_only -> compute, run_zk -> zk * update help documentation * update help documentation * move task memory computation into separate method * Add validation checks * remove print * Add validations * remove start-druid bash script, rename start-druid-main * Include tasks in lower bound memory calculation * Fix test * 256m instead of 256g * caffeine cache uses 5% of heap * ensure min task count is 2, task count is monotonic * update configs and documentation for runtime props in conf/druid/single-server/quickstart * Update docs * Specify memory argument for each profile in single-server.md * Update middleManager runtime.properties * Move quickstart configs to conf/druid/base, add bash launch script, support python2 * Update supervise script * rename base config directory to auto * rename python script, changes to pass repeated args to supervise * remove exmaples/conf/druid/base dir * add docs * restore changes in conf dir * update start-druid-auto * remove hashref for commands in supervise script * start-druid-main java_opts array is comma separated * update entry point script name in python script * Update help docs * documentation changes * docs changes * update docs * add support for running indexer * update supported services list * update help * Update python.md * remove dir * update .spelling * Remove dependency on psutil and pathlib * update docs * Update get_physical_memory method * Update help docs * update docs * update method to get physical memory on python * udpate spelling * update .spelling * minor change * Minor change * memory comptuation for indexer * update start-druid * Update python.md * Update single-server.md * Update python.md * run python3 --version to check if python is installed * Update supervise script * start-druid: echo message if python not found * update anchor text * minor change * Update condition in supervise script * JVM not jvm in docs	2022-12-09 11:04:02 -08:00
Vadim Ogievetsky	d8e27eaab4	update error anchors (#13527 )	2022-12-08 13:18:35 -08:00
Vadim Ogievetsky	d85fb8cc4e	Web console: improve compaction status display (#13523 ) * improve compaction status display * even more accurate * fix snapshot	2022-12-07 21:03:59 -08:00
Vadim Ogievetsky	9679f6a9b5	Web console: add arrayOfDoublesSketch and other small fixes (#13486 ) * add padding and keywords * add arrayOfDoubles * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * partiton int * fix docs Co-authored-by: Charles Smith <techdocsmith@gmail.com>	2022-12-06 21:21:49 -08:00
Vadim Ogievetsky	2fdcfffe40	don't render duration if aggregated (#13455 )	2022-11-30 19:21:07 -08:00
Vadim Ogievetsky	d8f4353c43	Web console: be more robust to aux queries failing and improve kill tasks (#13431 ) * be more robust to aux queries failing * feedback fixes * remove empty block * fix spelling * remove killAllDataSources from the console	2022-11-28 16:50:38 -08:00
Vadim Ogievetsky	a2d5e335f3	Web console: Index spec dialog (#13425 ) * add index spec dialog * add sanpshot	2022-11-28 11:40:45 -08:00
Vadim Ogievetsky	fe34ecc5e3	add ability to make inputFormat part of the example datasets (#13402 )	2022-11-21 12:50:44 -08:00
Kashif Faraz	133054bf27	Make batched segment sampling the default, minor cleanup of coordinator config (#13391 ) The batch segment sampling performs significantly better than the older method of sampling if there are a large number of used segments. It also avoids duplicates. Changes: - Make batch segment sampling the default - Deprecate the property `useBatchedSegmentSampler` - Remove unused coordinator config `druid.coordinator.loadqueuepeon.repeatDelay` - Cleanup `KillUnusedSegments` - Simplify `KillUnusedSegmentsTest`, add better tests, remove redundant tests	2022-11-21 20:31:46 +05:30
Vadim Ogievetsky	08fa0383b9	add supported indexSpec options (#13388 )	2022-11-19 00:00:32 -08:00
Vadim Ogievetsky	c628947c31	Web console: streaming json input format specifics (#13381 ) * streaming json input format specifics * goodies	2022-11-18 14:15:16 -08:00
Vadim Ogievetsky	a3d45f6086	treat user cancelation seriously (#13376 )	2022-11-18 14:04:16 -08:00
Kashif Faraz	71b133f3ff	Add `RoundRobinServerSelector` to speed up segment assignments (#13367 ) Segment assignments can take very long due to the strategy cost computation for a large number of segments. This commit allows segment assignments to be done in a round-robin fashion within a tier. Only segment balancing takes cost-based decisions to move segments around. Changes - Add dynamic config `useRoundRobinSegmentAssignment` with default value false - Add `RoundRobinServerSelector`. This does not implement the `BalancerStrategy` as it does not conform to that contract and may also be used in conjunction with a strategy (round-robin for `RunRules` and a cost strategy for `BalanceSegments`) - Drops are still cost-based even when round-robin assignment is enabled.	2022-11-16 20:05:17 +05:30
Vadim Ogievetsky	f6aca21e82	Web console: update DQT to version 0.17 (#13323 ) * update to DQT 17 * update licenses * after npm i	2022-11-07 17:47:11 -08:00
Vadim Ogievetsky	176934e849	Web console: expose in the UI that a query is stuck waiting for task slots (#13291 ) * add pending info * update tests * fixes after review	2022-11-01 09:40:55 -07:00
Margaret Brewster	e40c7f2a67	Update data loader parse screen help text (#13241 ) Co-authored-by: Margaret Brewster <margaretbrewster@Maggie-Brewster.local>	2022-10-29 11:51:37 +08:00
Vadim Ogievetsky	32020247d1	Web console: Update dqt to a version that adds quotes by default (#13243 ) * update dqt * auto quote by default	2022-10-28 01:06:45 -07:00
Vadim Ogievetsky	573e12c75f	Web console: making the cell filter menu more functional, removing the old query view, and updating d3 (#13169 ) * remove old query view * update tests * add filter * fix test * bump d3 things to latest versions * rent too far into the future with d3 * make config dialogs load * goodies * update snapshots * only compute duration when running or pending	2022-10-07 12:44:40 -07:00
Vadim Ogievetsky	ebfe1c0c90	Web console: fix DQT import (#13159 ) * fix dqt import * update licenses * update tests	2022-09-30 09:31:06 -07:00
Vadim Ogievetsky	a910764e41	better spec conversion with issues (#13136 )	2022-09-22 10:46:57 -07:00
Vadim Ogievetsky	6c1dc6589e	initialize all counters for stages with input (#13137 )	2022-09-22 08:10:50 -07:00
Vadim Ogievetsky	f1d3728371	append to exisitng callout (#13130 )	2022-09-21 19:39:28 -07:00
Vadim Ogievetsky	b9edfe34a4	be consistent about referring to the web console by its name (#13118 )	2022-09-19 15:02:17 -07:00
Vadim Ogievetsky	de8f229bed	Web console: correctly escape path based flatten specs (#13105 ) * fix path generation * do escape * fix replace * fix replace for good	2022-09-17 14:02:42 -07:00
Vadim Ogievetsky	c62a822121	support kafka lookups (#13098 )	2022-09-16 15:25:25 -07:00
Vadim Ogievetsky	078b50ebe1	link to error docs (#13094 )	2022-09-15 15:06:08 -07:00
Vadim Ogievetsky	08d6aca528	Web console: better detection for arrays containing objects (#13077 ) * better detection for arrays containing objects * include boolean also	2022-09-12 18:50:29 -07:00
Vadim Ogievetsky	5cc5f7b60c	quote columns, datasources in auto complete if needed (#13060 )	2022-09-09 11:22:40 -07:00
Kashif Faraz	7e20d70242	Fix web-console message in MSQ data loader (#12996 ) * Fix typo in web-console message * Prettify the changes	2022-09-07 13:34:10 -07:00
Vadim Ogievetsky	92789cfc4a	default to no compare (#13041 )	2022-09-07 08:28:28 -07:00
Vadim Ogievetsky	0ae515bd3c	Web console: don't crash if cookies are totally disabled (#13013 ) * fix local storage detection * fix numeric input dialog	2022-09-01 16:10:23 -07:00
Vadim Ogievetsky	054688528f	don't show transform actions on * queries (#13005 )	2022-08-30 21:54:18 -07:00
Vadim Ogievetsky	acb09ff18b	grab warnings from correct key + test (#12977 )	2022-08-25 18:47:33 -07:00
Lee Surprenant	4c61378ad1	fix broken link in web-console (#12976 ) In 0.23.0, the info-button links to https://druid.apache.org/docs/0.23.0/ingestion/flatten-json.html which is a 404 If I got the spot right, this should now link to https://druid.apache.org/docs/[VERSION]/ingestion/data-formats.html#flattenspec	2022-08-25 16:47:41 -07:00
Vadim Ogievetsky	fd6cfcb8fb	Web console: fix pagination, add error delimiters (#12969 ) * fix pagination, add error delimiters * reword debug message	2022-08-25 08:18:21 -07:00
Vadim Ogievetsky	04ee7abeff	Web console: Multi-stage query support (#12919 ) * MSQ web console * fix typo in comments * remove useless conditional * wrap SQL_DATA_TYPES * fixes sus regex * rewrite regex * remove problematic regex * fix UTs * convert PARTITIONED / CLUSTERED BY to ORDER BY for preview * fix log * updated to use shuffle * Web console: Use Ace.Completion directly (#1405) * Use Ace.Completion directly * Another Ace.Completion * better comment * fix column ordering in e2e test * add nested data example also Co-authored-by: John Gozde <john.gozde@imply.io>	2022-08-24 16:17:12 -07:00
Abhishek Agarwal	f0fc45028e	Update year in the notice file and the release process instructions (#12622 ) * Update notice file * Update release process instructions * Better release instructions * Update copyright year * Web console: updated the doc link (#12619) * updated the doc link * update snapshots Co-authored-by: Vadim Ogievetsky <vadim@ogievetsky.com>	2022-08-23 18:17:18 +05:30
AmatyaAvadhanula	d294404924	Kinesis ingestion with empty shards (#12792 ) Kinesis ingestion requires all shards to have at least 1 record at the required position in druid. Even if this is satisified initially, resharding the stream can lead to empty intermediate shards. A significant delay in writing to newly created shards was also problematic. Kinesis shard sequence numbers are big integers. Introduce two more custom sequence tokens UNREAD_TRIM_HORIZON and UNREAD_LATEST to indicate that a shard has not been read from and that it needs to be read from the start or the end respectively. These values can be used to avoid the need to read at least one record to obtain a sequence number for ingesting a newly discovered shard. If a record cannot be obtained immediately, use a marker to obtain the relevant shardIterator and use this shardIterator to obtain a valid sequence number. As long as a valid sequence number is not obtained, continue storing the token as the offset. These tokens (UNREAD_TRIM_HORIZON and UNREAD_LATEST) are logically ordered to be earlier than any valid sequence number. However, the ordering requires a few subtle changes to the existing mechanism for record sequence validation: The sequence availability check ensures that the current offset is before the earliest available sequence in the shard. However, current token being an UNREAD token indicates that any sequence number in the shard is valid (despite the ordering) Kinesis sequence numbers are inclusive i.e if current sequence == end sequence, there are more records left to read. However, the equality check is exclusive when dealing with UNREAD tokens.	2022-08-05 22:38:58 +05:30
Vadim Ogievetsky	f2a7970a6c	reindex flow should take order from Druid (#12790 )	2022-07-14 20:03:33 -07:00
Yuanli Han	50f1f5840d	show json and add search box (#12784 )	2022-07-14 17:01:30 -07:00
Yuanli Han	82315779ff	fix segment timeline bar chart (#12782 )	2022-07-14 16:58:24 -07:00
Vadim Ogievetsky	14e5b8325c	make tick formatting more robust (#12788 )	2022-07-14 16:56:53 -07:00
Vadim Ogievetsky	6f7fa334fd	Web console: totalNumMergeTasks can be set on range also (#12648 ) * totalNumMergeTasks can be set on range also * fix formatting	2022-06-14 11:18:17 -07:00
Vadim Ogievetsky	a235aca2b3	Web console: fix go to segments not working (#12541 ) * use correct filter syntax * fix tests	2022-05-19 14:34:03 -07:00

1 2 3 4 5 ...

382 Commits