druid

Commit Graph

Author	SHA1	Message	Date
Himanshu	654cdc07f5	Document HTTP based segment management and Deprecate classes to remove in future (#4997 ) * document http segment management * deprecated classes that shouldn't be used any further	2017-11-01 12:59:27 -04:00
Gian Merlino	6c725a7e06	Fix havingSpec on complex aggregators. (#5024 ) * Fix havingSpec on complex aggregators. - Uses the technique from #4883 on DimFilterHavingSpec too. - Also uses Transformers from #4890, necessitating a move of that and other related classes from druid-server to druid-processing. They probably make more sense there anyway. - Adds a SQL query test. Fixes #4957. * Remove unused import.	2017-11-01 12:58:08 -04:00
Jihoon Son	e96daa2593	Fix SQLMetadataSegmentManager (#5001 )	2017-10-31 08:02:41 -07:00
Gian Merlino	0ce406bdf1	Introduce "transformSpec" at ingest-time. (#4890 ) * Introduce "transformSpec" at ingest-time. It accepts a "filter" (standard query filter object) and "transforms" (a list of objects with "name" and "expression"). These can be used to do filtering and single-row transforms without need for a separate data processing job. The "expression" fields use the same expression language as other expression-based feature. * Remove forbidden api. * Fix compile error. * Fix tests. * Some more changes. - Add nullable annotation to Firehose.nextRow. - Add tests for index task, realtime task, kafka task, hadoop mapper, and ingestSegment firehose. * Fix bad merge. * Adjust imports. * Adjust whitespace. * Make Transform into an interface. * Add missing annotation. * Switch logger. * Switch logger. * Adjust test. * Adjustment to handling for DatasourceIngestionSpec. * Fix test. * CR comments. * Remove unused method. * Add javadocs. * More javadocs, and always decorate. * Fix bug in TransformingStringInputRowParser. * Fix bad merge. * Fix ISFF tests. * Fix DORC test.	2017-10-30 17:38:52 -07:00
Jonathan Wei	3e0a6fc374	Filter unauthorized datasources in INFORMATION_SCHEMA queries (#4998 ) * Filter unauthorized datasources in INFORMATION_SCHEMA queries * PR comments	2017-10-26 12:36:47 -07:00
Roman Leventov	125a912067	Add ability to inject extra dimensions for service emitter (#4982 ) * Add ability to inject extra dimensions for service emitter * Docs	2017-10-26 23:57:01 +05:30
Andy Sloane	ee66db900e	Fix binary serialization in caching (#4993 ) * Fix binary serialization in caching The previous caching code just concatenated a list of objects into a byte array -- this is actually not valid because jackson-databind uses enumerated references to strings internally, and concatenating multiple binary serialized objects can throw off the references. This change uses a single JsonGenerator to serialize the object list rather than concatenating byte arrays. * remove unused imports	2017-10-23 12:10:24 -07:00
Roman Leventov	772ca783cd	Fix race in CachingCostBalancerStrategyFactory (#4989 ) * Fix race in CachingCostBalancerStrategyFactory * Remote timeout	2017-10-20 16:53:51 -07:00
Himanshu	ef4a8cb724	Optional segment load/drop management without zookeeper using http (#4966 ) * introducing CuratorLoadQueuePeon * HttpLoadQueuePeon based off of current code * Revert "Remove SegmentLoaderConfig.numLoadingThreads config (#4829)" This reverts commit `d8b3bfa63c`. * SegmentLoadDropHandler copy/pasted from ZkCoordinator * Revert "1-based counts in ZkCoordinator (#4917)" This reverts commit `e725ff4146`. * remove non-zk part from ZkCoordinator * remove zk part from SegmentLoadDropHandler * additional changes for segment load/drop management with http * address review comments * add some more logs * Execs class is moved	2017-10-19 12:41:23 -07:00
Roman Leventov	26b87c9f8e	Fix CachingCostBalancerStrategyFactory's constructor (#4974 ) * Fix CachingCostBalancerStrategyFactory's constructor * Fix CachingCostBalancerStrategyFactory not registered in Lifecycle	2017-10-18 16:21:54 -05:00
Gian Merlino	5fc6891404	Reduce code duplication between test ExprMacroTables. (#4979 )	2017-10-18 15:57:49 -05:00
Gian Merlino	4881bb273b	Only consider loaded replicants when computing replication status. (#4921 ) * Only consider loaded replicants when computing replication status. This affects the computation of segment/underReplicated/count and segment/unavailable/count, as well as the loadstatus?simple and loadstatus?full APIs. I'm not sure why they currently consider segments in the load queues, but it would make more sense to me if they only considered segments that are actually loaded. * Fix tests. * Fix imports.	2017-10-18 11:11:42 -07:00
Roman Leventov	dc7cb117a1	Refactor ColumnSelectorFactory; Rely on ColumnValueSelector's polymorphism (#4886 ) * Refactor ColumnSelectorFactory; Rely on ColumnValueSelector's polymorphism * Fix MapVirtualColumn.makeColumnValueSelector() * Minor fixes * Fix IndexGeneratorCombinerTest * DimensionSelector to return zeros when treated as numeric ColumnValueSelector * Fix IncrementalIndexTest * Fix IncrementalIndex.makeColumnSelectorFactory() * Optimize MapBasedRow.getMetric() * Fix VarianceAggregatorTest * Simplify IncrementalIndex.makeColumnSelectorFactory() * Address comments * More comments * Test	2017-10-13 21:44:17 -05:00
Jihoon Son	8d9902831e	Refactoring PrefetchableTextFilesFirehoseFactory (#4836 ) * Refactoring prefetchable firehose * Fix to read cache when prefetch is disabled * More tests * Cleanup codes * Add Fetcher * Fix test failure * Count file size * Fix test * rename generic parameter * address comments * address comments * reuse buffer * move Execs to java-util * use execs * Fix build	2017-10-13 21:39:28 -05:00
Jihoon Son	675c6c00dd	Add checkstyle and intellij rule to prohibit unnecessary qualifiers in interfaces (#4958 ) * add checkstyle and intellij rule * fix tc fail	2017-10-13 07:56:19 -07:00
Atul Mohan	c07678b143	Synchronization of lookups during startup of druid processes (#4758 ) * Changes for lookup synchronization * Refactor of Lookup classes * Minor refactors and doc update * Change coordinator instance to be retrieved by DruidLeaderClient * Wait before thread shutdown * Make disablelookups flag true by default * Update docs * Rename flag * Move executorservice shutdown to finally block * Update LookupConfig * Refactoring and doc changes * Remove lookup config constructor * Revert Lookupconfig constructor changes * Add tests to LookupConfig * Make executorservice local * Update LRM * Move ListeningScheduledExecutorService to ExecutorCompletionService * Move exception to outer block * Remove check to see future is done * Remove unnecessary assignment * Add logging	2017-10-12 21:22:24 -05:00
Jihoon Son	d95915f8d2	Implement get methods for PrefetchableFirehose (#4948 )	2017-10-12 16:14:45 +09:00
Jihoon Son	dfa9cdc982	Prioritized locking (#4550 ) * Implementation of prioritized locking * Fix build failure * Fix tc fail * Fix typos * Fix IndexTaskTest * Addressed comments * Fix test * Fix spacing * Fix build error * Fix build error * Add lock status * Cleanup suspicious method * Add nullables * add doInCriticalSection to TaskLockBox and revert return type of task actions * fix build * refactor CriticalAction * make replaceLock transactional * fix formatting * fix javadoc * fix build	2017-10-11 23:16:31 -07:00
Roman Leventov	7a9940d624	Add /readiness to HistoricalResource (#4916 ) * Add /loadStatusCode to HistoricalResource * Address comments * Fixes	2017-10-11 20:35:52 -07:00
Jihoon Son	56fb11ce0b	Lazy initialization for JavaScript functions (#4871 ) * Lazy initialization of JavaScript functions * Fix test failure * Fix thread-safety and postpone js conf check * Fix test fail * Fix test * Fix KafkaIndexTaskTest * Move config check	2017-10-10 21:52:42 -07:00
Roman Leventov	e725ff4146	1-based counts in ZkCoordinator (#4917 )	2017-10-10 13:00:51 -07:00
Kevin Conaway	1bc4b71a34	Reduce Chance of Duplicates in EventReceiverFireHose (#4903 ) * Add ability to optionally specify a sequence identifier to reduce the possibility of duplicate events entering the event receiver firehose * Add ability to optionally specify a sequence identifier to reduce the possibility of duplicate events entering the event receiver firehose * Add a hard coded limit to the maximum number of possible producer IDs to prevent a malicious (or uninformed) client from overflowing the heap	2017-10-10 11:17:17 -07:00
Parag Jain	7cc18226cd	add more tls configs to enable/disable specific cipher suites and protocols (#4902 ) * add more tls configs to enable/disable specific cipher suites and protocols * fix doc, allow empty list	2017-10-09 13:53:12 -07:00
Gian Merlino	797b54d283	DruidLeaderClient: Throw IOException on retryable errors. (#4913 ) * DruidLeaderClient: Throw IOException on retryable errors. Fixes #4911. * Adjustments.	2017-10-06 15:12:09 -05:00
Himanshu	0e856ee806	add configs to enable fast request failure on broker and historical (#4540 ) * add configs to enable fast request failure on broker * address review comments * fix styling error * fix style error * have enableRequestLimit config instead of having user specify max limit * add comment * fix style error * add UT fo LimitRequestsFilter * address review comments * fix test * make LimitRequestsFilterTest more robust * fix JettyQosTest	2017-10-06 14:45:13 -05:00
praveev	4ff12e4394	Hadoop indexing: Fix NPE when intervals not provided (#4686 ) * Fix #4647 * NPE protect bucketInterval as well * Add test to verify timezone as well * Also handle case when intervals are already present * Fix checkstyle error * Use factory method instead for Datetime * Use Intervals factory method	2017-10-05 22:46:07 -07:00
Akash Dwivedi	2ee32399ff	granularity method in QueryMetrics. (#4570 ) * granularity method in QueryMetrics. PR to emit granularity dimension for timeseries, search, groupBy, select and topN queries. * QueryMetricsFactory classes for search and select queries. * Empty implementation for Granularity() method. * Review comment changes. * Remove unused import. * empty query() method. * checkstyle fix. * Import fix.	2017-10-04 09:42:52 -07:00
Jonathan Wei	07aa405a6f	Fix PreResponseAuthorizationCheckFilter HTTP error masking (#4900 ) * Fix PreResponseAuthorizationCheckFilter HTTP error masking * Add remote addr and host to missing auth check log message	2017-10-03 16:58:57 -05:00
Jonathan Wei	5e60ccade1	Add context map to AuthenticationResult (#4870 )	2017-10-02 17:08:14 -05:00
Jonathan Wei	9deab26d8b	Fix auth check in InventoryViewUtils (#4869 )	2017-10-02 11:38:45 -07:00
Niketh Sabbineni	3e9391433d	Coord resource throws NPE when segments are requested (#4759 )	2017-10-02 10:13:27 -07:00
Goh Wei Xiang	26fd2b3a8e	Priority on loading for primary replica (#4757 ) * Priority on loading for primary replica * Simplicity fixes * Fix on skipping drop for quick return. * change to debug logging for no replicants. * Fix on filter logic * swapping if-else * Fix on wrong "hasTier" logic * Refactoring of LoadRule * Rename createPredicate to createLoadQueueSizeLimitingPredicate * Rename getHolderList to getFilteredHolders * remove varargs * extract out currentReplicantsInTier * rename holders to holdersInTier * don't do temporary removal of tier. * rename primaryTier to tierToSkip * change LinkedList to ArrayList * Change MinMaxPriorityQueue in DruidCluster to TreeSet. * Adding some comments. * Modify log messages in light of predicates. * Add in-method comments * Don't create new Object2IntOpenHashMap for each run() call. * Cache result from strategy call in the primary assignment to be reused during the same run. * Spelling mistake * Cleaning up javadoc. * refactor out loading in progress check. * Removed redundant comment. * Removed forbidden API * Correct non-forbidden API. * Precision in variable type for NavigableSet. * Obsolete comment. * Clarity in method call and moving retrieval of ServerHolder into method call. * Comment on mutability of CoordinatoorStats. * Added auxiliary fixture for dropping.	2017-09-28 13:02:05 -07:00
Gian Merlino	a19f22b5bb	Add identity to query metrics, logs. (#4862 ) * Add identity to query metrics, logs. Also fix a bug where unauthorized requests would not emit any logs or metrics, and instead would log a "Tried to emit logs and metrics twice" warning. Also rename QueryResource's "getServer" to "cancelQuery", because that's what it does. * Do not emit identity by default.	2017-09-28 11:45:23 -07:00
Himanshu	f69c9280c4	remove ServerConfig from DruidNode as all information needs to be present in DruidNode serialized form (#4858 ) * remove ServerConfig from DruidNode as all information needs to be present in DruidNode serialized form * sanitize output of /druid/coordinator/v1/cluster endpoint	2017-09-28 10:40:59 -05:00
Goh Wei Xiang	2c30d5ba55	Add org.joda.time.DateTime.parse() to forbidden APIs (#4857 ) * Added org.joda.time.DateTime#(java.lang.String) to forbidden API. * Added org.joda.time.DateTime#(java.lang.String, org.joda.time.format.DateTimeFormatter) to forbidden API. * Add additional APIs that may create DateTime with default time zone * Add helper function that accepts formatter to parse String. * Add additional forbidden APIs * Replace existing usage of forbidden APIs * Use wrapper class to enforce Chronology on DateTimeFormatter. * Creates constant UtcFormatter for constant ISODateTimeFormat.	2017-09-27 17:46:44 -05:00
Gian Merlino	999c6d800e	Fix Router handling of SQL queries. (#4851 )	2017-09-27 10:58:24 -07:00
Roman Leventov	9c126e2aa9	Forbid MapMaker (#4845 ) * Forbid MapMaker * Shorter syntax * Forbid Maps.newConcurrentMap()	2017-09-27 06:49:47 -07:00
Charles Allen	a6470c1d03	Move caffeine out of extension and make it the default cache implementation. (#4810 ) * Move caffeine out of extension. * Remove `JsonTypeName` from the class itself * Fix bad docs * Fix distribution pom * Fix unused import * Make caffeine default * Address code comments * Add more description around the jre version in the readme * Add suggested comments	2017-09-22 10:46:55 -07:00
Jonathan Wei	09fcb75583	Add RequestLogEvent emitters config to graphite-emitter (#4678 ) * Add RequestLogEvent emitters config to graphite-emitter * eagerly compute emitter list * use lambdas * checkstyle	2017-09-22 06:14:32 -07:00
Roman Leventov	e267f3901b	Enforce Indentation with Checkstyle (#4799 )	2017-09-21 13:06:48 -07:00
Roman Leventov	d8b3bfa63c	Remove SegmentLoaderConfig.numLoadingThreads config (#4829 )	2017-09-20 21:27:43 -07:00
Charles Allen	47ebc48059	Use java 8 features in TierSelectorStrategy implementations (#4827 ) * Use java 8 features in TierSelectorStrategy implementations * Minor code cleanup * More java8 coolness * Code comments	2017-09-20 22:09:29 -05:00
Roman Leventov	88e9a80636	Rename ObjectValueSelector.get() to getObject(); Add getObject() and classOfObject() to ColumnValueSelector (#4801 )	2017-09-19 14:47:20 -05:00
Jonathan Wei	3a4a483bb0	Single auth check for authorized resource filtering (#4818 ) * Single auth check for authorized resource filtering * PR comment * PR comments	2017-09-19 21:46:08 +05:30
Jonathan Wei	c2a0e753b6	Extension points for authentication/authorization (#4271 ) * Extension points for authentication/authorization * Address some PR comments * Authorization result caching * Add unit tests for SecuritySanityCheckFilter and PreResponseAuthorizationCheckFilter * Use Set for auth caching, close outputstreams in filters * Don't close output stream on success in sanity check filter * Add ConfigResourceFilter to coordinator lookups * Fix filtering authorization check for empty resource list * HttpClient users must explicitly escalate the client * Remove response modification from PreResponseAuthorizationCheckFilter * Remove extraneous pom.xml * Fix unit test * Better lifecycle management * Rename AuthorizationManager to Authorizer * Fix authorization denials for empty supervisor list * Address some PR comments * Address more PR comments * Small cleanup * Add Jetty HttpClient wrapper to Authenticator * Remove Authorizer start/stop * Restore immutable context map in DruidConnection, UT fix * Fix/update docs * Add authorization checks to EventReceiverFirehose * Fix router authorization check failure, restore PreResponseAuthorizationFilter changes * Compile fixes * Test fixes * Update Authenticator/Authorizer doc comments * Merge fixes * PR comments * Fix test * Fix IT * More PR comments * PR comments * SSL fix	2017-09-15 23:45:48 -07:00
Himanshu	d37be5e6e9	don't hold thread while waiting after failure from server (#4795 )	2017-09-14 17:19:25 -05:00
Akash Dwivedi	a17e48fe69	search package name correction. (#4785 ) * search package name correction. * Refactor search.search pkg to search. * remove unused import.	2017-09-14 13:50:23 -07:00
Roman Leventov	267f415dc3	Update emitter library and add support for ParametrizedUriEmitter (#4722 ) * Move emitters from io.druid.server.initialization to the dedicated io.druid.server.emitter package; Update emitter library to 0.6.0; Add support for ParametrizedUriEmitter; Support hierarical properties in JsonConfigurator (was needed for ParametrizedUriEmitter) * Log created RequestLoggers * Fix forbidden API * Test fix * More Http and Parametrized Http Emitter docs * Switch to debug level	2017-09-13 17:17:19 -05:00
Himanshu	7919469de6	fixes HttpServerInventoryView to call server/segment callbacks correctly and Unit Tests for the class (#4767 ) * fixes HttpServerInventoryView to call server/segment callbacks correctly and Unit Tests for the class * fix checkstyle and forbidden-api errors * HttpServerInventoryView to finish start() only after server inventory is initialized * fix compilation errors * address review comments * add exponential backoff instead of fixed 5 secs on successive failures * update test to exercise server fail scenarios * use AtomicInteger for requestNum and increment only once	2017-09-13 14:24:19 -05:00
Gian Merlino	2ce8123bdb	Move scan-query from a contrib extension into core. (#4751 ) * Move scan-query from a contrib extension into core. Based on a proposal at: https://groups.google.com/d/topic/druid-development/ME_OatUDnbk/discussion This patch also adds support for virtual columns to the Scan query, and updates Druid SQL to use Scan instead of Select. This patch also makes some behavioral changes to handling of the __time column. In particular, it is now is returned as "__time" rather than "timestamp"; it is no longer included if you do not specifically ask for it in your "columns"; and it is returned as a long rather than a string. Users can revert time handling to the legacy extension behavior by setting "legacy" : true in their queries, or setting the property druid.query.scan.legacy = true. This is meant to provide a migration path for users that were formerly using the contrib extension. * Adjustments from review. * Add back Select query. * Adjust SQL docs. * Restore SelectQuery link.	2017-09-13 09:51:24 -07:00

1 2 3 4 5 ...

2236 Commits