From 99ee3f4dc38ca2df1f6b3859db732cb81190eca2 Mon Sep 17 00:00:00 2001 From: Gian Merlino Date: Tue, 7 Jun 2016 10:43:35 -0700 Subject: [PATCH] Fixups, clarifications to lookup docs. (#3060) --- .../extensions-core/lookups-cached-global.md | 21 ++++++++++--------- docs/content/querying/lookups.md | 10 +++++---- docs/content/toc.md | 1 + 3 files changed, 18 insertions(+), 14 deletions(-) diff --git a/docs/content/development/extensions-core/lookups-cached-global.md b/docs/content/development/extensions-core/lookups-cached-global.md index edd0d40f584..5b720ed79b7 100644 --- a/docs/content/development/extensions-core/lookups-cached-global.md +++ b/docs/content/development/extensions-core/lookups-cached-global.md @@ -2,7 +2,7 @@ layout: doc_page --- -# Namespaced Lookup +# Globally Cached Lookups
Lookups are an experimental feature. @@ -12,16 +12,17 @@ Make sure to [include](../../operations/including-extensions.html) `druid-lookup ## Configuration
-Static configuration is no longer supported. Only cluster wide configuration is supported +Static configuration is no longer supported. Lookups can be configured through +dynamic configuration.
-Cached namespace lookups are appropriate for lookups which are not possible to pass at query time due to their size, +Globally cached lookups are appropriate for lookups which are not possible to pass at query time due to their size, or are not desired to be passed at query time because the data is to reside in and be handled by the Druid servers, and are small enough to reasonably populate on a node. This usually means tens to tens of thousands of entries per lookup. -Cached namespace lookups all draw from the same cache pool, allowing each node to have a fixed cache pool that can be used by namespace lookups. +Globally cached lookups all draw from the same cache pool, allowing each node to have a fixed cache pool that can be used by cached lookups. -Cached namespace lookups can be specified as part of the [cluster wide config for lookups](../../querying/lookups.html) as a type of `cachedNamespace` +Globally cached lookups can be specified as part of the [cluster wide config for lookups](../../querying/lookups.html) as a type of `cachedNamespace` ```json { @@ -73,7 +74,7 @@ The parameters are as follows If `firstCacheTimeout` is set to a non-zero value, it should be less than `druid.manager.lookups.hostUpdateTimeout`. If `firstCacheTimeout` is NOT set, then management is essentially asynchronous and does not know if a lookup succeeded or failed in starting. In such a case logs from the lookup nodes should be monitored for repeated failures. -Proper functionality of Namespaced lookups requires the following extension to be loaded on the broker, peon, and historical nodes: +Proper functionality of globally cached lookups requires the following extension to be loaded on the broker, peon, and historical nodes: `druid-lookups-cached-global` ## Example configuration @@ -148,9 +149,9 @@ So if total `cachedNamespace` lookup size is in excess of 10MB, the extra will b For additional lookups, please see our [extensions list](../extensions.html). -## URI namespace update +## URI lookup -The remapping values for each namespaced lookup can be specified by a json object as per the following examples: +The remapping values for each globally cached lookup can be specified by a json object as per the following examples: ```json { @@ -296,7 +297,7 @@ The `simpleJson` lookupParseSpec does not take any parameters. It is simply a li } ``` -## JDBC namespaced lookup +## JDBC lookup The JDBC lookups will poll a database to populate its local cache. If the `tsColumn` is set it must be able to accept comparisons in the format `'2015-01-01 00:00:00'`. For example, the following must be valid sql for the table `SELECT * FROM some_lookup_table WHERE timestamp_column > '2015-01-01 00:00:00'`. If `tsColumn` is set, the caching service will attempt to only poll values that were written *after* the last sync. If `tsColumn` is not set, the entire table is pulled every time. @@ -330,4 +331,4 @@ The JDBC lookups will poll a database to populate its local cache. If the `tsCol # Introspection -Cached namespace lookups have introspection points at `/keys` and `/values` which return a complete set of the keys and values (respectively) in the lookup. Introspection to `/` returns the entire map. Introspection to `/version` returns the version indicator for the lookup. +Globally cached lookups have introspection points at `/keys` and `/values` which return a complete set of the keys and values (respectively) in the lookup. Introspection to `/` returns the entire map. Introspection to `/version` returns the version indicator for the lookup. diff --git a/docs/content/querying/lookups.md b/docs/content/querying/lookups.md index d7015514b1b..cd57d92e502 100644 --- a/docs/content/querying/lookups.md +++ b/docs/content/querying/lookups.md @@ -28,14 +28,16 @@ and such data belongs in the raw denormalized data for use in Druid. Very small lookups (count of keys on the order of a few dozen to a few hundred) can be passed at query time as a "map" lookup as per [dimension specs](../querying/dimensionspecs.html). -For static lookups defined in `runtime.properties` rather than embedded in the query, please look at the experimental [namespaced lookup extension](../development/extensions-core/namespaced-lookup.html). +Other lookup types are available as extensions, including: -For other additional lookups, please see our [extensions list](../development/extensions.html). +- Globally cached lookups from local files, remote URIs, or JDBC through [lookups-cached-global](../development/extensions-core/lookups-cached-global.html). +- Globally cached lookups from a Kafka topic through [kafka-extraction-namespace](../development/extensions-core/kafka-extraction-namespace.html). -Dynamic configuration +Dynamic Configuration ---------------------
-Dynamic lookup configuration is an experimental feature. +Dynamic lookup configuration is an experimental feature. Static +configuration is no longer supported.
The following documents the behavior of the cluster-wide config which is accessible through the coordinator. The configuration is propagated through the concept of "tier" of servers. diff --git a/docs/content/toc.md b/docs/content/toc.md index e8eb6c041c9..843418ffa99 100644 --- a/docs/content/toc.md +++ b/docs/content/toc.md @@ -42,6 +42,7 @@ * [Context](../querying/query-context.html) * [Multi-value dimensions](../querying/multi-value-dimensions.html) * [SQL](../querying/sql.html) + * [Lookups](../querying/lookups.html) * [Joins](../querying/joins.html) * [Multitenancy](../querying/multitenancy.html) * [Caching](../querying/caching.html)