druid/docs/querying
Pranav 20de7fd95a
Geo spatial interfaces (#16029)
This PR creates an interface for ImmutableRTree and moved the existing implementation to new class which represent 32 bit implementation (stores coordinate as floats). This PR makes the ImmutableRTree extendable to create higher precision implementation as well (64 bit).
In all spatial bound filters, we accept float as input which might not be accurate in the case of high precision implementation of ImmutableRTree. This PR changed the bound filters to accepts the query bounds as double instead of float and it is backward compatible change as it compares double to existing float values in RTree. Previously it was comparing input float to RTree floats which can cause precision loss, now it is little better as it compares double to float which is still not 100% accurate.
There are no changes in the way that we query spatial dimension today except input bound parsing. There is little improvement in string filter predicate which now parse double strings instead of float and compares double to double which is 100% accurate but string predicate is only called when we dont have spatial index.
With allowing the interface to extend ImmutableRTree, we allow to create high precision (HP) implementation and defines new search strategies to perform HP search Iterable<ImmutableBitmap> search(ImmutableDoubleNode node, Bound bound);
With possible HP implementations, Radius bound filter can not really focus on accuracy, it is calculating Euclidean distance in comparing. As EARTH 🌍 is round and not flat, Euclidean distances are not accurate in geo system. This PR adds new param called 'radiusUnit' which allows you to specify units like meters, km, miles etc. It uses https://en.wikipedia.org/wiki/Haversine_formula to check if given geo point falls inside circle or not. Added a test that generates set of points inside and outside in RadiusBoundTest.
2024-04-01 14:58:03 +05:30
..
aggregations.md Update the docs for EARLIEST_BY/LATEST_BY aggregators with the newly added numeric capabilities (#15670) 2024-02-01 10:24:43 +05:30
arrays.md MSQ: Validate that strings and string arrays are not mixed. (#15920) 2024-03-13 15:37:27 -07:00
caching.md remove group-by v1 (#14866) 2023-08-23 12:44:06 -07:00
datasource.md Introduce dynamic table append (#15897) 2024-03-01 04:31:57 -05:00
datasourcemetadataquery.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
dimensionspecs.md Reverse, pull up lookups in the SQL planner. (#15626) 2024-01-12 00:06:31 -08:00
filters.md document arrayContainsElement filter (#15455) 2023-12-07 00:14:00 -08:00
geo.md Geo spatial interfaces (#16029) 2024-04-01 14:58:03 +05:30
granularities.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
groupbyquery.md Fix dictionarySize overrides in tests (#15354) 2023-11-28 18:49:09 +05:30
having.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
hll-old.md De-incubation cleanup in code, docs, packaging (#9108) 2020-01-03 12:33:19 -05:00
joins.md Sort-merge join and hash shuffles for MSQ. (#13506) 2023-03-08 14:19:39 -08:00
limitspec.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
lookups.md Add sqlReverseLookupThreshold for ReverseLookupRule. (#15832) 2024-02-06 16:32:05 +05:30
math-expr.md start process of deprecating non-sql compatible legacy configurations (#15713) 2024-02-13 15:31:45 +05:30
multi-value-dimensions.md MSQ: Validate that strings and string arrays are not mixed. (#15920) 2024-03-13 15:37:27 -07:00
multitenancy.md Docs: Fix some typos. (#14663) 2023-07-26 21:24:18 +05:30
nested-columns.md [Docs] Refactor streaming ingestion section (#15591) 2024-02-12 13:52:42 -08:00
post-aggregations.md explicit outputType for ExpressionPostAggregator, better documentation for the differences between arrays and mvds (#15245) 2023-11-02 00:31:37 -07:00
query-context.md Reverse, pull up lookups in the SQL planner. (#15626) 2024-01-12 00:06:31 -08:00
query-execution.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
query-from-deep-storage.md Query from deep storage doc fixes. (#15382) 2023-11-16 14:05:20 +05:30
query-processing.md Revamp design page (#15486) 2023-12-08 11:40:24 -08:00
querying.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
scan-query.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
searchquery.md remove search auto strategy, estimateSelectivity of BitmapColumnIndex (#15550) 2023-12-13 16:30:01 -08:00
segmentmetadataquery.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
select-query.md Add "offset" parameter to the Scan query. (#10233) 2020-08-13 14:56:24 -07:00
sorting-orders.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
sql-aggregations.md start process of deprecating non-sql compatible legacy configurations (#15713) 2024-02-13 15:31:45 +05:30
sql-array-functions.md start process of deprecating non-sql compatible legacy configurations (#15713) 2024-02-13 15:31:45 +05:30
sql-data-types.md start process of deprecating non-sql compatible legacy configurations (#15713) 2024-02-13 15:31:45 +05:30
sql-functions.md update window functions doc (#15902) 2024-03-07 15:16:52 -08:00
sql-json-functions.md minor doc adjustments (#15531) 2023-12-11 18:22:44 -08:00
sql-metadata-tables.md [Docs] Refactor streaming ingestion section (#15591) 2024-02-12 13:52:42 -08:00
sql-multivalue-string-functions.md start process of deprecating non-sql compatible legacy configurations (#15713) 2024-02-13 15:31:45 +05:30
sql-operators.md Document pivot and unpivot operators (#15669) 2024-01-25 09:53:39 -08:00
sql-query-context.md Add sqlReverseLookupThreshold for ReverseLookupRule. (#15832) 2024-02-06 16:32:05 +05:30
sql-scalar.md LOOKUP docs: clarify behavior of replaceMissingValueWith. (#15879) 2024-02-11 13:11:00 -08:00
sql-translation.md update window functions doc (#15902) 2024-03-07 15:16:52 -08:00
sql-window-functions.md update window functions doc (#15902) 2024-03-07 15:16:52 -08:00
sql.md Introduce dynamic table append (#15897) 2024-03-01 04:31:57 -05:00
timeboundaryquery.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
timeseriesquery.md update timeseries to reflect NULL filling (#15512) 2023-12-07 14:41:27 -08:00
tips-good-queries.md remove references to Jupyter notebooks within the Druid repo (#15143) 2023-11-01 13:17:06 -07:00
topnmetricspec.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
topnquery.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00
troubleshooting.md remove group-by v1 (#14866) 2023-08-23 12:44:06 -07:00
using-caching.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
virtual-columns.md Docusaurus2 upgrade for master (#14411) 2023-08-16 19:01:21 -07:00