druid相關的時間序列資料庫——也用到了倒排相關的優化技術

桃子紅了吶發表於2017-11-15

Cattell [6] maintains a great summary about existing Scalable SQL and NoSQL data stores. Hu [18] contributed another great summary for streaming databases. Druid feature-wise sits some-

where between Google’s Dremel [28] and PowerDrill [17]. Druid has most of the features implemented in Dremel (Dremel handles arbitrary nested data structures while Druid only allows for a single

level of array-based nesting) and many of the interesting compression algorithms mentioned in PowerDrill. Although Druid builds on many of the same principles as other distributed columnar data stores [15], many of these data stores are

designed to be more generic key-value stores [23] and do not sup

port computation directly in the storage layer. There are also other

data stores designed for some of the same data warehousing issues

that Druid is meant to solve. These systems include in-memory

databases such as SAP’s HANA [14] and VoltDB [43]. These data

stores lack Druid’slowlatency ingestion characteristics. Druidalso

has native analytical features baked in, similar to ParAccel [34],

however, Druid allows system wide rolling software updates with

no downtime.

Druid is similiar to C-Store [38] and LazyBase [8] in that it has

twosubsystems,aread-optimizedsubsysteminthehistoricalnodes

andawrite-optimizedsubsysteminreal-timenodes. Real-timenodes

are designed to ingest a high volume of append heavy data, and do

not support data updates. Unlike the two aforementioned systems,

Druid is meant for OLAP transactions and not OLTP transactions.

Druid’s low latency data ingestion features share some similar-

ities with Trident/Storm [27] and Spark Streaming [45], however,

both systems are focused on stream processing whereas Druid is

focused on ingestion and aggregation. Stream processors are great

complements to Druid as a means of pre-processing the data before

the data enters Druid.

There are a class of systems that specialize in queries on top of

cluster computing frameworks. Shark [13] is such a system for

queriesontopofSpark,andCloudera’sImpala[9]isanothersystem

focused on optimizing query performance on top of HDFS. Druid

historical nodes download data locally and only work with native

Druid indexes. We believe this setup allows for faster query laten

cies.

Druid leverages a unique combination of algorithms in its archi-

tecture. Although we believe no other data store has the same set

of functionality as Druid, some of Druid’s optimization techniques

suchas using inverted indices to perform fast filter sarealsousedin

other data stores [26].

druid白皮書：http://static.druid.io/docs/druid.pdf

本文轉自張昺華-sky部落格園部落格，原文連結：http://www.cnblogs.com/bonelee/p/6433333.html，如需轉載請自行聯絡原作者

資料庫效能優化-索引與sql相關優化
2018-08-01
資料庫優化索引SQL
時間相關的操作
2021-11-10
大資料相關技術有哪些？
2018-04-22
大資料
時間相關的工具類
2018-03-06
【OPTIMIZATION】Oracle影響優化器選擇的相關技術
2021-10-15
Oracle優化
Mysql的優化的相關知識
2018-09-18
MySql優化
資料庫（相關練習）
2018-11-01
資料庫
MSSQL系列（一）：資料庫的相關操作
2020-07-17
SQL資料庫
ios效能優化相關
2018-09-12
iOS優化
MySQL資料庫部署及初始化相關
2022-09-01
MySql資料庫
資料庫事物相關問題
2020-09-20
資料庫
python 時間相關模組
2019-02-16
Python
區塊鏈(BlockChain)技術開發相關資料
2018-05-19
區塊鏈Blockchain
有關動態規劃的相關優化思想
2022-04-29
動態規劃優化
倒排索引及ES相關概念對比MySQL
2024-10-16
索引MySql
Hive優化相關設定
2018-11-29
Hive優化
記憶體優化相關
2019-07-23
記憶體優化
強化學習相關資料
2024-11-03
強化學習
驗證碼的作用和相關技術
2018-04-09
樹的相關術語
2022-02-28
SCM通道模型和SCME通道模型的matlab特性模擬,對比空間相關性,時間相關性,頻率相關性
2024-09-14
模型Matlab
Java Record 的一些思考 - 序列化相關
2022-01-04
Java
SAP CRM One Order header資料庫表幾個和時間戳相關的欄位
2020-08-23
Header資料庫時間戳
『現學現忘』Docker相關概念 — 8、虛擬化技術和容器技術的關係
2022-03-09
Docker
技術乾貨 | 解鎖Redis 時間序列資料的應用
2022-04-14
Redis
【PG管理】postgresql資料庫管理相關
2019-01-09
SQL資料庫
資料庫相關知識點提要
2023-01-28
資料庫
oracle臨時表空間相關
2023-12-29
Oracle
時間函式：與時間相關那些事。。。
2024-10-14
函式
蒐集到的Weex 相關資料
2018-03-14
運維相關的資料整理
2019-05-10
運維
Linux技術相關命令有哪些
2020-05-20
Linux
微服務框架相關技術整理
2021-03-02
微服務框架
資料的相關性或因果關係 - KDnuggets
2022-05-12
圖解Linux的IO模型和相關技術
2021-09-09
圖解Linux模型
vue相關的UI元件庫
2018-06-26
VueUI元件
時間序列化資料庫選型？時序資料庫的選擇？
2022-05-18
資料庫
MySQL 資料庫相關流程圖 / 原理圖
2019-10-08
MySql資料庫流程圖
Oracle undo保留時間的幾個相關引數
2018-08-03
Oracle

druid相關的時間序列資料庫——也用到了倒排相關的優化技術

相關文章