Apache Druid vs. ClickHouse

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Druid
Score 9.0 out of 10
N/A
Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture.N/A
ClickHouse
Score 7.3 out of 10
N/A
ClickHouse is an open-source, column-oriented OLAP database system enabling real-time analytical reports using SQL queries. With linear scalability, it handles trillions of rows and petabytes of data. ClickHouse Cloud offers a scalable serverless solution for real-time analytics.N/A
Pricing
Apache DruidClickHouse
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
DruidClickHouse
Free Trial
NoYes
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeOptional
Additional DetailsPay for what is used: It automatically scales up and down compute resources based on the user's workload It scales storage and compute separately It automatically scales unused resources down to zero so that users don’t pay for idle services
More Pricing Information
Community Pulse
Apache DruidClickHouse
User Ratings
Apache DruidClickHouse
Likelihood to Recommend
9.0
(0 ratings)
10.0
(0 ratings)
User Testimonials
Apache DruidClickHouse
Likelihood to Recommend
It is extremely well suited to rapid ingest of data from large data sources, due to the fact that you can restrict what is ingested by column/field, so that you only pull in the data you actually want or need.
As stated earlier, the open source version could use better cluster management tools, and troubleshooting tools for failing jobs/tasks.
Read full review
ClickHouse delivers exceptional speed and performance, positioning it as the top choice for managing large-scale analytical workloads. With a bunch of built-in functions, it empowers analysts to extract maximum insights from data effortlessly. If your scenario is to deal with analytical questions, then ClickHouse is for you, but if you are looking into a transactional database, that's not the case; even their table engines are not made for this.
Read full review
Pros
  • Rapid ingest
  • Limiting ingest to only the relevant fields/columns
  • Easy ingest spec creation
Read full review
  • Their MergeTree table engine provide impressive performance for data insert in bulk
  • Not only data insert but also the way MergeTree engine uses Primary Keys to sort the data and perform data skipping based on the granules its also their secret for ridiculous fast queries
  • Data compression its also great
  • They provide especial table engines that allow you to read data directly from other sources like S3
  • Since its written with C++ you have very granular data types and especial ones like enum, LowCardinality and etc, they save you a lot of storage since are stored as integer values
  • ClickHouse functions besides the ones that respect ANSI Standards are also awesome and useful
Read full review
Cons
  • Security configuration is problematic
  • Cluster management could have more features
  • Troubleshooting incomplete tasks/jobs is a chore
Read full review
  • Avro data manipulation
  • Kafka consistency
  • DDL operations errors (by replica configuration)
Read full review
Alternatives Considered
No answers on this topic
ClickHouse was not compared to them as a competitor but as the ideal partner to complete an information analysis system, providing users with the most complete and efficient tools. Therefore, in this case it was considered that it would be the ideal candidate due to its characteristics compared to the other competitors.
Read full review
Return on Investment
  • Integration with S3 storage has saved about 35% on our storage, over HDFS
  • The rapid ingest has saved user's time in the query aspects of their applications.
  • The ability to ingest from a variety of data sources has made overall user application queries much simpler
Read full review
  • Queries that used to take more than 2 minutes now take less than 1 second
  • Possibility to analyze use cases in real time (before was impossible)
  • The applications are more complete and the users decisions are better
Read full review
ScreenShots