Compare CrateDB

How is CrateDB different from other databases in the market?

TimescaleDB is a time-series database built as a PostgreSQL extension, greatly improving the functionality of PostgreSQL for time-series workloads. However, TimescaleDB is not the best option for large IoT use cases with high data variety and a need for efficient, simple scalability.

TimescaleDB was conceived as a single-node database. In contrast, CrateDB was built to scale: distribution is at the core of CrateDB's architecture, which provides unlimited horizontal scalability since the start. Many developments later, CrateDB is now a fully mature distributed database—while the multi-node version of Timescale was released very recently (late 2020).

 

  TimescaleDB CrateDB
Cluster type Multi-node Multi-node (shared-nothing)
Data replication Yes At table level
Open-source Apache 2.0 Apache 2.0
Access language ANSI SQL ANSI SQL
Schemas Static Dynamic
Columnar indexing Yes Yes
Aggregation queries Yes Yes
JOINs Yes Full
Full-text search No Yes (Lucene powered)

Since scalability is at the core of CrateDB, to scale it is as easy as can be—especially when operated in the cloud. Data replication and cluster rebalancing are automatic in CrateDB, and it performs aggregations, JOINs, sub-selects, and ad-hoc queries at in-memory speed. It also includes Lucene-powered full-text search.

(TimescaleDB supports JSON columns, but their performance is not optimal... And indices must be manually configured in order to get a good response. With CrateDB, indexing is automatic.)

Besides, TimescaleDB needs two times more storage than CrateDB in the IIoT workloads we've tested. 

For the same IIoT dataset and budget, CrateDB was able to store one year's worth of data—vs one month in TimescaleDB 

In summary: for IoT use cases involving huge data volumes, high concurrency, and requiring a real-time response, we believe CrateDB is a better fit than TimescaleDB.

  • Scalability is the heart and soul of CrateDB. Customers have been using clusters with 80+ nodes in production for years
  • CrateDB's excellent indexing structure assures the handling of time series data with efficiency
  • At the same time, its shared-nothing distribution allows for horizontal scalability without a single point of failure
  • On top of it, CrateDB offers mature features like automatic cluster rebalancing and full-text search, still not present on TimescaleDB

...but don't take our word for it.

Download CrateDB and test it out yourself

continue reading
 
Blogpost

Comparing databases for a IIoT use case

Testing out MongoDB, TimescaleDB, InfluxDB and CrateDB

 
White Paper

Time-series data in manufacturing

Why are they different from other time series use cases?

 
Blog

Amazon Timestream is finally released...

And these are our first impressions