Skip to content

The Database
for Real-Time Analytics
and Hybrid Search

Any type of data. Distributed. Native SQL.

rockset-logo[Rockset] is being discontinued.
Migrate to CrateDB with a special offer  ->
CrateDB is a leader in Time Series Databases on G2
CrateDB was awarded with the badge Best Relationship Winter 2025 in the Time Series Databases category on G2

Real-time Analytics

Execute ad-hoc queries on billions of records in milliseconds. Columnar storage guarantees ultra-fast aggregations, enabling instant data-driven decisions. Begin with a simple query and delve into complex data relationships, revealing trends and patterns across diverse data types. Learn more >

Hybrid Search

Effortless search across structured, semi-structured, geospatial, and vector data. Perform full-text, vector search or similarity searches and combine the results with other data types. The fully distributed SQL query engine, built on top of Apache Lucene, ensures unmatched performance and scalability. Learn more >

Marketecture

Leading Companies Leveraging Real-Time Insights at Scale

Real-time Ingestion and Dynamic Indexing

Enjoy the power of instant indexing and adaptability, perfectly suited for handling complex and evolving data structures.

Real-time indexing automatically indexes all columns, including nested structures, as data is ingested, ensuring immediate query availability with no latency.

The flexible data schema dynamically adapts based on the data you ingest, offering seamless integration and instant readiness for analysis.

Real-time Querying and Search

Experience ultra-fast response times, even for complex ad-hoc queries, with results delivered in milliseconds. Perform on-the-fly aggregations, effortlessly handling complex joins, large datasets, and historical data.

Leverage the power of full-text and vector search without needing additional databases. Seamlessly integrate with AI/ML frameworks for advanced data analysis.

Enhanced Developer Productivity

Boost your developer productivity with native SQL for simple queries and quick onboarding. Analyze relational, JSON, time-series, geospatial, full-text, and vector data within a single system.

PostgreSQL compatibility ensures easy integration with third-party tools, enhancing compatibility and migration. Utilize the vector store to seamlessly integrate with AI/ML tools and LangChain, allowing you the freedom to choose your LLM and embedding algorithms.

The power and flexibility of the open-source licensing model liberates you from vendor lock-in, and provides support from the growing developer community.

 
        
/* Based on device data, this query returns the average
 * of the battery level for every hour for each device_id
 */
WITH avg_metrics AS (
    SELECT device_id,
       DATE_BIN('1 hour'::INTERVAL, time, 0) AS period,
       AVG(battery_level) AS avg_battery_level
    FROM devices.readings
    GROUP BY 1, 2 
    ORDER BY 1, 2
)
SELECT period,
       t.device_id,
       manufacturer,
       avg_battery_level  
FROM avg_metrics t, devices.info i
WHERE t.device_id = i.device_id 
      AND model = 'mustang'
LIMIT 10;

        
        
        
+---------------+------------+--------------+-------------------+
|    period     |  device_id | manufacturer | avg_battery_level |
+---------------+------------+--------------+-------------------+
| 1480802400000 | demo000001 |    iobeam    | 49.25757575757576 |
| 1480806000000 | demo000001 |    iobeam    | 47.375            |
| 1480802400000 | demo000007 |    iobeam    | 25.53030303030303 |
| 1480806000000 | demo000007 |    iobeam    | 58.5              |
| 1480802400000 | demo000010 |    iobeam    | 34.90909090909091 |
| 1480806000000 | demo000010 |    iobeam    | 32.4              |
| 1480802400000 | demo000016 |    iobeam    | 36.06060606060606 |
| 1480806000000 | demo000016 |    iobeam    | 35.45             |
| 1480802400000 | demo000025 |    iobeam    | 12                |
| 1480806000000 | demo000025 |    iobeam    | 16.475            |
+---------------+------------+--------------+-------------------+

        
        

Streamlined Operations

Experience a cost-efficient, robust, and scalable architecture that delivers high performance at any scale. Eliminate the hassle of combining and synchronizing different databases, reducing overhead, and minimizing your carbon footprint.

Ensure high availability with automatic failover, recovery, and replication, keeping your data safe and accessible. The resilient architecture detects failures and maintains cluster health, offering peace of mind even in distributed environments.

Choose from multiple deployment models: DBaaS, hybrid cloud, of self-managed, providing flexibility to meet your operational needs, even for Edge deployment with limited connectivity. Whether you're running on a single laptop or dozens of servers with terabytes of data, seamlessly scale from prototype to production.

Introduction to CrateDB

Turbocharge your aggregations, search and AI models and get real-time insights

Upcoming Events

Webinar

Join our live webinar on February 25th to learn how to master Change Data Capture and supercharge your analytics pipeline!

Meetup

Join CrateDB’s Engineering Team in Amsterdam for a dynamic, hands-on morning that will transform the way you think about data analytics and AI....