The Guide for Time Series Data Projects is out.

Download now
Skip to content
Solutions

Vector Database

CrateDB is an open source, multi-model and distributed database that offers high performance, scalability and flexibility. Maximize the potential of vector data with a single, scalable database that can be queried with SQL and streamlines data management, significantly reducing development time and total cost of ownership.

Open source vector database, all with SQL

Hyper-fast. Queries in milliseconds.

        

SELECT text, _score
FROM word_embeddings
WHERE knn_match(embedding,[0.3, 0.6, 0.0, 0.9], 2)
ORDER BY _score DESC; 
        

|------------------------|--------|
|         text           | _score |
|------------------------|--------|
|Discovering galaxies    |0.917431|
|Discovering moon        |0.909090|
|Exploring the cosmos    |0.909090|
|Sending the mission     |0.270270|
|------------------------|--------|
        

SELECT text, _score
FROM word_embeddings
WHERE knn_match(embedding, (SELECT embedding FROM word_embeddings WHERE text ='Discovering galaxies'), 2)
ORDER BY _score DESC
        

|------------------------|--------|
|         text           | _score |
|------------------------|--------|
|Discovering galaxies    |1       |
|Discovering moon        |0.952381|
|Exploring the cosmos    |0.840336|
|Sending the mission     |0.250626|
|------------------------|--------|

Streamlined data management

Eliminate the need to manage multiple systems. CrateDB seamlessly integrates your data, keeping your (meta-)data and vector representations aligned without the complexity of data synchronization processes. Not only does it offer powerful vector search capabilities, but it also seamlessly integrates with time series, geospatial, JSON, full-text search, and other data types.

cr-quote-image

Data enriched with semantics

Seamlessly add vector data types to any row in the database, providing context aligned with your (meta-)data and enhancing explainability.

cr-quote-image

Advanced search capabilities

Combine vector, full-text, and keyword searches for improved semantic similarity and keyword matching, enhancing search precision and relevance.
cr-quote-image

Enhanced AI model integration

Leverage CrateDB's native support for complex data analytics to speed up integration with AI models, optimizing your AI projects.
cr-quote-image

Improved scalability

Handle vector data with confidence, eliminating the need for separate vector databases and enabling smoother scaling as your data grows.
cr-quote-image

Faster development & lower maintenance

Save time in development by avoiding the integration and learning curve associated with external solutions for vector data storage and retrieval.
cr-quote-image

Keynote - The transformative effects of real-time AI 

In this keynote at the AI & Big Data Expo Europe 2023, CrateDB's VP Product shares his vision for the future with multi-model SQL databases and Large Language Models.

Dev Talk - How to Use Private Data in Generative AI

This talk at Fosdem 2024 focuses on the combination of CrateDB and LangChain: it helps get started with using private data as context for large language models through LangChain, incorporating the concept of Retrieval Augmented Generation (RAG).

5 Essential Things You Need to Know about Vector Databases

This infographic gives you some basic understanding of vector databases, from what you should look for when choosing one to combining vector data with other data types.

CrateDB stands as a vector store database with key features that elevate its capabilities: vector storage and similarity search.

  • Vector storage empowers users to efficiently store embeddings produced by their preferred machine learning models, creating a streamlined method for managing and accessing vectorized data.
  • Similarity search enables users to effortlessly discover similarities within datasets represented as vectors, fostering advanced data exploration and in-depth analysis. 

By offering these vector database capabilities within a single, scalable product, CrateDB streamlines data management, cutting down both development time and total cost of ownership.

Typical use cases for vector databases

Unlock the potential of CrateDB's vector storage and similarity search across a range of industries and applications:

E-commerce recommendations

Elevate e-commerce experiences by enabling semantic vector and full-text searches to deliver more relevant product recommendations.
cr-quote-image

Chatbots & customer support

Enhance customer interactions by understanding questions with precision. Contextualize conversations, providing better service with improved understanding of user inquiries, regardless of the terms they use.

cr-quote-image

Anomaly & fraud detection

Detect outliers effortlessly in various sectors, from finance to IoT and cybersecurity by normalizing and encoding user or machine behavior into vectors.
cr-quote-image

Multimodal search

Extend your search capabilities beyond text to include images and videos.
cr-quote-image

Generative AI

Store embeddings, provide additional context in prompts and act as conversational memory for LLM-based applications. Use vector search functionality for retrieval augmented generation (RAG), which enables LLMs to understand specific data.

cr-quote-image

Additional resources on vector databases