Milvus Vector Database
What is Milvus?
Milvus is a highly flexible, reliable, and blazing-fast cloud-native, open-source vector database. It powers embedding similarity search and AI applications and strives to make vector databases accessible to every organization. Milvus can store, index, and manage a billion+ embedding vectors generated by deep neural networks and other machine learning (ML) models.
Try Zilliz Cloud for freeBuilt on a strong and growing community.
22,868+GitHub stars
2,512+Slack Community
5,327,339+Downloads
255+Contributors
Why Milvus?
Milvus is a cloud-native, open-source vector database built to power embedding similarity search and AI applications.
Easy to Use
With Milvus vector database, you can create a large-scale similarity search service in less than a minute. Simple and intuitive SDKs are also available for a variety of different languages.
Blazing Fast
Milvus is hardware efficient and provides advanced indexing algorithms, achieving a 10x performance boost in retrieval speed.
Highly Available
Milvus vector database has been battle-tested by over a thousand enterprise users in various use cases. With extensive isolation of individual system components, Milvus is highly resilient and reliable.
Highly Scalable
Milvus's distributed and high-throughput nature makes it a natural fit for serving large-scale vector data.
Cloud-native
The Milvus database adopts a systemic approach to cloud-nativity, separating compute from storage and allowing you to scale up and out.
Feature-rich
The Milvus vector search support various data types, attribute filtering, UDF support, configurable consistency level, time travel, and more.
How does Milvus Work?
Milvus consists of a storage layer and a compute layer, and to enhance elasticity and flexibility, all components in Milvus are stateless. The system comprises of four levels:
Access layer
The access layer comprises a group of stateless proxies and serves as the front layer of the system and endpoint to users.
Coordinator service
The coordinator service assigns tasks to the worker nodes and functions as the system's brain.
Worker nodes
The worker nodes function as arms and legs and are dumb executors that follow instructions from the coordinator service and execute user-triggered DML/DDL commands.
Storage
Storage is the bone of the system and is responsible for data persistence. It comprises meta storage, log broker, and object storage.
What is Milvus Used For?
Today, hundreds of Milvus use cases are commonly used to build similarity search-based applications. Any company that relies on or works with unstructured data can find numerous benefits.
Semantic text search
Processing and querying text across multiple vectors like intent, location, and previous search history can provide the context necessary for more accurate and nuanced results.
AI advertising
Vector databases can be used in targeted advertising to improve the relevance and effectiveness of ad targeting. In this context, the database can store and index large amounts of data related to user behavior, demographics, and interests as high-dimensional vectors. Ads are then mapped to the same space as the users, making targeted advertising as simple as performing a query in Milvus.
Video, audio, and image similarity search
Discover similar videos, audio, and images within large datasets with our state-of-the-art similarity search options include Euclidean distance, cosine similarity, and Jaccard similarity.
Recommender system
Vector databases such as Milvus can power product recommendation engines for e-commerce by combining multiple sources of unstructured data such as search history and past purchases. Additionally, user-generated content of various formats can be stored as a single vector representation making recommending new content as easy as querying over content users have liked or engaged with previously.
Question answering system
A question answering system is able to answer questions posed in natural language. These systems can be used for a variety of applications, including customer support, internal information retrieval, and more.
AI drug discovery
In drug discovery, vector representations of compounds include the overall structure and biological properties. A vector database can store and index this data as high-dimensional vectors, enabling new drug discovery simply by querying.