Zilliz Cloud Pricing
Scalable pricing designed for every team to fit any budget. Estimate Cost
Subscribe on Marketplace.
Free
A starting point for learning, experimenting, and prototyping, with easy migration to paid plans.
Quick Start$0/mo.
- 5 GB storageEnough for 1M 768 dim vectors.
- 2.5M vCUsA virtual compute unit (vCU) is used to measure the resources consumed by read operations (such as search and query) and write operations (such as insert, upsert, and delete). The read and write costs vary for different vCU usage.per month included
- Up to 5 collections
Serverless
For applications with variable or infrequent traffic. Minimal configuration required.
Dedicated
Dedicated clusters offer use case optimized CUs to achieve high control, consistent performance, and cost-effectiveness. Suitable for development and testing.
Try FreeFrom
$99 /mo. (Up to 30-day free trial)
- Multiple cloud providers and regions
- Use case optimized CU types
- Basic metrics and monitors
- Contact Us
BYOC
Designed for organizations prioritizing custom infrastructure, enhanced data protection, and compliance.
- Deploy on your infra of choice
- Enhanced data control and security
- Flexibility and scalability on demand
Dedicated clusters offer diverse CUs tailored to fit your use cases
A CU (compute unit) is a measure of computational resources for data processing. Each CU type offers different combination of CPU, memory, and storage.
CU Type | Search QPS | Search Latency | Per CU Capacity | Cost per Million Vectors | Best Suited for |
---|---|---|---|---|---|
Performance-optimized | 500~1500 | sub-10 ms | 1.5 million vectors | from $65/mo. | Ideal for real-time applications requiring instant search results and high concurrent traffic. |
Capacity-optimized | 100~300 | tens-ms | 5 million vectors | from $20/mo. | Perfect for applications handling large vector datasets while maintaining reliable search speeds. |
Extended-capacityNew! Contact Sales | 5~20 | hundreds-ms | 20 million vectors | from $10/mo. | Designed for massive-scale datasets where optimizing total cost is prioritized over latency. |
*This table is based on evaluations of 768-dimensional vectors.
Free
Quick StartServerless
Try FreeDedicated (Standard)
Try FreeDedicated (Enterprise)
Try Free
Deployment | ||||
---|---|---|---|---|
Environment | Shared | Shared | Dedicated | Dedicated |
CU type | ||||
Performance-optimized Ideal for applications requiring low latency and high throughput. Each CU can handle about 1.5 million 768-dim vectors. | ||||
Capacity-optimized Suited for managing large datasets with moderate search performance requirements. Each CU can handle about 5 million 768-dim vectors. | ||||
Extended-capacityNew! | ||||
Public cloud provider | Google Cloud | Google Cloud | AWS, Google Cloud, Azure | AWS, Google Cloud, Azure |
Scale Scale up and down with zero downtime. | Auto-scaling | Manual scaling to 32 CUs | Auto-scaling Manual scaling to 256 CUs or more | |
Uptime SLA Guaranteed uptime for production workloads. | 99.95% |
High Availability | ||||
---|---|---|---|---|
Availability zone | Single | Single | Multiple | |
Replica Zilliz Cloud supports cluster-level replication, delivering a QPS that scales proportionally with the number of replicas. This replication feature automatically distributes replicas across different availability zones (AZs), enhancing both throughput and high availability. |
Data Service | ||||
---|---|---|---|---|
Vector Search A basic vector search finds the topK most similar results with tunable recall rates. | ||||
Filtered Search A filtered search conducts metadata filtering before conducting vector searches, narrowing down the search scope to only the entities matching the specified filtering conditions. | ||||
Range Search A range search improves search result relevancy by restricting the distance or score of the returned entities within a specific range. | ||||
Grouping Search A grouping search groups search results by the values in a specified field to aggregate data at a higher level, enhancing diversity of search results. | ||||
Hybrid Search A hybrid search supports searching for multiple vectors simultaneously and enhances search accuracy. | ||||
Full Text Search A full text search retrieves documents containing specific terms or phrases in text datasets, then ranking the results based on relevance. | ||||
Text Match A text match enables precise document retrieval based on specific terms. | ||||
Query A query allows you to find results that match specified metadata. |
Data Management | ||||
---|---|---|---|---|
Cross-tier data migration Easy migration from Free tier, Serverless, and Standard. | From Free tier | From Free tier and Serverless | From Free tier, Serverless & Dedicated(Standard) | |
Zero downtime migration Allows your service to remain operational throughout the migration process. | ||||
Migration from external sources Easy migration from Milvus, Pinecone, Qdrant, Elasticsearch, PostgreSQL, Tencent Cloud VectorDB, etc. | ||||
High speed data import High speed data import from object storage like S3. | ||||
Recycle bin Dropped collections will be retained for 30 days to facilitate easy recovery. |
Data Security & Compliance | ||||
---|---|---|---|---|
OAuth 2.0 OAuth 2.0 for authorizing account access without sharing or storing user login credentials. | ||||
Enterprise SSO Streamlined user authentication which supports both Okta and SAML 2.0 protocol. | Public Preview | |||
MFA | ||||
Auditing Comprehensive auditing logs that capture all UI and RESTful API operations on the control plane, as well as all SDK and RESTful API operations on the data plane. | ||||
API key management | ||||
Data encryption in transit and at rest | ||||
Backup and restore Supports backup at both the cluster and collection levels, with options for manual and automatic backups. | ||||
IP address access control | ||||
Private networking Private connection between your VPC and Zilliz Cloud VPC. | ||||
SOC 2 Type II and ISO/ICE 27001 compliant, GDPR-ready | ||||
HIPPA-ready |
Observability | ||||
---|---|---|---|---|
Fine-grained metrics with real-time monitoring dashboards Metrics to monitor performance, storage, usage, data statistics, etc. | ||||
Alerts Seamless integration with various alerting channels including emails, PagerDuty, Slack, Opsgenie, Lark, Webhook, etc. | ||||
Alerting and monitoring integrations Monitoring API and integrations with Prometheus and Datadog. | ||||
Job Center A centralized job center page to track the progress of tasks including migration, data import, backup and restore, clone collection, and create sample collection, etc. |
Role-based Access Control | ||||
---|---|---|---|---|
Organization and project management | 1 organization 1 project | 1 organization Up to 10 projects | 1 organization Up to 10 projects | 1 organization Up to 10 projects |
Organization and project RBAC Role-based access control at both organization and project levels. | ||||
Data plane RBAC Data layer RBAC enables precise permission control over collections, partitions, and operations, enhancing security and operational alignment. |
Integrations and Tools | ||||
---|---|---|---|---|
Intuitive RESTful APIs for control and data plane operations | ||||
User-friendly SDKs in multiple programming languages | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs |
VTS (Vector Transport Service) An open-source tool for securely moving unstructured and vector data in private environments. | ||||
VectorDBBench An open-source benchmarking tool for mainstream vector databases. It is also a tool for ultimate performance and cost-effectiveness comparison. |
Support | ||||
---|---|---|---|---|
Community support | ||||
Email support | ||||
Response time SLAs | ||||
Urgent | 4 hours | 4 hours | 1 hour | |
High | 1 business day | 1 business day | 4 hours | |
Normal | 2 business days | 2 business days | 1 business day | |
Technical contacts | Up to 1 | Up to 1 | Up to 4 |
Estimate Your Cost
Use this calculator to understand how Zilliz Cloud charges.
Cloud Provider
Cloud Region
Pricing Plan
StandardOffers high control and consistent performance, cost-efficient for development and testing environments.
CU Type
Capacity-optimizedSuited for managing large datasets with moderate search performance requirements. Each CU can handle about 5 million 768-dim vectors.
Number of Entities
Vector Dimensions
mmap
Enabling mmap (memory mapping) can optimize memory usage and the amount of data that can be stored in the same CU size will be increased. Learn More
Estimated total cost per month
Total Cost (excl. tax) = CU Cost × Replica Count + Storage Cost
The price estimate is monthly, but the actual cost is billed hourly. You can suspend clusters anytime to save costs.
$
- CU
Unit Price
$0/CU
A CU is the basic unit of compute resources used for parallel processing of data.
$
GB
Storage
Unit Price
$0/GB
$
Acknowledgement: Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on a variety of factors, including your actual usage of services.
Frequently Asked Questions
What is a Compute Unit (CU)?
What is a vCU?
Which type of CU should I pick?
How many CUs do I need for a given collection?
How can I get Zilliz Cloud discounts?
How can I request a new cloud region?