Zilliz Cloud Pricing
Scalable pricing designed for every team to fit any budget. Estimate Cost
Subscribe on Marketplace.
Free
A starting point for learning, experimenting, and prototyping, with easy migration to paid plans.
Quick Start$0/mo.
- 5 GB storageEnough for 1M 768 dim vectors.
- 2.5M vCUsA virtual compute unit (vCU) is used to measure the resources consumed by read operations (such as search and query) and write operations (such as insert, upsert, and delete). The read and write costs vary for different vCU usage.per month included
- Up to 5 collections
Serverless
For applications with variable or infrequent traffic. Minimal configuration required.
Dedicated
Dedicated clusters offer use case optimized CUs to achieve high control, consistent performance, and cost-effectiveness. Suitable for development and testing.
Try FreeFrom
$99 /mo. (Up to 30-day free trial)
- Multiple cloud providers and regions
- Use case optimized CU types
- Basic metrics and monitors
- Contact Us
BYOC
Designed for organizations prioritizing custom infrastructure, enhanced data protection, and compliance.
- Deploy on your infra of choice
- Enhanced data control and security
- Flexibility and scalability on demand
Dedicated clusters offer diverse CUs tailored to fit your use cases
A CU (compute unit) is a measure of computational resources for data processing. Each CU type offers different combination of CPU, memory, and storage.
CU Type | Search QPS | Search Latency | Per CU Capacity | Cost per Million Vectors | Best Suited for |
---|---|---|---|---|---|
Performance-optimized | 500~1500 | sub-10 ms | 1.5 million vectors | from $65/mo. | Ideal for real-time applications requiring instant search results and high concurrent traffic. |
Capacity-optimized | 100~300 | tens-ms | 5 million vectors | from $20/mo. | Perfect for applications handling large vector datasets while maintaining reliable search speeds. |
Extended-capacityNew! Contact Sales | 5~20 | hundreds-ms | 20 million vectors | from $10/mo. | Designed for massive-scale datasets where optimizing total cost is prioritized over latency. |
*This table is based on evaluations of 768-dimensional vectors.
Free
Quick StartServerless
Try FreeDedicated (Standard)
Try FreeDedicated (Enterprise)
Try Free
Deployment | ||||
---|---|---|---|---|
Environment | Shared | Shared | Dedicated | Dedicated |
CU type | ||||
Performance-optimized Ideal for applications requiring low latency and high throughput. Each CU can handle about 1.5 million 768-dim vectors. | ||||
Capacity-optimized Suited for managing large datasets with moderate search performance requirements. Each CU can handle about 5 million 768-dim vectors. | ||||
Extended-capacityNew! | ||||
Public cloud provider | Google Cloud | Google Cloud | AWS, Google Cloud, Azure | AWS, Google Cloud, Azure |
Scale Scale up and down with zero downtime. | Auto-scaling | Manual scaling to 32 CUs | Auto-scaling Manual scaling to 256 CUs or more | |
Uptime SLA Guaranteed uptime for production workloads. | 99.95% |
High Availability | ||||
---|---|---|---|---|
Availability zone | Single | Single | Multiple | |
Replica Zilliz Cloud supports cluster-level replication, delivering a QPS that scales proportionally with the number of replicas. This replication feature automatically distributes replicas across different availability zones (AZs), enhancing both throughput and high availability. |
Data Management | ||||
---|---|---|---|---|
Cross-tier data migration Easy migration from Free tier, Serverless, and Standard. | From Free tier | From Free tier and Serverless | From Free tier, Serverless & Dedicated(Standard) | |
Migration from external sources Easy migration from Milvus, Pinecone, Qdrant, Elasticsearch, PostgreSQL, Tencent Cloud VectorDB, etc. | ||||
High speed data import High speed data import from object storage like S3. | ||||
Recycle bin Dropped collections will be retained for 30 days to facilitate easy recovery. |
Data Security & Compliance | ||||
---|---|---|---|---|
OAuth 2.0 OAuth 2.0 for authorizing account access without sharing or storing user login credentials. | ||||
Enterprise SSO Streamlined user authentication which supports both Okta and SAML 2.0 protocol. | Public Preview | |||
MFA | ||||
Auditing Comprehensive auditing logs that capture all UI and RESTful API operations on the control plane, as well as all SDK and RESTful API operations on the data plane. | ||||
API key management | ||||
Data encryption in transit and at rest | ||||
Backup and restore Supports backup at both the cluster and collection levels, with options for manual and automatic backups. | ||||
IP address access control | ||||
Private networking Private connection between your VPC and Zilliz Cloud VPC. | ||||
SOC 2 Type II and ISO/ICE 27001 compliant, GDPR and HIPPA ready |
Observability | ||||
---|---|---|---|---|
Fine-grained metrics with real-time monitoring dashboards Metrics to monitor performance, storage, usage, data statistics, etc. | ||||
Alerts Seamless integration with various alerting channels including emails, PagerDuty, Slack, Opsgenie, Lark, Webhook, etc. | ||||
Alerting and monitoring integrations Monitoring API and integrations with Prometheus and Datadog. | ||||
Job Center A centralized job center page to track the progress of tasks including migration, data import, backup and restore, clone collection, and create sample collection, etc. |
Role-based Access Control | ||||
---|---|---|---|---|
Organization and project management | 1 organization 1 project | 1 organization Up to 10 projects | 1 organization Up to 10 projects | 1 organization Up to 10 projects |
Organization and project RBAC Role-based access control at both organization and project levels. | ||||
Data plane RBAC Data layer RBAC enables precise permission control over collections, partitions, and operations, enhancing security and operational alignment. |
Integrations and Tools | ||||
---|---|---|---|---|
Intuitive RESTful APIs for control and data plane operations | ||||
User-friendly SDKs in multiple programming languages | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs |
VectorDBBench An open-source benchmarking tool for mainstream vector databases. It is also a tool for ultimate performance and cost-effectiveness comparison. |
Support | ||||
---|---|---|---|---|
Community support | ||||
Email support | ||||
Response time SLAs | ||||
Urgent | 4 hours | 4 hours | 1 hour | |
High | 1 business day | 1 business day | 4 hours | |
Normal | 2 business days | 2 business days | 1 business day | |
Technical contacts | Up to 1 | Up to 1 | Up to 4 |
Estimate Your Cost
Use this calculator to understand how Zilliz Cloud charges.
Cloud Provider
Cloud Region
Pricing Plan
StandardOffers high control and consistent performance, cost-efficient for development and testing environments.
CU Type
Capacity-optimizedSuited for managing large datasets with moderate search performance requirements. Each CU can handle about 5 million 768-dim vectors.
Number of Entities
Vector Dimensions
Estimated total cost per month
Total Cost (excl. tax) = CU Cost × Replica Count + Storage Cost
The price estimate is monthly, but the actual cost is billed hourly. You can suspend clusters anytime to save costs.
$
CU Cost
CUCU Usage
CU
Unit Price
$0/CU
A CU is the basic unit of compute resources used for parallel processing of data.
$
Storage Cost
GBStorage Usage
GB
Unit Price
$0/GB
$
Acknowledgement: Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on a variety of factors, including your actual usage of services.
Frequently Asked Questions
What is a Compute Unit (CU)?
What is a vCU?
Which type of CU should I pick?
How many CUs do I need for a given collection?
How can I get Zilliz Cloud discounts?
How can I request a new cloud region?