Zilliz Cloud Pricing
Built to scale with your usage, meet your security and compliance needs, and remain cost-effective for any budget. Estimate Cost
Free
Starting point for learning and personal projects.
- 5 GB storage
- 2.5M per month includedvCUs
- Up to 5 collections
Standard
Managed essentials for non-critical workloads. Best for: prototypes and testing environments.
From $0.3/GB/month (Serverless)
- Fully managed vector databases with core APIs
- Backup, restore, and basic monitoring
- Built-in encryption for data in transit and at rest
EnterpriseMOST POPULAR
MOST POPULAR
Enterprise-grade reliability and controls. Best for: production applications.
- 99.95% uptime SLA
- Audit logs, SSO (SAML 2.0 based), granular RBAC
- Multi-replica and elastic scaling
- Private endpoint and VPC peering
- included
Business Critical
Regulated-ready with maximum resilience. Best for: healthcare, finance, and other highly regulated, mission-critical systems.
- Global cluster with high-level availability and disaster recovery
- Advanced security: CMEK and full-path in-transit encryption (edge TLS + internal)
- HIPAA-eligible with enhanced data privacy features
- and rapid incident response
BYOC (Bring Your Own Cloud)
Designed for organizations that prioritize custom infrastructure, enhanced data protection, and compliance.
- Deploy on your infrastructure of choice
- High-level control and security
- Same features and experience as SaaS Dedicated clusters
Flexible Deployment Options
Dedicated
Dedicated provides isolated, reserved environments for production workloads that demand consistent and predictable performance. This option is ideal for sustained high-throughput and latency-sensitive applications.
- Predictable Performance- Leverage dedicated compute units (CU) to ensure stable performance without resource contention. 
- Fixed and Transparent Costs- Easily manage your budget with a clear, pay-as-you-go pricing model. 
- Full Control & Customization- Gain granular control over your resources and access advanced features to meet your specific application requirements. 
Cluster Types
Dedicated clusters offer diverse types tailored to fit your use cases. A CU (compute unit) is a measure of computational resources for data processing. Each cluster type offers different combination of CPU, memory, and storage.
Performance-optimized
Ideal for real-time applications requiring instant search results and high concurrent traffic.
Per CU Capacity
1.5 million vectors
Search QPS
500-1500
Search Latency
10 ms
From $65
per million vectors / month
Capacity-optimized
Perfect for applications handling large vector datasets while maintaining reliable search speeds.
Per CU Capacity
5 million vectors
Search QPS
100-300
Search Latency
50-100 ms
From $20
per million vectors / month
Tiered-storage
Best for ultra-large-scale, cost-sensitive workloads with clear hot and cold data patterns. Each query CU can handle about 20 million 768-dim vectors.
Per CU Capacity
20 million vectors
Hot Data Access
Search QPS
100-150
Search Latency
20-40 ms
Cold Data Access
Search QPS
5-20
Search Latency
200-1000 ms
From $7
per million vectors / month
*This data is based on evaluations of 768-dimensional vectors.
View Plan Feature Details
- FreeStart for Free
- ServerlessTry Free
- DedicatedStandardTry Free
- DedicatedEnterpriseTry Free
- DedicatedBusiness CriticalContact Us
| Deployment | |||||
|---|---|---|---|---|---|
| Environment | Shared | Shared | Dedicated | Dedicated | Dedicated | 
| Cluster Type | |||||
| Performance-optimizedIdeal for applications requiring low latency and high throughput. Each query CU can handle about 1.5 million 768-dim vectors. | |||||
| Capacity-optimizedSuited for managing large datasets with moderate search performance requirements. Each query CU can handle about 5 million 768-dim vectors. | |||||
| Tiered-storageBest for ultra-large-scale, cost-sensitive workloads with clear hot and cold data patterns. Each query CU can handle about 20 million 768-dim vectors. | |||||
| Public cloud provider | Google Cloud | AWS, Google Cloud | AWS, Google Cloud, Azure | AWS, Google Cloud, Azure | AWS, Google Cloud, Azure | 
| Compute Scaling | System-managed auto-scaling (No configuration required) | Manual scaling to 32 CUs | Configurable auto-scaling Manual scaling to 256 CUs or more | Configurable auto-scaling Manual scaling to 256 CUs or more | |
| Uptime SLA | 99.95% | 99.99% (If mutli-replica enabled) | |||
Estimate Your Cost
Use this calculator to understand how Zilliz Cloud charges.
Number of Entities
Vector Dimensions
mmap
Enabling mmap (memory mapping) can optimize memory usage and the amount of data that can be stored in the same Query CU will be increased. Learn More
Estimated total cost per month
$
Evaluate your cost with $200 free credits
Prices are estimates and may differ from actual costs. We recommend running a Proof of Concept (PoC) using the free credits to validate costs and performance, or contacting us for a tailored cost-optimization plan and additional PoC resources.
- Query CU$ 
- GB - Storage - $ 
Frequently Asked Questions
- What is a Compute Unit (CU)? 
- What is a vCU? 
- Which type of cluster should I pick? 
- How many query CUs do I need for a given collection? 
- How can I get Zilliz Cloud discounts? 
- How can I request a new cloud region?