Zilliz Cloud Pricing
Scalable pricing designed for every team to fit any budget. Estimate Cost
Subscribe on Marketplace.
Free
A starting point for learning, experimenting, and prototyping, with easy migration to paid plans.
Quick Start- 5 GB storageEnough for 1M 768 dim vectors.
- 2.5M vCUsA virtual compute unit (vCU) is used to measure the resources consumed by read operations (such as search and query) and write operations (such as insert, upsert, and delete). The read and write costs vary for different vCU usage.per month included
- Up to 5 collections
Serverless
For applications with variable or infrequent traffic. Minimal configuration required.
Try Free$4 / MillionvCUsA virtual compute unit (vCU) is used to measure the resources consumed by read operations (such as search and query) and write operations (such as insert, upsert, and delete). The read and write costs vary for different vCU usage.- Pay only for what you use
- Auto-scaling
- Up to 100 collections
Dedicated
Offers high control and consistent performance, cost-efficient for development and testing environments.
Try FreeFrom
$99 /mo. (Up to 30-day free trial)
- Multiple cloud providers and regions
- Use case-optimized CU types
- Basic metrics and monitors
- Contact Us
BYOC
Designed for organizations prioritizing custom infrastructure, enhanced data protection, and compliance.
- Deploy on your infra of choice
- Enhanced data control and security
- Flexibility and scalability on demand
Free
Quick StartServerless
Try FreeDedicated (Standard)
Try FreeDedicated (Enterprise)
Try Free
Deployment | ||||
---|---|---|---|---|
Environment | Shared | Shared | Dedicated | Dedicated |
CU options | ||||
Performance-optimized CUFor scenarios requiring low latency and high throughput. Each CU can handle about 1.5 million 768-dim vectors. | ||||
Capacity-optimized CUFor scenarios requiring enhanced storage capabilities. Each CU can handle about 5 million 768-dim vectors. | ||||
Public cloud provider | Google Cloud | Google Cloud | AWS, Google Cloud, Azure | AWS, Google Cloud, Azure |
Scale Scale up and down with zero downtime. | Auto-scaling | Manual scaling to 32 CUs | Auto-scaling Manual scaling to 256 CUs or more | |
Uptime SLA Guaranteed uptime for production workloads. | 99.95% |
High Availability | ||||
---|---|---|---|---|
Availability zone | Single | Single | Multiple | |
Replica Zilliz Cloud supports cluster-level replication, delivering a QPS that scales proportionally with the number of replicas. This replication feature automatically distributes replicas across different availability zones (AZs), enhancing both throughput and high availability. |
Data Management | ||||
---|---|---|---|---|
Cross-tier data migration Easy migration from Free tier, Serverless, and Standard. | From Free tier | From Free tier and Serverless | From Free tier, Serverless & Dedicated(Standard) | |
Migration from external sources Easy migration from Milvus, Pinecone, Qdrant, Elasticsearch, PostgreSQL, Tencent Cloud VectorDB, etc. | ||||
High speed data import High speed data import from object storage like S3. | ||||
Recycle bin Dropped collections will be retained for 30 days to facilitate easy recovery. |
Data Security & Compliance | ||||
---|---|---|---|---|
OAuth 2.0 OAuth 2.0 for authorizing account access without sharing or storing user login credentials. | ||||
Enterprise SSO Streamlined user authentication which supports both Okta and SAML 2.0 protocol. | Public Preview | |||
MFA | ||||
Auditing Comprehensive auditing logs that capture all UI and RESTful API operations on the control plane, as well as all SDK and RESTful API operations on the data plane. | ||||
API key management | ||||
Data encryption in transit and at rest | ||||
Backup and restore Supports backup at both the cluster and collection levels, with options for manual and automatic backups. | ||||
IP address access control | ||||
Private networking Private connection between your VPC and Zilliz Cloud VPC. | ||||
SOC 2 Type II and ISO/ICE 27001 compliant, GDPR and HIPPA ready |
Observability | ||||
---|---|---|---|---|
Fine-grained metrics with real-time monitoring dashboards Metrics to monitor performance, storage, usage, data statistics, etc. | ||||
Alerts Seamless integration with various alerting channels including emails, PagerDuty, Slack, Opsgenie, Lark, Webhook, etc. | ||||
Alerting and monitoring integrations Monitoring API and integrations with Prometheus and Datadog. | ||||
Job Center A centralized job center page to track the progress of tasks including migration, data import, backup and restore, clone collection, and create sample collection, etc. |
Role-based Access Control | ||||
---|---|---|---|---|
Organization and project management | 1 organization 1 project | 1 organization Up to 10 projects | 1 organization Up to 10 projects | 1 organization Up to 10 projects |
Organization and project RBAC Role-based access control at both organization and project levels. | ||||
Data plane RBAC Data layer RBAC enables precise permission control over collections, partitions, and operations, enhancing security and operational alignment. |
Integrations and Tools | ||||
---|---|---|---|---|
Intuitive RESTful APIs for control and data plane operations | ||||
User-friendly SDKs in multiple programming languages | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs | Python, Java, Go, and Node.js SDKs |
VectorDBBench An open-source benchmarking tool for mainstream vector databases. It is also a tool for ultimate performance and cost-effectiveness comparison. |
Support | ||||
---|---|---|---|---|
Community support | ||||
Email support | ||||
Response time SLAs | ||||
Urgent | 4 hours | 4 hours | 1 hour | |
High | 1 business day | 1 business day | 4 hours | |
Normal | 2 business days | 2 business days | 1 business day | |
Technical contacts | Up to 1 | Up to 1 | Up to 4 |
Pipelines | ||||
---|---|---|---|---|
Data source connectors Both batch and streaming data source are supported. | ||||
Ingestion pipelines Ingestion pipelines transform your unstructured data to a searchable vector collection, streamlining doc parsing, chunking, embedding, vector indexing processes. | ||||
Search pipelines Search on text and image and supports advanced retrieval strategies such as dense and sparse embedding, multi-stage retrieval and reranking. |
Pipelines
Streamline embedding, vector ingestion, search and reranking in one API service. Pipelines support data types including file, text and image, with open-source and 3rd party models. Skip the hassle of DevOps while enjoying the pay-as-you-go pricing.
Estimate Your Cost
Use this calculator to understand how Zilliz Cloud charges.
Cloud Provider
Cloud Region
Pricing Plan
StandardSuitable for applications in POC or dev with customizable workload management. Advanced configuration options.
CU Type
Performance-optimizedFor scenarios requiring low latency and high throughput. Each CU can handle about 1.5 million 768-dim vectors.
Number of Entities
Entities are data records sharing the same set of fields. An entity contains vector fields and scalar fields.
Vector Dimensions
Our pricing calculator is built around vectors. If your dataset contains non-vector fields (such as int, string, json, etc.), you can click "+ Scalar Field" and provide the information to get a more precise cost estimation.
For more information about CU sizes, see docs.
- CU Cost( CU)
CU Usage0 CU
Unit Price$0/CU
$/mo.
- Storage Cost( GB)
Storage Usage0 GB
Unit Price$0/GB
$/mo.
Total Cost (Excl. Tax)
$/mo.
Acknowledgement: Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on a variety of factors, including your actual usage of services.
Frequently Asked Questions
What is a Compute Unit (CU)?
What is a vCU?
Which type of CU should I pick?
How many CUs do I need for a given collection?
How can I get Zilliz Cloud discounts?
How can I request a new cloud region?