Hugging Face / all-mpnet-base-v2
Milvus Integrated
Task: Embedding
Modality: Text
Similarity Metric: Any (Normalized)
License: Apache 2.0
Dimensions: 768
Max Input Tokens: 384
Price: Free
Model Overview
The "all-mpnet-base-v2" is a sentence and short paragraph encoder that transforms input text into a 768-dimensional vector. It's a refined version of the microsoft/mpnet-base model, fine-tuned on a dataset of 1 billion sentence pairs using a contrastive learning objective. all-mpnet-base-v2
is perfect for tasks such as information retrieval, clustering, and sentence similarity.
For more details, check out this post:
How to create embeddings using all-mpnet-base-v2
There are two primary ways to generate vector embeddings:
- PyMilvus: the Python SDK for Milvus that seamlessly integrates the
all-mpnet-base-v2
model. - SentenceTransformer library: the Python library
sentence-transformer
.
Once the vector embeddings are created, they can be stored in a vector database like Zilliz Cloud (a fully managed vector database powered by Milvus) and used for semantic similarity search.
Here are four key steps:
- Sign up for a Zilliz Cloud account for free.
- Set up a serverless cluster and obtain the Public Endpoint and API Key.
- Create a vector collection and insert your vector embeddings.
- Run a semantic search on the stored embeddings.
Create embeddings via PyMilvus and insert them into Zilliz Cloud for semantic search
Coming soon
Check out this documentation for more details about PyMilvus integration with all-mpnet-base-v2.
Create embeddings via the SentenceTransformer library and insert them into Zilliz Cloud for semantic search
Coming soon
Further Reading
Start Free, Scale Easily
Try the fully-managed vector database built for your GenAI applications.
Try Zilliz Cloud for Free