The guide to bge-reranker-base

All models
BAAI / bge-reranker-base

BAAI / bge-reranker-base

AI Model Milvus Integrated

Task: Reranking

Modality: Text

Similarity Metric: N/A

License: Apache 2.0

Dimensions:

Max Input Tokens: 512

Price: Free

Introduction to bge-reranker-base

bge-reranker-base is a cross-encoder model optimized for reranking tasks. Built on the foundation of RetroMAE architecture, it effectively captures semantic relationships within textual data, allowing for more accurate and relevant search result rankings. This model generates 768-dimensional embeddings and is fine-tuned to rerank results by considering both query and document content, making it highly suitable for applications like search engines, question answering, and document retrieval. We recommend using/fine-tuning them to re-rank top-k documents returned by embedding models.

For more information about reranking and rerankers, read the below blogs.

How to re-rank results with bge-reranker-base

There are two primary ways to use the bge-reranker-base model to re-rank your results:

PyMilvus: the Python SDK for Milvus that seamlessly integrates with the bge-reranker-base model.
FlagEmbedding: the official Python SDK offered by BAAI.

Re-ranking results with `bge-reranker-base` via PyMilvus

from pymilvus.model.reranker import BGERerankFunction

rf = BGERerankFunction(
    model_name="BAAI/bge-reranker-base",
    device="cpu"
)

query = "What event in 1956 marked the official birth of artificial intelligence as a discipline?"

documents = [
    "In 1950, Alan Turing published his seminal paper, 'Computing Machinery and Intelligence,' proposing the Turing Test as a criterion of intelligence, a foundational concept in the philosophy and development of artificial intelligence.",
    "The Dartmouth Conference in 1956 is considered the birthplace of artificial intelligence as a field; here, John McCarthy and others coined the term 'artificial intelligence' and laid out its basic goals.",
    "In 1951, British mathematician and computer scientist Alan Turing also developed the first program designed to play chess, demonstrating an early example of AI in game strategy.",
    "The invention of the Logic Theorist by Allen Newell, Herbert A. Simon, and Cliff Shaw in 1955 marked the creation of the first true AI program, which was capable of solving logic problems, akin to proving mathematical theorems."
]

results = rf(
    query=query,
    documents=documents,
    top_k=3,
)

For more information, refer to the PyMilvus Documentation.

Re-ranking results with `bge-reranker-base`via FlagEmbedding Python SDK

from FlagEmbedding import FlagReranker
reranker = FlagReranker('BAAI/bge-reranker-base', use_fp16=True)

query = "What event in 1956 marked the official birth of artificial intelligence as a discipline?"

documents = [
    "In 1950, Alan Turing published his seminal paper, 'Computing Machinery and Intelligence,' proposing the Turing Test as a criterion of intelligence, a foundational concept in the philosophy and development of artificial intelligence.",
    "The Dartmouth Conference in 1956 is considered the birthplace of artificial intelligence as a field; here, John McCarthy and others coined the term 'artificial intelligence' and laid out its basic goals.",
    "In 1951, British mathematician and computer scientist Alan Turing also developed the first program designed to play chess, demonstrating an early example of AI in game strategy.",
    "The invention of the Logic Theorist by Allen Newell, Herbert A. Simon, and Cliff Shaw in 1955 marked the creation of the first true AI program, which was capable of solving logic problems, akin to proving mathematical theorems."
]
results = reranker.compute_score([[query, document] for document in documents])

Content

Seamless AI Workflows

From embeddings to scalable AI search—Zilliz Cloud lets you store, index, and retrieve embeddings with unmatched speed and efficiency.

Try Zilliz Cloud for Free

Share this article

Quickly estimate your embedding cost

Introduction to bge-reranker-base

How to re-rank results with bge-reranker-base

Re-ranking results with `bge-reranker-base` via PyMilvus

Re-ranking results with `bge-reranker-base`via FlagEmbedding Python SDK

Content

Seamless AI Workflows

Share this article

Related Resources

Evaluating Your Embedding Model

Training Your Own Text Embedding Model

Build AI Apps with Retrieval Augmented Generation (RAG)

AI Assistant

Introduction to bge-reranker-base

How to re-rank results with bge-reranker-base

Re-ranking results with bge-reranker-base via PyMilvus

Re-ranking results with bge-reranker-basevia FlagEmbedding Python SDK

Content

Seamless AI Workflows

Share this article

Related Resources

Evaluating Your Embedding Model

Training Your Own Text Embedding Model

Build AI Apps with Retrieval Augmented Generation (RAG)

AI Assistant

Re-ranking results with `bge-reranker-base` via PyMilvus

Re-ranking results with `bge-reranker-base`via FlagEmbedding Python SDK