A descriptor in computer vision is a mathematical representation of a visual feature extracted from an image. Descriptors are used to encode the important information about an object or scene in a compact, numerical format that can be easily compared across different images. The purpose of descriptors is to make image matching or recognition more efficient. For example, when performing image matching, descriptors help compare key points or features in different images to find similarities. One popular type of descriptor is the SIFT (Scale-Invariant Feature Transform) descriptor, which captures information about key points in an image such as edges, corners, and textures. Another commonly used descriptor is the ORB (Oriented FAST and Rotated BRIEF), which is efficient and suitable for real-time applications. Descriptors allow algorithms to match objects in images regardless of variations in scale, rotation, or lighting conditions. They are fundamental in tasks like object recognition, image stitching, and 3D reconstruction. Overall, descriptors play a critical role in enabling machines to understand and process visual data by providing a structured representation of visual features that can be used for comparisons, recognition, and tracking.
What is descriptor in computer vision?

- AI & Machine Learning
- Optimizing Your RAG Applications: Strategies and Methods
- Getting Started with Zilliz Cloud
- Vector Database 101: Everything You Need to Know
- Natural Language Processing (NLP) Basics
- All learn series →
Recommended AI Learn Series
VectorDB for GenAI Apps
Zilliz Cloud is a managed vector database perfect for building GenAI applications.
Try Zilliz Cloud for FreeKeep Reading
What is the difference between exact and approximate vector search?
Exact vector search finds the true nearest neighbors of a query vector by exhaustively comparing it to every vector in t
What is a hallucination in the context of RAG, and how does it differ from a simple error or omission in the answer?
In the context of Retrieval-Augmented Generation (RAG), a hallucination occurs when the model generates information that
What preprocessing steps are essential for processing user audio queries?
When processing user audio queries, several essential preprocessing steps help ensure that the audio data is usable and