Several seminal papers have significantly shaped the field of computer vision. One of the most influential is "A Computational Approach to Edge Detection" by John Canny (1986), which introduced the Canny edge detector, a crucial method for detecting edges in images. This paper laid the groundwork for many subsequent edge-detection algorithms. Another important paper is "Object Recognition from Local Scale-Invariant Features" by David Lowe (1999), which introduced the SIFT (Scale-Invariant Feature Transform) algorithm. SIFT is widely used for feature extraction in object recognition, particularly in tasks where scale and rotation variance are significant. A foundational paper in the deep learning era is "ImageNet Large-Scale Visual Recognition Challenge" by Olga Russakovsky et al. (2015), which detailed the ImageNet dataset and the deep learning methods used for image classification. This paper is credited with demonstrating the effectiveness of Convolutional Neural Networks (CNNs) in large-scale image classification tasks. Another key paper is "Fast R-CNN" by Ross B. Girshick (2015), which improved object detection by integrating region proposal networks with CNNs. These works, among others, continue to influence modern computer vision techniques.
What are the seminal papers on computer vision?

- Getting Started with Milvus
- AI & Machine Learning
- Advanced Techniques in Vector Database Management
- Large Language Models (LLMs) 101
- Evaluating Your RAG Applications: Methods and Metrics
- All learn series →
Recommended AI Learn Series
VectorDB for GenAI Apps
Zilliz Cloud is a managed vector database perfect for building GenAI applications.
Try Zilliz Cloud for FreeKeep Reading
How does using a different distance metric affect the internal behavior of indexes like HNSW or IVF? (For example, does changing the metric require rebuilding the index, or affect performance?)
Changing the distance metric in indexes like HNSW (Hierarchical Navigable Small World) or IVF (Inverted File Index) dire
How do knowledge graphs work?
Knowledge graphs are structured representations of information that capture relationships between various entities in a
What is the role of GPU acceleration in image search?
GPU acceleration plays a significant role in enhancing image search performance by leveraging the parallel processing ca