Sign in to unlock AI Chat and more features

DINOv2: Learning Robust Visual Features without Supervision

Abstract
The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any system by producing all-purpose visual features, i.e., features that work across image distributions and tasks without finetuning. This work shows that existing pr...
Keywords
Computer science
Pipeline (software)
Artificial intelligence
Machine learning
Scale (ratio)
Training set
Image (mathematics)
Physics
Quantum mechanics
Programming language
Sustainable Development Goals (SDG)
Quality education


pdf file

DINOv2: Learning Robust Visual Features without Supervision
pdf file

DINOv2: Learning Robust Visual Features without Supervision

Abstract
The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any system by producing all-purpose visual features, i.e., features that work across image distributions and tasks without finetuning. This work shows that existing pr...
Keywords
Computer science
Pipeline (software)
Artificial intelligence
Machine learning
Scale (ratio)
Training set
Image (mathematics)
Physics
Quantum mechanics
Programming language
Sustainable Development Goals (SDG)
Quality education


pdf file