Second Brain

❯

000 Zettelkasten

❯

Adaptation of Image Models to Video

Adaptation of Image Models to Video

Jan 11, 20261 min read

computer_vision
video
world-models

in between the backbone layers

Multi-View Foundation Models
Exploring Temporally-Aware Features for Point Tracking (technically not video, but related)

after the backbone layers

Advancing Video Self-Supervised Learning via Image Foundation Models

DINOv2/DINOv3 features does incredibly well on video tasks (video classification, dense forecasting, intuitive physics).

From DINOv3.

From Back to the Features - DINO as a Foundation for Video World Models

Graph View

Created with Quartz v4.5.2 © 2026