Icon

Second Brain

Home

❯

000 Zettelkasten

❯

Adaptation of Image Models to Video

Adaptation of Image Models to Video

Jan 11, 20261 min read

  • computer_vision
  • video
  • world-models

in between the backbone layers

  • Multi-View Foundation Models
  • Exploring Temporally-Aware Features for Point Tracking (technically not video, but related)

after the backbone layers

  • Advancing Video Self-Supervised Learning via Image Foundation Models

DINOv2/DINOv3 features does incredibly well on video tasks (video classification, dense forecasting, intuitive physics).

From DINOv3.

From Back to the Features - DINO as a Foundation for Video World Models


Graph View

Created with Quartz v4.5.2 © 2026