Second Brain

Tag: multimodal

6 items with this tag.

  • Jul 08, 2025

    NeoBabel - A Multilingual Open Tower for Visual Generation

    • multimodal
    • llm
  • Feb 13, 2024

    PIN - Positional Insert Unlocks Object Localisation Abilities in VLMs

    • multimodal
    • object_localisation
  • Dec 11, 2023

    4M - Massively Multimodal Masked Modeling

    • multimodal
  • Nov 28, 2023

    MobileCLIP - Fast Image-Text Models through Multi-Modal Reinforced Training

    • efficient_dl
    • efficient_vision
    • computer_vision
    • multimodal
  • Oct 24, 2023

    TiC-CLIP - Continual Training of CLIP models

    • continual_learning
    • multimodal
  • Jun 22, 2023

    Learning Unseen Modality Interaction

    • multimodal
    • llm

Created with Quartz v4.5.2 © 2025