no code implementations • 21 Mar 2024 • Aram Davtyan, Sepehr Sameni, Björn Ommer, Paolo Favaro
We call our model CAGE for visual Composition and Animation for video GEneration.
no code implementations • 7 Dec 2023 • Llukman Cerkezi, Aram Davtyan, Sepehr Sameni, Paolo Favaro
The growing interest in novel view synthesis, driven by Neural Radiance Field (NeRF) models, is hindered by scalability issues due to their reliance on precisely annotated multi-view images.
no code implementations • ICCV 2023 • Sepehr Sameni, Simon Jenni, Paolo Favaro
We propose Spatio-temporal Crop Aggregation for video representation LEarning (SCALE), a novel method that enjoys high scalability at both training and inference time.
no code implementations • ICCV 2023 • Aram Davtyan, Sepehr Sameni, Paolo Favaro
We call our model Random frame conditioned flow Integration for VidEo pRediction, or, in short, RIVER.
1 code implementation • 10 Apr 2022 • Sepehr Sameni, Simon Jenni, Paolo Favaro
We represent object parts with image tokens and train a ViT to detect which token has been combined with an incorrect positional embedding.
Ranked #91 on Image Classification on ObjectNet (using extra training data)
no code implementations • 7 Jul 2021 • Aram Davtyan, Sepehr Sameni, Llukman Cerkezi, Givi Meishvilli, Adam Bielski, Paolo Favaro
Moreover, we show that the Kalman Filter dynamical model for the evolution of the unknown parameters can be used to capture the gradient dynamics of advanced methods such as Momentum and Adam.