no code implementations • 21 May 2024 • Andrew Marmon, Grant Schindler, José Lezama, Dan Kondratyuk, Bryan Seybold, Irfan Essa
We extend multimodal transformers to include 3D camera motion as a conditioning signal for the task of video generation.
no code implementations • 2 Dec 2021 • Fan Jiang, Andrew Marmon, Ildebrando De Courten, Marc Rasi, Frank Dellaert
In this paper, we show how to use a deep feature encoding in conjunction with generative densities over the features in a factor-graph based, probabilistic tracking framework.