3 code implementations • 7 Feb 2023 • Haoran Zhu, Boyuan Chen, Carter Yang
Vision transformer (ViT) is an attention neural network architecture that is shown to be effective for computer vision tasks.
Image Classification