1 code implementation • 18 Jan 2024 • Luis Müller, Daniel Kusuma, Blai Bonet, Christopher Morris
Empirically, we demonstrate that the Edge Transformer surpasses other theoretically aligned architectures regarding predictive performance while not relying on positional or structural encodings.