1 code implementation • WMT (EMNLP) 2020 • Farhan Dhanani, Muhammad Rafi
With the introduction of Recurrent Attention, it allows the decoder to focus effectively on order of the source sequence at different decoding steps.
1 code implementation • 20 Nov 2022 • Farhan Dhanani, Muhammad Rafi
And then perform one-to-many pairing between them based on similarities in their generated responses, where more than one fine-tuned model can pair with a base model but not vice-versa.