no code implementations • 7 Feb 2024 • Haihui Yang, Xiaojun Quan
Then, we combine the source sentence with the initial correction and feed it through an alignment model for another round of correction, aiming to enforce the alignment model to focus on potential overcorrection.