no code implementations • 6 Jan 2024 • Sanjit Kakarla, Danielle Thomas, Jionghao Lin, Shivang Gupta, Kenneth R. Koedinger
By analyzing 50 real-life tutoring dialogues, we find both GPT-3. 5-Turbo and GPT-4 demonstrate proficiency in assessing the criteria related to reacting to students making errors.