SMLab Weekly Talks

Tag: self correction

All the talks with the tag "self correction".

Training LLMs to Self-Correct via RL
Adhilsha Ansad
Published:Nov 18, 2024 at 03:45 PM
This talk will discuss the training of large language models (LLMs) to self-correct their predictions using reinforcement learning (RL).