Tag: self correction
All the talks with the tag "self correction".
Training LLMs to Self-Correct via RL
Adhilsha AnsadPublished: at 03:45 PMThis talk will discuss the training of large language models (LLMs) to self-correct their predictions using reinforcement learning (RL).