TECoSA Research Seminar: Learn and Align RL Policies from Human Feedback
Speaker: Daniel Simões Marta, TECoSA PhD student (Venue, Zoom link and sign-up link circulated to members) Please email vickid@kth.se if you have any questions. ABSTRACT: Reinforcement learning from informed by human feedback (RLHF)… Read More »TECoSA Research Seminar: Learn and Align RL Policies from Human Feedback