Reinforcement Learning from Human Feedback with Hugging Face
by
Thu, Mar 7, 2024
4:30 PM – 6 PM EST (GMT-5)
Private Location (sign in to display)
49
Registered
Registration
Registration is now closed (this event already took place).
Details
Speakers
Costa Huang
Machine learning engineer
Hugging Face
Costa Huang is a machine learning engineer at Hugging Face, specializing in Reinforcement Learning from Human Feedback (RLHF). He holds a Ph.D. from Drexel University, focusing on efficient and reproducible reinforcement learning. Notably, he's the creator of CleanRL, a user-friendly RL library designed for researchers.