
Reinforcement Learning from Human Feedback with Hugging Face
Registration
Registration is now closed (this event already took place).
Details
Speakers
Costa Huang
Machine learning engineer
Hugging Face
Costa Huang is a machine learning engineer at Hugging Face, specializing in Reinforcement Learning from Human Feedback (RLHF). He holds a Ph.D. from Drexel University, focusing on efficient and reproducible reinforcement learning. Notably, he's the creator of CleanRL, a user-friendly RL library designed for researchers.
Hosted By
Co-hosted with: GradFUTURES
Contact the organizers