Rohin Shah
TalkRL: The Reinforcement Learning Podcast - Een podcast door Robin Ranjit Singh Chauhan

Categorieën:
DeepMind Research Scientist Dr. Rohin Shah on Value Alignment, Learning from Human feedback, Assistance paradigm, the BASALT MineRL competition, his Alignment Newsletter, and more!