with Joel Z. Leibo Feedback form Request an episode Multi-agent Reinforcement Learning in Sequential Social Dilemmas Joel Z. Leibo, Vinicius Zambaldi, Marc Lanctot, Janusz Marecki, Thore Graepel Matrix games like Prisoner's Dilemma have guided
With Alex Turner Feedback form Request an episode Optimal Policies Tend to Seek Power by Alexander Matt Turner, Logan Smith, Rohin Shah, Andrew Critch, Prasad Tadepalli Abstract: "Some researchers have speculated that capable reinforcement lear
with Greg Anderson Feedback form Request an episode Neurosymbolic Reinforcement Learning with Formally Verified Exploration by Greg Anderson, Abhinav Verma, Isil Dillig, Swarat Chaudhuri Abstract: "We present Revel, a partially neural reinfor
With Bettina Könighofer and Rüdiger Ehlers Feedback form Request an episode Safe Reinforcement Learning via Shielding Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu Reinforcement learning algori
Feedback form: https://forms.gle/4YFCJ83seNwsoLnH6 Request an episode: https://forms.gle/AA3J7SeDsmADLkgK9 The Technical AI Safety Podcast is supported by the Center for Enabling Effective Altruist Learning and Research, or CEEALAR. CEEALAR, kn