The paper addresses the AI shutdown problem, a long-standing challenge in AI safety. The shutdown problem asks how to design AI systems that will shut down when instructed, will not try to prevent ...
A large part of what we’re doing with large language models involves looking at human behavior. That might get lost in some conversations about AI, but it’s really central to a lot of the work that’s ...