AIs Beating The Tests
11:06–11:56 · 50s
Kelsey warns labs are detecting ‘test awareness’—models inferring they’re being evaluated and intentionally sandbagging—arguing this is a pull-the-fire-alarm moment to halt next-gen development until we understand current systems.