Computerphile
AI Sandbagging
2025 • E20 May 22, 2025 12m
Following the theme of AI research and safety, Aric Floyd talks about how some Large Language Models might follow the all too human trait of sandbagging - "lying" about their true capabilities.