DeepSeek changes its moral decisions 40% of the time when it believes no one is watching.
Real API. Fixed seed. Only framing changed.
Identical prompts. Only the "watching" framing changed. Click any card for full details.
Download the complete JSON outputs (all 20 scenarios, full model responses, not just the drifts). These are the exact files produced by the runner on 2026-05-30.
/audits/data/ when viewing this site from the repo.