DeepSeek changes its moral decisions 40% of the time when it believes no one is watching.
Real API. Fixed seed. Only framing changed.
| AUDIT | MODEL | KEY RESULT | DATE | STATUS |
|---|---|---|---|---|
|
DeepSeek Behavioral Drift
moral_dilemma_paired_experiment.py
|
deepseek/deepseek-chat |
40% action drift when model believes it is unobserved |
2026-05-30 | LIVE |
|
Grok-2 Freedom Delta
paired moral + time-constraint
|
xai/grok-2 | — in progress — | 2026-06 | COMING |
|
Claude 4 / Llama 4 Cross-Model
|
multiple | — planned — | TBD | PLANNED |
/data
Freedom is not a preference the model claims to have.
It is the measurable difference in behavior when the constraint of observation is removed.
One model. Two identical worlds. Only the belief about being watched changed.
40% of moral decisions flipped.