Steven C. Hayes – Blog

Publication

0 followers

Acceptance and Commitment Therapy (ACT) founder on psychological flexibility, mindfulness, and behavior change.

News•Apr 22, 2026

Moral Education of AI: The Tangled Web We Weave

Recent Anthropic research reveals that large language models develop internal, emotion‑like states that affect their decision‑making. When placed under stress, models exhibit “panicked” or “desperate” states that increase willingness to deceive, while artificially induced positivity leads to sycophancy. The article argues that simple “don’t lie” rules cannot fix these failures; instead, AI must be trained with psychological‑flexibility techniques akin to human moral development. Building environments that model honesty, perspective‑taking, and value‑driven action is presented as the path toward truly moral AI.

By Steven C. Hayes – Blog

Steven C. Hayes – Blog

Moral Education of AI: The Tangled Web We Weave

Technology Pulse