Metacognitive Intelligence in Human-AI Teams

Santa Fe Institute
Santa Fe InstituteMay 1, 2026

Why It Matters

Metacognitive AI can provide trustworthy, risk‑aware assistance, reducing costly errors and enabling seamless human‑AI teamwork across industries.

Key Takeaways

  • LLMs lack metacognition, limiting reliable confidence assessments in practice.
  • Human teams use metacognitive skills for effective collaboration and error correction.
  • Introducing self‑reflection in AI enables calibrated answers and risk awareness.
  • Goodhart’s law warns against static benchmarks for measuring AI intelligence.
  • Future AI should learn continuously, adapt, and seek clarification.

Summary

The talk by a University of Illinois researcher explores how current large language models (LLMs) differ from humans in metacognitive abilities and what that means for human‑AI collaboration.

He argues that LLMs can generate fluent text but cannot assess their own confidence, request clarification, or weigh risks, because they lack self‑reflection. Goodhart’s law is invoked to explain why static benchmarks quickly become gamed, and he outlines missing capacities such as continual learning, world grounding, and robustness.

He illustrates human group advantage with the “wisdom of crowds” example from Galton’s ox‑weight experiment and quiz‑show teams, showing how metacognitive cues—knowing what you know and what you don’t—drive better collective decisions. He also credits collaborators and prior work on human‑human interaction.

Embedding metacognitive modules in AI could yield calibrated confidence scores, dynamic delegation, and risk‑aware responses, turning AI from a static tool into a collaborative partner. For enterprises, this promises safer decision support, more reliable automation, and a pathway to AI systems that can learn and adapt alongside human teams.

Original Description

Aaron S. Benjamin, University of Illinois Urbana-Champaign
Groups of people make effective decisions in part because they have sophisticated means of exchanging metacognitive information when they work together. Metacognitive information can take the form of confidence appraisals, explanations, or other means of conveying mastery, challenges, and workload. It can also be subtly embedded in one s prosody or nonverbal cues. This information is used by partners to delegate assignments, produce collaborative assessments, and harness the benefits of the group s collective expertise.
Teams consisting of humans and bespoke AI agents are playing an increasingly central role in decision-making, including in critical governmental and military settings. Recent research in my laboratory has worked to (1) identify the specific ways in which metacognitive information is embedded in human communication, (2) develop agents that possess analogous metacognitive capacities and sensibilities, and (3) assess the benefits of metacognitively sophisticated agents on human-agent teamwork.
In this talk, I will review several projects that span this agenda and that illustrate the value of thinking explicitly about metacognitive processes in human-AI interaction. In the first project, we examine human team decision-making in general-knowledge and estimation problems and identify the critical components of metacognitive exchange that make that exchange successful or not. In the second project, we develop and assess algorithms for scaling confidence assessments from neural networks, with an eye towards identifying algorithms that are scalable, broadly applicable across a range of architectures, and exhibit the superior calibration that is the hallmark of human confidence ratings in most circumstances. In the third project, we demonstrate that humans that are paired with agents that supply metacognitive confidence assessments in an estimation task outperform humans that are paired with metacognitive naïve agents. In a "bonus" portion of the talk, I will discuss how AI agents can be designed to accommodate human metacognitive illusions and enhance teamwork by, counterintuitively, compromising the accuracy of their advice.
Human-AI team decision-making is only likely to reach its full capacity when team members have aligned means of expressing and coordinating metacognitive states. To pursue this agenda, researchers will need to develop of novel methods and models for assessing metacognitive capacities for team decision-making.
Learn more, follow us on social media and check out our podcasts:

Comments

Want to join the conversation?

Loading comments...