Why It Matters
Without a clear Step 2, businesses risk investing in unproven AI solutions, while policymakers grapple with how to ensure safe, effective deployment that delivers real economic value.
Key Takeaways
- •AI agents failed most of 480 real‑world tasks in Mercur study
- •Anthropic predicts job impact, but forecasts lack workplace validation
- •Lack of regulation leaves AI rollout in a strategic vacuum
- •Hype‑driven claims can sway markets despite weak evidence
- •Transparent models and new evaluation metrics needed for true transformation
Pulse Analysis
The AI boom today reads like the infamous underpants‑gnomes meme: companies have mastered Step 1—building powerful large‑language models—and tout Step 3—industry‑wide transformation—while Step 2 remains a hazy blank. Activist group Pause AI highlighted this gap with a protest flyer that asked regulators to define the missing middle. Yet, without clear policy or implementation pathways, the promise of a digital super‑mind hangs on speculation, and investors are left navigating hype rather than concrete roadmaps. The uncertainty also fuels activist calls for a pause until Step 2 is defined, echoing broader societal concerns about AI’s unchecked rollout.
Recent empirical work underscores the uncertainty. Anthropic’s labor‑impact model flags managers, architects and media professionals as most vulnerable, yet its forecasts are based on task‑level capabilities rather than real‑world performance. In a separate Mercur study, AI agents from OpenAI, Anthropic and DeepMind were tested on 480 banking, consulting and legal tasks and failed to complete the majority. These mixed signals reveal a gap between laboratory benchmarks and operational effectiveness, cautioning firms against over‑reliance on headline metrics. Furthermore, the study highlighted that even state‑of‑the‑art models struggle with nuanced judgment, a core requirement for many professional services.
Bridging the missing middle will require coordinated regulation, model transparency and new evaluation standards. Policymakers must define accountability frameworks that address bias, safety and economic displacement before AI systems are embedded in critical workflows. Meanwhile, model developers need to open up training data and performance logs so businesses can conduct independent audits. Only with rigorous, real‑world testing and clear governance can the industry move beyond speculative profit promises to deliver measurable productivity gains and sustainable growth. Establishing industry‑wide benchmarks that simulate real‑world conditions will help investors differentiate viable solutions from hype‑driven hype.
The missing step between hype and profit

Comments
Want to join the conversation?
Loading comments...