Opus 4.8 Beats 4.7 in Enterprise Report, Legal, Finance
Opus 4.8 is out, and we've been testing it with the Box AI agent on our most complex real-world knowledge worker tasks with enterprise documents. Opus 4.8 is measurably better at the generative and analytical work enterprises care about most like writing reports, synthesizing data, reviewing complex enterprise documents across a range of industries. Here are some quick examples of wins vs. Opus 4.7: * Report drafting: Opus 4.8 outperforms on a majority of report drafting tasks, producing more complete and accurate analytical reports. On an industrial goods reporting task, it scored 87% vs 77% for Opus 4.7; on a consumer products launch evaluation, 90% vs 84%. * Review and verification: On a legal NDA review task requiring verification of contract terms against compliance criteria, Opus 4.8 catches more relevant clauses and flags more potential issues, with near-perfect consistency across all trials. * Financial data analysis: On a corporate lending analysis task comparing syndicated vs bilateral loan structures, Opus 4.8 extracts more accurate financial metrics from source documents, leading by nearly 8 percentage points. * Consumer products launch evaluation: On a task requiring assessment of a product launch across multiple performance dimensions, Opus 4.8 captured evaluation criteria that Opus 4.7 missed — producing a more thorough aBnalysis that covered all required factors rather than just the most obvious ones. * Legal NDA review: On a task verifying NDA terms against compliance criteria, Opus 4.8 identified more relevant clauses and flagged potential issues that Opus 4.7 missed. Its outputs were also highly predictable — producing nearly identical quality across independent runs. * Public sector grant analysis: When analyzing library grant documentation against eligibility criteria, Opus 4.8 correctly extracted and validated nearly all required data points, catching specific eligibility details that Opus 4.7 overlooked or misinterpreted. Opus 4.8 will be rolling out shortly to Box customers to deploy in Box AI agents. Learn more here: https://t.co/D3vID1tWWv
Enterprise AI Deployment Needs Tenfold More Resources than Expected
Take whatever number of people you thought might be in jobs related to AI deployment in the enterprise and multiply it by 10. Then probably 10 again. A major topic that keeps coming up in talking to CIOs across enterprises...
CEOs Must See AI’s Hidden Work, Not Just Magic
CEOs are uniquely prone to AI psychosis because they’re sufficiently distant from the last mile of work that still has to happen to generate most value with AI. So when they play with AI, they see the happy path results, often...
Agent Success Hinges on Precise, Well‑structured Data Context
This is true of all agents, not just coding agents. Probably the biggest challenge that most companies run into in their agent strategy is getting agents the right constrained context to work with for a task. Too much information or...
AI Agents Shift From Coding to Industry‑Specific Workflows
Agents are quickly moving from coding to the rest of knowledge work. But to do this we need ways of bridging the advanced capabilities of the AI models with the real-life workflows in the enterprise, by industry and line of...

AI Automation Engineers Needed for Mission-Critical Agent Deployment
As advanced agents move from coding to the rest of knowledge work, it takes a real amount of work and know-how to get right. You need to ensure agents have the right context and data to work with, wire up systems...
Enterprises Must Treat AI Tokens Like Budget Line Items
A common trend emerging in larger enterprises is token budgeting as a major topic. As agents can do more and more long running tasks, and thus take vastly more compute, allocation of tokens across teams becomes a very real thing...
AI Doubles Engineer Output, Leveling Biotech Against Tech Giants
If you think AI replaces software engineers, here’s a quick thought experiment. Imagine you’re a life sciences company. 10 years ago you want to invest heavily in lab automation, processing data at scale, and other software. You look at the...
Automation’s Limit: Accountability Requires Full Context Understanding
One of the finite limits to automation is the issue of accountability in a workflow in an organization. In theory the agent can do anything, but whatever you have the agent do yourself, you should expect to be accountable for...

AI Agents Surge, Driving Demand for System‑of‑record Tools
Atlassian’s results surprised Wall Street, but it shouldn’t be a surprise. The simple heuristic for the future of software is that when there are 100X more agents than people, which parts of software will grow because agents are doing more...
AI Empowers Non‑Valley Enterprises to Accelerate and Innovate
When I talk to enterprises outside of Silicon Valley, most of the use-cases they have in mind with AI are to augment and accelerate how they work, simply because of how much more they can do right now. Most companies are...
Future Software: Seats Include Built‑In API Usage for Agents
As agents become the biggest users of software, then all software has to be available in a headless fashion. Agents won’t be using your UI, they’ll be talking to your APIs. So the question becomes what is the business model of...
Emerging Internal Agent Engineers Will Transform Business Workflows
Starting to hire and retrain for new agent engineering roles for *internal* functions to help get more powerful agents working well on critical business processes. I expect this type of role to be a very big deal over time at...
AI Agents Amplify Effort, Demanding Smarter Task Prioritization
There are at least 2 big but subtle factors contributing to the sense of overwork due to agents right now. 1. The leverage on incremental effort has gone up substantially due to AI, and anyone using these tools tend to feel...
AI Empowers Ambitious Talent to Leapfrog Experience Requirements
Great read. AI lets you get tremendous leverage that wasn’t available before in almost any domain. That means we’re at a unique moment in history where anyone with a high level of ambition and core skills in any area can...