Aaron Levie

Creator

4 followers

ceo @box - your business lives in content. unleash it with AI

Social•May 28, 2026

Opus 4.8 Beats 4.7 in Enterprise Report, Legal, Finance

Opus 4.8 is out, and we've been testing it with the Box AI agent on our most complex real-world knowledge worker tasks with enterprise documents. Opus 4.8 is measurably better at the generative and analytical work enterprises care about most like writing reports, synthesizing data, reviewing complex enterprise documents across a range of industries. Here are some quick examples of wins vs. Opus 4.7: * Report drafting: Opus 4.8 outperforms on a majority of report drafting tasks, producing more complete and accurate analytical reports. On an industrial goods reporting task, it scored 87% vs 77% for Opus 4.7; on a consumer products launch evaluation, 90% vs 84%. * Review and verification: On a legal NDA review task requiring verification of contract terms against compliance criteria, Opus 4.8 catches more relevant clauses and flags more potential issues, with near-perfect consistency across all trials. * Financial data analysis: On a corporate lending analysis task comparing syndicated vs bilateral loan structures, Opus 4.8 extracts more accurate financial metrics from source documents, leading by nearly 8 percentage points. * Consumer products launch evaluation: On a task requiring assessment of a product launch across multiple performance dimensions, Opus 4.8 captured evaluation criteria that Opus 4.7 missed — producing a more thorough aBnalysis that covered all required factors rather than just the most obvious ones. * Legal NDA review: On a task verifying NDA terms against compliance criteria, Opus 4.8 identified more relevant clauses and flagged potential issues that Opus 4.7 missed. Its outputs were also highly predictable — producing nearly identical quality across independent runs. * Public sector grant analysis: When analyzing library grant documentation against eligibility criteria, Opus 4.8 correctly extracted and validated nearly all required data points, catching specific eligibility details that Opus 4.7 overlooked or misinterpreted. Opus 4.8 will be rolling out shortly to Box customers to deploy in Box AI agents. Learn more here: https://t.co/D3vID1tWWv

By Aaron Levie

Social•May 28, 2026

Enterprise AI Deployment Needs Tenfold More Resources than Expected

Take whatever number of people you thought might be in jobs related to AI deployment in the enterprise and multiply it by 10. Then probably 10 again. A major topic that keeps coming up in talking to CIOs across enterprises...

By Aaron Levie

Social•May 24, 2026

CEOs Must See AI’s Hidden Work, Not Just Magic

CEOs are uniquely prone to AI psychosis because they’re sufficiently distant from the last mile of work that still has to happen to generate most value with AI. So when they play with AI, they see the happy path results, often...

By Aaron Levie

Social•May 19, 2026

Agent Success Hinges on Precise, Well‑structured Data Context

This is true of all agents, not just coding agents. Probably the biggest challenge that most companies run into in their agent strategy is getting agents the right constrained context to work with for a task. Too much information or...

By Aaron Levie

Social•May 12, 2026

AI Agents Shift From Coding to Industry‑Specific Workflows

Agents are quickly moving from coding to the rest of knowledge work. But to do this we need ways of bridging the advanced capabilities of the AI models with the real-life workflows in the enterprise, by industry and line of...

By Aaron Levie

Social•May 11, 2026

AI Automation Engineers Needed for Mission-Critical Agent Deployment

As advanced agents move from coding to the rest of knowledge work, it takes a real amount of work and know-how to get right. You need to ensure agents have the right context and data to work with, wire up systems...

By Aaron Levie

Social•May 9, 2026

Enterprises Must Treat AI Tokens Like Budget Line Items

A common trend emerging in larger enterprises is token budgeting as a major topic. As agents can do more and more long running tasks, and thus take vastly more compute, allocation of tokens across teams becomes a very real thing...

By Aaron Levie

Social•May 2, 2026

AI Doubles Engineer Output, Leveling Biotech Against Tech Giants

If you think AI replaces software engineers, here’s a quick thought experiment. Imagine you’re a life sciences company. 10 years ago you want to invest heavily in lab automation, processing data at scale, and other software. You look at the...

By Aaron Levie

Social•May 2, 2026

Automation’s Limit: Accountability Requires Full Context Understanding

One of the finite limits to automation is the issue of accountability in a workflow in an organization. In theory the agent can do anything, but whatever you have the agent do yourself, you should expect to be accountable for...

By Aaron Levie

Social•May 1, 2026

AI Agents Surge, Driving Demand for System‑of‑record Tools

Atlassian’s results surprised Wall Street, but it shouldn’t be a surprise. The simple heuristic for the future of software is that when there are 100X more agents than people, which parts of software will grow because agents are doing more...

By Aaron Levie

Social•May 1, 2026

AI Empowers Non‑Valley Enterprises to Accelerate and Innovate

When I talk to enterprises outside of Silicon Valley, most of the use-cases they have in mind with AI are to augment and accelerate how they work, simply because of how much more they can do right now. Most companies are...

By Aaron Levie

Social•May 1, 2026

Future Software: Seats Include Built‑In API Usage for Agents

As agents become the biggest users of software, then all software has to be available in a headless fashion. Agents won’t be using your UI, they’ll be talking to your APIs. So the question becomes what is the business model of...

By Aaron Levie

Social•Apr 30, 2026

Emerging Internal Agent Engineers Will Transform Business Workflows

Starting to hire and retrain for new agent engineering roles for *internal* functions to help get more powerful agents working well on critical business processes. I expect this type of role to be a very big deal over time at...

By Aaron Levie

Social•Apr 26, 2026

AI Agents Amplify Effort, Demanding Smarter Task Prioritization

There are at least 2 big but subtle factors contributing to the sense of overwork due to agents right now. 1. The leverage on incremental effort has gone up substantially due to AI, and anyone using these tools tend to feel...

By Aaron Levie

Social•Apr 26, 2026

AI Empowers Ambitious Talent to Leapfrog Experience Requirements

Great read. AI lets you get tremendous leverage that wasn’t available before in almost any domain. That means we’re at a unique moment in history where anyone with a high level of ambition and core skills in any area can...

By Aaron Levie

Aaron Levie

Opus 4.8 Beats 4.7 in Enterprise Report, Legal, Finance

Enterprise AI Deployment Needs Tenfold More Resources than Expected

CEOs Must See AI’s Hidden Work, Not Just Magic

Agent Success Hinges on Precise, Well‑structured Data Context

AI Agents Shift From Coding to Industry‑Specific Workflows

AI Automation Engineers Needed for Mission-Critical Agent Deployment

Enterprises Must Treat AI Tokens Like Budget Line Items

AI Doubles Engineer Output, Leveling Biotech Against Tech Giants

Automation’s Limit: Accountability Requires Full Context Understanding

AI Agents Surge, Driving Demand for System‑of‑record Tools

AI Empowers Non‑Valley Enterprises to Accelerate and Innovate

Future Software: Seats Include Built‑In API Usage for Agents

Emerging Internal Agent Engineers Will Transform Business Workflows

AI Agents Amplify Effort, Demanding Smarter Task Prioritization

AI Empowers Ambitious Talent to Leapfrog Experience Requirements

Technology Pulse

Opus 4.8 Beats 4.7 in Enterprise Report, Legal, Finance