
Google’s Gemini 3.5 Flash Dominates Benchmarks, Boosts Coding
Genuinely impressive release by Google today (remember when they were behind?) Gemini 3.5 Flash perf: * Building on prior strengths (83.6% of MMMU-Pro for multimodal), * big jump on agentic coding (76.2% on Terminal-Bench for agentic coding and 56.5% on Toolathon for real world tasks) * progress and expert tasks (57.9% on Finance Agent 2... we are cooked) * leading scores across SWE-Bench, OSWorld etc. (also, elegant to bold the top scores in the chart below even if when it's not Google leading) Ofc, just benchmarks, and also not cheap (~$9/M output), but Google is cookin'... we are all so spoiled to have the 3 labs compete
AI Agent Pricing Shifts Toward Seat‑Based Features
The more I think about AI agents, the less obvious it is that pricing goes purely consumption-based Token costs matter... but enterprise agents may need identities, roles, auth, budgets, audit logs etc That sounds oddly seat-like? just not human-seat-like
AI Safety Requires Global Collaboration, Not Just Scale
Deeply thoughtful conversation with @zicokolter, board member at @OpenAI and head of the machine learning department at @CarnegieMellon, about AI safety, AI security, agents and frontier AI 00:00 Intro 01:32 OpenAI board role and Safety & Security Committee 03:53 How OpenAI reviews...
AI Shifts From Writing Code to Autonomous Maintenance
.@RampLabs (the AI unit of @tryramp) has been *cooking* with agentic innovation Here's @a_levitator discussing and demo'ing code self-maintaining software and the concept of AI software factories #DataDrivenNYC ______________ 00:04 - Intro 01:11 - The shift from writing code to code maintenance 01:59 - Introducing...
Europe Seeks AI Independence Amid US Platform Dominance
Combining Cohere and Aleph Alpha is such an interesting moment. Spending a lot of time outside the US, hard to overstate how much reluctance there is to outsourcing intelligence to a handful of US platforms, given current geopolitical chaos.
NYC AI Night: Data-Driven Meetups with RampLabs & Estuary
AI folks in NYC -- Data Driven NYC (#121) this Tuesday at 6pm. Come meet fellow AI builders and our speakers: * @RampLabs has been cooking lately with a lot of agentic innovation; Alex Levinson will demo * @EstuaryDev provides unified...

AI Customer Service Is Already Solved, Too Many Vendors
"There are some many vendors in AI for customer service, it's basically a solved problem" https://t.co/Kd8GLBAFPL
AI Agents Become Local Co‑workers, Reshaping Software Development
Claude Cowork, Mythos, and the Future of Software: my conversation with @felixrieseberg, who leads Cowork at @AnthropicAI 00:00 Intro 01:53 Claude Mythos Preview and the “step-function change” 06:16 Why Anthropic is treating Mythos differently 11:19 The real story behind Claude Cowork’s “10-day” build 12:42...
Crypto Winter Mutes Even Carreyrou's Satoshi Claim
You know it's a deep crypto winter when John Carreyrou (of Theranos investigation fame) thinks he's figured out who Satoshi is and no one on my timeline cares
Iranian Pilot Rescue Poised for Netflix Series
“The pilot rescue in Iran will make for a great Netflix show one day” AI: here you go
Traditional Marketing Fails in the AI-Driven Era
if you're running traditional marketing playbooks in an age where OpenAI acquires TBPN, you're ngmi. The game has completely changed

VCs Gone From Metrics to Geopolitics Overreach
VCs used to have opinions on startup metrics Then we became experts in epidemiology Then we mastered central bank interest rates policy discussions Today we will educate you in all things Straight of Hormuz, oil prices and middle eastern naval strategies...
A New Mathematical Renaissance Fuels Scientific Breakthroughs
"We are on the threshold of a mathematical renaissance and massive scientific discoveries" - @CarinaLHong, CEO of @axiommathai, which just announced their $200M Series A Full episode 👇 https://t.co/26VUb0Cuqx

Podcasters Aren’t All Alike – Celebrate Differences
Podcasters: we are not all the same Also: (@MatthewBerman and I keeping our man @dylan522p on message 🤝) https://t.co/o6kr36PccQ

Series A Startup Hosts Premier AI Infra ConferenceSeries A Startup Hosts Premier AI Infra Conference
It’s sort of ridiculous that a Series A company would be able to pull off a major conference with some of the very best speakers in AI infra at the Chase Center… but that’s exactly what @daytonaio did today. ...