The best way to use AI is an interface to information that lets you deepen and improve your own knowledge and mental models. The worst way to use AI is as a crutch to outsource and forsake your own cognition
Sufficiently advanced agentic coding is essentially machine learning: the engineer sets up the optimization goal as well as some constraints on the search space (the spec and its tests), then an optimization process (coding agents) iterates until the goal is...
If you're a researcher in academia using Keras 3 (PhD student, postdoc, professor...) and you want to train on TPUs, you could receive compute awards from Google for your research. Google is running a new academic grant program, separate from...
The 6th gen platform reportedly costs ~$70k per vehicle ($50k base + $20k custom fit & sensors). The cost has room to fall by 50% in the next 2 years (>$30k vehicle + $10k sensors). Waymo currently does over 500,000 driverless...
There are two categories of people: those who quickly figure out that chatbots give you the answer you expect when you ask questions in a biased way, and the ascended polymaths currently out-thinking every expert on Earth
One of the first signs of the emergence of superhuman AGI will be the emergence of a quantitative trading firm with impossible returns
Short term, this is still the best reference point. Long term, past references might become much less useful.
Interesting finding on frontier model performance on ARC -- due to extensive direct targeting of the benchmark, models are overfitting to the original ARC encoding format. Frontier model performance remains largely tied to a familiar input distribution.
Right now it's still taking me more time to generate medium-complexity diagrams by describing them to Nano Banana than by drawing them manually in Google Slides...
I don't think the rise of AGI will lead to a sudden exponential explosion in AI capabilities. There are bottlenecks on the sources of new capability improvements, and horizontally scaling intelligence in silicon (even by a massive factor) doesn't lift...
Reaching AGI won't be beating a benchmark. It will be the end of the human-AI gap. Benchmarks are simply a way to estimate the current gap, which is why we need to continually release new benchmarks (focused on the remaining...
A good canary in the coal mine for AI-caused job loss will be call centers. We're currently projecting ~2.75M call center jobs in the US in 2026. In 2016 it was ~2.63M. The global call center market size has grown...
Lots of folks spread false narratives about how ARC-1 was created in response to LLMs, or how ARC-2 was only created because ARC-1 was saturated. Setting the record straight: 1. ARC-1 was designed 2017-2019 and released in 2019 (pre LLMs). 2. The...