I now constantly get questions about the SAAS meltdown, role of AI, system of records etc. I don't have an answer to all these. But I do know that we saw an acceleration in our business in Q2, Q3, and now finished the year with accelerating Q4. The question is, why? Short answer: AI. But the underlying reason is subtle. We are growing fast because we are finally removing the biggest bottleneck in data: the technical barrier to entry. For years, if you didn’t know SQL, Python, you were locked out of the value chain. That has changed fundamentally with the 𝐆𝐞𝐧𝐢𝐞 𝐟𝐚𝐦𝐢𝐥𝐲, and it is the "secret sauce" behind our recent momentum: • 𝐆𝐞𝐧𝐢𝐞: Analysts can query data without any SQL. I use this every day myself. • 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐆𝐞𝐧𝐢𝐞: Builds end-to-end AI models for you, similar to Cursor for ML on your data. • 𝐃𝐚𝐭𝐚 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫 𝐆𝐞𝐧𝐢𝐞: Write Spark pipelines, does plumbing, troubleshooting. We've been talking about DATA + AI democratization, but generative AI finally enabled it in a way that wasn't possible before. That's why we're seeing a market response. Take 𝐋𝐚𝐤𝐞𝐛𝐚𝐬𝐞 𝐏𝐨𝐬𝐭𝐠𝐫𝐞𝐬. We launched this serverless engine for agents and apps recently. At 8 months into its journey, its revenue is already 2x what our Data Warehouse product was at the same stage. All this taken together, we ended up with the following stats for Q4: 🚀 $5.4B Revenue Run-Rate, growing >65% YoY 🚀 $1.4B AI Revenue Run-Rate 🚀 FCF Positive for the year 🚀 NRR >>140% https://t.co/yq3riYyr8r
Checkout this video of how to use Agent Skills in Databricks. 👩🏽🔬 Gavita does a great job in this video.
Yes, since we added "External Orchestration" to Lakeflow more and more ppl use it to orchestrate all their estate, not just Databricks jobs/Spark etc, but also EC2 machines, other platforms (e.g. Snowflake), scripts, etc.
This post explains why we've seen massive migration from Airflow to Lakeflow in Databricks over the last 3 years. We started 3 years ago to focus on these workloads because reliability is the most important thing when you're running data...