Richard Seroter

Creator

0 followers

Chief Evangelist, Google Cloud; platform engineering, integration, app modernization

Social•May 11, 2026

AI Shifts Engineering Management Focus, Not Playbook

Should tech managers and leaders run the same engineering playbook with AI in the mix? The ideas might be the same, but the focus is probably different. This @InfoQ piece looks at topics like team metrics, skills development, and guardrails. https://t.co/Tt5YEK3x5J

By Richard Seroter

Social•May 11, 2026

Keep Coding Actively, Don’t Let AI Dull Your Skills

If you're not manually coding as much anymore, how do you stay sharp as an engineer? This post says we shouldn't passively consume AI output and atrophy our skills. Rather, actively engage and do things like reading diffs and debugging errors. https://t.co/KETBZiEjVO...

By Richard Seroter

Social•May 8, 2026

Kubernetes 1.36 Launches with 70+ Upgrades, Now on GKE

Kubernetes v1.36 came out a couple of weeks ago and has over 70 enhancements. https://t.co/iMLh6dcHvg Oh, and it's already been available in @googlecloud GKE for a week. https://t.co/HjvEHc0GQh

By Richard Seroter

Social•May 8, 2026

Tech Hiring Spikes to Three-Year High, 271k New Posts

Tech job postings hit 3-year high https://t.co/NUHjpucj2y < "... more than 271,483 new job postings and more than 575,000 total active job postings." Seems like good news as companies grow their tech investment.

By Richard Seroter

Social•May 8, 2026

Will AI Ever Produce Trustworthy, Fully Autonomous Code?

What do you think? Will you ever get to "write-only code" that's never skimmed, reviewed, or tweaked by a human? What would have to be true to trust the output? That's the idea here, and it's happening in some places. https://t.co/xVaMIq7HM7

By Richard Seroter

Social•May 8, 2026

Google Cloud Cuts Cold Starts, Adds Sub‑ms Bigtable Tier

It's apparently "faster performance" Friday at @GoogleCloudTech. With faster node startup for GKE, say goodbye to cold-start latency https://t.co/NU88mzkOPj New Bigtable in-memory tier for sub-millisecond read latency https://t.co/s0GBEMEBQr https://t.co/2JdqpUcLmO

By Richard Seroter

Social•May 8, 2026

Pinecone Moves Beyond RAG to Upstream Knowledge

The half-life of a "best practice" in this industry is like 3 months. RAG? That's so 2Q 2025. Pinecone who got famous as a vector database serving RAG use cases, now pushes knowledge upstream into artifacts used by agents. https://t.co/yOFcvZmsKR

By Richard Seroter

Social•May 8, 2026

AI Disrupts SaaS Freemium, Demands Resilient Front‑Ends and AI‑Powered Coding

Seroter Daily Reading List – May 7, 2026 (#779): Today’s links look at why SaaS freemium playbooks don’t work in AI, how to design front-end systems for cloud failure, and why coding with AI agents is a baseline expectations for tech...

By Richard Seroter

Social•May 7, 2026

Avoid Circular Dependencies: Separate Monitoring From Observed Systems

Your status page or observability system probably shouldn't be using the same system it's supposed to be monitoring. Airbnb found some circular dependencies and made changes to ensure they could do monitoring reliably at scale. https://t.co/rmSAeYTZF2

By Richard Seroter

Social•May 7, 2026

Documenting Today’s Effective Agentic Coding Lessons

These are solid lessons for agentic coding, from @dbreunig. Will these be the *same* lessons in six months? Who knows, but it's good to document what works today ... https://t.co/DzTFXwI2qm

By Richard Seroter

Social•May 7, 2026

Gemini 3.1 Flash‑Lite Launches: Fast, Affordable Premium AI

Gemini 3.1 Flash-Lite is GA and ready to use. It's a fast and cost-effective way to access premium AI. https://t.co/ELGpnllkIv

By Richard Seroter

Social•May 7, 2026

AlphaEvolve Proves AI Can Create Novel, Real‑world Algorithms

"LLMs just parrot back what they've been trained on and don't create anything novel." I hear that, but the @GoogleDeepMind AlphaEvolve agent is designing advanced algorithms that are making an impact in the real world. Powerful update ... https://t.co/EwszAjFzch

By Richard Seroter

Social•May 7, 2026

GKE Pod Snapshots Eliminate Cold Starts for Massive Models

Facing a cold start when your XX billion parameter model is loading up for inference on Kubernetes? You might love the new @googlecloud GKE pod snapshots. All your app state, along with most file system and networking state, get saved and...

By Richard Seroter

Social•May 7, 2026

Prefer REST APIs; Reserve MCP/A2A for Complex Reasoning

Choosing between APIs, MCP, and Agent-to-Agent architectures https://t.co/4TnEq1Dbq6 < @lak_luster says to default to function tools that use REST APIs, and only bring in MCP or A2A when there's more reasoning involved, or dynamic use cases. https://t.co/Am9DfWv1Qy

By Richard Seroter

Social•May 7, 2026

Gate AI Features by Usage, Outcomes, and Compute

Give people a taste of your product and then reveal the paywall to access premium features. SaaS products have used that approach for a while, but it's different with AI. Instead, AI products should gate by usage intensity, outcomes, and compute. https://t.co/V7YSfrBgZo

By Richard Seroter

Social•May 7, 2026

Managers Must Get Hands‑On to Lead AI Transformation

"Therefore, one cannot drive AI transformation without being an active AI practitioner. But what does being an active AI practitioner look like for a manager?" - @dblockdotorg https://t.co/0qeplV10G6 < go hands on as managers. You might rediscover your passion for...

By Richard Seroter

Social•May 6, 2026

AI-Driven TensorFlow-to-JAX Migration Mirrors Deterministic Coding Loop

That post today about how Google did an AI-assisted code migration from TensorFlow to JAX? https://t.co/EcyofYfoPB The Planner-Orchestrator-Coder pattern we applied looks a lot like what @CTOAdvisor calls "deterministic coding in a loop." https://t.co/gkGb07wQnq https://t.co/IMxYmPjS5J

By Richard Seroter

Social•May 6, 2026

Connect Your Agent Gateway to Existing Security Providers

Is your agent gateway connected to your existing security providers? Should be! We're making a big ecosystem bet on the Agent Gateway, which is part of the @googlecloud Gemini Enterprise Agent Platform. https://t.co/8ZDnmwuzmR https://t.co/OlAxPDfC88

By Richard Seroter

Social•May 6, 2026

AI-Native Teams Prioritize Prototypes, Small Squads, Rapid Cycles

What are AI-native engineering orgs doing? ➡️ Prototypes over PRDs ➡️ Small, high-leverage squads ➡️ Faster release cycles to facilitate learning ➡️ Shortened planning horizons https://t.co/VGEvQU0lRk

By Richard Seroter

Social•May 6, 2026

Honest Feedback Loops Beat Fancy Eval Dashboards

"The teams that win the next phase of AI engineering won’t be the ones with the most elaborate eval dashboards; they’ll be the ones with the most honest feedback loops." https://t.co/yx5BRTSHCV < invest in evals, says @mjasay

By Richard Seroter

Social•May 6, 2026

Google Cloud IAM Gets Major New Features and Guardrails

All of a sudden, identity management is a vibrant and exciting space. We made a ton of @googlecloud IAM improvements lately across agent identities, gateways, guardrails, and more. Check out this recap: https://t.co/ShLXPI78Md https://t.co/C1IBJ1ylrN

By Richard Seroter

Social•May 6, 2026

Add Human‑in‑the‑Loop to Pause AI Code Generation

[blog] How to force your custom agent to stop and seek human approval https://t.co/fPu5tjkMQ4 < in which I use the new human-in-the-loop feature of the Agent Development Kit to approve AI-generated code tutorials. https://t.co/0xkrl1CKmj

By Richard Seroter

Social•May 6, 2026

Empathy Enhances Direct Communication, Not Weakness

Can you be direct without being a jerk? Is being "strategic" just code for being soft or manipulative? No. Having empathy doesn't make you weak. It gives you a better chance of actually landing your (direct) message with the recipient. Good...

By Richard Seroter

Social•May 5, 2026

Netflix Routes Millions of Requests to Thousands of Models

Wow. The Netflix ML serving platform serves hundreds of model types/versions, and around 1 million requests per second. How do they route traffic to the right model in such a large-scale service system? Deep dive post: https://t.co/1sU4Jtt5l8 https://t.co/ANPlcxqpSA

By Richard Seroter

Social•May 5, 2026

Start Small, Personalize Fast, Empower Others to Build

Some cool lessons learned from Stripe in this podcast in the @lennysan network. ✅ Don't engineer the platform right away. You can start with what you have around ✅ Quickly personalize with your context ✅ Let more people build to unblock themselves https://t.co/ful1vfbMFG

By Richard Seroter

Social•May 5, 2026

Local LLMs Offer Cost‑Effective Compute and Data Sovereignty

Run LLMs on your own hardware? This scenario didn't get me fired up for a while, but now I get it. Whether offloading token-hungry tasks to self-hosted LLMs where you only pay for compute, or satisfying unique sovereignty needs, local AI...

By Richard Seroter

Social•May 5, 2026

Newsletter as Personal Discipline, Not Just Audience Reach

I have a confession. My daily newsletter of links and commentary (https://t.co/okv3uDONrs) is mostly for me. Oh, I love that plenty of people read it. But my main goal is forcing a reading and publishing discipline. @allen_hutchison is doing it too...

By Richard Seroter

Social•May 5, 2026

Flip the Ratio: Cheap Assessment, Board‑

"The answer probably starts with flipping the ratio: making assessment as cheap as generation, paying for fixes instead of just finds, and treating supply chain security as the board-level priority it has been pretending to be."

By Richard Seroter

Social•May 5, 2026

Uber Solved AI Scaling with Centralized Gateways and Registry

What problems did Uber face when scaling AI? There wasn't a shared way of building, security was inconsistent, opaque visibility into call patterns, and discovery was unmanaged. They added centralized MCP gateways and a registry. More ... https://t.co/pVTbFSPRR9

By Richard Seroter

Social•May 4, 2026

Webhooks Deliver Real-Time Gemini API Operation Completions

"Webhooks allow the Gemini API to push real-time notifications to your server when asynchronous or Long-Running Operations (LROs) complete." That's a BFD.

By Richard Seroter

Social•May 4, 2026

Diffusion-Style Decoding Triples LLM Speed on TPUs

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-style speculative decoding https://t.co/j1QPASxHKP < some pretty amazing increases in tokens per second

By Richard Seroter

Social•May 4, 2026

Treat Skills as Programs: Architecture Drives Runtime Loading

"Skills are programs, not prompts. How the skills runtime actually loads, and why the architecture is everything." https://t.co/oojCUzACLQ < I liked this post which promoted a thoughtful approach to designing (and testing) skills

By Richard Seroter

Social•May 4, 2026

Startups Favor Flutter Over Native for Faster Scaling

Why Startups Are Choosing Flutter Over Native in 2026: A CTO’s Perspective https://t.co/RCOd4dDYln < fairly strong case here for when to use a cross-platform framework like @FlutterDev and when to go native

By Richard Seroter

Social•May 4, 2026

Git Worktrees: Essential Tool for Modern Development

Do you know what git worktrees are? Do you use them? Does your agent? @kweinmeister explains them well here, and why they're valuable to modern developers ... https://t.co/vbJXFxSgQ2

By Richard Seroter

Social•May 1, 2026

Stop Redundant Map Lookups to Save CPU

You might not notice, but this way you're accessing maps in code is wasting CPU cycles. They add up! Here's quick advice on avoiding redundant lookups ... https://t.co/uOpmIOwM5r

By Richard Seroter

Social•May 1, 2026

Agents Disrupt Database Contracts; Build Defensive Data Layers

Have agents broken the unspoken contract we had with our databases? You know, human-authored apps running deterministic code and predictable queries? @arpit_bhayani wrote a terrific post for how to create a defensively designed data layer ... https://t.co/8OOXrmkD6g

By Richard Seroter

Social•May 1, 2026

Human Infrastructure Matters as Much as AI Tools

"It is impossible to predict at this point just how AI will transform the workplace. However, one thing is certain: understanding and building the right human infrastructure will be as important as picking the right AI tools." https://t.co/sm48Z9f6lH

By Richard Seroter

Social•May 1, 2026

AI Investment Requires Systems, Not Just Tool Licenses

"The most important takeaway is that AI investment is a systems decision, not a tooling decision. Buying licenses without investing in the surrounding platform, data, and governance simply accelerates the rate at which existing dysfunction shows up in production." https://t.co/dng1pr7LZr

By Richard Seroter

Social•Apr 30, 2026

Fully‑managed Remote MCP Servers Accelerate Google Cloud Integration

0-to-50 in record time. We've now got dozens of fully-managed remote MCP servers that let your agents easily interact with your favorite @googlecloud services. Infra, AI, databases, ops, security, docs, Workspace, you name it. https://t.co/u7LZhEnT1B https://t.co/wt2CQODKq4

By Richard Seroter

Social•Apr 30, 2026

AI Productivity Gains Modest Now, Will Surge with New Models

AI productivity gains: More modest than expected https://t.co/xW6Scx6ATH < checks out, so far. But once the agentic operating model really takes hold, team structures change, and better platforms improve the build-to-prod stages, these numbers will jump.

By Richard Seroter

Social•Apr 30, 2026

Rapid Storage Integration with PyTorch Keeps GPUs Busy

"By integrating Rapid Storage, powered by Google’s Colossus storage architecture, directly with PyTorch via the industry-standard fsspec interface, we are enabling researchers and developers to keep their GPUs busier than ever before." https://t.co/T0oMWbNKkc https://t.co/xS9wXRcmyi

By Richard Seroter

Social•Apr 30, 2026

Evaluation Costs Outpace Training for Modern ML Agents

"For neural operators, ML research agents, and replication benchmarks, the ratio has flipped: a credible evaluation can cost more than training the candidate model." https://t.co/BNE4tmQS7q < @huggingface post says static eval costs were tolerable, but agent evals aren't cheap.

By Richard Seroter

Social•Apr 30, 2026

Managing State and Coordination for Long‑Running Agents

Long-running agents pose fresh challenges. Where and how do you persist state? Who does compute coordination and completion verification? @addyosmani wrote a fantastic deep dive that lists the patterns, solutions, and limits. A must read: https://t.co/Gg7apYKJuG https://t.co/XEBSbYnzTe

By Richard Seroter

Social•Apr 29, 2026

Agents Build Agents, Generative UI, 1,000 Real AI Use Cases

Seroter Daily Reading List – April 29, 2026 (#773): Today’s links look at using agents to create agents, what generative UI is actually about, and a thousand real-world AI use cases from actual companies. https://t.co/xbjkX63V0c

By Richard Seroter

Social•Apr 29, 2026

Workday’s Service Empire May Expose Vulnerability

"A $30B company with 10,000+ customers and a services cartel that’s arguably bigger than the product itself will not roll over quietly." https://t.co/eBzsJxBGUV < is Workday finally vulnerable? They've got an enviable footprint and new focus. All platforms need to...

By Richard Seroter

Social•Apr 29, 2026

Putting Payroll in the Field for Lean Operations

"We are going to become an organization that puts its payroll in the field." https://t.co/69FZ6RHusY < The future involves a lean back office. Pay people who do the work, and let agents do the work about work.

By Richard Seroter

Social•Apr 29, 2026

A2UI Shows How Simple Component Schemas Enable Generative UI

I'm somewhat embarrassed to admit that the whole "generative UI" thing hasn't clicked with me. But after reading this post, the lightbulb went off. For something like A2UI, you give the agent the components and a schema, and it does the...

By Richard Seroter

Social•Apr 28, 2026

Winning Agentic AI Requires Clean Data, Rigor, Not Flashy Bots

"If you’re trying to pick winners in agentic AI, don’t look for those with the cleverest agents. Instead, look to the companies with the cleanest data contracts, the best evaluation discipline, the most coherent identity model, and the least tolerance...

By Richard Seroter

Social•Apr 28, 2026

On‑Device AI Thrives with LiteRT and NPU

Building real-world on-device AI with LiteRT and NPU https://t.co/8PbZfxIKiW < you can do some pretty amazing things with on-device AI right now. Here are some real-world examples.

By Richard Seroter

Social•Apr 28, 2026

Clarifying AI Agent Memory: Files, Blocks, or Services?

Where can you store "memory" for an AI agent? This post calls out files, memory blocks, and skill as core options. Would memory services (like the one in our Agent Platform) be categorized as a memory block? Or is that another type...

By Richard Seroter

Richard Seroter

AI Shifts Engineering Management Focus, Not Playbook

Keep Coding Actively, Don’t Let AI Dull Your Skills

Kubernetes 1.36 Launches with 70+ Upgrades, Now on GKE

Tech Hiring Spikes to Three-Year High, 271k New Posts

Will AI Ever Produce Trustworthy, Fully Autonomous Code?

Google Cloud Cuts Cold Starts, Adds Sub‑ms Bigtable Tier

Pinecone Moves Beyond RAG to Upstream Knowledge

AI Disrupts SaaS Freemium, Demands Resilient Front‑Ends and AI‑Powered Coding

Avoid Circular Dependencies: Separate Monitoring From Observed Systems

Documenting Today’s Effective Agentic Coding Lessons

Gemini 3.1 Flash‑Lite Launches: Fast, Affordable Premium AI

AlphaEvolve Proves AI Can Create Novel, Real‑world Algorithms

GKE Pod Snapshots Eliminate Cold Starts for Massive Models

Prefer REST APIs; Reserve MCP/A2A for Complex Reasoning

Gate AI Features by Usage, Outcomes, and Compute

Managers Must Get Hands‑On to Lead AI Transformation

AI-Driven TensorFlow-to-JAX Migration Mirrors Deterministic Coding Loop

Connect Your Agent Gateway to Existing Security Providers

AI-Native Teams Prioritize Prototypes, Small Squads, Rapid Cycles

Honest Feedback Loops Beat Fancy Eval Dashboards

Google Cloud IAM Gets Major New Features and Guardrails

Add Human‑in‑the‑Loop to Pause AI Code Generation

Empathy Enhances Direct Communication, Not Weakness

Netflix Routes Millions of Requests to Thousands of Models

Start Small, Personalize Fast, Empower Others to Build

Local LLMs Offer Cost‑Effective Compute and Data Sovereignty

Newsletter as Personal Discipline, Not Just Audience Reach

Flip the Ratio: Cheap Assessment, Board‑

Uber Solved AI Scaling with Centralized Gateways and Registry

Webhooks Deliver Real-Time Gemini API Operation Completions

Diffusion-Style Decoding Triples LLM Speed on TPUs

Treat Skills as Programs: Architecture Drives Runtime Loading

Startups Favor Flutter Over Native for Faster Scaling

Git Worktrees: Essential Tool for Modern Development

Stop Redundant Map Lookups to Save CPU

Agents Disrupt Database Contracts; Build Defensive Data Layers

Human Infrastructure Matters as Much as AI Tools

AI Investment Requires Systems, Not Just Tool Licenses

Fully‑managed Remote MCP Servers Accelerate Google Cloud Integration

AI Productivity Gains Modest Now, Will Surge with New Models

Rapid Storage Integration with PyTorch Keeps GPUs Busy

Evaluation Costs Outpace Training for Modern ML Agents

Managing State and Coordination for Long‑Running Agents

Agents Build Agents, Generative UI, 1,000 Real AI Use Cases

Workday’s Service Empire May Expose Vulnerability

Putting Payroll in the Field for Lean Operations

A2UI Shows How Simple Component Schemas Enable Generative UI

Winning Agentic AI Requires Clean Data, Rigor, Not Flashy Bots

On‑Device AI Thrives with LiteRT and NPU

Clarifying AI Agent Memory: Files, Blocks, or Services?

Technology Pulse

Kubernetes 1.36 Launches with 70+ Upgrades, Now on GKE