Aleksei Petrov

Aleksei Petrov

Creator
0 followers

CTO at QuantFlow; builds AI agents that integrate with CI and issue trackers to automate coding and delivery with telemetry and controls.

Anthropic's Upcoming Autonomous Agent Mirrors Pilot
SocialApr 14, 2026

Anthropic's Upcoming Autonomous Agent Mirrors Pilot

Anthropic’s new tool is on the horizon, pretty close to what the Pilot is - autonomous agent. Let’s see if it’s a good one ✌️

By Aleksei Petrov
Pilot, ClaudeCode, GLM 5.1 Hit 74.2% Success
SocialApr 14, 2026

Pilot, ClaudeCode, GLM 5.1 Hit 74.2% Success

Pilot + ClaudeCode + GLM 5.1 74.2% success rate on Terminal Benchmark 2.0 full run. I have to check we didn't violate any rules 😁

By Aleksei Petrov
Opus Plummets to 10th as Token Costs Rise
SocialApr 13, 2026

Opus Plummets to 10th as Token Costs Rise

Opus dropped on the benchmark from 2nd to 10th place. Limits are tighter, tokens are more expensive now, this is how it all goes. Investment needs to be returned. Nice I don’t care, as my limits are cooked till the end...

By Aleksei Petrov
GLM Tackles Tough
SocialApr 12, 2026

GLM Tackles Tough

GLM caught the “chess-best-move” trial on this test run — the nasty one. It’s been grinding on it for ~40 minutes already, ~20 left on the clock. Let’s see if it clears it; Opus only managed to pull it off a...

By Aleksei Petrov
Reduced to One Worker, Now Runs Overnight
SocialApr 11, 2026

Reduced to One Worker, Now Runs Overnight

Scaled infra down to a single worker. Last run was burning tokens way too fast. Now it’s crawling… so this one runs overnight.

By Aleksei Petrov
AI Agent Automates Dev Issue Resolution on AWS
SocialApr 11, 2026

AI Agent Automates Dev Issue Resolution on AWS

AI Agent passing development issues, online. Join the Thread 🍿 AWS infra. ClaudeCode. Pilot. Hit the star: https://github.com/qf-studio/pilot

By Aleksei Petrov
Design a Website with One AI
SocialApr 9, 2026

Design a Website with One AI

Website Design w/ Claude is so much fun. NO SKILLS NEEDED. Sonnet 4.6 made a quick sketch to refresh our website. Not bad for the single prompt 👏 Screenshot: Product section. Terminal animated.

By Aleksei Petrov
Solo AI‑Built Pilot Tops Terminal‑Bench 2.0
SocialApr 9, 2026

Solo AI‑Built Pilot Tops Terminal‑Bench 2.0

Pilot — #1 on Terminal-Bench 2.0. 82.9% accuracy. 124 entries. Claude Opus 4.6. Built by single person + AI in Montenegro. No VC. No cluster. Standard infra. Leaderboard is live: https://www.tbench.ai/leaderboard/terminal-bench/2.0 Open source: https://pilot.quantflow.studio

By Aleksei Petrov
5K/Mo Buys Full AI‑powered Dev Studio
SocialApr 6, 2026

5K/Mo Buys Full AI‑powered Dev Studio

QuantFlow Studio is open for subscriptions 🎉 $5K/mo — one EU dev's cost — buys a whole studio's output. Engineering, design, AI integrations. 2 engineers orchestrating self-made agents, end-to-end. Proof we're not LARPing: • Pilot — 82% on Terminal Bench 2.0 (built in...

By Aleksei Petrov
14 Releases in One Day, Delivery Fully Automated
SocialApr 5, 2026

14 Releases in One Day, Delivery Fully Automated

14 releases one day. Delivery on autopilot 🛩️ Just checked reports, Claude and Pilot are building.

By Aleksei Petrov
AI Agents Delivered Fully Tested Code Overnight
SocialApr 4, 2026

AI Agents Delivered Fully Tested Code Overnight

Set up two AI agents before bed last night. - Pilot (executor) — picks GitHub issues, writes code, ships. - ClaudeCode (/loop to monitor) — checks status every 30 min, reports. Morning: everything wired, tested, parity checks passing I review with a coffee...

By Aleksei Petrov
Pilot Ships with Short Video and GIF Demos
SocialApr 2, 2026

Pilot Ships with Short Video and GIF Demos

Pilot on delivery duty today. Cutting short videos and gifs to show how it ships. https://github.com/qf-studio/pilot

By Aleksei Petrov
Pilot v2.86.3 Adds Crash Cleanup, Dashboard Graph, Repo Migration
SocialMar 31, 2026

Pilot v2.86.3 Adds Crash Cleanup, Dashboard Graph, Repo Migration

Pilot v2.86.3 released. Fixed: — Stale worktrees after OOM/SIGKILL never cleaned up (818MB each) — Squash merges dropped PR titles → broke release tagging — GoReleaser pointed to old repo after migration New: — Dashboard git graph follows active task's project — Worktree cleanup on crash and...

By Aleksei Petrov
Top AI Coding Agent Ignored Despite Benchmark Victory
SocialMar 30, 2026

Top AI Coding Agent Ignored Despite Benchmark Victory

Built an AI agent that took #1 on Terminal-Bench 2.0 — "the industry benchmark for coding agents". 82.0% across 445 trials. Validated by the maintainer 3 days ago. "Ready to merge." Still not on the leaderboard. LinkedIn DM — no response. Discord —...

By Aleksei Petrov