
Shipmas Day 10: The AI Reverse Engineering Workflow
The video walks viewers through a hands‑on example of reverse‑engineering the popular Opus Clip service, showing how to recreate its short‑form video generation pipeline using open‑source AI tools. The creator starts by downloading a YouTube source with yt‑dlp, extracting the audio, and feeding it to Whisper for a timestamped transcript, then leverages Gemini 3 to parse the text and suggest highlight moments. Key technical steps are laid out in detail: a JSON timeline is built containing start/end times, clip type and text; ffmpeg slices the original MP4 accordingly; YOLO detects faces to isolate the speaking presenter; the clip is reformatted to a vertical 9:16 aspect ratio and captions are burned in with ffmpeg. The workflow is orchestrated through cloud‑based code‑generation tools like Claude Code, which auto‑creates a step‑by‑step plan that the presenter executes in Python. Throughout the demo the host peppers the narrative with concrete examples, such as generating a “funny” clip titled “Trump won a UFC fight at the White House” and showing how the system asks clarifying questions about manual versus automatic clip selection and speaker detection. He also highlights the use of a “Gemini tree” documentation folder and the iterative debugging of the YOLO model integration. The broader implication is a template for creators to build their own end‑to‑end video‑clipping pipelines without relying on proprietary services. By stitching together readily available AI models and scripting tools, content producers can automate repurposing long‑form footage into platform‑ready vertical snippets, cutting costs and accelerating turnaround times.

Shipmas Day 9: How I Use AI Video To Get 10+ Million Views
The video walks viewers through a hands‑on demonstration of an AI‑driven workflow that can churn out vertical videos capable of attracting tens of millions of views. The creator starts by explaining the premise – a simple loop that stitches together...

Shipmas Day 8: Image to Code AI Subagents in Claude Code
The video showcases a new workflow built around Claude Code’s sub‑agent architecture, aimed at turning a visual specification into a fully‑styled web page for a “Titan Bear” build guide in Path of Exile 2. The presenter walks through how a single user prompt—"create...

Shipmas Day 7: Create Personal Software Tools With Claude Code
The video walks viewers through building a lightweight web app that automatically trims silent sections from uploaded MP4 videos. Leveraging Claude Code, an AI‑driven code generator, the creator outlines a plan to use the Vercel stack and FFmpeg to detect silence...

Shipmas Day 6: Bring Any Idea To Life App (Nano Banana Pro API)
The video showcases the “Bring Any Idea To Life” application built on the Nano Banana Pro API, leveraging Whisper for speech‑to‑text transcription and Gemini 3 for generative analysis. The host walks viewers through a workflow where a user speaks an idea,...

Shipmas Day 5: Autonomous Real Time Webpage (AI Automation)
The video walks viewers through the creation of an autonomous, real‑time web page that continuously curates and publishes content from Reddit. Using a custom MCP server, the creator fetches new Reddit posts every five minutes, then employs Google Gemini to...

Shipmas Day 4: LLM Group Chat (OpenRouter)
The video walks viewers through building a multi‑model group chat using the OpenRouter API, which aggregates dozens of large language models (LLMs) under a single endpoint. The creator selects models such as Claude Haiku, Gemini, GPT‑4.5, and Grok‑4.1, wiring them...

Shipmas Day 3: Change Any Scene In Your Favorite Movie With AI
The video walks viewers through a hands‑on demo of a “scene changer” app built on the Cling 2.6 image‑to‑video model. By uploading a short clip, extracting a single frame with ffmpeg, and feeding that frame plus a natural‑language prompt into the...

Shipmas Day 2: Podcast Anything App (Gemini 3)
The developer built a web app that converts uploaded documents (PDFs, markdown, text) into multi-voice podcast episodes by using Gemini 3 to generate scripts and a multispeech TTS API to produce audio. The interface offers controls for tone (roast, steelman,...

Shipmas Day 1: Autonomous AI Social Media Video Converter App
A developer demonstrated building an autonomous app that converts landscape (16:9) videos into vertical (9:16) social clips by combining YOLO face detection, MediaPipe speaking detection, smoothing logic, and FFmpeg cropping. They used cloud code and Opus 4.5 agents to plan,...

Claude Opus 4.5 Just Changed Video Automation FOREVER
A creator demonstrates an end-to-end automated video production workflow powered by Claude (Opus 4.5) and complementary tools—Whisper for transcription, 11 Labs for synthetic voice, FFmpeg for editing, and AI image generators to fill visual gaps. The system ingests source footage,...

AI Game Devlog #1: Opus 4.5 Is UNREAL - MW2 FPS Clone
An indie developer used Anthropic’s Claude Opus 4.5 to rapidly prototype a Modern Warfare 2–inspired FPS called “360 No Scope,” demonstrating kill cams, sniper and knife mechanics, AI bots, instant replays and a simple best-of-five game loop. He showed a...

5 Nano Banana Pro "Million Dollar" Apps I Built In ONE Day
A developer used Google’s new Nano Banana Pro model and Gemini 3 in Google AI Studio to prototype five consumer app ideas in a single day, demonstrating rapid end-to-end app generation. Demonstrations included a random celebrity selfie generator that blends...

Gemini 3 PRO: The Best LLM Ever Made | 7 Use Cases TESTED
The reviewer tests Google’s newly released Gemini 3 across seven hands-on use cases rather than benchmarks, including a cloud-based Linux terminal, drone control, UI replication, a game clone, image understanding, video I/O, and a personal Path of Exile 2 benchmark....