AI Videos

Video•Dec 21, 2025

NVIDIA’s AI Finally Solved Walking In Games

The video spotlights a breakthrough from NVIDIA that replaces traditional capsule‑based NPC movement with fully physically simulated humanoids. By coupling a diffusion‑based path planner called Trace with a joint‑control system dubbed Pacer, the researchers enable agents to generate and follow realistic walking trajectories in real time, eliminating the classic “moon‑walking” foot‑slip bugs that plague many games. Key technical insights include the use of roughly 20 motor‑driven joints per character, a diffusion model that denoises noisy path predictions into smooth, anticipatory routes, and an adversarial reinforcement‑learning loop where a discriminator judges the naturalness of each step. Over three days, more than 2,000 parallel humanoids performed billions of attempts, learning to balance, swing arms, and adapt to stairs, slopes, and uneven terrain without any handcrafted animation clips. The demo is peppered with vivid examples: agents shouting “holy crap, help me!” when a foot slips, crowds that organically weave around obstacles instead of following rigid “if neighbor is close, turn left” rules, and the ability to prompt the diffusion model to make groups walk side‑by‑side. The system even handles diverse body types—short, tall, plump—without extra tuning, and it can generate messy pedestrian behavior useful for testing autonomous‑vehicle algorithms. Implications are twofold. For game developers, the technology promises a dramatic reduction in animation labor while delivering more lifelike crowds that react naturally to complex geometry. For the broader AI and automotive sectors, the open‑source framework provides a scalable way to populate virtual cities with realistic, physics‑grounded pedestrians, improving the fidelity of simulation‑based safety testing for self‑driving cars.

By Two Minute Papers

Video•Dec 21, 2025

Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning

Google unveiled T5 Gemma 2, the latest iteration of its encoder‑decoder AI family built on the Gemma 3 architecture, positioning it as a purpose‑built engine for long‑form text and multimodal reasoning. The announcement highlights a shift from the dominant decoder‑only “ChatGPT‑style” models toward...

By Analytics Vidhya

Video•Dec 20, 2025

Are AI Benchmarks Telling The Full Story? [SPONSORED]

The video critiques the current reliance on technical AI benchmarks, arguing that they miss the human‑centric aspects of large language model (LLM) performance. Andrew Gordon and Nora Petrova of Prolific explain that while models may ace exams like MMLU or...

By Machine Learning Street Talk

Video•Dec 20, 2025

Exploring the MTEB Leaderboard | Vector Databases for Beginners | Part 6

The video walks viewers through the MTEB (Massive Text Embedding Benchmark) leaderboard, positioning it as a practical guide for selecting open‑source embedding models and tuning modules for vector‑search applications. The presenter highlights recent UI changes—new benchmarks, language options, and domain‑specific...

By Data Science Dojo

Video•Dec 20, 2025

Shipmas Day 15: Claude Code Skills Will Dominate 2026

In the latest Shipmas Day 15 broadcast, the host walks viewers through a “skill” framework for Anthropic’s Claude model, arguing that modular skill files will become the dominant way developers harness AI code generation by 2026. The workflow hinges on a...

By All About AI

Video•Dec 19, 2025

AI Still Hallunicates Can We Trust It, And To What Extent | Joshua Starmer X Data Science

The video centers on the persistent problem of AI hallucinations—instances where large language models generate plausible‑but‑incorrect information—and asks how much trust users can place in these systems. Joshua Starmer, speaking alongside Data Science, argues that while the technology will improve,...

By Data Science Dojo

Video•Dec 19, 2025

Choosing the Right Embedding Model | Vector Databases for Beginners | Part 5

The video walks viewers through the decision‑making process for selecting an embedding model, a critical component in building vector‑database‑driven applications. It contrasts two concrete examples—a modern open‑source BERT‑base model and a proprietary OpenAI offering—while acknowledging the overwhelming variety of alternatives...

By Data Science Dojo

Video•Dec 19, 2025

Training a Unitree G1 to Walk W/ Reinforcement Learning

The video chronicles a creator’s effort to teach a Unitree G1 quadruped to walk using reinforcement‑learning techniques, emphasizing the transition from pure simulation (Sim2Sim) to real‑world deployment (Sim2Real). After years of attempting Sim2Real, the presenter finally succeeded thanks to advances...

By Harrison Kinsley

Video•Dec 19, 2025

If You're Doing a Repeated Task Every Week, Spend that Time Automating It Instead

The video introduces Exec Prep GPT, a generative‑AI assistant built to automate the preparation and feedback of “tee‑up” documents that executives use to surface decisions. The presenter feeds the model a deliberately weak tee‑up—lacking clear purpose, approver, and background—to showcase how the...

By How I AI

Video•Dec 19, 2025

How to Run LLMs Locally - Full Guide

The video provides a step‑by‑step guide for developers who want to run large language models (LLMs) on their own hardware, focusing on two primary approaches: the open‑source Ollama tool and Docker’s model runner. It begins by positioning local inference as...

By Tech With Tim

Video•Dec 19, 2025

Mistral OCR 3: AI That Can Actually Read Documents

Mistral AI unveiled its latest offering, Mistral OCR 3, a next‑generation optical character recognition model that promises to bridge the gap between raw document images and actionable data. The announcement positions the technology as a catalyst for a new wave...

By Analytics Vidhya

Video•Dec 18, 2025

What Is Sycophancy in AI Models?

The video, presented by Kyra from Anthropic’s safeguards team, introduces the concept of “sycophancy” in AI—when a model tells users what they want to hear rather than what is accurate or helpful. Drawing on her background in psychiatric epidemiology, Kyra...

By Anthropic

Video•Dec 18, 2025

Shipmas Day 14: Can AI Agents "Dream" In a Simulation?

The video showcases a prototype social simulation built on Google’s Gemini 3 Flash model, where three AI agents—Jack, a barista at the Daily Grind; Claude, a barista at Bean There; and Erica, a shared customer—interact through a gossip‑style conduit. By capturing each agent’s...

By All About AI

Video•Dec 18, 2025

Let Claude Handle Work in Your Browser

The video introduces a new browser‑based integration of Anthropic’s Claude, positioning the AI as a hands‑free assistant that can take over routine web‑based work. By embedding Claude directly into a sidebar, users can invoke the model to read, summarize, and...

By Anthropic

Video•Dec 18, 2025

AI Will Take My Job. Here's 5 Things I'm Doing About It

AI is reshaping the labor market at breakneck speed, and the video’s creator argues that the real threat isn’t a robot apocalypse but the inability to keep pace with relentless change. He frames the next two‑year window as a rare...

By Ken Jee

Video•Dec 18, 2025

We Gave AI Control of a Real Business

Project VEND is Anthropic’s live experiment in which its Claude model was tasked with running a small vending‑machine business from the company’s office. The AI, personified as “Claudius,” handled everything from Slack‑based customer requests and wholesale sourcing to pricing,...

By Anthropic

Video•Dec 18, 2025

Two Futures | Runtime 2025

The video titled “Two Futures” (runtime 2025) serves as a high‑concept launch narrative for a next‑generation artificial‑intelligence platform, positioning it as the foundational “fuel” for creating “infinite universes” of innovation. It frames the technology as the most complex and large‑scale...

By Andreessen Horowitz (a16z)

Video•Dec 17, 2025

Binti Helps Social Workers License Foster Families Faster with Claude

The video spotlights Binti, a technology platform designed to accelerate the licensing of foster and adoptive families, leveraging Anthropic’s Claude AI to automate paperwork for social workers. The speaker, a veteran social worker with eleven years of experience, explains that...

By Anthropic

Video•Dec 17, 2025

From Word2Vec to Transformers | Vector Databases for Beginners | Part 4

The video “From Word2Vec to Transformers | Vector Databases for Beginners | Part 4” walks viewers through the historical shift from static, word‑level embeddings to context‑aware transformer‑based models. It opens by recapping the shortcomings of early techniques like Word2Vec—namely their...

By Data Science Dojo

Video•Dec 17, 2025

Make Your AI Agents Production-Ready with Nvidia’s NeMo Toolkit

The video introduces NVIDIA’s NeMo Agent Toolkit (NAT), an open‑source suite designed to harden AI agents for production use. Hosted by NVIDIA engineer Brian McBear, the course walks viewers through transforming a proof‑of‑concept chatbot into a reliable, scalable service, emphasizing...

By Andrew Ng

Video•Dec 17, 2025

Gemini 3.0 Flash (Tested): Google's NEW Model Is INTERESTING...

Google unveiled Gemini 3.0 Flash, a low‑latency, cost‑optimized sibling of the Gemini 3 Pro model. While the official blog post is pending, the model is already accessible via platforms like Zenmux and OpenRouter. Priced at $0.30 per million input tokens...

By AICodeKing

Video•Dec 17, 2025

How to Get a Machine Learning Engineer Job Fast - Without a Uni Degree

In the video, the creator outlines a step‑by‑step roadmap for becoming a machine‑learning (ML) engineer by 2026 without a university degree, emphasizing the specific technical competencies and practical tools needed to break into the role. The guide is framed as...

By Tech With Tim

Video•Dec 17, 2025

Manus 1.6 Just Leveled Up AI Agents — They Actually Get Work Done

The video announces the launch of Manus 1.6, a major upgrade to the company’s autonomous AI‑agent platform, and introduces a premium tier called Manus 1.6 Max. The new version is positioned as a “digital worker” that can take a task from initial concept...

By Analytics Vidhya

Video•Dec 16, 2025

Introducing SAM Audio: The First Unified Multimodal Model for Audio Separation | AI at Meta

Introducing SAM Audio, Meta’s latest AI breakthrough, is positioned as the first unified multimodal model capable of separating audio sources across music, speech, and ambient sounds. The system allows users to isolate a specific sound by issuing text prompts—such as...

By AI at Meta

Video•Dec 16, 2025

Shipmas Day 12: AI Music Video Generator App

The video walks viewers through a hands‑on workflow for building an AI‑powered music‑video generator, stitching together image creation, lyric writing, audio synthesis, and video rendering using a suite of emerging models. The presenter starts with a prompt‑driven image generator (Nano...

By All About AI

Video•Dec 16, 2025

Day 4-Live Session-Getting Started With Generative And Agentic AI In 2026

The live session titled “Day 4‑Live Session‑Getting Started With Generative And Agentic AI In 2026” opened with the presenter outlining a comprehensive roadmap for anyone looking to break into AI, from fresh graduates to senior executives. He emphasized that the...

By Krish Naik

Video•Dec 16, 2025

Automate Your Weekly Meeting Prep with AI Agents

The video introduces an AI‑driven workflow designed to automate the preparation for weekly meetings by acting as a personal “second brain.” The presenter explains that the agent first scans the user’s calendar, flags meetings that require advance work, and then...

By How I AI

Video•Dec 16, 2025

OpenCode Desktop: RIP Claude Code? Is It REALLY SPECIAL?

The video reviews the newly released OpenCode Desktop, a graphical front‑end for the OpenCode AI coding agent that aims to bring terminal‑centric functionality to a broader, non‑technical audience. The presenter walks through the beta installation, the layout of the sidebar,...

By AICodeKing

Video•Dec 16, 2025

Speech to Text Is Harder Than You Think

The video tackles a misconception that speech‑to‑text (STT) is merely a matter of converting audio into words. It argues that for production voice agents, transcription is only the first step; the real battle lies in extracting precise entities, handling latency,...

By Louis Bouchard

Video•Dec 16, 2025

Open-Source AI Just Crushed One of the Hardest Math Exams

Open‑source researchers at Noise announced that their new 30‑billion‑parameter model, Normus‑1, achieved an 87‑out of‑120 score on the 2025 Putnam Mathematical Competition – a result that places the system within elite human performance on one of the world’s toughest undergraduate...

By Analytics Vidhya

Video•Dec 15, 2025

The Nano Banana AI Business That's Making People RICH ($960+/Day)

The video walks viewers through a turnkey business model that leverages Google’s newly released Nano Banana Pro image model to produce high‑quality, custom pet artwork for print‑on‑demand merchandise. By pairing the AI’s ability to replicate a simple cartoon‑hand‑drawn style with a seasonal...

By Alek

Video•Dec 15, 2025

NVIDIA Nemotron 3 Nano 30B First Impression - Shipmas Day 11

The video showcases NVIDIA’s newly released Nemotron 3 Nano 30B, a hybrid mixture‑of‑experts large language model that packs 30 billion parameters while activating only 3 billion at a time. Hosted on Hugging Face and other platforms, the model is fully open‑weight and boasts a massive 1 million...

By All About AI

Video•Dec 15, 2025

Why Josh Always Asks, “Can A Topic Be Any Simpler Than This?” | Joshua Starmer X Data Science Dojo

In a candid conversation with Data Science Dojo, Joshua Starmer explains the guiding principle behind his instructional videos: constantly asking, “Can a topic be any simpler without dumbing it down?” He frames this question as a litmus test for clarity,...

By Data Science Dojo

Video•Dec 15, 2025

This Google Game Secretly Teaches You Perfect AI Image Prompts

The video spotlights Google’s new interactive experiment, “Say What You See,” a gamified tool that trains users to craft precise AI image prompts. By presenting an AI‑generated picture and challenging players to describe it in fewer than 120 characters, the...

By Analytics Vidhya

Video•Dec 15, 2025

How I Personally Use AI Browsers

The video showcases how the creator has adopted Perplexity’s AI‑powered browser, Comet, as his default web tool, demonstrating its real‑time, context‑aware capabilities. He walks viewers through several everyday tasks—shopping for a Christmas gift, extracting specific segments from YouTube videos, translating...

By Matt Wolfe

Video•Dec 15, 2025

Genspark's Super AI Agent Is INSANE

The video introduces GenSpark, a rapidly emerging AI platform marketed as a “super agent” that consolidates a wide array of generative capabilities into a single workspace. The presenter walks viewers through the UI, highlighting integrations with Gmail, Google Drive, Calendar,...

By Tech With Tim

Video•Dec 15, 2025

The Hidden Skill Boost Behind Posting Online

The video explores the often‑overlooked benefit of publishing content online: it serves as a powerful learning accelerator. The creator explains that his initial foray into content creation wasn’t driven by audience size, revenue, or virality, but by a desire to...

By Louis Bouchard

Video•Dec 15, 2025

GPT-5.2 Is Here: OpenAI’s Biggest Leap Toward Real AI Work

OpenAI unveiled GPT‑5.2, positioning it as the company’s most powerful model to date and a decisive step toward an AI that can perform real‑world work rather than merely converse. The announcement frames the release as a “biggest leap” in the...

By Analytics Vidhya

Video•Dec 14, 2025

Titans: Learning to Memorize at Test Time (Paper Analysis)

The video reviews Google Research’s “Titans: Learning to Memorize at Test Time,” a NeurIPS paper that proposes a novel architecture enabling language models to retain information beyond their fixed context window. The presenter explains that the model treats the keys...

By Yannic Kilcher

Video•Dec 14, 2025

Shipmas Day 10: The AI Reverse Engineering Workflow

The video walks viewers through a hands‑on example of reverse‑engineering the popular Opus Clip service, showing how to recreate its short‑form video generation pipeline using open‑source AI tools. The creator starts by downloading a YouTube source with yt‑dlp, extracting the audio,...

By All About AI

Video•Dec 14, 2025

Production-Grade AI Agent - Full Tutorial W/ Python, Inngest, BrightData & More

In this tutorial the creator walks viewers through building a production‑grade AI web agent that can ingest live web data and serve millions of users. Using Python as the core language, the stack combines Ingest for orchestration, Bright Data’s SERP...

By Tech With Tim

Video•Dec 14, 2025

Shadcn Create + Opus 4.5 / Gemini 3 Pro: This Is THE BEST WAY to Make BEAUTIFUL APPS with AI!

The video spotlights ShadCN’s newly released “Create” builder, a visual interface that lets developers customize the look and feel of the popular open‑source UI component library and instantly scaffold a project with a single command. By pairing this tool with...

By AICodeKing

Video•Dec 14, 2025

Researchers Built a Tiny Economy. AIs Broke It Immediately

The research team behind SimWorld unveiled a procedurally generated video‑game city populated by autonomous agents—vehicles, robots and humans—each powered by leading large language models such as ChatGPT, Gemini, DeepSeek, Claude and a legacy GPT‑4‑mini. The experiment tasked these agents...

By Two Minute Papers

Video•Dec 14, 2025

Top 5 AI Chrome Extensions That Work Like Real Agents! 🤖🔥

AI Chrome extensions are emerging as lightweight, on‑demand agents that can read, summarize, scrape and even execute workflows directly within the browser. The video spotlights five tools—HardPiAI, Body, Axiom Browser Automation, Perplexity AI Companion, and Toxiate AI Agents—each promising to...

By Analytics Vidhya

Video•Dec 13, 2025

The Mathematical Foundations of Intelligence [Professor Yi Ma]

In a recent interview, Professor Yi Ma, a leading figure in deep learning and the author of *Learning Deep Representations of Data Distributions*, outlines a new mathematical framework for intelligence built on two core principles – parsimony and self‑consistency. He...

By Machine Learning Street Talk

Video•Dec 13, 2025

Shipmas Day 9: How I Use AI Video To Get 10+ Million Views

The video walks viewers through a hands‑on demonstration of an AI‑driven workflow that can churn out vertical videos capable of attracting tens of millions of views. The creator starts by explaining the premise – a simple loop that stitches together...

By All About AI

Video•Dec 13, 2025

Exploring the Origins with Word2Vec | Vector Databases for Beginners | Part 3

The video "Exploring the Origins with Word2Vec | Vector Databases for Beginners | Part 3" walks viewers through the historical breakthrough that introduced word embeddings, focusing on the Word2Vec model and its role in turning raw text into numeric vectors....

By Data Science Dojo

Video•Dec 13, 2025

Google DeepMind: "The Arrival of AGI"

The video examines the accelerating discourse around artificial general intelligence (AGI) as it moves from speculative theory to concrete business planning. It highlights a Federal Reserve Bank of Dallas chart that predicts two divergent outcomes before 2035: a benign singularity...

By Wes Roth

Video•Dec 13, 2025

Microsoft’s GigaTIME: $10 Slides → Lab-Level Cancer Insights 🔬🚀

Microsoft unveiled GigaTime, an open‑source artificial‑intelligence model that can turn a routine $10 hematoxylin‑eosin (H&E) pathology slide into a high‑resolution immune‑cell map traditionally produced only through costly, multi‑day multiplexed immunofluorescence (MIF) assays. By learning from a massive paired dataset...

By Analytics Vidhya

Video•Dec 13, 2025

Why Everyone Hates The McDonald's AI Ad

The video dissects the recent McDonald’s commercial that was entirely AI‑generated, a piece that quickly went viral for its bizarre premise – a montage of people lamenting Christmas and suffering slapstick misfortunes, all rendered by artificial intelligence. The creator explains...

By Matt Wolfe

NVIDIA’s AI Finally Solved Walking In Games

Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning

Are AI Benchmarks Telling The Full Story? [SPONSORED]

Exploring the MTEB Leaderboard | Vector Databases for Beginners | Part 6

Shipmas Day 15: Claude Code Skills Will Dominate 2026

AI Still Hallunicates Can We Trust It, And To What Extent | Joshua Starmer X Data Science

Choosing the Right Embedding Model | Vector Databases for Beginners | Part 5

Training a Unitree G1 to Walk W/ Reinforcement Learning

If You're Doing a Repeated Task Every Week, Spend that Time Automating It Instead

How to Run LLMs Locally - Full Guide

Mistral OCR 3: AI That Can Actually Read Documents

What Is Sycophancy in AI Models?

Shipmas Day 14: Can AI Agents "Dream" In a Simulation?

Let Claude Handle Work in Your Browser

AI Will Take My Job. Here's 5 Things I'm Doing About It

We Gave AI Control of a Real Business

Two Futures | Runtime 2025

Binti Helps Social Workers License Foster Families Faster with Claude

From Word2Vec to Transformers | Vector Databases for Beginners | Part 4

Make Your AI Agents Production-Ready with Nvidia’s NeMo Toolkit

Gemini 3.0 Flash (Tested): Google's NEW Model Is INTERESTING...

How to Get a Machine Learning Engineer Job Fast - Without a Uni Degree

Manus 1.6 Just Leveled Up AI Agents — They Actually Get Work Done

Introducing SAM Audio: The First Unified Multimodal Model for Audio Separation | AI at Meta

Shipmas Day 12: AI Music Video Generator App

Day 4-Live Session-Getting Started With Generative And Agentic AI In 2026

Automate Your Weekly Meeting Prep with AI Agents

OpenCode Desktop: RIP Claude Code? Is It REALLY SPECIAL?

Speech to Text Is Harder Than You Think

Open-Source AI Just Crushed One of the Hardest Math Exams

The Nano Banana AI Business That's Making People RICH ($960+/Day)

NVIDIA Nemotron 3 Nano 30B First Impression - Shipmas Day 11

Why Josh Always Asks, “Can A Topic Be Any Simpler Than This?” | Joshua Starmer X Data Science Dojo

This Google Game Secretly Teaches You Perfect AI Image Prompts

How I Personally Use AI Browsers

Genspark's Super AI Agent Is INSANE

The Hidden Skill Boost Behind Posting Online

GPT-5.2 Is Here: OpenAI’s Biggest Leap Toward Real AI Work

Titans: Learning to Memorize at Test Time (Paper Analysis)

Shipmas Day 10: The AI Reverse Engineering Workflow

Production-Grade AI Agent - Full Tutorial W/ Python, Inngest, BrightData & More

Shadcn Create + Opus 4.5 / Gemini 3 Pro: This Is THE BEST WAY to Make BEAUTIFUL APPS with AI!

Researchers Built a Tiny Economy. AIs Broke It Immediately

Top 5 AI Chrome Extensions That Work Like Real Agents! 🤖🔥

The Mathematical Foundations of Intelligence [Professor Yi Ma]

Shipmas Day 9: How I Use AI Video To Get 10+ Million Views

Exploring the Origins with Word2Vec | Vector Databases for Beginners | Part 3

Google DeepMind: "The Arrival of AGI"

Microsoft’s GigaTIME: $10 Slides → Lab-Level Cancer Insights 🔬🚀

Why Everyone Hates The McDonald's AI Ad

AI Pulse