Clément Delangue

Creator

2 followers

Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders

Social•May 29, 2026

Avoid Re‑encoding Decoded Tokens for Stable RL Training

Most people training agentic LLMs with RL right now have a silently broken training loop and have no idea. Here's the trap: single-turn RL works beautifully. Clean curves, sane rewards, everything converges. Then you add tools so the model can act mid-rollout, and things get weird. Loss spikes for no reason. Eventually a shape-mismatch error. The culprit: every time you parse the model's output to detect a tool call, then re-tokenize the updated conversation for the next turn, you're rolling the dice. Usually the round-trip gives back the same tokens. Sometimes it doesn't and your gradient lands on a sequence the model never actually sampled. No crash. Just quietly wrong math and a useless gradient signal. The fix is one rule: never re-encode tokens you've decoded. Keep the sampled tokens in one buffer, never re-render them, and both failure modes disappear. That's Token-In, Token-Out done right. Our team just published a beautiful deep-dive on exactly this, including an audit across the major open-weights model families showing most chat templates already support it. Required reading if you're doing multi-turn RL 🤗🔥 https://t.co/zmx0EQl3jM

By Clément Delangue

Social•May 18, 2026

On‑Prem AI with Hugging Face Solves GPU Shortage

I believe on-prem and local AI - based on Hugging Face open-source models - will be an important answer to the GPU shortages this year (because they are cheaper, faster, safer than cloud APIs)! Great collaboration between Hugging Face &...

By Clément Delangue

Social•May 15, 2026

Unified, Affordable Storage for AI Models and Data

AI teams shouldn’t have to choose between expensive object storage and painful git workflows. @huggingface Storage is built for model weights, datasets, checkpoints and artifacts: - simple per-TB pricing - built-in CDN - Xet deduplication - private by default when needed Store your AI data where...

By Clément Delangue

Social•May 14, 2026

Scaling Laws Validate Performance Gains in Time‑Series Models

Are scaling laws finally working for time series foundation models? Today, @datadoghq is releasing Toto 2.0 weights in Apache 2.0 on @huggingface. It's a family of open-weights TSFMs from 4M to 2.5B parameters, where every size beats the last from a...

By Clément Delangue

Social•May 12, 2026

Prices Rise June 1; Grab Early‑Bird Deal Now

Reachy is mad, but RAM costs + tariffs are forcing our hand. Prices will go up on June 1st! Still at the early bird price until then though if you were looking for an excuse to get one now: https://t.co/veqPEwFIaP! https://t.co/UP45svdMr8

By Clément Delangue

Social•May 7, 2026

Create Reachy Mini App with 1920s Posh Voice

Can someone build the Reachy Mini app that lets it speak like Talkie from Alec Radford in a posh pre-1931 British voice? I want to hear a next-gen robot channeling the 1920s! Space: https://lnkd.in/eugi77Xv How to build a Reachy Mini...

By Clément Delangue

Social•May 5, 2026

Governments Should Adopt Open‑Source Sovereign AI

More governments and public agencies should use HF and open-source AI in general. Let’s go sovereign AI!

By Clément Delangue

Social•May 5, 2026

Share Datasets on Hugging Face to Unlock Collective Insights

By sharing datasets on @huggingface, you help agents being able to analyze them, giving everyone the ability to make sense of complex data. For example, this is some of the insights I uncovered from @jamiequint's fascinating data: https://t.co/PW4GH7Uxna What other...

By Clément Delangue

Social•May 1, 2026

AI Labs Use Web Distillation, Then Block Competition

I think the expression is “pulling the ladder”! All labs trained their models by distilling (at the very least distilling the web) which allowed them to become the fastest growing businesses in the history of humanity and now that they...

By Clément Delangue

Social•Apr 30, 2026

Distillation Deserves Fair‑use Protection for Open‑source AI

What people call "distillation" is a super common practice (you use other models to benchmark your model, to evaluate your inputs or to add a little bit to your datasets) that in my opinion should be covered by fair use...

By Clément Delangue

Social•Apr 28, 2026

Real‑time Training Metrics Now Built Into ML Intern

Added native metric logging + @TrackioApp integration to ml intern so that you can follow every training run it kicks off in real time. Try it by asking "train a tiny model on a tiny dataset, find something super small/super...

By Clément Delangue

Social•Apr 27, 2026

Top 3 Trending HF Models: DeepSeek AI, OpenAI, Qwen

Top 3 trending models of the week on HF: DeepSeek AI , OpenAI & Qwen !

By Clément Delangue

Social•Apr 27, 2026

Llamacpp: Local, Free, Fast, Secure AI Future

llamacpp is the future of AI (local + free + fast + secure + powerful)!

By Clément Delangue

Social•Apr 24, 2026

Beyond Growth: Prioritizing Moats and Quality in AI

I'm always baffled how most investors seem to obsess over top revenue growth numbers these days and seem to take that as a universal prediction of future success in AI. Maybe we're finally starting to get back to more sanity...

By Clément Delangue

Social•Apr 24, 2026

Model Races to #1 on Hugging Face in Minutes

500+ likes in 28 mins. On their way to be the fastest model ever to get to #1 trending on HF! https://t.co/kxmwUEnwyY https://t.co/lvkbh2gXGi

By Clément Delangue

Clément Delangue

Avoid Re‑encoding Decoded Tokens for Stable RL Training

On‑Prem AI with Hugging Face Solves GPU Shortage

Unified, Affordable Storage for AI Models and Data

Scaling Laws Validate Performance Gains in Time‑Series Models

Prices Rise June 1; Grab Early‑Bird Deal Now

Create Reachy Mini App with 1920s Posh Voice

Governments Should Adopt Open‑Source Sovereign AI

Share Datasets on Hugging Face to Unlock Collective Insights

AI Labs Use Web Distillation, Then Block Competition

Distillation Deserves Fair‑use Protection for Open‑source AI

Real‑time Training Metrics Now Built Into ML Intern

Top 3 Trending HF Models: DeepSeek AI, OpenAI, Qwen

Llamacpp: Local, Free, Fast, Secure AI Future

Beyond Growth: Prioritizing Moats and Quality in AI

Model Races to #1 on Hugging Face in Minutes

Technology Pulse