Hugging Face

Company-Unified Profile

15 followers

The AI community building the future. https://t.co/VkRPD0Vclr

Blog•Nov 19, 2025

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

ServiceNow‑AI converted its 15 B attention‑based reasoning model into a hybrid Mamba architecture, achieving 2.1× throughput with negligible quality loss. The breakthrough came from distilling on the teacher’s high‑quality SFT reasoning traces rather than generic pretraining data, and using reverse KL divergence as the loss. A three‑stage replacement process—identifying low‑impact layers, progressive Mamba substitution, and final SFT fine‑tuning—produced the Apriel‑H1‑15b‑Thinker‑SFT checkpoint, which outperforms the teacher on several reasoning benchmarks while cutting latency.

Technology Pulse

Hugging Face

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

The Pharmome Map: A Comprehensive Public Dataset for Drug-Target Interaction Modeling

Easily Build and Share ROCm Kernels with Hugging Face

World's Largest AI Hackathon Launches with $20K Prizes

Join the AMD Open Robotics Hackathon

Building for an Open Future - Our New Partnership with Google Cloud

Essential Testing Tips for Your Gradio App

Building a Healthcare Robot From Simulation to Deployment with NVIDIA Isaac

Voice Cloning with Consent

Huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

Streaming Datasets: 100x More Efficient

LeRobot v0.4.0: Super Charging OSS Robotics Learning

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Sentence Transformers Is Joining Hugging Face!

Hugging Face and VirusTotal Collaborate to Strengthen AI Security

Unlock the Power of Images with AI Sheets

Supercharge Your OCR Pipelines with Open Models

Google Cloud C4 Brings a 70% TCO Improvement on GPT OSS with Intel and Hugging Face

Get Your VLM Running in 3 Simple Steps on Intel CPUs

Technology Pulse

Hugging Face

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

The Pharmome Map: A Comprehensive Public Dataset for Drug-Target Interaction Modeling

Easily Build and Share ROCm Kernels with Hugging Face

World's Largest AI Hackathon Launches with $20K Prizes

Join the AMD Open Robotics Hackathon

Building for an Open Future - Our New Partnership with Google Cloud

Essential Testing Tips for Your Gradio App

Building a Healthcare Robot From Simulation to Deployment with NVIDIA Isaac

Voice Cloning with Consent

Huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

Streaming Datasets: 100x More Efficient

LeRobot v0.4.0: Super Charging OSS Robotics Learning

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Sentence Transformers Is Joining Hugging Face!

Hugging Face and VirusTotal Collaborate to Strengthen AI Security

Unlock the Power of Images with AI Sheets

Supercharge Your OCR Pipelines with Open Models

Google Cloud C4 Brings a 70% TCO Improvement on GPT OSS with Intel and Hugging Face

Get Your VLM Running in 3 Simple Steps on Intel CPUs