Linux's Sched_ext Will Prioritize Idle SMT Siblings For Better Performance

Linux's Sched_ext Will Prioritize Idle SMT Siblings For Better Performance

Phoronix
PhoronixMar 23, 2026

Key Takeaways

  • sched_ext now prefers idle SMT sibling before other CPUs
  • Improves CPU‑bound workload throughput by 2–3%
  • Reduces L1 cache misses and wake‑up latency
  • Patch targets Linux 7.1 merge window
  • Enables BPF‑based scheduler customizations

Summary

The Linux kernel’s extensible scheduler class sched_ext will now prioritize idle SMT siblings before other idle CPUs, altering the idle‑CPU selection policy. The change, slated for the Linux 7.1 merge window, first checks a sibling hyper‑thread, then same‑LLC, same‑NUMA, and finally any idle core. NVIDIA’s Andrea Righi measured a 2–3 % throughput boost on CPU‑bound workloads with this tweak. The patch resides in the sched_ext.git's “for‑next” branch and targets next month’s merge window.

Pulse Analysis

The Linux kernel’s extensible scheduler class, sched_ext, has long allowed developers to inject custom scheduling logic through BPF programs. In the upcoming 7.1 release, a modest yet strategic tweak redirects the idle‑CPU selection algorithm to favor an idle simultaneous‑multithreading (SMT) sibling before scanning other cores within the same last‑level cache (LLC). By checking the sibling first, the scheduler keeps a task on the same physical core, preserving the L1 data and instruction caches that would otherwise be flushed during a cross‑core migration. This change lands in the ‘for‑next’ branch of the sched_ext.git repository and is slated for the Linux 7.1 merge window.

NVIDIA’s Andrea Righi measured the impact on pure CPU‑bound workloads, reporting a consistent 2–3 % uplift in throughput when the system runs a number of threads equal to the count of SMT cores. The gain stems from two factors: reduced wake‑up latency because the sibling core is already powered and ready, and fewer cache misses as the task continues to use the same L1 cache lines. In large‑scale servers that rely on hyper‑threaded CPUs, even a few percentage points translate into noticeable energy savings and higher job completion rates.

The patch underscores how BPF‑driven scheduler extensions can deliver tangible performance improvements without altering the core kernel codebase. As Linux 7.1 approaches, cloud providers and high‑performance computing clusters are likely to adopt the new policy to squeeze extra efficiency from existing hardware. Moreover, the success of this targeted optimization may encourage further BPF experiments—such as workload‑aware core selection or dynamic frequency scaling—positioning Linux as a flexible platform for next‑generation data‑center orchestration.

Linux's sched_ext Will Prioritize Idle SMT Siblings For Better Performance

Comments

Want to join the conversation?