NIH-Supported Study Develops AI Algorithm Trained on EHR Data to Predict Rare Disease

NIH-Supported Study Develops AI Algorithm Trained on EHR Data to Predict Rare Disease

HTN – Health Tech Newspaper (UK)
HTN – Health Tech Newspaper (UK)Apr 20, 2026

Why It Matters

WEST demonstrates that weakly supervised AI can overcome data quality challenges in rare‑disease detection, promising earlier diagnoses and more efficient clinical workflows. Its scalability could reshape how health systems, especially the NHS, integrate AI to cut non‑clinical workload and improve patient outcomes.

Key Takeaways

  • WEST algorithm predicts rare lung diseases from noisy EHR data.
  • Outperformed baseline models for pulmonary hypertension and severe asthma detection.
  • Uses weakly supervised learning to train on both diagnosed and undiagnosed patients.
  • Scalable to longitudinal data for early disease onset and treatment response.
  • Supports NHS AI adoption, aiming to cut non‑clinical tasks by 10%.

Pulse Analysis

The National Institutes of Health‑backed study introduces the WEakly Supervised Transformer (WEST), a novel AI model that can extract diagnostic signals from the imperfect, fragmented data typical of electronic health records. By leveraging weak supervision, WEST learns from both confirmed cases and patients lacking a formal diagnosis, sidestepping the need for large, perfectly labeled datasets. This capability is especially valuable for rare diseases, where patient numbers are low and high‑quality annotations are scarce, positioning the algorithm as a potential game‑changer in clinical informatics.

In validation trials, WEST was applied to electronic records of patients at risk for pulmonary hypertension and severe asthma—two low‑prevalence lung conditions. The model achieved the highest predictive scores among all benchmark classifiers, correctly flagging cases that clinicians later confirmed. By outperforming traditional supervised approaches, WEST demonstrates that weakly supervised transformers can reduce the diagnostic lag that often plagues rare‑disease patients, potentially shortening the average time to diagnosis from years to months and improving eligibility for early interventions.

The next phase will extend WEST to longitudinal data streams, enabling predictions of disease onset and individualized treatment response. Such foresight aligns with the NHS’s digital agenda, which seeks to embed AI tools that trim non‑clinical workload by roughly 10 % and boost digital skill adoption. If scaled nationally, the algorithm could streamline rare‑disease pathways, lower healthcare costs, and provide a template for AI‑driven precision medicine across other specialties, reinforcing the broader momentum toward cloud‑based, secure health‑tech ecosystems.

NIH-supported study develops AI algorithm trained on EHR data to predict rare disease

Comments

Want to join the conversation?

Loading comments...