
The catalog fills a long‑standing gap in genomic reference data, allowing clinicians and scientists to pinpoint repeat expansions that underlie dozens of genetic disorders and to develop targeted therapies.
Tandem repeats—short DNA motifs that are copied head‑to‑tail—constitute roughly 3 % of the human genome and are a major source of genetic diversity. While single‑nucleotide variants have been cataloged exhaustively, repeat expansions remain under‑represented in reference databases, despite their proven role in disorders such as Huntington’s disease, myotonic dystrophy, and certain cancers. The difficulty of accurately measuring repeat length with short‑read sequencing has historically limited large‑scale analyses, leaving a critical blind spot for both basic research and clinical genomics.
The newly released Human Genome Tandem Repeat Catalog addresses that blind spot by aggregating high‑resolution repeat profiles from long‑read assemblies, optical mapping, and population‑scale short‑read datasets. Over one million repeat loci are annotated with motif composition, copy‑number distribution, and allele frequencies derived from more than 100 k genomes spanning diverse ancestries. The resource is hosted on an open‑access portal that syncs with UCSC, Ensembl, and the NCBI Genome Data Viewer, while a RESTful API enables bulk queries and integration into bioinformatics pipelines.
By making repeat variation transparent, the catalog empowers researchers to link previously ambiguous loci to disease phenotypes, refine genome‑wide association studies, and design CRISPR‑based interventions with greater precision. Clinical laboratories can now incorporate repeat allele frequency thresholds into diagnostic reporting, reducing false‑positive calls for pathogenic expansions. Looking ahead, the consortium plans quarterly updates as new sequencing cohorts become available, ensuring the catalog remains a living reference that will shape genomics, drug discovery, and personalized medicine for years to come.
Comments
Want to join the conversation?
Loading comments...