DNAcrypt-AI: A Genome-Scale Bioinformatics Framework for Coordinate-Based Cryptography Using Artificial Intelligence

DNAcrypt-AI: A Genome-Scale Bioinformatics Framework for Coordinate-Based Cryptography Using Artificial Intelligence

Research Square – News/Updates
Research Square – News/UpdatesJun 16, 2026

Why It Matters

By turning the human reference genome into a high‑entropy cryptographic substrate, DNAcrypt‑AI offers a novel, scalable method for secure data concealment that could reshape bio‑informatics‑adjacent security solutions.

Key Takeaways

  • DNAcrypt-AI encodes data across random genome coordinates.
  • Uses Covary ML to map coordinates to characters without alignment.
  • Supports 6–90 character messages with constant encoded size.
  • Open-source on GitHub, leverages hg19/hg38 reference genomes.

Pulse Analysis

The concept of biological entropy has long intrigued researchers seeking unconventional data reservoirs. Traditional genome‑based encoding schemes focus on narrow features such as short tandem repeats or single‑nucleotide polymorphisms, leaving the vast structural complexity of the human genome untapped. DNAcrypt‑AI flips this paradigm by treating the entire reference assembly as a high‑entropy canvas, distributing information across dispersed chromosomal loci. This approach not only maximizes the entropy pool but also sidesteps the need for synthetic DNA synthesis, lowering barriers to practical deployment.

At the core of DNAcrypt‑AI is a three‑stage pipeline. First, a genomic keyring randomly selects coordinates, creating a dispersed “key” that is inherently resistant to pattern analysis. Second, the FAS2rDNA reconstruction engine expands these points into multi‑FASTA corpora, preserving the native genomic context. Finally, Covary—a translation‑aware, alignment‑free machine‑learning model—learns coordinate‑to‑character relationships, enabling deterministic decoding without explicit nucleotide constraints. The system consistently encodes and retrieves alphanumeric strings ranging from six to ninety characters, while the physical data footprint remains fixed, demonstrating true size‑independent encoding.

The implications extend beyond academic curiosity. By leveraging an existing, universally accessible reference genome, DNAcrypt‑AI offers a cost‑effective, scalable alternative to conventional cryptographic primitives, especially for applications where data must be hidden in plain sight within biological datasets. Its open‑source nature invites community‑driven enhancements, such as integration with cloud‑based genomic storage or expansion to other reference assemblies. As AI‑driven bioinformatics continues to mature, frameworks like DNAcrypt‑AI could catalyze a new class of security tools that blend computational genomics with information theory.

DNAcrypt-AI: A genome-scale bioinformatics framework for coordinate-based cryptography using artificial intelligence

Comments

Want to join the conversation?

Loading comments...