Scalable design of orthogonal DNA barcode libraries

Gokul Gowri; Kuanwei Sheng; Peng Yin
Nat. Comput. Sci., 2024
https://doi.org/10.1038/s43588-024-00646-z

Abstract

Orthogonal DNA barcode library design is an essential task in bioengineering. Here we present seqwalk, an efficient method for designing barcode libraries that satisfy a sequence symmetry minimization (SSM) heuristic for orthogonality, with theoretical guarantees of maximal or near-maximal library size under certain design constraints. Seqwalk encodes SSM constraints in a de Bruijn graph representation of sequence space, enabling the application of recent advances in discrete mathematics1 to the problem of orthogonal sequence design. We demonstrate the scalability of seqwalk by designing a library of >106 SSM-satisfying barcode sequences in less than 20 s on a standard laptop.

logo
logo