Abstract

Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Using mutagenesis typically requires screening sizable random DNA libraries, which limits the designs to span merely a short section of the promoter and restricts their control of gene expression. Here, we prototype a deep learning strategy based on generative adversarial networks (GAN) by learning directly from genomic and transcriptomic data. Our ExpressionGAN can traverse the entire regulatory sequence-expression landscape in a gene-specific manner, generating regulatory DNA with prespecified target mRNA levels spanning the whole gene regulatory structure including coding and adjacent non-coding regions. Despite high sequence divergence from natural DNA, in vivo measurements show that 57% of the highly-expressed synthetic sequences surpass the expression levels of highly-expressed natural controls. This demonstrates the applicability and relevance of deep generative design to expand our knowledge and control of gene expression regulation in any desired organism, condition or tissue.

Keywords

biokemija;izražanje genov;bioinformatika;globoko učenje;sintetična biologija;

Data

Language: English
Year of publishing:
Typology: 1.01 - Original Scientific Article
Organization: NIB - National Institute of Biology
UDC: 577
COBISS: 120155139 Link will open in a new window
ISSN: 2041-1723
Views: 9
Downloads: 3
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: Slovenian
Secondary keywords: biokemija;izražanje genov;bioinformatika;globoko učenje;sintetična biologija;
Source comment: Nasl. z nasl. zaslona; Opis vira z dne 5. 9. 2022; Št. članka: 5099;
Pages: str. 1-17
Issue: ǂVol. ǂ13
Chronology: 30 Aug. 2022
DOI: 10.1038/s41467-022-32818-8
ID: 24543206