Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility. / Salvatore, Marco; Horlacher, Marc; Marsico, Annalisa; Winther, Ole; Andersson, Robin.

I: NAR Genomics and Bioinformatics, Bind 5, Nr. 2, 026, 2023.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Salvatore, M, Horlacher, M, Marsico, A, Winther, O & Andersson, R 2023, 'Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility', NAR Genomics and Bioinformatics, bind 5, nr. 2, 026. https://doi.org/10.1093/nargab/lqad026

APA

Salvatore, M., Horlacher, M., Marsico, A., Winther, O., & Andersson, R. (2023). Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility. NAR Genomics and Bioinformatics, 5(2), [026]. https://doi.org/10.1093/nargab/lqad026

Vancouver

Salvatore M, Horlacher M, Marsico A, Winther O, Andersson R. Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility. NAR Genomics and Bioinformatics. 2023;5(2). 026. https://doi.org/10.1093/nargab/lqad026

Author

Salvatore, Marco ; Horlacher, Marc ; Marsico, Annalisa ; Winther, Ole ; Andersson, Robin. / Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility. I: NAR Genomics and Bioinformatics. 2023 ; Bind 5, Nr. 2.

Bibtex

@article{16803ffa02ca4baba7ad6992cfc958f0,
title = "Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility",
abstract = "Dysfunction of regulatory elements through genetic variants is a central mechanism in the pathogenesis of disease. To better understand disease etiology, there is consequently a need to understand how DNA encodes regulatory activity. Deep learning methods show great promise for modeling of biomolecular data from DNA sequence but are limited to large input data for training. Here, we develop ChromTransfer, a transfer learning method that uses a pre-trained, cell-type agnostic model of open chromatin regions as a basis for fine-tuning on regulatory sequences. We demonstrate superior performances with ChromTransfer for learning cell-type specific chromatin accessibility from sequence compared to models not informed by a pre-trained model. Importantly, ChromTransfer enables fine-tuning on small input data with minimal decrease in accuracy. We show that ChromTransfer uses sequence features matching binding site sequences of key transcription factors for prediction. Together, these results demonstrate ChromTransfer as a promising tool for learning the regulatory code.",
keywords = "TRANSCRIPTION FACTORS, GENE-EXPRESSION, ENHANCERS, BINDING, GENOME",
author = "Marco Salvatore and Marc Horlacher and Annalisa Marsico and Ole Winther and Robin Andersson",
year = "2023",
doi = "10.1093/nargab/lqad026",
language = "English",
volume = "5",
journal = "NAR Genomics and Bioinformatics",
issn = "2631-9268",
publisher = "Oxford University Press",
number = "2",

}

RIS

TY - JOUR

T1 - Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility

AU - Salvatore, Marco

AU - Horlacher, Marc

AU - Marsico, Annalisa

AU - Winther, Ole

AU - Andersson, Robin

PY - 2023

Y1 - 2023

N2 - Dysfunction of regulatory elements through genetic variants is a central mechanism in the pathogenesis of disease. To better understand disease etiology, there is consequently a need to understand how DNA encodes regulatory activity. Deep learning methods show great promise for modeling of biomolecular data from DNA sequence but are limited to large input data for training. Here, we develop ChromTransfer, a transfer learning method that uses a pre-trained, cell-type agnostic model of open chromatin regions as a basis for fine-tuning on regulatory sequences. We demonstrate superior performances with ChromTransfer for learning cell-type specific chromatin accessibility from sequence compared to models not informed by a pre-trained model. Importantly, ChromTransfer enables fine-tuning on small input data with minimal decrease in accuracy. We show that ChromTransfer uses sequence features matching binding site sequences of key transcription factors for prediction. Together, these results demonstrate ChromTransfer as a promising tool for learning the regulatory code.

AB - Dysfunction of regulatory elements through genetic variants is a central mechanism in the pathogenesis of disease. To better understand disease etiology, there is consequently a need to understand how DNA encodes regulatory activity. Deep learning methods show great promise for modeling of biomolecular data from DNA sequence but are limited to large input data for training. Here, we develop ChromTransfer, a transfer learning method that uses a pre-trained, cell-type agnostic model of open chromatin regions as a basis for fine-tuning on regulatory sequences. We demonstrate superior performances with ChromTransfer for learning cell-type specific chromatin accessibility from sequence compared to models not informed by a pre-trained model. Importantly, ChromTransfer enables fine-tuning on small input data with minimal decrease in accuracy. We show that ChromTransfer uses sequence features matching binding site sequences of key transcription factors for prediction. Together, these results demonstrate ChromTransfer as a promising tool for learning the regulatory code.

KW - TRANSCRIPTION FACTORS

KW - GENE-EXPRESSION

KW - ENHANCERS

KW - BINDING

KW - GENOME

U2 - 10.1093/nargab/lqad026

DO - 10.1093/nargab/lqad026

M3 - Journal article

C2 - 37007588

VL - 5

JO - NAR Genomics and Bioinformatics

JF - NAR Genomics and Bioinformatics

SN - 2631-9268

IS - 2

M1 - 026

ER -

ID: 344368575