Discovering functionally important sites in proteins

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Discovering functionally important sites in proteins. / Cagiada, Matteo; Bottaro, Sandro; Lindemose, Søren; Schenstrøm, Signe M.; Stein, Amelie; Hartmann-Petersen, Rasmus; Lindorff-Larsen, Kresten.

I: Nature Communications, Bind 14, Nr. 1, 4175, 2023.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Cagiada, M, Bottaro, S, Lindemose, S, Schenstrøm, SM, Stein, A, Hartmann-Petersen, R & Lindorff-Larsen, K 2023, 'Discovering functionally important sites in proteins', Nature Communications, bind 14, nr. 1, 4175. https://doi.org/10.1038/s41467-023-39909-0

APA

Cagiada, M., Bottaro, S., Lindemose, S., Schenstrøm, S. M., Stein, A., Hartmann-Petersen, R., & Lindorff-Larsen, K. (2023). Discovering functionally important sites in proteins. Nature Communications, 14(1), [4175]. https://doi.org/10.1038/s41467-023-39909-0

Vancouver

Cagiada M, Bottaro S, Lindemose S, Schenstrøm SM, Stein A, Hartmann-Petersen R o.a. Discovering functionally important sites in proteins. Nature Communications. 2023;14(1). 4175. https://doi.org/10.1038/s41467-023-39909-0

Author

Cagiada, Matteo ; Bottaro, Sandro ; Lindemose, Søren ; Schenstrøm, Signe M. ; Stein, Amelie ; Hartmann-Petersen, Rasmus ; Lindorff-Larsen, Kresten. / Discovering functionally important sites in proteins. I: Nature Communications. 2023 ; Bind 14, Nr. 1.

Bibtex

@article{ab7dcc3bced44b65b0e5685f95689654,
title = "Discovering functionally important sites in proteins",
abstract = "Proteins play important roles in biology, biotechnology and pharmacology, and missense variants are a common cause of disease. Discovering functionally important sites in proteins is a central but difficult problem because of the lack of large, systematic data sets. Sequence conservation can highlight residues that are functionally important but is often convoluted with a signal for preserving structural stability. We here present a machine learning method to predict functional sites by combining statistical models for protein sequences with biophysical models of stability. We train the model using multiplexed experimental data on variant effects and validate it broadly. We show how the model can be used to discover active sites, as well as regulatory and binding sites. We illustrate the utility of the model by prospective prediction and subsequent experimental validation on the functional consequences of missense variants in HPRT1 which may cause Lesch-Nyhan syndrome, and pinpoint the molecular mechanisms by which they cause disease.",
author = "Matteo Cagiada and Sandro Bottaro and S{\o}ren Lindemose and Schenstr{\o}m, {Signe M.} and Amelie Stein and Rasmus Hartmann-Petersen and Kresten Lindorff-Larsen",
note = "Publisher Copyright: {\textcopyright} 2023. The Author(s).",
year = "2023",
doi = "10.1038/s41467-023-39909-0",
language = "English",
volume = "14",
journal = "Nature Communications",
issn = "2041-1723",
publisher = "nature publishing group",
number = "1",

}

RIS

TY - JOUR

T1 - Discovering functionally important sites in proteins

AU - Cagiada, Matteo

AU - Bottaro, Sandro

AU - Lindemose, Søren

AU - Schenstrøm, Signe M.

AU - Stein, Amelie

AU - Hartmann-Petersen, Rasmus

AU - Lindorff-Larsen, Kresten

N1 - Publisher Copyright: © 2023. The Author(s).

PY - 2023

Y1 - 2023

N2 - Proteins play important roles in biology, biotechnology and pharmacology, and missense variants are a common cause of disease. Discovering functionally important sites in proteins is a central but difficult problem because of the lack of large, systematic data sets. Sequence conservation can highlight residues that are functionally important but is often convoluted with a signal for preserving structural stability. We here present a machine learning method to predict functional sites by combining statistical models for protein sequences with biophysical models of stability. We train the model using multiplexed experimental data on variant effects and validate it broadly. We show how the model can be used to discover active sites, as well as regulatory and binding sites. We illustrate the utility of the model by prospective prediction and subsequent experimental validation on the functional consequences of missense variants in HPRT1 which may cause Lesch-Nyhan syndrome, and pinpoint the molecular mechanisms by which they cause disease.

AB - Proteins play important roles in biology, biotechnology and pharmacology, and missense variants are a common cause of disease. Discovering functionally important sites in proteins is a central but difficult problem because of the lack of large, systematic data sets. Sequence conservation can highlight residues that are functionally important but is often convoluted with a signal for preserving structural stability. We here present a machine learning method to predict functional sites by combining statistical models for protein sequences with biophysical models of stability. We train the model using multiplexed experimental data on variant effects and validate it broadly. We show how the model can be used to discover active sites, as well as regulatory and binding sites. We illustrate the utility of the model by prospective prediction and subsequent experimental validation on the functional consequences of missense variants in HPRT1 which may cause Lesch-Nyhan syndrome, and pinpoint the molecular mechanisms by which they cause disease.

U2 - 10.1038/s41467-023-39909-0

DO - 10.1038/s41467-023-39909-0

M3 - Journal article

C2 - 37443362

AN - SCOPUS:85164843304

VL - 14

JO - Nature Communications

JF - Nature Communications

SN - 2041-1723

IS - 1

M1 - 4175

ER -

ID: 360248935