DeepLoc 2.1: multi-label membrane protein type prediction using protein language models

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 833 KB, PDF-dokument

DeepLoc 2.0 is a popular web server for the prediction of protein subcellular localization and sorting signals. Here, we introduce DeepLoc 2.1, which additionally classifies the input proteins into the membrane protein types Transmembrane, Peripheral, Lipid-anchored and Soluble. Leveraging pre-trained transformer-based protein language models, the server utilizes a three-stage architecture for sequence-based, multi-label predictions. Comparative evaluations with other established tools on a test set of 4933 eukaryotic protein sequences, constructed following stringent homology partitioning, demonstrate state-of-the-art performance. Notably, DeepLoc 2.1 outperforms existing models, with the larger ProtT5 model exhibiting a marginal advantage over the ESM-1B model. The web server is available at https://services.healthtech.dtu.dk/services/DeepLoc-2.1.
OriginalsprogEngelsk
TidsskriftNucleic Acids Research
Vol/bind52
Udgave nummerW1
Sider (fra-til)W215-W220
Antal sider6
ISSN0305-1048
DOI
StatusUdgivet - 2024

Bibliografisk note

Publisher Copyright:
© 2024 The Author(s).

ID: 399110403