Skip to main navigation Skip to search Skip to main content

Improving protein succinylation sites prediction using embeddings from protein language model

  • Suresh Pokharel
  • , Pawel Pratyush
  • , Michael Heinzinger
  • , Robert H Newman
  • , Dukka B. Kc
  • Michigan Technological University
  • Technische Universität München
  • The University of North Carolina at Chapel Hill

Research output: Contribution to journalArticlepeer-review

65 Scopus citations

Abstract

Protein succinylation is an important post-translational modification (PTM) responsible for many vital metabolic activities in cells, including cellular respiration, regulation, and repair. Here, we present a novel approach that combines features from supervised word embedding with embedding from a protein language model called ProtT5-XL-UniRef50 (hereafter termed, ProtT5) in a deep learning framework to predict protein succinylation sites. To our knowledge, this is one of the first attempts to employ embedding from a pre-trained protein language model to predict protein succinylation sites. The proposed model, dubbed LMSuccSite, achieves state-of-the-art results compared to existing methods, with performance scores of 0.36, 0.79, 0.79 for MCC, sensitivity, and specificity, respectively. LMSuccSite is likely to serve as a valuable resource for exploration of succinylation and its role in cellular physiology and disease.
Original languageEnglish
Article number16933
JournalScientific Reports
Volume12
Issue number1
DOIs
StatePublished - Dec 1 2022

Fingerprint

Dive into the research topics of 'Improving protein succinylation sites prediction using embeddings from protein language model'. Together they form a unique fingerprint.

Cite this