This site needs JavaScript to work properly. Please enable it to take advantage of the complete set of features!
Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log in
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 23;25(6):bbae583.
doi: 10.1093/bib/bbae583.

ToxGIN: an In silico prediction model for peptide toxicity via graph isomorphism networks integrating peptide sequence and structure information

Affiliations

ToxGIN: an In silico prediction model for peptide toxicity via graph isomorphism networks integrating peptide sequence and structure information

Qiule Yu et al. Brief Bioinform. .

Abstract

Peptide drugs have demonstrated enormous potential in treating a variety of diseases, yet toxicity prediction remains a significant challenge in drug development. Existing models for prediction of peptide toxicity largely rely on sequence information and often neglect the three-dimensional (3D) structures of peptides. This study introduced a novel model for short peptide toxicity prediction, named ToxGIN. The model utilizes Graph Isomorphism Network (GIN), integrating the underlying amino acid sequence composition and the 3D structures of peptides. ToxGIN comprises three primary modules: (i) Sequence processing module, converting peptide 3D structures and sequences into information of nodes and edges; (ii) Feature extraction module, utilizing GIN to learn discriminative features from nodes and edges; (iii) Classification module, employing a fully connected classifier for toxicity prediction. ToxGIN performed well on the independent test set with F1 score = 0.83, AUROC = 0.91, and Matthews correlation coefficient = 0.68, better than existing models for prediction of peptide toxicity. These results validated the effectiveness of integrating 3D structural information with sequence data using GIN for peptide toxicity prediction. The proposed ToxGIN and data can be freely accessible at https://github.com/cihebiyql/ToxGIN.

Keywords: 3D structure of peptides; computational toxicology; deep learning; graph isomorphism network (GIN); peptide toxicity prediction; protein language models.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart of the proposed ToxGIN model, comprising three main modules: (A) sequence processing module, which extracts information about nodes and edges from the peptide's 3D structure and sequence; (B) feature extraction module, which employs GIN to aggregate information from neighboring nodes and edges; and (C) classification module, which uses these features to generate the toxicity probability.
Figure 2
Figure 2
The ten-fold cross validation results of ESM2, AAindex, AAC, ASDC, Pse-AAC and APAAC are based on the six basic classifiers. (A) Results based on RF. (B) Results based on SVM. (C) Results based on GNB. (D) Results based on LightGBM. (E) Results based on LR. (F) Results based on KNN.
Figure 3
Figure 3
Feature visualization of ESM2, AAindex and other four hand-crafted features. (A) Feature visualization of ESM2. (B) Feature visualization of AAindex. (C) Feature visualization of AAC. (D) Feature visualization of ASDC. (E) Feature visualization of Pse-AAC. (F) Feature visualization of APAAC.
Figure 4
Figure 4
Performance comparison of ToxGIN and its variants across evaluation metrics. The figure displays the effectiveness of the full model (ToxGIN with ESM2_t36) alongside its variants: Excluding 3D structures (w/o structures), ESM2 features (w/o ESM2), and AAindex properties (w/o AAindex), as well as results from various ESM2 model sizes (ESM2_t33, ESM2_t30, ESM2_t12).

References

    1. Albericio F, Kruger HG. Therapeutic peptides. Future Med Chem 2012;4:1527–31. 10.4155/fmc.12.94. - DOI - PubMed
    1. Guntuboina C, Das A, Mollaei P. et al. . PeptideBERT: A language model based on transformers for peptide property prediction. J Phys Chem Lett 2023;14:10427–34. 10.1021/acs.jpclett.3c02398. - DOI - PMC - PubMed
    1. Chiangjong W, Chutipongtanate S, Hongeng S. Anticancer peptide: Physicochemical property, functional aspect and trend in clinical application (review). Int J Oncol 2020;57:678–96. 10.3892/ijo.2020.5099. - DOI - PMC - PubMed
    1. Lei J, Sun LC, Huang S. et al. . The antimicrobial peptides and their potential clinical applications. Am J Transl Res 2019;11:3919–31. - PMC - PubMed
    1. Craik DJ, Fairlie DP, Liras S. et al. . The future of peptide-based drugs. Chem Biol Drug Des 2013;81:136–47. 10.1111/cbdd.12055. - DOI - PubMed
Cite

AltStyle によって変換されたページ (->オリジナル) /