Name	Name	Last commit message	Last commit date
Latest commit History 2 Commits
data	data
models--facebook--esm2_t30_150M_UR50D	models--facebook--esm2_t30_150M_UR50D
models	models
pkl	pkl
.gitattributes	.gitattributes
ESM2-AFPpred.py	ESM2-AFPpred.py
README.md	README.md
c_AFPs-Gen.py	c_AFPs-Gen.py
environment.yaml	environment.yaml
readpkl.py	readpkl.py
writepkl.py	writepkl.py

Name

Last commit message

Last commit date

Latest commit

History

2 Commits

data

models--facebook--esm2_t30_150M_UR50D

pkl

Accelerating de novo design of antifungal peptides using pre-trained protein language models by Kedong Yin, Ruifang Li et al.

This repository consists of two parts: 1. A method for generating candidate antifungal peptides (AFPs) sequences based on recombining dominant amino acids (dipeptide components). 2. A method for predicting AFP activity based on the ESM-2 pre-trained model (ESM2-AFPpred). The combination of these two methods can accelerate the de novo design of AFPs.

It is based on the article "Deep learning combined with quantitative structure - activity relationship accelerates de novo design of antifungal peptides" by Kedong Yin, Ruifang Li, and others. This repository includes Python code, and weight files for ESM2-AFPpred.

1.Download pre trained models and cache them locally

Please download the pre trained model cache used in this study from Hugging Face (facebook/esm2_t30_150M_UR50D at main) and store it in .\models--facebook--esm2_t30_150M_UR50D.

The cache files that need to be downloaded include:

config.json
pytorch_model.bin
special_tokens_map.json
tokenizer_config.json
vocab.txt

2.Generation of candidate antifungal peptides

python3 c_AFPs-Gen.py

The dominant amino acids need to be set at n1-ni. Please refer to the main text for the calculation of dominant amino acids. Use MySQL database to store the generated candidate antifungal peptide sequences and their corresponding physicochemical properties. Users need to set parameters such as user, password, host, database, port, etc. themselves. 'readpkl.py' and 'writepkl.py' are used to read and write the weights of amino acid physicochemical properties.

3.Prediction of candidate antifungal peptides

python3 ESM2-AFPpred.py

Enter the peptide sequence to be predicted in the 'data\input.csv' file, run the code, and the result will be stored in the 'data\output.csv' file.

4.environment

environment.yaml

All the environments and packages that this project relies on have been packaged into 'environment.yaml'.

If you need to access the dataset or other code, please contact Kedong Yin( 2703937842@qq.com ).

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shallFun4Learning/AFP_DL

Folders and files

Latest commit

History

Repository files navigation

Accelerating de novo design of antifungal peptides using pre-trained protein language models by Kedong Yin, Ruifang Li et al.

1.Download pre trained models and cache them locally

2.Generation of candidate antifungal peptides

3.Prediction of candidate antifungal peptides

4.environment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Accelerating de novo design of antifungal peptides using pre-trained protein language models by Kedong Yin, Ruifang Li et al.

1.Download pre trained models and cache them locally

2.Generation of candidate antifungal peptides

3.Prediction of candidate antifungal peptides

4.environment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages