-
Loading metrics
Open Access
Peer-reviewed
Methods
TastepepAI: An artificial intelligence platform for taste peptide de novo design
TastepepAI: An artificial intelligence platform for taste peptide de novo design
- Jianda Yue,
- Tingting Li,
- Jian Ouyang,
- Jiawei Xu,
- Hua Tan,
- Zihui Chen,
- Changsheng Han,
- Huanyu Li,
- Songping Liang,
- Zhonghua Liu
-
- Published: October 16, 2025
- https://doi.org/10.1371/journal.pcbi.1013602
Peer Review History
| Original SubmissionAugust 6, 2025 |
|---|
|
|
Decision Letter
-
Arne Elofsson, Editor, Mohammad Sadegh Taghizadeh, Editor
TastepepAI:An artificial intelligence platform for taste peptide de novo design PLOS Computational Biology Dear Dr. Wang, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript within 60 days (November 09, 2025 at 23:59 PM). If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter We look forward to receiving your revised manuscript. Kind regards, Mohammad Sadegh Taghizadeh, Ph.D. Academic Editor PLOS Computational Biology Arne Elofsson Section Editor PLOS Computational Biology Journal Requirements: 1) Please provide an Author Summary. This should appear in your manuscript between the Abstract (if applicable) and the Introduction, and should be 150-200 words long. The aim should be to make your findings accessible to a wide audience that includes both scientists and non-scientists. Sample summaries can be found on our website under Submission Guidelines: https://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-parts-of-a-submission 2) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: https://journals.plos.org/ploscompbiol/s/figures 3) We have noticed that you have uploaded Supporting Information files, but you have not included a list of legends. Please add a full list of legends for your Supporting Information files after the references list. 4) Some material included in your submission may be copyrighted. According to PLOSu2019s copyright policy, authors who use figures or other material (e.g., graphics, clipart, maps) from another author or copyright holder must demonstrate or obtain permission to publish this material under the Creative Commons Attribution 4.0 International (CC BY 4.0) License used by PLOS journals. Please closely review the details of PLOSu2019s copyright requirements here: PLOS Licenses and Copyright. If you need to request permissions from a copyright holder, you may use PLOS's Copyright Content Permission form. Please respond directly to this email and provide any known details concerning your material's license terms and permissions required for reuse, even if you have not yet obtained copyright permissions or are unsure of your material's copyright compatibility. Once you have responded and addressed all other outstanding technical requirements, you may resubmit your manuscript within Editorial Manager. Potential Copyright Issues: - Figures 1 and 6. Please confirm whether you drew the images / clip-art within the figure panels by hand. If you did not draw the images, please provide (a) a link to the source of the images or icons and their license / terms of use; or (b) written permission from the copyright holder to publish the images or icons under our CC BY 4.0 license. Alternatively, you may replace the images with open source alternatives. See these open source resources you may use to replace images / clip-art: - https://commons.wikimedia.org 5) Please ensure that the funders and grant numbers match between the Financial Disclosure field and the Funding Information tab in your submission form. Note that the funders must be provided in the same order in both places as well. - State the initials, alongside each funding source, of each author to receive each grant. For example: "This work was supported by the National Institutes of Health (####### to AM; ###### to CJ) and the National Science Foundation (###### to AM)." - State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.". If you did not receive any funding for this study, please simply state: u201cThe authors received no specific funding for this work.u201d Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Yue et al. developed an artificial intelligence platform for designing taste peptides. This manuscript addresses a topic that may be of interest to ploscb as a methods paper. This work provides an artificial intelligence tool for generating target peptides with the desired taste profiles. This approach allows for a reduction in time and resources, thus enabling broader application in the food industry. Although generally well-structured and presented, there are two points that could be further clarified to support this work. In the results, section 2.2 LA-VAE: a loss-supervised adaptive variational autoencoder with contrastive learning for controlled taste peptide generation paragraph, it's not clear what has been done: ‘To meet the demands for precise control of specific taste properties (e.g., bitter taste elimination40,53) in practical applications, we incorporated a contrastive learning-based properties, enabling bilateral partitioning of training data: sequences with target taste avoidance mechanism into the LA-VAE architecture (Figure 3B) This mechanism allows users to explicitly specify desired (preferred) and avoided (aversive) taste features constitute the positive set, while those with avoided features form the negative set. Through parallel training on these complementary datasets, LA-VAE establishes a structured contrastive representation framework in the latent space.‘ More specifically, what is meant by "bilateral partitioning of the training data"? Furthermore, does this "partitioning" reduce the initial dataset and thus the model training? How does this partitioning ultimately impact decision-making or predictions? In the results section, 2.4. Design and validation of safe taste peptides via TastePepAI paragraph: If the training data with "currently no bitterness" were removed, would this have a negative or positive impact on the model's learning? ‘This limited bitterness suppression might be attributed to residual impurities from peptide synthesis (e.g., trifluoroacetic acid, sodium acetate) potentially interfering with sensory evaluation10,56,57, coupled with possible taste evaluation blind spots in training data labeled as ‘currently without bitterness’58,59. Consequently, accurate assessment of bitterness suppression requires further validation.’ How do the authors intend to proceed in validating the suppression of bitterness? Reviewer #2: The manuscript introduces TastePepAI, an innovative AI framework for customized taste peptide design and safety assessment. The integration of LA-VAE with a toxicity prediction module represents a promising advance in computational peptide engineering and addresses a timely challenge in food science. However, the work suffers from insufficient methodological detail and limited validation. 1. During LA-VAE training, amino acid sequences are reconstructed via an autoencoder to obtain latent representations. However, it remains unclear which two distributions are compared in the KL divergence term. The paper mentions contrastive learning, but does not specify how the contrastive loss is calculated. Please clearly define all loss functions used, together with their weighting scheme in the total loss. 2. The inference stage is insufficiently explained. What is the input during inference? If it is only the desired taste profile (e.g., a character string of taste attributes), how are the corresponding candidate peptide sequences generated from this input? 3. Some hyperparameters lack justification, including latent space dimensionality (2000), K value in KNN (k = 5), and the model weights in SpepToxPred. Please explain the rationale or tuning procedure for these parameters. 4. The criterion that "sequences that simultaneously satisfy both ‘positive sample affinity’ and ‘negative sample repulsion’ are selected as final outputs" is unclear. Please provide a quantitative definition of this criterion. 5. The training dataset contains only 1,131 sequences but encompasses many taste combinations. For certain combinations, the number of available sequences may be very small. Please discuss how the model ensures sufficient positive and negative samples for effective learning. 6. The current validation is insufficient, as the study only demonstrates the generation of bitter-avoidance peptides. It is recommended to include simpler baseline experiments, such as generating peptides with a single taste or with simple dual-taste combinations, and report the success rate. 7. While the experimental verification is commendable, demonstrating the validity of this computational design method based solely on a single experimental set may be insufficient. Ideally, the method should be systematically evaluated using independent test sets and quantitative metrics (e.g., sequence recovery rates analogous to protein design studies). If this is challenging, at minimum, a confidence or uncertainty measure for the generated sequences should be provided to allow users to assess their reliability. 8. The term "losstol" appears multiple times but is not defined. Please provide a clear explanation. 9. The description of the dataset (sources, size, preprocessing and filtering criteria) should be provided in the Methods section. The Results section can present statistical characteristics or distributional analyses. 10. Regarding uncertain taste labels ("x"), the paper introduces this notation but does not explain how the model processes such labels. 11. Ensure consistent terminology when describing taste categories (e.g., always use the plural form: "sweet, salty, and umami tastes"). Reviewer #3: The paper proposes a computational pipeline for the generation of taste peptides. The main components of the methodology are loss-supervised adaptive variational autoencoder (LA-VAE) for the generation of peptides, combined with a toxicity prediction model. The approach has been validated in vitro (by testing synthesized peptides on cell cultures) and through an electronic tongue. A collection of generated taste peptides has been made available through the development of a public database. The paper is interesting, methodologically sound and well written. The approach is original and represents a significant advancement of the state of the art. Consequently, my overall opinion on the paper is positive. I only have a few minor comments: - The github repository with the source code is available, but the web pages at the domain http://www.wang-subgroup.com/ (which seem to contain a lot of necessary stuff) are not available. When I try to connect (from Europe) I get an error page with text in (probably) Chinese language. Maybe this is a small technical problem, easy to fix (this is why I suggest "minor" revision), but the availability of these web pages is crucial for the acceptance of this paper - Why the "all_test.fast" file in the github repository contains only very short sequences? - In the abstract: "to efficiently optimizes" -> "to efficiently optimize" and also "facilitates" -> "facilitate" - In the introduction: "Here, this paper" sounds weird ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: None Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files. If you choose "no", your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy . Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: Reproducibility: ?> |
| Revision 1 |
|
|
Decision Letter
-
Arne Elofsson, Editor, Mohammad Sadegh Taghizadeh, Editor
PCOMPBIOL-D-25-01523R1 TastepepAI:An artificial intelligence platform for taste peptide de novo design PLOS Computational Biology Dear Dr. Wang, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript within 30 days (November 01, 2025, at 23:59). If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Mohammad Sadegh Taghizadeh, Ph.D. Academic Editor PLOS Computational Biology Arne Elofsson Section Editor PLOS Computational Biology Journal Requirements: 1) Please revise your current Competing Interest statement to the standard "The authors have declared that no competing interests exist." 2) Please ensure that the links provided in the Data Availability statement are working. Note: If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise. Reviewers' comments: Reviewer's Responses to Questions Reviewer #1: Yue et al. responded appropriately to the recommendations and revised the manuscript to clarify critical aspects. They also clarified the impact on decision-making or model predictions under other conditions as requested. Reviewer #2: The authors have addressed my concerns regarding the inference stage by clarifying the input format and providing details on how training data are selected and processed based on user-defined patterns. However, one aspect remains insufficiently clear. In the revised text, the inference stage is described as: "Subsequently, the LA-VAE undergoes end-to-end training using the filtered training data to learn latent space representations of target taste features." While this explains how the model learns latent representations, it does not specify how latent vectors are subsequently sampled and decoded into candidate peptide sequences. I suggest that the authors provide additional details on the inference workflow, particularly how sampling is performed in the latent space and how these samples are transformed into peptide sequences. Reviewer #3: The authors addressed my comments in a satisfactory way in the new version of the paper. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files. If you choose "no", your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy . Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: Reproducibility: ?> |
| Revision 2 |
|
|
Decision Letter
-
Arne Elofsson, Editor, Mohammad Sadegh Taghizadeh, Editor
Dear Dr. Wang, We are pleased to inform you that your manuscript 'TastepepAI:An artificial intelligence platform for taste peptide de novo design' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Mohammad Sadegh Taghizadeh, Ph.D. Academic Editor PLOS Computational Biology Arne Elofsson Section Editor PLOS Computational Biology *********************************************************** Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #2: The authors have addressed all my concerns. I have no more questions. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files. If you choose "no", your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy . Reviewer #2: No |
| Formally Accepted |
|
Acceptance Letter
-
Arne Elofsson, Editor, Mohammad Sadegh Taghizadeh, Editor
PCOMPBIOL-D-25-01523R2 TastepepAI:An artificial intelligence platform for taste peptide de novo design Dear Dr Wang, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. For Research, Software, and Methods articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Anita Estes PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .