Introducing EzAAI: a pipeline for high throughput calculations of prokaryotic average amino acid identity
- PMID: 33907973
- DOI: 10.1007/s12275-021-1154-0
Introducing EzAAI: a pipeline for high throughput calculations of prokaryotic average amino acid identity
Erratum in
-
Erratum to: Introducing EzAAI: A Pipeline for High Throughput Calculations of Prokaryotic Average Amino Acid Identity.Kim D, Park S, Chun J. Kim D, et al. J Microbiol. 2023 Sep;61(9):879. doi: 10.1007/s12275-023-00075-z. J Microbiol. 2023. PMID: 37707763 No abstract available.
Abstract
The average amino acid identity (AAI) is an index of pairwise genomic relatedness, and multiple studies have proposed its application in prokaryotic taxonomy and related disciplines. AAI demonstrates better resolution in elucidating taxonomic structure beyond the species rank when compared with average nucleotide identity (ANI), which is a standard criterion in species delineation. However, an efficient and easy-to-use computational tool for AAI calculation in large-scale taxonomic studies is not yet available. Here, we introduce a bioinformatic pipeline, named EzAAI, which allows for rapid and accurate AAI calculation in prokaryote sequences. The EzAAI tool is based on the MMSeqs2 program and computes AAI values almost identical to those generated by the standard BLAST algorithm with significant improvements in the speed of these evaluations. Our pipeline also provides a function for hierarchical clustering to create dendrograms, which is an essential part of any taxonomic study. EzAAI is available for download as a standalone JAVA program at http://leb.snu.ac.kr/ezaai .
Keywords: average amino acid identity; comparative genomics; phylogeny; software suite.
References
-
- Hunter, J.D. 2007. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9, 90–95. - DOI
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials