Life Science Database Archive
PGDBj - Ortholog DB

Cluster (Viridiplantae)

Data detail

info Data name Cluster (Viridiplantae)
info Version info Description of data contents

Clusters of amino acid sequences of Viridiplantae (green plants) obtained from the NCBI Reference Sequence Database. Along a phylogenetic tree, clusters were generated in Viridiplantae taxon and in each sub-taxon of Viridiplantae by using the results of all-against-all BLAST searches among the amino acid sequences. An amino acid sequence belongs to only one cluster in a taxon.

info Data file
File name:
pgdbj_ortholog_db_viridiplantae_cluster.zip
File size:
16.6 MB
info Simple search URL info Data acquisition method

Data in "Protein (Viridiplantae)" was used.

info Data analysis method

Along a phylogenetic tree obtained from the NCBI Taxonomy Database, clusters in lower taxa (subclusters) were recursively aggregated to form clusters in a taxon (superclusters).

info Number of data entries

2,310,444 entries

Data itemDescription
Cluster ID The cluster ID is composed of a Taxonomy ID and a serial number beginning with "0". For instance, "cluster ID: 33090.0" means the protein belongs to the cluster ranked 0th among the clusters in the "taxon: 33090". This cluster ID is uniquely-assigned by the PGDBj Ortholog Database.
Cluster size Number of proteins affiliated with the Cluster
Supercluster Next supercluster
Subcluster Next subcluster

AltStyle によって変換されたページ (->オリジナル) /