Life Science Database Archive
ASTRA

Exon

Data detail

info Data name Exon
info DOI
info Description of data contents

Exons in variants

info Data file
File name:
astra_exon.zip
File size:
5.9 MB
info Simple search URL
info Data acquisition method

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

info Data analysis method

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types.

info Number of data entries

676,111 entries

Data itemDescription
Exon ID Specific exon ID for this database
Species Species name
Locus ID Specific locus ID for this database
cDNA ID Specific cDNA ID for this database
Left in cDNA The left position of exon onto cDNA
Right in cDNA The right position of exon onto cDNA
Left in genome The left position of exon onto genome
Right in genome The right position of exon onto genome

AltStyle によって変換されたページ (->オリジナル) /