Life Science Database Archive
ASTRA

cDNA

Data detail

info Data name cDNA
info DOI
info Description of data contents

List of cDNA in locus

info Data file
File name:
astra_cdna.zip
File size:
3.3 MB
info Simple search URL
info Data acquisition method

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

info Data analysis method

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types. Genes were identificatified by cDNA annotations

info Number of data entries

84,438 entries

Data itemDescription
cDNA ID Specific cDNA ID for this database
Locus ID Specific locus ID for this database
Species Species name
Chr. No. Chromosome No.
Strand Strand
Gene name Gene name
UniGene ID UniGene ID
cDNA UniGene ID UniGene ID of this cDNA
Genbank accession Genbank accession number
GI Genbank ID
CDS left The left position of coding region onto this cDNA
CDS right The right position of coding region onto this cDNA
Splicing pattern Splicing pattern (gene model)
cDNA sequence length cDNA sequence length
DOU start Premature (pseudo) start codon (true) or not (false)
DOU end Premature (pseudo) stop codon (true) or not (false)
NMD Nonsense-mediated mRNA Decay (NMD) might happen (true) or not (false)
Number of exons Number of exons in this variant
Number of splicing regions Number of alternative splicing/transcriptional initiation regionss

AltStyle によって変換されたページ (->オリジナル) /