This site needs JavaScript to work properly. Please enable it to take advantage of the complete set of features!
Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log in
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan;39(Database issue):D487-93.
doi: 10.1093/nar/gkq1057. Epub 2010 Nov 3.

SAHG, a comprehensive database of predicted structures of all human proteins

Affiliations

SAHG, a comprehensive database of predicted structures of all human proteins

Chie Motono et al. Nucleic Acids Res. 2011 Jan.

Abstract

Most proteins from higher organisms are known to be multi-domain proteins and contain substantial numbers of intrinsically disordered (ID) regions. To analyse such protein sequences, those from human for instance, we developed a special protein-structure-prediction pipeline and accumulated the products in the Structure Atlas of Human Genome (SAHG) database at http://bird.cbrc.jp/sahg. With the pipeline, human proteins were examined by local alignment methods (BLAST, PSI-BLAST and Smith-Waterman profile-profile alignment), global-local alignment methods (FORTE) and prediction tools for ID regions (POODLE-S) and homology modeling (MODELLER). Conformational changes of protein models upon ligand-binding were predicted by simultaneous modeling using templates of apo and holo forms. When there were no suitable templates for holo forms and the apo models were accurate, we prepared holo models using prediction methods for ligand-binding (eF-seek) and conformational change (the elastic network model and the linear response theory). Models are displayed as animated images. As of July 2010, SAHG contains 42,581 protein-domain models in approximately 24,900 unique human protein sequences from the RefSeq database. Annotation of models with functional information and links to other databases such as EzCatDB, InterPro or HPRD are also provided to facilitate understanding the protein structure-function relationships.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
SAHG prediction systems. ‘Structure prediction pipeline’ and ‘Other structure and function predictions’ are shown in the right pink regions and bottom light-blue regions, respectively. The center panel illustrates each procedure in the flow of the structure prediction pipeline, showing how the results of systems are integrated. SWPPA: Smith–Waterman profile–profile alignment method; ID: intrinsically disordered; ENM: elastic network model.
Figure 2.
Figure 2.
(A) Example view of SAHGs detailed information page [RefSeqID: NP_002834.3, protein tyrosine phosphatase, receptor type, J isoform 1 precursor (48)]. Labels I, II, III, IV, V and VI indicate the ‘Protein information’ panel, the ‘Complex’ button, the ‘bar indicator’, the ‘Domain information’ panel, the ‘Jmol Window’ and the ‘Catalytic residue’ pin on the bar indicator, respectively. (B) Example view of a ‘Complex information’ page (NP_002834.3). For this protein, only one complex structure in a homo-trimeric form was predicted.

References

    1. Nelson KE, Weinstock GM, Highlander SK, Worley KC, Creasy HH, Wortman JR, Rusch DB, Mitreva M, Sodergren E, Chinwalla AT, et al. A catalog of reference genomes from the human microbiome. Science. 2010;328:994–999. - PMC - PubMed
    1. Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns NL, Kermani BG, Carnevali P, Nazarenko I, Nilsen GB, Yeung G, et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010;327:78–81. - PubMed
    1. Zhang W, Dolan ME. Impact of the 1000 genomes project on the next wave of pharmacogenomic discovery. Pharmacogenomics. 2010;11:249–256. - PMC - PubMed
    1. Metzker ML. Sequencing technologies - the next generation. Nat. Rev. Genet. 2010;11:31–46. - PubMed
    1. MacLean D, Jones JD, Studholme DJ. Application of ‘next-generation’ sequencing technologies to microbial genetics. Nat. Rev. Microbiol. 2009;7:287–296. - PubMed

Publication types

Cite

AltStyle によって変換されたページ (->オリジナル) /