GIGADB DATASETS
GigaDB contains 2687 discoverable, trackable, and citable datasets that have been assigned DOIs and are available for public download and use.
Datasets and tools
Supporting data for "Population-level allelic dispersion modelling by maelstRom yields genome-wide maps of allele-specific dysregulation during early carcinogenesis."
Supporting data for "ColoPola: A polarimetric imaging dataset for colorectal cancer detection"
Supporting data for "Complete end-to-end learning from protein feature representation to protein interactome inference"
Supporting data for "Towards a Standardized Framework for Pangenome Graph Evaluation: Assessing Crop Plant Pangenome Variation Graph Construction from Multiple Assemblies"
Supporting data for "Comparative genomics and multi-omics analyses reveal the evolution and physiological basis of rubber biosynthesis in Hevea species"
Supporting data for "DUNE: a versatile neuroimaging encoder captures brain complexity across three major diseases: cancer, dementia and schizophrenia"
Supporting data for "A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support"
Supporting data for "The complete genome assembly of Astragalus membranaceus: enabling more accurate genetic research"
Supporting data for "The Great Genotyper: A Graph-Based Method for Population Genotyping of Small and Structural Variants"
Supporting data for "Unsupervised multi-scale clustering of single-cell transcriptomes to identify hierarchical structures of cell subtypes"
Supporting data for "Foundation Model of Electronic Medical Records for Adaptive Risk Estimation"
Supporting data for "Haplotype-resolved reference genomes of the sea turtle clade unveil ultra-syntenic genomes with hotspots of divergence"
Latest news
Beyond Open Access: GigaScience Press is Your Open Science Solution
Watch this video about sharing research with GigaScience Press including hosting your data in GigaDB.
What is Open Science and why is it important? GigaScience Press explains more, and here provides some insight into how our unique journals GigaScience and GigaByte provide a single integrated open science solution for researchers to make published research and all of the supporting data, code and methods open and stabling shared. Helping meet new global standards and policies for sharing research outputs. Having a human in the loop, our team of biocurators are on standby to help authors follow current best practices in data stewardship. Offering storage and curation in our GigaDB repository if required.
Dataset types
- Genomic datasets (1597)
- Software datasets (553)
- Transcriptomic datasets (397)
- Imaging datasets (1272)
- Neuroscience datasets (98)
- Epigenomic datasets (64)
- Metagenomic datasets (114)
-
Genome mapping datasets(19)
- Workflow datasets (120)
- Proteomic datasets (49)
- Metabarcoding datasets (9)
- Metadata datasets (49)
- Climate datasets (2)
- Network-Analysis datasets (14)
- EEG datasets (4)
- Phenotyping datasets (29)
- Metabolomic datasets (29)
- Lipidomic datasets (4)
- Ecology datasets (15)
- Virtual-Machine datasets (11)
RSS
-
New dataset added on 2025年09月30日: 10.5524/102761 Supporting data for "Population-level allelic dispersion modelling by maelstRom yields genome-wide maps of allele-specific dysregulation during early carcinogenesis."
-
New dataset added on 2025年09月23日: 10.5524/102759 Supporting data for "Complete end-to-end learning from protein feature representation to protein interactome inference"
-
New dataset added on 2025年09月23日: 10.5524/102763 Supporting data for "ColoPola: A polarimetric imaging dataset for colorectal cancer detection"
-
New dataset added on 2025年09月18日: 10.5524/102758 Supporting data for "Towards a Standardized Framework for Pangenome Graph Evaluation: Assessing Crop Plant Pangenome Variation Graph Construction from Multiple Assemblies"
-
New dataset added on 2025年09月11日: 10.5524/102755 Supporting data for "Comparative genomics and multi-omics analyses reveal the evolution and physiological basis of rubber biosynthesis in Hevea species"
-
New dataset added on 2025年09月08日: 10.5524/102757 Supporting data for "DUNE: a versatile neuroimaging encoder captures brain complexity across three major diseases: cancer, dementia and schizophrenia"
-
New dataset added on 2025年09月03日: 10.5524/102756 Supporting data for "A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support"
-
New dataset added on 2025年08月29日: 10.5524/102751 Supporting data for "The complete genome assembly of Astragalus membranaceus: enabling more accurate genetic research"
-
New dataset added on 2025年08月22日: 10.5524/102749 Supporting data for "The Great Genotyper: A Graph-Based Method for Population Genotyping of Small and Structural Variants"
-
New dataset added on 2025年08月20日: 10.5524/102753 Supporting data for "Unsupervised multi-scale clustering of single-cell transcriptomes to identify hierarchical structures of cell subtypes"