Cloud computing assumes a pivotal role in biomedical research, offering a range of critical benefits. It provides the flexibility of scalable computing power and storage, access to extensive biomedical datasets, cutting-edge software and hardware resources, and user-friendly data sharing capabilities. Particularly noteworthy is its ability to level the playing field for institutions with limited on-premises computing facilities. Many NIH researchers stand to gain significantly from cloud computing technology, as it addresses their evolving computational needs. Additionally, many NIH research projects can experience transformative enhancements by harnessing the innovative capabilities of the cloud.
To take the advantage of the opportunities, the NIH Office of Data Science Strategy launched this program to support NIH researchers in leveraging cloud resources in their research activities.
NOSIs:
- 2023: NOT-OD-23-070, expired on 4/12/2023
- 2024, 2025, 2026: NOT-OD-24-078, expiration date 6/19/2026
High-Value Datasets (HVD) Program:
- HVD 2020, expired on 1/15/2020
- HVD 2021, expired on 1/10/2021
- HVD 2022, expired on 2/1/2022
- HVD 2023, expired on 2/1/2023
- HVD 2024, expired on 2/13/2024
- HVD 2025, expired on 2/5/2025
STRIDES Cloud Credits Program (SCC):
- SCC 2025, round 1, expired on 11/15/2024
PI meetings:
Awardee projects and their descriptions are available below.
Principal Investigator | Project Title | NIH IC |
---|---|---|
Sandhya Xirasagar | Using the AWS Cloud for Improved PII Data Security and Cross-IC Collaboration to Develop Gene Prioritization and Text Mining Pipelines for the Genome Research Integration System | NIAID |
Hari Shroff | Evaluation of cloud computing for imaging and microscopy datasets | NEI & NIBIB |
Valentina Di Francesco | AnVIL & STRIDES | NHGRI |
Jim Gnadt | Zebrafish Dataset Hosting Supplement | NINDS |
Krista Zanetti | COMETS Analytics | NCI |
Geoffrey Tobias | DCEG Analytic Tools Suite | NCI |
Jonathan Kaltman | Migrating imaging datasets and tools to the NHLBI BioData Catalyst | NHLBI |
Weiniu Gan | Single-cell Omics based Reference Lung resource (RefLung) | NHLBI |
Keyvan Farahani | A Sustainable Medical Imaging Challenge Cloud Infrastructure (MedICCI) | NCI |
Jeff Shilling | NCI IRP Cloud Migration: Genomics, Chemistry, and Imaging | NCI |
Quan Chen | Facilitating access of immunological data in ImmPort for analyses | NIAID |
Principal Investigator | Project Title | NIH IC |
---|---|---|
Charlene Schramm | Using Seven Bridges’ CAVATICA to empower use of the INCLUDE DCC platform | OD |
Adrienne Campbell | Inline image reconstruction of dynamic 3D data using a GPU-enabled cloud implementation | NHLBI |
Greg Farber | Migrating Human Connectome Analysis Tools to the Cloud | NIMH |
Deborah Duran | NIMHD Algorithmic Auditing Tool and Centralized Research Collaboration Platform to Assess and Mitigate Healthcare Decision Biases Impacting Marginalized Populations | NIMHD |
Kim Pruitt | SRA RNA-seq precomputed alignments and gene expression counts | NLM |
Daniel Veltri, Andrew Oler | Develop a Scalable and Reusable Framework for State-of-the-Art Structural Variant Calling of Whole Genome Sequencing Data | NIAID |
Daniel Reich | Medical Image processing and structured storage | NINDS |
Wang, Xujing | Cloud migration of data and data analysis platform of The Environmental Determinants of Diabetes in The Young Study (TEDDY) | NIDDK |
Tanja Davidsen | NCI CRDC Cloud Transfer of TP53 Website and Database | NCI |
Wang, Xujing | Migration of Core Applications from the NIDDK information Network (dkNET) | NIDDK |
Jeff Shilling | NCI IRP Cloud Migration: Imaging and Chemistry | NCI |
Valentina Di Francesco | Telomere-to-Telomere Consortium Analyses on the NHGRI AnVIL | NHGRI |
Arvydas Maminishkis | Artificial Intelligence Based Morphometric Analysis of Cells | NEI |
Mehdi Pirooznia | Cloud implementation of TensorFlow machine learning framework for SNP and indel variant calling on exomes and whole genomes sequencing | NHLBI |
Debra Babcock | Parkinson's Disease Biomarkers Program (PDBP) | NINDS |
Weiniu Gan | National Sleep Research Resource | NHLBI |
Principal Investigator | Project Title | NIH IC |
---|---|---|
Andrew Singlet, Cornelis Blauwendraat | Long-read DNA sequencing of neurodegenerative disorders | NIA |
Qian Zhu | Rare Disease Alert System | NCATS |
Ewy Mathé | Public Substance Registration Using the Global Substance Registration System (GSRS) | NCATS |
Valentina Di Francesco | Text mining in the Cloud | NHGRI |
Ronald M. Summers | Small Bowel Segmentation | CC |
Javed Khan MD | Oncogenomics Pipeline & Databases for Childhood Cancer Data Initiative (CCDI) and other Pediatric Cancers | NCI |
Valentina Di Francesco | Long Read Variant Frequency Database on AnVIL | NHGRI |
Janelle Cortner | Accelerating External Sharing of NCI IRP Imaging Data & AI Models via Platform Connectivity | NCI |
Iman Martin | Building a cross-study data set for the PRIMED consortium | NHGRI |
Johnny Tam, Hari Shroff | Cloud Computing for Optical Image Restoration and Intramural Training | NEI & NIBIB |
Kirsten Herrick, Christie Kaefer | Automated Self-Administered 24-hour Dietary Assessment Tool (ASA24) | NCI |
Janelle Cortner | Accelerating External Sharing of NCI IRP Genomic Data via Platform Connectivity | NCI |
Janelle Cortner | Leveraging Intramural NCI Data Platforms for Accelerated Data Sharing | NCI |
Lisa Cunningham, PhD | Generation of an NIH-wide Clinical Database of Hearing and Balance Function | NIDCD |
Principal Investigator | Project Title | NIH IC |
---|---|---|
Zhiyong Lu | Scaling up literature annotations with cloud computing in PubTator 3.0 | NLM |
Keith Shockley, Alison Motsinger-Reif | Genome-wide analysis using cloud computing in the All of Us Researcher Workbench | NIEHS |
Stephen Brooks, Darrick Akiyama | NIAMS Hybrid Cloud Computing Pilot | NIAMS |
Elizabeth Powell | Alcoholism Solutions: Synthesizing Information to Support Treatments (ASSIST 2.0) | NIAAA |
Scott Auerbach | ToxPipe: Semi-Autonomous AI Integration of Diverse Toxicological Data Streams | NIEHS |
Joseph Marcotrigiano | Cryo-EM data processing on the cloud | NIAID |
Javed Khan | Oncogenomics Pipeline & Databases for Childhood Cancer Data Initiative (CCDI) and other Pediatric Cancers | NCI |
Principal Investigator | Project Title | NIH IC |
---|---|---|
Yang Fann | Exploring the Potential of Large Language Models for IRP Operations and Research | NINDS |
Qian Zhu | Accelerating Rare Disease Research via Rare Disease Alert System (RDAS) | NCATS |
Joseph Marcotrigiano | Optimization of cloud computing for cryo-EM data processing | NIAID |
Robert J. Lederman | Optimization of Cardiovascular Interventional MRI Devices via Cloud-Based High-Fidelity Electromagnetic Simulations | NHLBI |
Cliff Wong, Emily Greenspan | HPC MuMMI for Commercial AWS Cloud Proof of Concept | NCI |
Mitchell Machiela | LDlink | NCI |
Vipul Periwal | Faithful low-dimensional representations for modeling imbalanced data sets | NIDDK |
Evan Bolton | PubChemRDF in Cloud | NLM |
Jeffrey Beck | Using Generative AI in the cloud to develop a training set for new MeSH terms for the MEDLINE Medical Text Indexing System | NLM |
Naoko Mizuno | Exploration of cloud computing for cryo-EM/ET analysis pipeline | NHLBI |
Rebecca Troisi | Development of epidemiological study data platforms to support FAIR research practices | NCI |
Peter Kraft | FlowIQ: A workflow cloud migration toolkit | NCI |
Johnny Tam | Large scale image data handling in the cloud | NEI |
Javed Khan, MD | Oncogenomics Pipeline & Databases for Childhood Cancer Data Initiative (CCDI) and other Pediatric Cancers | NCI |
Xiaofang Jiang | Predicting Phage-Host Range Using Deep-Learning | NLM |
Ronald M. Summers | Processing Radiology Reports with GPT-4 | CC |
Anirban Banerjee | Structural studies of integral membrane enzymes and transmembrane transporters | NICHD |
Neil Hanchard | Cloud-computing to enable global admixture mapping in childhood hypertension | NHGRI |
Keith Shockley, Alison Motsinger-Reif | Variant Calling and Data Dissemination in the All of Us Cloud Computing Resource | NIEHS |
Principal Investigator | Project Title | NIH IC |
---|---|---|
Granger Sutton | Cloud-based Medical Imaging Data Analysis Using Artificial Intelligence | NCI |
Matthew McAuliffe | BRICS Cloud | CIT |
Jenny Hinshaw | Cloud computing for structural studies of large dynamin helical assemblies | NIDDK |
Anand Swaroop | Cloud-Based Training of Large Language Models for Retinal Transcriptome Analysis | NEI |
Fausto Vela | REShAPE3D: Expanding Dimensions in RPE Analysis | NEI |
Cliff Wong (COR) Emily Greenspan (PI) | HPC MuMMI for Commercial AWS Cloud Phase 2 – Full Feature Proof of Concept | NCI |
Mia Gaudet | Centralized Orchestration of Serverless Analytics Pipelines Using Apache Airflow in Google Cloud | NCI |
Di Xia | Structural Mechanisms of Cellular Drug Resistance | NCI |
Ivan Ovcharenko | Advanced AI models of disease-causative genome variants. | NLM |
Zhiyong Lu | An LLM Powered Tool for Expert Search within NIH IRP | NLM |
Haiming Cao | Development of an Artificial Intelligence (AI) Agent for Elucidating the Molecular and Disease Mechanisms of Human Long Non-Coding RNAs (lncRNAs) | NHLBI |
Keith Shockley, Alison Motsinger-Reif | Support for bioinformatics pipelines for high value genotype calling at biobank scale in the All of Us Researcher Workbench | NIEHS |
Mario J. Borgnia | One stop resource for cryo EM data collection and processing | NIEHS |
Justine Buschman | NIAMS EP DEA Specialized AI Agents Pilot | NIAMS |
View SCC 2025 Round 1 Awardees
Principal Investigator | Project Title | NIH IC |
---|---|---|
Robert Lederman | AI-powered real-time 3D visualization for X-ray cardiac intervention | NHLBI |
Mario Borgnia | Determine structures of macromolecular complexes using cryogenic electron microscopy (CryoEM) | NIEHS |
Dimitrios Metaxotos | Leverag GPU-accelerated frameworks to optimize large-scale data and network analysis workflows | NCATS |
Naomi Ohashi | Atom Modeling PipeLine (AMPL) and Generalized Generative Molecular Design (GMD) | NHLBI |
Gayla Poling | Expand nHEAR by integrating additional specialized hearing/vestibular data from the AVCRS and to further streamline data migration processes from ongoing and future projects | NIDCD |
Brad Bower | Understand the Impact of NIH Investment on FDA Cleared Products | NIBIB |
Darrick Akiyama | NIAMS Artificial Intelligence Pilot | NIAMS |
Saman Moshafi | Computational Resources for Cancer Research (CRCR) information system | NCI |
Johanna Goderre Jones | National Childhood Cancer Registry (NCCR) data platform | NCI |
Mitchell Machiela | Assess linkage disequilibrium by querying SNPs/indels | NCI |
Joseph Marcotrigiano | Cryo Electron Microscope Data Processing | NIAID |
Principal Investigator | Institution | Project Title | NIH IC |
---|---|---|---|
DURA-BERNAL, SALVADOR | SUNY DOWNSTATE MEDICAL CENTER | Dissemination of a tool for data-driven multiscale modeling of brain circuits | NIBIB |
KUMAR, POORNIMA | MCLEAN HOSPITAL | Building Reinforcement Learning and Normative Models in the Cloud | NIMH |
OBUA, CELESTINO | MBARARA UNIVERSITY/SCIENCE/TECHNOLOGY | MUST Data Science Research Hub (MUDSReH) - Democratized Trusted Research Environment (dTRE) | FIC |
MORSE, GENE D | STATE UNIVERSITY OF NEW YORK AT BUFFALO | Exploration of Cloud Solutions to Enhance Global Infectious Diseases Research Training Program Activities | FIC |
CHERRY, J. MICHAEL | STANFORD UNIVERSITY | Support for the use and evaluation of large cloud-based genomic datasets | NHGRI |
LIU, TIEMING | OKLAHOMA STATE UNIVERSITY STILLWATER | Empowering Cloud Computing for Non-image-based Diabetic Retinopathy Screening by Designing an EHR-oriented Incremental Learning Framework | NEI |
KESSELMAN, CARL | UNIVERSITY OF SOUTHERN CALIFORNIA | Hybrid- and Multi-Cloud Storage Strategies for Cost-effective Deployment of Data Resources | NIDCR |
ZORN, AARON M | CINCINNATI CHILDRENS HOSP MED CTR | Cloud implementation of Xenbase the Xenopus model organism knowledgebase | NICHD |
RESNICK, ADAM CAIN | CHILDREN'S HOSP OF PHILADELPHIA | Enhancing CBTN digital pathology processing pipeline through the use AWS cloud-based services to enable automation, parallel processing, and rapid use of AI/ML analytics | NICHD |
VALERIUS, MICHAEL TODD | BRIGHAM AND WOMEN'S HOSPITAL | ATLAS-D2K - Exploring Cloud Optimization | NIDDK |
BOBASHEV, GEORGIY | RESEARCH TRIANGLE INSTITUTE | Supplement for Cloud Computing: Opioid Policy Models | NIDA |
POWDERLY, WILLIAM G. | WASHINGTON UNIVERSITY | Exploration of Cloud-based High Performance Computing | NCATS |
YIN, YANBIN | UNIVERSITY OF NEBRASKA LINCOLN | Exploration of cloud computing for CAZyme research | NIGMS |
SAFO, SANDRA E | UNIVERSITY OF MINNESOTA | MultiViewPortal: Towards a Scalable Web Application for Multiview Learning | NIGMS |
HOWE, WILLIAM MATTHEW | VIRGINIA POLYTECHNIC INST AND ST UNIV | An evaluation of the costs and benefits of cloud computing for modern systems neuroscience | NIDDK |
BROOKS, STEPHEN | NATIONAL INSTITUTE OF ARTHRITIS AND MUSCULOSKELETAL AND SKIN DISEASES | NIAMS Hybrid Cloud Computing Pilot | NIAMS |
POWELL, ELIZABETH | NATIONAL INSTITUTE ON ALCOHOL ABUSE AND ALCOHOLISM | Alcoholism Solutions: Synthesizing Information to Support Treatments (ASSIST 2.0) | NIAAA |
ARNAOUT, RIMA | UNIVERSITY OF CALIFORNIA, SAN FRANCISCO | Developing FAIR practices for cloud-enabled AI deployment for prospective testing | NHLBI |
YU, BING | UNIVERSITY OF TEXAS HLTH SCI CTR HOUSTON | Development of a cloud-based analytical tool for polygenic risk score and its implication in heart failure research | NHLBI |
KOBER, KORD MICHAEL | UNIVERSITY OF CALIFORNIA, SAN FRANCISCO | An Evaluation of Cloud Computing for Symptom Science Research: Moving Genomics and Machine Learning Analyses of Cancer Chemotherapy-Related Fatigue to the Cloud | NCI |
TAVTIGIAN, SEAN VAHRAM | UNIVERSITY OF UTAH | Cloud Enabled, Rigorous, Functional Assay Calibration (CERFAC) | NCI |
KIEHL, KENT A | LOVELACE BIOMEDICAL RESEARCH INSTITUTE | Cloud based neuroimaging analysis for identifying traumatic brain injuries and related changes | NINDS |
RUEBEL, OLIVER | UNIVERSITY OF CALIF-LAWRENC BERKELEY LAB | Evaluation and optimization of NWB neurophysiology software and data in the cloud | NINDS |
LU, ZHIYONG | NATIONAL LIBRARY OF MEDICINE | Scaling up literature annotations with cloud computing in PubTator 3.0 | NLM |
SHOCKLEY, KEITH/MOTSINGER-REIF, ALISON | NATIONAL INSTITUTE OF ENVIRONMENTAL HEALTH SCIENCES | Genome-wide analysis using cloud computing in the All of Us Researcher Workbench | NIEHS |
AUERBACH, SCOTT | NATIONAL INSTITUTE OF ENVIRONMENTAL HEALTH SCIENCES | ToxPipe: Semi-Autonomous AI Integration of Diverse Toxicological Data Streams | NIEHS |
MARCOTRIGIANO, JOSEPH | NATIONAL INSTITUTE OF ALLERGY AND INFECTIOUS DISEASES | Cryo-EM data processing on the cloud | NIAID |
COOPER, LEE | NORTHWESTERN UNIVERSITY AT CHICAGO | Cloud strategies for improving cost, scalability, and accessibility of a machine learning system for pathology images | NLM |
KARRIKER-JAFFE, KATHERINE J. | RESEARCH TRIANGLE INSTITUTE | Supplement for Cloud Computing: Alcohol Use Disorder Treatment Simulation | NIAAA |
KHAN, JAVED | NATIONAL CANCER INSTITUTE | Oncogenomics Pipeline & Databases for Childhood Cancer Data Initiative (CCDI) and other Pediatric Cancers | NCI |
KUMAR, VIVEK | JACKSON LABORATORY | Google Cloud Pipeline for mouse behavior and frailty assessment for the aging research community | NIA |