Supported by CCR Office of Science and Technology Resources (OSTR)
ncibtep@nih.gov

Bioinformatics Training and Education Program

Featured

Upcoming Classes & Events

April

Organized by
CIT
Description

If you are a new user unsure about how to get started, please join us at one of our biweekly 30-minute new user orientations. The sessions cover basic topics such as:

  • What is Biowulf?
  • Biowulf's architecture
  • How to connect to Biowulf
  • How to transfer files to Biowulf
  • How to access training and obtain support

Email 

If you are a new user unsure about how to get started, please join us at one of our biweekly 30-minute new user orientations. The sessions cover basic topics such as:

  • What is Biowulf?
  • Biowulf's architecture
  • How to connect to Biowulf
  • How to transfer files to Biowulf
  • How to access training and obtain support

Email staff@hpc.nih.gov for the meeting link.

Organized by
CIT Technology Training Program
Description

Join us for a quick tour of a “day in the life” with Microsoft 365 Copilot. In this 60-minute overview, see how M365 Copilot helps you manage emails, prep for meetings, and create documents effortlessly in Outlook, Teams, Word, Excel, and PowerPoint. Boost your productivity and make every day easier! Imagine starting your day with a clear inbox, joining meetings fully prepared, and creating polished documents in record time with the Read More

Join us for a quick tour of a “day in the life” with Microsoft 365 Copilot. In this 60-minute overview, see how M365 Copilot helps you manage emails, prep for meetings, and create documents effortlessly in Outlook, Teams, Word, Excel, and PowerPoint. Boost your productivity and make every day easier! Imagine starting your day with a clear inbox, joining meetings fully prepared, and creating polished documents in record time with the help of M365 Copilot. Join us to see how Copilot transforms everyday tasks into effortless productivity! 

Organized by
ODSS
Description

This webinar will explore the data reuse journey, providing perspective from a researcher who successfully conducted secondary analyses using existing datasets. Guest speaker Ravi Mathur, PhD, of RTI International, is a researcher specializing in biomedical data science, real-world data analysis, and cloud-based research platform use.

This webinar will explore the data reuse journey, providing perspective from a researcher who successfully conducted secondary analyses using existing datasets. Guest speaker Ravi Mathur, PhD, of RTI International, is a researcher specializing in biomedical data science, real-world data analysis, and cloud-based research platform use.

Distinguished Speakers Seminar Series

Join Meeting
Organized by
BTEP
Description

The ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell differentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved Read More

The ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell differentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved transcriptomics further allows the measurement of gene expression levels along with the location of the RNA molecules within a tissue. Transcriptomics exemplifies the range of issues one encounters in a data science workflow, where the data are complex in a variety of ways, questions are not always clearly formulated, there are multiple analysis steps, and drawing on rigorous statistical principles and methods is essential to derive meaningful and reliable biological results. 

In this talk, Dr. Dudoit will provide a survey of statistical questions related to the analysis of single-cell transcriptome sequencing data to investigate the differentiation of stem cells in the brain, including, exploratory data analysis, expression quantitation, cluster analysis, and the inference of cellular lineages. She will also address differential expression analysis in spatial transcriptomics.

Join Meeting
Organized by
NLM
Description

The NLM Colloquia on Biomedical Data Science and Computational Biology Research is a series of scientific lectures featuring experts from across the bioinformatics community who present their research and discuss how it contributes to advancing biomedical discovery. This series is presented by NLM’s DIR a premier hub of innovation for computational biology and biomedical data science.

The NLM Colloquia on Biomedical Data Science and Computational Biology Research is a series of scientific lectures featuring experts from across the bioinformatics community who present their research and discuss how it contributes to advancing biomedical discovery. This series is presented by NLM’s DIR a premier hub of innovation for computational biology and biomedical data science.

Organized by
NCI Rising Scholars: Cancer Research Seminar Series
Description

Dr. Greenwald developed machine learning tools to profile highly multiplexed imaging data. He then used these tools to characterize the tumor microenvironment in breast cancer patient samples, combining this spatial information with paired DNA and RNA sequencing data to predict patient response to immunotherapy.

Dr. Greenwald developed machine learning tools to profile highly multiplexed imaging data. He then used these tools to characterize the tumor microenvironment in breast cancer patient samples, combining this spatial information with paired DNA and RNA sequencing data to predict patient response to immunotherapy.

Join Meeting
Organized by
NIH Rare Disease Informatics SIG
Description

The NIH Rare Disease Informatics Special Interest Group (RDI SIG) is a trans-NIH group focused on informatics approaches for curating, harmonizing, standardizing, and analyzing biomedical data from diverse sources, including gene sequences, bioassays, electronic health records, other forms of real-world data, and scientific publications, to support clinical, biological, and public health research in rare diseases.

The NIH Rare Disease Informatics Special Interest Group (RDI SIG) is a trans-NIH group focused on informatics approaches for curating, harmonizing, standardizing, and analyzing biomedical data from diverse sources, including gene sequences, bioassays, electronic health records, other forms of real-world data, and scientific publications, to support clinical, biological, and public health research in rare diseases.

Organized by
OCIO| NIH Library| CIT
Description

ChatGPT 102 training is part 2 of a three-part series.  

This one-hour online training led by OpenAI experts will dive deeper into intermediate features and strategies for maximizing ChatGPT Enterprise in NIH workflows. Building on the fundamentals from ChatGPT 101, this training will focus on intermediate features including Custom GPTs, Projects, Data Analysis, coding in Canvas, and Deep Research to enable broader value creation Read More

ChatGPT 102 training is part 2 of a three-part series.  

This one-hour online training led by OpenAI experts will dive deeper into intermediate features and strategies for maximizing ChatGPT Enterprise in NIH workflows. Building on the fundamentals from ChatGPT 101, this training will focus on intermediate features including Custom GPTs, Projects, Data Analysis, coding in Canvas, and Deep Research to enable broader value creation and collaboration with ChatGPT. Attendees will also learn how to integrate ChatGPT into specialized tasks and optimize outputs for NIH-specific use cases. 

By the end of this training, attendees will be able to: 

  • Create and customize GPTs and projects to serve as tailored assistants for NIH-specific initiatives and domains. 
  • Utilize additional intermediate features including Data Analysis, coding in Canvas, and Deep Research, to handle complex tasks and collaborative workflows. 
  • Implement best practices for integrating ChatGPT into broader NIH processes while maintaining compliance and security standards. 

Attendees are expected to be familiar with the basic functions of ChatGPT to be successful in this training (gained by attending ChatGPT 101), attending another relevant training, and/or using ChatGPT previously). 

Organized by
Ryan O'Neill (NHLBI)
Description

The Replication Gap: Moving NIH Beyond Computational Reproducibility

The Replication Gap: Moving NIH Beyond Computational Reproducibility

Organized by
NIH Library
Description

This one-hour online training, is the first of a two-part series, which introduces participants to cleaning and exploring a patient health dataset using Python and pandas. Attendees will load tabular data, inspect structure and data types, summarize columns, and identify common data quality problems such as missing values, inconsistent formats, and duplicate records. They will then apply practical fixes, including standardizing height and weight units, parsing and normalizing dates of birth, splitting combined fields, Read More

This one-hour online training, is the first of a two-part series, which introduces participants to cleaning and exploring a patient health dataset using Python and pandas. Attendees will load tabular data, inspect structure and data types, summarize columns, and identify common data quality problems such as missing values, inconsistent formats, and duplicate records. They will then apply practical fixes, including standardizing height and weight units, parsing and normalizing dates of birth, splitting combined fields, and using Boolean masks to flag or correct implausible values.​

By the end of this session students will be able to:

  • Import CSV data into pandas DataFrames and quickly understand column types, basic statistics, and overall data quality.​
  • Identify duplicate or repeated patient records and decide whether to keep, correct, or remove them.​
  • Detect and handle missing or inconsistent values using methods such as isna, fillna, filtering, and conditional replacement.​
  • Standardize mixed formats (for example, heights with and without units, date strings in different formats, and numeric values embedded in text).​
  • Create derived columns such as systolic and diastolic blood pressure, and use logical conditions to flag questionable or out-of-range values.​

Attendees are expected to have:

  • Basic Python coding knowledge
  • Familiarity with an IDE and loading script and data files into the IDE. (Colab, Jupyter Notebooks) 

Requirements: 

  • Participants will receive a script file and data files prior to the training. These should be loaded and ready to use before the training session begins. 
Organized by
NIH Library
Description

This one-hour online training, the second session of the two-part series,  focuses on reshaping and enriching the cleaned patient dataset to prepare it for analysis and reporting. Attendees will practice splitting and recombining columns (for example, separating full names into first and last names), converting columns to appropriate data types, and engineering new fields such as outlier indicators and blood pressure status labels. The session also covers merging multiple tables (patient details, contact Read More

This one-hour online training, the second session of the two-part series,  focuses on reshaping and enriching the cleaned patient dataset to prepare it for analysis and reporting. Attendees will practice splitting and recombining columns (for example, separating full names into first and last names), converting columns to appropriate data types, and engineering new fields such as outlier indicators and blood pressure status labels. The session also covers merging multiple tables (patient details, contact information, and subsets of records) and filtering or subsetting data to answer specific analytical questions.​

By the end of this training, attendees will be able to:

  • Reshape and restructure data by splitting and combining columns, changing data types, and reordering or selecting relevant fields.​
  • Engineer clinically useful features, including z-score–based outlier flags, hypertension indicators, and combined status columns for downstream models or dashboards.​
  • Merge and join DataFrames using common keys (such as patient ID) to bring together core data with supplemental tables like contact information.​
  • Filter and subset records based on multiple conditions (for example, patients with diabetes and abnormal blood pressure) to create analysis-ready datasets.​

Attendees are expected to have:

  • To have attended Intro to Data Wrangling Using Python - Part 1 of the series
  • Basic Python coding knowledge

Familiarity with an IDE and loading script and data files into the IDE. (Colab, Jupyter Notebooks) 

Requirements: 

  • Participants will receive a script file and data files prior to the training. These should be loaded and ready to use before the training session begins. 
Organized by
CIT Technology Training Program
Description

Discover how Copilot can help you build compelling presentations faster than ever. During this 60-minute class, you’ll explore how to use simple, natural language prompts to generate high‑quality slide content, enhance your visuals, and craft engaging speaker notes. You’ll also learn how Copilot can streamline revisions, tighten your message, and adapt your presentation to different audiences with just a few quick commands. By the end,&Read More

Discover how Copilot can help you build compelling presentations faster than ever. During this 60-minute class, you’ll explore how to use simple, natural language prompts to generate high‑quality slide content, enhance your visuals, and craft engaging speaker notes. You’ll also learn how Copilot can streamline revisions, tighten your message, and adapt your presentation to different audiences with just a few quick commands. By the end, you’ll be able to create polished, professional PowerPoint decks with confidence—and in a fraction of the usual time.

Organized by
CIT Technology Training Program
Description

Note: We highly recommend taking the Getting Started with AI Productivity Double Feature class prior to this. 

 If you have begun experimenting with AI tools to draft emails, summarize information, or brainstorm ideas, but are wondering where the big productivity gain are, this class is for you. The real productivity gains come when AI is used not just for single tasks but for&Read More

Note: We highly recommend taking the Getting Started with AI Productivity Double Feature class prior to this. 

 If you have begun experimenting with AI tools to draft emails, summarize information, or brainstorm ideas, but are wondering where the big productivity gain are, this class is for you. The real productivity gains come when AI is used not just for single tasks but for structured workflows that guide complex work from start to finish. This class will show you how to design practical AI workflows for common NIH activities such as literature reviews, grant and application summaries, meeting-to-action tracking, portfolio analysis, and policy brief development. This fast-paced session will teach you how to break complex work into AI-assisted steps, build reusable prompts, create verification checkpoints, and ensure output remain accurate and responsible. By the end of the session, you will be able to transform AI from a helpful tool into a repeatable productivity system that supports research, program, and policy work at NIH. 

Organized by
CIT Technology Training Program
Description

Join us for a quick tour of a “day in the life” with Microsoft 365 Copilot. In this 60-minute overview, see how M365 Copilot helps you manage emails, prep for meetings, and create documents effortlessly in Outlook, Teams, Word, Excel, and PowerPoint. Boost your productivity and make every day easier! Imagine starting your day with a clear inbox, joining meetings fully prepared, and creating polished documents in record time with the Read More

Join us for a quick tour of a “day in the life” with Microsoft 365 Copilot. In this 60-minute overview, see how M365 Copilot helps you manage emails, prep for meetings, and create documents effortlessly in Outlook, Teams, Word, Excel, and PowerPoint. Boost your productivity and make every day easier! Imagine starting your day with a clear inbox, joining meetings fully prepared, and creating polished documents in record time with the help of M365 Copilot. Join us to see how Copilot transforms everyday tasks into effortless productivity!

Organized by
OCIO| NIH Library| CIT
Description

Advanced ChatGPT training is part 3 of a three-part series. 

This one-hour online training, led by OpenAI experts, is for those who have completed the ChatGPT 101 and 102 trainings. The training will focus on leveraging two of ChatGPT Enterprise's most powerful features: Custom GPTs and Data Analysis. Attendees will learn how to create specialized GPTs tailored for specific NIH tasks and how to use the Data Analysis feature to upload, interpret, and visualize Read More

Advanced ChatGPT training is part 3 of a three-part series. 

This one-hour online training, led by OpenAI experts, is for those who have completed the ChatGPT 101 and 102 trainings. The training will focus on leveraging two of ChatGPT Enterprise's most powerful features: Custom GPTs and Data Analysis. Attendees will learn how to create specialized GPTs tailored for specific NIH tasks and how to use the Data Analysis feature to upload, interpret, and visualize data sets for deeper insights. This training is designed to provide the skills needed to apply these advanced tools to complex, enterprise-level projects. 

By the end of this training, attendees will be able to: 

  • Build and deploy Custom GPTs tailored to specific NIH workflows. 
  • Use the Data Analysis feature to upload, analyze, and visualize data. 
  • Apply advanced techniques to solve complex problems using ChatGPT Enterprise. 

Attendees are expected to be able to utilize ChatGPT to be successful in this training.  

You can register for the other trainings in this series via the link(s) below:  

 ChatGPT 101

ChatGPT 102

Organized by
Ryan O'Neill (NHLBI)
Description

Artificial Evolution with Artificial Intelligence

Artificial Evolution with Artificial Intelligence

Organized by
ABCS/FNLCR
Description

We will demonstrate how the Frederick Research Compute Environment (FRCE) accelerates and streamlines genomic variant annotation workflows. By leveraging parallelization and high-performance resources, FRCE enables us to efficiently process and manage large-scale genomic datasets. This approach improves throughput, reproducibility, and scalability, making variant analysis pipelines more effective for large-scale datasets.

We will demonstrate how the Frederick Research Compute Environment (FRCE) accelerates and streamlines genomic variant annotation workflows. By leveraging parallelization and high-performance resources, FRCE enables us to efficiently process and manage large-scale genomic datasets. This approach improves throughput, reproducibility, and scalability, making variant analysis pipelines more effective for large-scale datasets.

Organized by
CIT Technology Training Program
Description

Join us for a quick tour of a “day in the life” with Microsoft 365 Copilot. In this 60-minute overview, see how M365 Copilot helps you manage emails, prep for meetings, and create documents effortlessly in Outlook, Teams, Word, Excel, and PowerPoint. Boost your productivity and make every day easier! Imagine starting your day with a clear inbox, joining meetings fully prepared, and creating polished documents in record time with the Read More

Join us for a quick tour of a “day in the life” with Microsoft 365 Copilot. In this 60-minute overview, see how M365 Copilot helps you manage emails, prep for meetings, and create documents effortlessly in Outlook, Teams, Word, Excel, and PowerPoint. Boost your productivity and make every day easier! Imagine starting your day with a clear inbox, joining meetings fully prepared, and creating polished documents in record time with the help of M365 Copilot. Join us to see how Copilot transforms everyday tasks into effortless productivity! 

Organized by
NCI Genomic Data Commons
Description

The Genomic Data Commons is releasing a new Correlation Plot Tool which provides a framework for correlating GDC molecular information (mutation, CNV, gene expression) with clinical and survival data. Using quick access features, researchers can compare mutation or CNV status of a gene with a clinical variable or survival, CNV and mutation for given genes, a gene's CNV with its expression, and gene expression level with survival. The tool assists in identifying statistically meaningful Read More

The Genomic Data Commons is releasing a new Correlation Plot Tool which provides a framework for correlating GDC molecular information (mutation, CNV, gene expression) with clinical and survival data. Using quick access features, researchers can compare mutation or CNV status of a gene with a clinical variable or survival, CNV and mutation for given genes, a gene's CNV with its expression, and gene expression level with survival. The tool assists in identifying statistically meaningful correlations between genomic variants and clinical phenotypes to uncover patterns that assist in enabling diagnostic and treatment discoveries. Join us for an overview and demonstration of the GDC Correlation Plot Tool, and associated data supporting correlative analysis.

May

Organized by
OCIO| NIH Library| CIT
Description

This 90-minute online training led by Google experts will introduce the foundational features of Gemini for Government, tailored specifically to accelerate research and enhance productivity within NIH workflows. This training will focus on immediate, high-impact use cases that solve everyday challenges, from drafting manuscripts to communicating scientific findings more effectively. Attendees will learn how to leverage Gemini for Government's secure, AI-powered tools to streamline tasks and will get a first look at the future Read More

This 90-minute online training led by Google experts will introduce the foundational features of Gemini for Government, tailored specifically to accelerate research and enhance productivity within NIH workflows. This training will focus on immediate, high-impact use cases that solve everyday challenges, from drafting manuscripts to communicating scientific findings more effectively. Attendees will learn how to leverage Gemini for Government's secure, AI-powered tools to streamline tasks and will get a first look at the future of research automation with agents. 

By the end of this training, attendees will be able to: 

  • Utilize Gemini for Government to accelerate daily tasks, including drafting manuscript sections and analyzing meeting notes. 
  • Transform a text-based research summary into a clear and effective visual concept for an infographic. 
  • Perform natural language semantic searches to instantly find and synthesize information from scientific publications. 
  • Describe the potential of agents to automate research workflows. 

Attendees are not expected to have any prior knowledge of the tool to be successful in this training. Gemini for Government can be accessed at: https://go.hhs.gov/gemini  

Organized by
OCIO| NIH Library| CIT
Description

This 90-minute online training led by Google experts will dive into intermediate features, including agent creation and code generation, with Gemini for Government. The training will focus on hands-on applications, including building a simple agent with Agent Designer, using vibe-coding for data analysis, and leveraging NotebookLM as a personal research assistant. Attendees will also learn how to create more sophisticated, data-driven infographics to communicate their findings. 

By the end of this training, Read More

This 90-minute online training led by Google experts will dive into intermediate features, including agent creation and code generation, with Gemini for Government. The training will focus on hands-on applications, including building a simple agent with Agent Designer, using vibe-coding for data analysis, and leveraging NotebookLM as a personal research assistant. Attendees will also learn how to create more sophisticated, data-driven infographics to communicate their findings. 

By the end of this training, attendees will be able to:  

  • Build a simple, custom agent using Agent Designer to automate a research-related task, such as monitoring new publications. 
  • Generate Python scripts using natural language (vibe-coding) to clean and analyze data. 
  • Utilize NotebookLM to upload source materials, ask targeted questions across documents, and organize research insights. 
  • Create a data-driven infographic that transforms raw data into a compelling visual story. 

Attendees are expected to be familiar with the basic functions of Gemini to be successful in this training (gained by attending Gemini for Government 101), attending another relevant training, and/or using Gemini previously).  Gemini for Government can be accessed at: https://go.hhs.gov/gemini  

Organized by
SEER*Stat Tools Series
Description

Joinpoint regression is commonly used to model trends in time-specific estimates derived from aggregate data. These methods were developed primarily for non-survey data, such as cancer registry data, under the assumption that estimates at different time points are uncorrelated or follow an autoregressive AR(1) error structure. However, directly applying existing joinpoint methods to complex survey data-for example, multistage cluster samples from the annual National Health Interview Survey-fails to account for covariance among survey estimates Read More

Joinpoint regression is commonly used to model trends in time-specific estimates derived from aggregate data. These methods were developed primarily for non-survey data, such as cancer registry data, under the assumption that estimates at different time points are uncorrelated or follow an autoregressive AR(1) error structure. However, directly applying existing joinpoint methods to complex survey data-for example, multistage cluster samples from the annual National Health Interview Survey-fails to account for covariance among survey estimates induced by the sample design.

To address this limitation, we extended joinpoint methods for aggregate outcomes to accommodate potentially correlated errors arising from complex survey designs. We also developed unit-level models that account for both this correlation structure and the design-based degrees of freedom required for valid inference. This presentation introduces these methods and presents results from empirical applications and simulation studies.

Organized by
CIT
Description

All problems and concerns are welcome, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. The meeting connection details are emailed to all Biowulf users the week of the consult.

Email staff@hpc.nih.gov for the meeting link.

 

All problems and concerns are welcome, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. The meeting connection details are emailed to all Biowulf users the week of the consult.

Email staff@hpc.nih.gov for the meeting link.

 

Organized by
NIH Library
Description

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature. 

This four-hour online training will provide a brief review of Read More

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature. 

This four-hour online training will provide a brief review of the principles of epidemiology, outbreak investigations, implications in public health, key concepts and terms, and commonly used statistics in epidemiology (e.g., morbidity and mortality rates; incidence and prevalence; relative risk; odds ratio; sensitivity and specificity). Time will be devoted to questions from attendees and references will be provided for in-depth self-study. 

By the end of this training, attendees will be able to:  

  • Define epidemiology and its key principles
  • Share the purpose and function of outbreak investigations
  • Describe methods for measuring risk
  • Be familiar with screening and diagnostic accuracy indices and their differences
  • Describe when to use relative risks and odds ratios
  • Explain differences between confounding and interaction 
Organized by
Ryan O'Neill (NHLBI)
Description

Join us for a day-long symposium exploring AI approaches in biomedical sciences, with the aim of sharing effective AI implementation strategies across NIH. 

Contact Lead Organizer Ryan O’Neill, PhD (oneillrs@nih.gov) for more info.

Sign language interpreting and CART services are available upon request to participate in this event. Individualsneeding either of these services and/or other reasonable accommodations should Read More

Join us for a day-long symposium exploring AI approaches in biomedical sciences, with the aim of sharing effective AI implementation strategies across NIH. 

Contact Lead Organizer Ryan O’Neill, PhD (oneillrs@nih.gov) for more info.

Sign language interpreting and CART services are available upon request to participate in this event. Individualsneeding either of these services and/or other reasonable accommodations should contact Lisa Bossert (lisa.bossert@nih.gov) by May 1.

Organized by
OCIO| NIH Library| CIT
Description

This 90-minute online training led by Google experts is the capstone session for power users who want to push the boundaries of AI in biomedical research. This session showcases advanced agentic workflows and complex comparative analysis. The training will feature a demo on building sophisticated research assistant agents with Agent Designer and will demonstrate additional NotebookLM use cases for research. 

By the end of this training, attendees will be able to:  

<Read More

This 90-minute online training led by Google experts is the capstone session for power users who want to push the boundaries of AI in biomedical research. This session showcases advanced agentic workflows and complex comparative analysis. The training will feature a demo on building sophisticated research assistant agents with Agent Designer and will demonstrate additional NotebookLM use cases for research. 

By the end of this training, attendees will be able to:  

  • Design a complex, multi-agent system in Agent Designer capable of automating a research sub-task, such as finding and comparing experimental protocols. 
  • Apply advanced NotebookLM techniques to perform complex comparative analysis on diverse scientific data sources. 
  • Develop strategies for using AI to analyze a portfolio of grants and publications to identify alignment with NIH strategic priorities. 

Attendees are expected to be able to independently utilize Gemini to be successful in this training.  Gemini for Government can be accessed at: https://go.hhs.gov/gemini  

June

Join Meeting
Organized by
BTEP
Description

Qlucore Omics Explorer is a desktop-based point-and-click software with built-in machine learning capabilities. It enables RNA sequencing (bulk and single cell), proteomics and metabolomics analysis. This software is available for NCI CCR scientists upon submitting a ticket at https://service.cancer.gov/ncisp. In this demonstration-only class, Qlucore scientist will illustrate single cell RNA sequencing analysis workflow starting from data import through performing QC, visualization, clustering (tSNE, UMAP, 3D PCA) and marker-based cell type Read More

Qlucore Omics Explorer is a desktop-based point-and-click software with built-in machine learning capabilities. It enables RNA sequencing (bulk and single cell), proteomics and metabolomics analysis. This software is available for NCI CCR scientists upon submitting a ticket at https://service.cancer.gov/ncisp. In this demonstration-only class, Qlucore scientist will illustrate single cell RNA sequencing analysis workflow starting from data import through performing QC, visualization, clustering (tSNE, UMAP, 3D PCA) and marker-based cell type identification. Experience using or installation of this software is not required for attendance. Participation is restricted to NIH staff.

Organized by
FAES
Description

This series invites Principal Investigators, Senior Scientists, and Senior Clinicians to share cutting-edge research and developments in their fields. Each session includes a 20-30 minute presentation followed by a Q&A or journal club discussion, fostering deeper insights and scholarly exchange. Lunch is provided. Please note this event is only open to members of the NIH community.

Recent advances in large language models (LLMs) have enabled powerful AI agents for biomedical Read More

This series invites Principal Investigators, Senior Scientists, and Senior Clinicians to share cutting-edge research and developments in their fields. Each session includes a 20-30 minute presentation followed by a Q&A or journal club discussion, fostering deeper insights and scholarly exchange. Lunch is provided. Please note this event is only open to members of the NIH community.

Recent advances in large language models (LLMs) have enabled powerful AI agents for biomedical research, yet their adoption in high-stakes settings remains limited by concerns about hallucination, opacity, and reliability. In this talk, I discuss how expert-curated domain knowledge can be used to help mitigate these challenges in general-purpose LLMs. Drawing on real-world systems and case studies such as GeneAgent (Nature Methods 2025), I will highlight design principles for building AI agents that are scientifically sound, interpretable, and suitable for biomedical research and clinical applications.

Organized by
OCIO| NIH Library| CIT
Description

This 90-minute online training led by Google experts will introduce the foundational features of Gemini for Government, tailored specifically to accelerate research and enhance productivity within NIH workflows. This training will focus on immediate, high-impact use cases that solve everyday challenges, from drafting manuscripts to communicating scientific findings more effectively. Attendees will learn how to leverage Gemini for Government's secure, AI-powered tools to streamline tasks and will get a first look at the future Read More

This 90-minute online training led by Google experts will introduce the foundational features of Gemini for Government, tailored specifically to accelerate research and enhance productivity within NIH workflows. This training will focus on immediate, high-impact use cases that solve everyday challenges, from drafting manuscripts to communicating scientific findings more effectively. Attendees will learn how to leverage Gemini for Government's secure, AI-powered tools to streamline tasks and will get a first look at the future of research automation with agents. 

By the end of this training, attendees will be able to: 

  • Utilize Gemini for Government to accelerate daily tasks, including drafting manuscript sections and analyzing meeting notes. 
  • Transform a text-based research summary into a clear and effective visual concept for an infographic. 
  • Perform natural language semantic searches to instantly find and synthesize information from scientific publications. 
  • Describe the potential of agents to automate research workflows. 

Attendees are not expected to have any prior knowledge of the tool to be successful in this training. Gemini for Government can be accessed at: https://go.hhs.gov/gemini  

Organized by
CIT
Description

All problems and concerns are welcome, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. The meeting connection details are emailed to all Biowulf users the week of the consult.

Email staff@hpc.nih.gov for the meeting link.

All problems and concerns are welcome, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. The meeting connection details are emailed to all Biowulf users the week of the consult.

Email staff@hpc.nih.gov for the meeting link.

Organized by
OCIO| NIH Library| CIT
Description

This 90-minute online training led by Google experts will dive into intermediate features, including agent creation and code generation, with Gemini for Government. The training will focus on hands-on applications, including building a simple agent with Agent Designer, using vibe-coding for data analysis, and leveraging NotebookLM as a personal research assistant. Attendees will also learn how to create more sophisticated, data-driven infographics to communicate their findings. 

By the end of this training, Read More

This 90-minute online training led by Google experts will dive into intermediate features, including agent creation and code generation, with Gemini for Government. The training will focus on hands-on applications, including building a simple agent with Agent Designer, using vibe-coding for data analysis, and leveraging NotebookLM as a personal research assistant. Attendees will also learn how to create more sophisticated, data-driven infographics to communicate their findings. 

By the end of this training, attendees will be able to:  

  • Build a simple, custom agent using Agent Designer to automate a research-related task, such as monitoring new publications. 
  • Generate Python scripts using natural language (vibe-coding) to clean and analyze data. 
  • Utilize NotebookLM to upload source materials, ask targeted questions across documents, and organize research insights. 
  • Create a data-driven infographic that transforms raw data into a compelling visual story. 

Attendees are expected to be familiar with the basic functions of Gemini to be successful in this training (gained by attending Gemini for Government 101), attending another relevant training, and/or using Gemini previously).  Gemini for Government can be accessed at: https://go.hhs.gov/gemini  

Join Meeting
Organized by
BTEP
Description

Partek Flow is a point-and-click platform for building analysis workflows for Next Generation Sequences (NGS), including DNA, bulk and single-cell RNA, spatial transcriptomics, ATAC, and ChIP, helping scientists avoid the steep learning curve of code-based NGS analysis. In this demonstration-only class, Illumina scientist will illustrate how to obtain insights to regulation of gene expression from bulk RNA and ATAC sequencing data. No prior experience or access to Partek Flow is required. Attendance is limited Read More

Partek Flow is a point-and-click platform for building analysis workflows for Next Generation Sequences (NGS), including DNA, bulk and single-cell RNA, spatial transcriptomics, ATAC, and ChIP, helping scientists avoid the steep learning curve of code-based NGS analysis. In this demonstration-only class, Illumina scientist will illustrate how to obtain insights to regulation of gene expression from bulk RNA and ATAC sequencing data. No prior experience or access to Partek Flow is required. Attendance is limited to NIH staff.

Organized by
OCIO| NIH Library| CIT
Description

This 90-minute online training led by Google experts is the capstone session for power users who want to push the boundaries of AI in biomedical research. This session showcases advanced agentic workflows and complex comparative analysis. The training will feature a demo on building sophisticated research assistant agents with Agent Designer and will demonstrate additional NotebookLM use cases for research. 

By the end of this training, attendees will be able to:&Read More

This 90-minute online training led by Google experts is the capstone session for power users who want to push the boundaries of AI in biomedical research. This session showcases advanced agentic workflows and complex comparative analysis. The training will feature a demo on building sophisticated research assistant agents with Agent Designer and will demonstrate additional NotebookLM use cases for research. 

By the end of this training, attendees will be able to:  

  • Design a complex, multi-agent system in Agent Designer capable of automating a research sub-task, such as finding and comparing experimental protocols. 
  • Apply advanced NotebookLM techniques to perform complex comparative analysis on diverse scientific data sources. 
  • Develop strategies for using AI to analyze a portfolio of grants and publications to identify alignment with NIH strategic priorities. 

Attendees are expected to be able to independently utilize Gemini to be successful in this training.  Gemini for Government can be accessed at: https://go.hhs.gov/gemini  

Distinguished Speakers Seminar Series

Join Meeting
Organized by
BTEP
Description

Dr. Bocks' research utilizes a synergistic "READ, LEARN, WRITE" framework that combines multi-omics profiling, deep learning, and high-throughput CRISPR screening to map, model, and program complex cellular functions. By integrating single-cell technologies to read epigenetic states with advanced neural networks to learn their regulatory circuits, he can systematically design and write new biological instructions into human cells. They successfully applied this integrated approach to optimize immunotherapy, using large-scale in vivo CRISPR screens to identify Read More

Dr. Bocks' research utilizes a synergistic "READ, LEARN, WRITE" framework that combines multi-omics profiling, deep learning, and high-throughput CRISPR screening to map, model, and program complex cellular functions. By integrating single-cell technologies to read epigenetic states with advanced neural networks to learn their regulatory circuits, he can systematically design and write new biological instructions into human cells. They successfully applied this integrated approach to optimize immunotherapy, using large-scale in vivo CRISPR screens to identify and validate gene knockouts that significantly boost the performance of CAR T cells against solid tumors.