Homepage

Classes & Events

Browse Classes, Special Events, and Series Webinars.

Browse Class Schedule

Bioinformatics Resources

Class Documentation, Core Facilities, and Software.

Resources & Software

Bioinformatics Forums

Ask Questions about Bioinformatics Topics.

Questions & Answers

Video Archive

Class and Webinar Recordings and Transcripts.

Watch Videos

Featured

Screenshot 2025-08-25 at 12.15.15 PM.png

(April 16) Sandrine Dudoit PhD, Distinguished Speakers Seminar Series

Learning from Data in Single Cell Transcriptomics

Registration

Screenshot 2025-09-10 at 12.04.45 PM.png

(March 19) Vince Carey PhD- Bioconductor Decade 3: Evolving an Open Ecosystem for Genomic Data Science

BTEP Distinguished Speakers Seminar Series

More Details

Screenshot 2026-03-03 at 12.00.36 PM.png

April 9 Webinar: OncoFold - Visualizing Somatic Mutations in 3D Protein Structures (Felix Dietlein MD PHD)

OncoFold enables researchers to interpret mutations in cancer through their structural context, explore significantly mutated regions with ligands and detailed domain annotations, identify spatial clustering indicative of positive selection, and gain mechanistic insights into how specific mutations may alter protein function and contribute to tumorigenesis.

Details

Biowulf New User Orientation

The NIH HPC Staff offers biweekly new user orientation sessions to get started with Biowulf. If you are a new user unsure about how to get started, please join the NIH HPC Staff at one of their biweekly 30-minute new user orientations.

Learn More Here

(March 25) Documenting Analysis with Jupyter Lab

Learn about Jupyter Lab, a tool for maintaining analysis code and output in one place to facilitate reproducible data analysis.

Registration

Upcoming Classes & Events

From

Up To

Presenters

Topic

Organizer

Method

Advanced Search

March

Monday

ChatGPT Learning Sessions: ChatGPT 102

When: Mon, Mar 09, 2026 - 11:00 am - 12:00 pm Add to Calendar
Delivery: Online
Presented By: Guest Speaker(s)

Organized by

OCIO| NIH Library| CIT

Description

ChatGPT 102 training is part 2 of a three-part series.

This one-hour online training led by OpenAI experts will dive deeper into intermediate features and strategies for maximizing ChatGPT Enterprise in NIH workflows. Building on the fundamentals from ChatGPT 101, this training will focus on intermediate features including Custom GPTs, Projects, Data Analysis, coding in Canvas, and Deep Research to enable broader value creation and collaboration with ChatGPT. Attendees will also learn how to integrate ChatGPT into specialized tasks and optimize outputs for NIH-specific use cases.

By the end of this training, attendees will be able to:

Create and customize GPTs and projects to serve as tailored assistants for NIH-specific initiatives and domains.
Utilize additional intermediate features including Data Analysis, coding in Canvas, and Deep Research, to handle complex tasks and collaborative workflows.
Implement best practices for integrating ChatGPT into broader NIH processes while maintaining compliance and security standards.

Attendees are expected to be familiar with the basic functions of ChatGPT to be successful in this training (gained by attending ChatGPT 101), attending another relevant training, and/or using ChatGPT previously).

View Details

Monday

AI Club: Investigating the Impact of Silencers on Disease Using Deep Learning

When: Mon, Mar 09, 2026 - 11:00 am - 12:00 pm Add to Calendar
Delivery: Hybrid
Location: NIH Library Training Room, Building 10, Clinical Center, South Entrance
Presented By: Di Huang PhD (NCBI)

Organized by

Ryan O'Neill (NHLBI)

Description

Using deep learning to investigate regulatory silencers

View Details

Tuesday

Introduction to Data Wrangling Using Python: Part 1 of 2

When: Tue, Mar 10, 2026 - 10:00 am - 11:00 am Add to Calendar
Delivery: Online
Presented By: Cindy Sheffield (NIH Library)

Organized by

NIH Library

Description

This one-hour online training, is the first of a two-part series, which introduces participants to cleaning and exploring a patient health dataset using Python and pandas. Attendees will load tabular data, inspect structure and data types, summarize columns, and identify common data quality problems such as missing values, inconsistent formats, and duplicate records. They will then apply practical fixes, including standardizing height and weight units, parsing and normalizing dates of birth, splitting combined fields, Read More

By the end of this session students will be able to:

Import CSV data into pandas DataFrames and quickly understand column types, basic statistics, and overall data quality.
Identify duplicate or repeated patient records and decide whether to keep, correct, or remove them.
Detect and handle missing or inconsistent values using methods such as isna, fillna, filtering, and conditional replacement.
Standardize mixed formats (for example, heights with and without units, date strings in different formats, and numeric values embedded in text).
Create derived columns such as systolic and diastolic blood pressure, and use logical conditions to flag questionable or out-of-range values.

Attendees are expected to have:

Basic Python coding knowledge
Familiarity with an IDE and loading script and data files into the IDE. (Colab, Jupyter Notebooks)

Requirements:

Participants will receive a script file and data files prior to the training. These should be loaded and ready to use before the training session begins.

You can register for Part 2 in this series via the link below:

https://www.nihlibrary.nih.gov/training/introduction-data-wrangling-using-python-part-2-2

View Details

Tuesday

P-values: What They Are, What They Mean, and How to Use Them

When: Tue, Mar 10, 2026 - 12:00 pm - 1:00 pm Add to Calendar
Delivery: Hybrid
Location: Bldg 549, Frederick, Ft. Detrick, Executive Board Room
Presented By: Alexander Y. Mitrophanov PhD (ABCS/FNLCR)

Organized by

ABCS/FNLCR

Description

The P-value is a cornerstone notion in statistics that is often used as the main deciding factor in determining the conclusiveness and reliability of empirical findings. Despite its ubiquity as a data-analysis feature in biological sciences, the rigorous definition, study, and interpretation of P-values can be elusive for practitioners. In this lecture, I will de-mystify P-values and explain their proper use in statistical hypothesis testing and related methodologies. Our main goal will be to Read More

This session will be recorded, and all materials will be posted on our training website and shared with attendees a few days after the event. In addition, the Advanced Biomedical Computational Science (ABCS) group at Frederick National Lab (FNL) provides statistical analysis and consultation for NCI and FNL laboratories. ABCS also hosts virtual office hours every Wednesday from 12:00–1:00 p.m. ET on Teams. For more details, please contact Natasha Pacheco.

View Details

Tuesday

Battle of the Bots: ChatGPT and Chirp Explained

When: Tue, Mar 10, 2026 - 1:00 pm - 2:00 pm Add to Calendar
Delivery: Online
Presented By: Chris Graves (CIT)

Join Meeting

Organized by

CIT Technology Training Program

Description

This fast-paced, 60-minute class puts two AI heavyweights—Chirp and ChatGPT—head to head. Through live demos, real prompts, and a few surprises, participants will see how each model writes, reasons, and responds when given the same challenges. Along the way, we’ll explore what each tool does best, where they stumble, and how to choose the right AI partner for different tasks. Expect practical takeaways, engaging examples, and a clearer Read More

View Details

Tuesday

Introduction to R and RStudio

Part Of: Introductory R for Novices: Getting Started with R Course
When: Tue, Mar 10, 2026 - 2:00 pm - 3:00 pm Add to Calendar
Delivery: Online
Presented By: Alex Emmons (BTEP)

Join Meeting

Organized by

BTEP

Description

This lesson will serve as a general introduction to R and RStudio. Attendees will explore the RStudio interactive development environment (IDE) and get started with R programming.

Wednesday

Introduction to Data Wrangling Using Python: Part 2 of 2

When: Wed, Mar 11, 2026 - 10:00 am - 11:00 am Add to Calendar
Delivery: Online
Presented By: Cindy Sheffield (NIH Library)

Organized by

NIH Library

Description

This one-hour online training, the second session of the two-part series, focuses on reshaping and enriching the cleaned patient dataset to prepare it for analysis and reporting. Attendees will practice splitting and recombining columns (for example, separating full names into first and last names), converting columns to appropriate data types, and engineering new fields such as outlier indicators and blood pressure status labels. The session also covers merging multiple tables (patient details, contact Read More

By the end of this training, attendees will be able to:

Reshape and restructure data by splitting and combining columns, changing data types, and reordering or selecting relevant fields.
Engineer clinically useful features, including z-score–based outlier flags, hypertension indicators, and combined status columns for downstream models or dashboards.
Merge and join DataFrames using common keys (such as patient ID) to bring together core data with supplemental tables like contact information.
Filter and subset records based on multiple conditions (for example, patients with diabetes and abnormal blood pressure) to create analysis-ready datasets.

Attendees are expected to have:

To have attended Intro to Data Wrangling Using Python - Part 1 of the series
Basic Python coding knowledge

Familiarity with an IDE and loading script and data files into the IDE. (Colab, Jupyter Notebooks)

Requirements:

Participants will receive a script file and data files prior to the training. These should be loaded and ready to use before the training session begins.

You can register for Part 1 in this series via the link below:

https://www.nihlibrary.nih.gov/training/introduction-data-wrangling-using-python-part-1-2

View Details

Thursday

Prompt Once, Use Everywhere: Build an AI Prompt Library in Microsoft 365

When: Thu, Mar 12, 2026 - 1:00 pm - 3:00 pm Add to Calendar
Delivery: Online
Presented By: Abby Herriman (CIT)

Organized by

CIT Technology Training Program

Description

If you use AI tools even occasionally, you’ve probably spent more time than you’d like rewriting prompts, tweaking outputs, or trying to remember “that one prompt that worked.” This live, hands-on class shows you how to stop starting over. You’ll learn how to turn your best prompts into reusable, high-quality assets—stored and shared using the Microsoft 365 tools you already work in every day. In under two hours, you’ll learn practical prompt design techniques that work across tools like ChatGPT, Claude, and CHiRP, and how to organize them in Teams, SharePoint, Word, Excel, and Loop so they’re easy to find, reuse, and improve. The focus is real NIH work, responsible AI use, and immediately applicable skills. You’ll leave with ready-to-use templates, example prompts, and a clear system you can apply the same day to save time, improve results, and make AI a reliable part of your workflow—not an experiment you have to rethink each time.

View Details

Thursday

Statistics and Epidemiology - Part 2: Overview of Study Design

When: Thu, Mar 12, 2026 - 1:00 pm - 4:00 pm Add to Calendar
Delivery: Online
Presented By: Ninet Sinaii Ph.D. MPH (Biostatistics and Clinical Epidemiology Branch NIH Clinical Center)

Organized by

NIH Library

Description

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature.

This three-hour online training will provide a review of study Read More

This three-hour online training will provide a review of study designs in biomedical research. This training will also cover details related to case studies/series, ecological, cross-sectional, case-control, and cohort studies, clinical trials, and other study designs and considerations. Time will be devoted to questions from attendees and references will be provided for in-depth self-study.

By the end of this training, attendees will be able to: 

Describe two broad categories of study designs

Provide examples of descriptive and analytic studies

Explain the advantages and disadvantages of analytic studies

Understand the differences between observational and experimental studies

List other types of atypical study designs

View Details

Thursday

Basics of R Programming: R Objects and Data Types

Part Of: Introductory R for Novices: Getting Started with R Course
When: Thu, Mar 12, 2026 - 2:00 pm - 3:00 pm Add to Calendar
Delivery: Online
Presented By: Alex Emmons (BTEP)

Join Meeting

Organized by

BTEP

Description

In this lesson, attendees will learn the most basic features of the R programming language. The focus will be on R syntax, R objects, and data types.

Friday

Sequence Read Archive: Leveraging this Petabyte-scale Database to Drive Biomedical Discovery

When: Fri, Mar 13, 2026 - 12:00 pm - 1:00 pm Add to Calendar
Delivery: Online
Presented By: Derek Caetano-Anolles PhD (NCBI)

Organized by

Data Sharing and Reuse Seminar Series

Description

The Sequence Read Archive (SRA) is the largest publicly available repository of high-throughput sequencing data. With big data come big challenges, and that includes keeping the SRA sustainable while making sure that data is findable, accessible, interoperable and reusable. Following a brief introduction to the SRA and the expanse of data it holds, we will share best practices for accessing SRA data for your analyses and the various formats you may encounter. Finally, we will describe the SRA Lite file format, which is faster to download with the added advantage of shrinking the overall footprint of SRA. We will demonstrate the use of SRA Lite format in NCBI RNA-seq pipelines and related analyses, and offer appropriate NCBI resources to learn more and engage with us.

View Details

Friday

AI Update: What's New in Artificial Intelligence

When: Fri, Mar 13, 2026 - 1:00 pm - 1:45 pm Add to Calendar
Delivery: Online
Presented By: Alicia Lillich (NIH Library)

Organized by

NIH Library

Description

This 45-minute online training provides a high-level overview of recent developments in artificial intelligence (AI). Each session highlights emerging trends, tools, and use cases in the evolving AI landscape, with an emphasis on practical relevance and responsible use. Whether you're just getting started or looking to stay current, this training offers timely insights in a concise format.

By the end of this Read More

By the end of this training, attendees will be able to: 

Summarize key trends and developments in AI

Identify new tools, capabilities, or applications relevant to their work

Describe considerations for ethical and responsible use of AI technologies

Attendees are not expected to have any prior knowledge to be successful in this training.

View Details

Monday

ChatGPT Learning Session: Advanced Session - Custom GPTs and Data Analysis

When: Mon, Mar 16, 2026 - 11:00 am - 12:00 pm Add to Calendar
Delivery: Online
Presented By: Guest Speaker(s)

Organized by

OCIO| NIH Library| CIT

Description

Advanced ChatGPT training is part 3 of a three-part series.

This one-hour online training, led by OpenAI experts, is for those who have completed the ChatGPT 101 and 102 trainings. The training will focus on leveraging two of ChatGPT Enterprise's most powerful features: Custom GPTs and Data Analysis. Attendees will learn how to create specialized GPTs tailored for specific NIH tasks and how to use the Data Analysis feature to upload, interpret, and visualize Read More

Advanced ChatGPT training is part 3 of a three-part series.

By the end of this training, attendees will be able to:

Build and deploy Custom GPTs tailored to specific NIH workflows.
Use the Data Analysis feature to upload, analyze, and visualize data.
Apply advanced techniques to solve complex problems using ChatGPT Enterprise.

Attendees are expected to be able to utilize ChatGPT to be successful in this training. 

You can register for the other trainings in this series via the link(s) below: 

 ChatGPT 101

ChatGPT 102

View Details

Monday

AI Club: The Replication Gap: Moving NIH Beyond Computational Reproducibility

When: Mon, Mar 16, 2026 - 11:00 am - 12:00 pm Add to Calendar
Delivery: Hybrid
Location: NIH Library Training Room Building 10 Clinical Center South Entrance
Presented By: Sepid Mazrouee PhD (NIAID)

Organized by

Ryan O'Neill (NHLBI)

Description

The Replication Gap: Moving NIH Beyond Computational Reproducibility

View Details

Tuesday

Basics of R Programming: Vectors

Part Of: Introductory R for Novices: Getting Started with R Course
When: Tue, Mar 17, 2026 - 2:00 pm - 3:00 pm Add to Calendar
Delivery: Online
Presented By: Alex Emmons (BTEP)

Join Meeting

Description

In this lesson, attendees will continue to learn basic features of the R programming language. The focus of this lesson will be vectors, one of the most common object types in R. You will learn why vectors are useful and how to create, modify, and export vectors.

Wednesday

Multi-modal Modeling in Precision Medicine: From Data Imputation to Synthetic Data

When: Wed, Mar 18, 2026 - 11:00 am - 12:00 pm Add to Calendar
Delivery: Online
Presented By: Oliver Gevaert PhD (Stanford Medicine)

Organized by

CBIIT

Description

how cross-modal data modeling uses one data type (like imaging) to fill in gaps in another data type (like genomics).
ongoing multi-modal modeling efforts in spatial omics, digital pathology, and radiology.
how multi-modal modeling is anticipated to help us better understand disease biology and improve healthcare practices.

Multi-modal modeling can empower researchers, like you, to model complex interactions among diverse biomedical data types (including omics and Read More

how cross-modal data modeling uses one data type (like imaging) to fill in gaps in another data type (like genomics).
ongoing multi-modal modeling efforts in spatial omics, digital pathology, and radiology.
how multi-modal modeling is anticipated to help us better understand disease biology and improve healthcare practices.

Multi-modal modeling can empower researchers, like you, to model complex interactions among diverse biomedical data types (including omics and imaging). Attend this seminar and get a better understanding of how one modality influences another, facilitating in-silico exploration of disease mechanisms without the need for extensive and costly real-world data collection

View Details

Thursday

Bioconductor Decade 3: Evolving an Open Ecosystem for Genomic Data Science

When: Thu, Mar 19, 2026 - 1:00 pm - 2:00 pm Add to Calendar
Delivery: Online
Presented By: Vincent J. Carey (Brigham and Women's Hospital Harvard Medical School)

Distinguished Speakers Seminar Series

Join Meeting

Organized by

BTEP

Description

In this talk, Dr. Carey will describe how Bioconductor approaches new challenges in supporting open method development and reproducible
analyses in genomic data science. He will discuss aspects of the project that bear on education in cancer epidemiology and
computational cancer genomics, and on emerging topics in software and data engineering for scalable omics analyses.

Thursday

Introduction to R Data Structures: Data Import

Part Of: Introductory R for Novices: Getting Started with R Course
When: Thu, Mar 19, 2026 - 2:00 pm - 3:00 pm Add to Calendar
Delivery: Online
Presented By: Alex Emmons (BTEP)

Join Meeting

Organized by

BTEP

Description

This lesson will introduce data structures including data frames and show attendees how to import data into the R environment.

Tuesday

Foundational Models for Cancer: Advancing Diagnosis, Prognosis, and Treatment Response

When: Tue, Mar 24 - Thu, Mar 26, 2026 -10:00 am - 2:00 pm Add to Calendar
Delivery: Online
Presented By: Asif Rizwan (NCI)

Organized by

NCI

Description

Overview

This 3-day, virtual workshop will explore how foundation models—a powerful class of advanced AI models —can transform cancer research and clinical care. We will focus on their potential to improve diagnosis, prognosis, and treatment response, with a strong emphasis on clinical translation and technology development.

Key Topics:

Foundation Read More

Overview
This 3-day, virtual workshop will explore how foundation models—a powerful class of advanced AI models —can transform cancer research and clinical care. We will focus on their potential to improve diagnosis, prognosis, and treatment response, with a strong emphasis on clinical translation and technology development.
Key Topics:
Foundation Model Primer: A high-level introduction to foundation models.

Multimodal Data: Combining pathology, radiology, omics, and patient data into unified models.

Prediction: Predicting therapeutic response, resistance, and patient outcomes.

Validation and Reproducibility: Ensuring model results are consistent and reliable for real-world clinical performance and use.

Diagnostic Case Studies: Real-world applications for early detection and automated diagnostics.

Federated Learning: Approaches to training robust models across multiple institutions—without sharing sensitive patient data

Challenges, Risk, and Regulation: Addressing model interpretability and regulatory considerations for clinical adoption.

Agenda (https://events.cancer.gov/dctd/foundationmodel/agenda)

View Details

Tuesday

24

R Data Structures: Data Frames

Part Of: Introductory R for Novices: Getting Started with R Course

When: Tue, Mar 24, 2026 - 2:00 pm - 3:00 pm Add to Calendar

Delivery: Online

Presented By: Alex Emmons (BTEP)

Join Meeting
Organized by
BTEP

Description

This is the last lesson in Part 1 of Introductory R for Novices: Getting Started with R. This lesson will focus exclusively on working with data frames. Attendees will learn how to examine, summarize, and access data in data frames.

This is the last lesson in Part 1 of Introductory R for Novices: Getting Started with R. This lesson will focus exclusively on working with data frames. Attendees will learn how to examine, summarize, and access data in data frames.

Register Now View Details

Wednesday

25

Documenting Analysis with Jupyter Lab

When: Wed, Mar 25, 2026 - 2:00 pm - 3:00 pm Add to Calendar

Delivery: Online

Presented By: Joe Wu (BTEP)

Join Meeting
Organized by
BTEP

Description

This class will introduce beginners or those looking for a refresher to Jupyter Lab, a platform used to organize code and analysis steps in one place. Jupyter Lab can be easily installed or run in a web browser, and supports several languages such as R and Python. It provides a way to keep track of all steps in an analysis and a place for collaboration. This class will not be hands-on and is a Read More

This class will introduce beginners or those looking for a refresher to Jupyter Lab, a platform used to organize code and analysis steps in one place. Jupyter Lab can be easily installed or run in a web browser, and supports several languages such as R and Python. It provides a way to keep track of all steps in an analysis and a place for collaboration. This class will not be hands-on and is a demo only. Experience using or installation onto personal computer of Jupyter Lab is not needed to attend. This is for NIH audience only.

Register Now View Details

April

Monday

06

Proteomics Analysis Using Qlucore

When: Mon, Apr 06, 2026 - 11:00 am - 12:00 pm Add to Calendar

Delivery: Online

Presented By: Jan Nilsson (Qlucore), Joe Wu (BTEP)

Join Meeting
Organized by
BTEP

Description

Qlucore Omics Explorer is a desktop-based point-and-click software with built-in machine learning capabilities. It enables RNA sequencing (bulk and single cell), proteomics and metabolomics analysis. This software is available for NCI CCR scientists upon submitting a ticket at https://service.cancer.gov/ncisp. In this demonstration-only class, Qlucore scientist will illustrate proteomics analysis workflow starting from data import through performing QC, constructing visualizations (ie. PCA, heatmap, volcano, box, and violin plots),and conducting GSEA. Read More

Qlucore Omics Explorer is a desktop-based point-and-click software with built-in machine learning capabilities. It enables RNA sequencing (bulk and single cell), proteomics and metabolomics analysis. This software is available for NCI CCR scientists upon submitting a ticket at https://service.cancer.gov/ncisp. In this demonstration-only class, Qlucore scientist will illustrate proteomics analysis workflow starting from data import through performing QC, constructing visualizations (ie. PCA, heatmap, volcano, box, and violin plots),and conducting GSEA. Experience using or installation of this software is not required for attendance. Participation is restricted to NIH staff.

Register Now View Details

Tuesday

07

Python for Data Science: How to Get Started, What to Learn, and Why

When: Tue, Apr 07, 2026 - 10:00 am - 11:00 am Add to Calendar

Delivery: Online

Presented By: Cindy Sheffield (NIH Library)

Organized by
NIH Library

Description

This one-hour online training will provide a high-level overview of Python coding concepts, as well as some of the integrative development environments (IDEs, such as Jupyter notebooks) used for Python coding. Python is a programming language used for data science, specifically: data analysis, statistical analysis, and visualization of results. The training will feature the following IDEs: Google Colaboratory: Jupyter Notebook; and Anaconda’s: Spyder, Jupyter Notebook, and JupyterLab. Read More

This one-hour online training will provide a high-level overview of Python coding concepts, as well as some of the integrative development environments (IDEs, such as Jupyter notebooks) used for Python coding. Python is a programming language used for data science, specifically: data analysis, statistical analysis, and visualization of results. The training will feature the following IDEs: Google Colaboratory: Jupyter Notebook; and Anaconda’s: Spyder, Jupyter Notebook, and JupyterLab. This overview training will demonstrate how these skills can boost productivity, rigor, and transparency in reporting research findings. 

By the end of the training, attendees will be able to:

Recognize four freely available IDEs for python coding

Identify fundamental components of python code

Understand how and why notebooks support rigor and transparency in analysis

Attendees are not expected to have any prior knowledge of python coding or the IDEs to be successful in this training.

If you choose to follow along with Google Colab or Jupyter Notebooks, these IDEs should be installed and ready to go. Code will be provided during the training for this option.

View Details

Wednesday

08

Getting Started with SAS

When: Wed, Apr 08, 2026 - 11:00 am - 12:00 pm Add to Calendar

Delivery: Online

Presented By: Instructor (SAS)

Organized by
NIH Library

Description

This one-hour online training, provided by a presenter from SAS, introduces the basics of accessing SAS 9.4 tools and setting up your environment. 

By the end of this training, attendees will be able to:  

Load data using SAS Studio or Enterprise Guide

<Read More

This one-hour online training, provided by a presenter from SAS, introduces the basics of accessing SAS 9.4 tools and setting up your environment. 

By the end of this training, attendees will be able to:  

Load data using SAS Studio or Enterprise Guide

Run simple programs using SAS Studio or Enterprise Guide

Generate reports using SAS Studio or Enterprise Guide

Describe technical aspects, such as understanding libraries, managing data sets, and using core SAS procedures for analysis

Attendees are not expected to have any prior knowledge of SAS to be successful in this training.

View Details

Thursday

09

Statistics and Epidemiology - Part 3: Overview of Common Statistical Tests

When: Thu, Apr 09, 2026 - 10:00 am - 5:00 pm Add to Calendar

Delivery: Online

Presented By: Ninet Sinaii Ph.D. MPH (Biostatistics and Clinical Epidemiology Branch NIH Clinical Center)

Organized by
NIH Library

Description

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature.

This six-hour online training will describe the basic concepts for using Read More

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature.

This six-hour online training will describe the basic concepts for using common statistical tests such as Chi-square, paired and two-sample t-tests, ANOVA, correlations, simple and multiple regression, logistic regression, and survival analysis. Time will be devoted to questions from attendees and references will be provided for in-depth self-study.

By the end of this training, attendees will be able to: 

Explain the importance of study design and hypothesis

Describe types of data and their distributions

List examples of statistical tests for analyzing continuous data

List examples of statistical tests for analyzing dichotomous or categorical data

Understand differences in regression methods

Identify nonparametric tests and when to use them

The first part of the class will be 10:00 a.m. to 12:00 p.m. EST followed by a break from 12:00-1:00 p.m. The class resumes at 1:00 p.m. and concludes at 5:00 p.m.

View Details

Thursday

09

OncoFold: Visualizing Somatic Mutations in 3D Protein Structures

When: Thu, Apr 09, 2026 - 1:00 pm - 2:00 pm Add to Calendar

Delivery: Online

Presented By: Do Young Hyeon (Harvard Medical School), Felix Dietlein MD PhD (Harvard Medical School), Yuxiang Zhou (Harvard Medical School)

Join Meeting
Organized by
BTEP

Description

OncoFold is a web resource to visualize somatic mutations in 3D protein structures. It enables researchers to interpret mutations in cancer through their structural context, explore significantly mutated regions with ligands and detailed domain annotations, identify spatial clustering indicative of positive selection, and gain mechanistic insights into how specific mutations may alter protein function and contribute to tumorigenesis.

OncoFold is a web resource to visualize somatic mutations in 3D protein structures. It enables researchers to interpret mutations in cancer through their structural context, explore significantly mutated regions with ligands and detailed domain annotations, identify spatial clustering indicative of positive selection, and gain mechanistic insights into how specific mutations may alter protein function and contribute to tumorigenesis.

Register Now View Details

Friday

10

ChatGPT Learning Sessions: ChatGPT 101

When: Fri, Apr 10, 2026 - 1:00 pm - 2:00 pm Add to Calendar

Delivery: Online

Presented By: Guest Speaker(s)

Organized by
OCIO| NIH Library| CIT

Description

ChatGPT 101 training is part 1 of a three-part series.

This one-hour online training led by OpenAI experts will cover the fundamentals of using ChatGPT Enterprise effectively in your daily NIH workflows. Attendees will learn to navigate the ChatGPT interface, implement practices for prompt writing, and utilize key features, such as working with files, search functions, and content drafting in Canvas. The training will also demonstrate real-world use cases for Read More

ChatGPT 101 training is part 1 of a three-part series.

This one-hour online training led by OpenAI experts will cover the fundamentals of using ChatGPT Enterprise effectively in your daily NIH workflows. Attendees will learn to navigate the ChatGPT interface, implement practices for prompt writing, and utilize key features, such as working with files, search functions, and content drafting in Canvas. The training will also demonstrate real-world use cases for improving productivity and highlight security and compliance features tailored for NIH staff.

By the end of this training, attendees will be able to:

Use ChatGPT Enterprise’s foundational features, including Working with documents, Search, and Canvas.

Apply effective prompt strategies to generate accurate, useful outputs for NIH-specific tasks.

Understand best practices to help ensure responsible use of generative AI tools like ChatGPT.

Attendees are not expected to have any prior knowledge of the tool to be successful in this training.

View Details

Monday

13

AI Club: Denoising for Light Microscopy using Deep Learning

When: Mon, Apr 13, 2026 - 11:00 am - 12:00 pm Add to Calendar

Delivery: Hybrid

Location: NIH Library Training Room Building 10 Clinical Center South Entrance

Presented By: Sarah Hooper PhD (NHLBI)

Description

Denoising for Light Microscopy using Deep Learning

Denoising for Light Microscopy using Deep Learning

View Details

Tuesday

14

How to Make Your Data FAIR

When: Tue, Apr 14, 2026 - 1:00 pm - 2:30 pm Add to Calendar

Delivery: Online

Presented By: Raisa Ionin (NIH Library)

Organized by
NIH Library

Description

This one and a half-hour online training covers the basic principles of FAIR (Findable, Accessible, Interoperable, Reusable) data and why it is important to make your data FAIR.

By the end of this training, attendees will be able to: 

Define FAIR data

Read More

This one and a half-hour online training covers the basic principles of FAIR (Findable, Accessible, Interoperable, Reusable) data and why it is important to make your data FAIR.

By the end of this training, attendees will be able to: 

Define FAIR data

Explain what purpose FAIR data serves

Apply FAIR data principles to make data findable, accessible, interoperable, and reusable

This is an introductory level training.

View Details

Thursday

16

Learning from Data in Single-Cell Transcriptomics

When: Thu, Apr 16, 2026 - 1:00 pm - 2:00 pm Add to Calendar

Delivery: Online

Presented By: Sandrine Dudoit (UC Berkeley)

Distinguished Speakers Seminar Series

Join Meeting
Organized by
BTEP

Description

The ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell diﬀerentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved Read More

The ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell diﬀerentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved transcriptomics further allows the measurement of gene expression levels along with the location of the RNA molecules within a tissue. Transcriptomics exemplifies the range of issues one encounters in a data science workflow, where the data are complex in a variety of ways, questions are not always clearly formulated, there are multiple analysis steps, and drawing on rigorous statistical principles and methods is essential to derive meaningful and reliable biological results.

In this talk, Dr. Dudoit will provide a survey of statistical questions related to the analysis of single-cell transcriptome sequencing data to investigate the differentiation of stem cells in the brain, including, exploratory data analysis, expression quantitation, cluster analysis, and the inference of cellular lineages. She will also address differential expression analysis in spatial transcriptomics.

Register Now View Details

Friday

17

ChatGPT Learning Sessions: ChatGPT 102

When: Fri, Apr 17, 2026 - 1:00 pm - 2:00 pm Add to Calendar

Delivery: Online

Presented By: Guest Speaker(s)

Organized by
OCIO| NIH Library| CIT

Description

ChatGPT 102 training is part 2 of a three-part series.

This one-hour online training led by OpenAI experts will dive deeper into intermediate features and strategies for maximizing ChatGPT Enterprise in NIH workflows. Building on the fundamentals from ChatGPT 101, this training will focus on intermediate features including Custom GPTs, Projects, Data Analysis, coding in Canvas, and Deep Research to enable broader value creation Read More

ChatGPT 102 training is part 2 of a three-part series.

This one-hour online training led by OpenAI experts will dive deeper into intermediate features and strategies for maximizing ChatGPT Enterprise in NIH workflows. Building on the fundamentals from ChatGPT 101, this training will focus on intermediate features including Custom GPTs, Projects, Data Analysis, coding in Canvas, and Deep Research to enable broader value creation and collaboration with ChatGPT. Attendees will also learn how to integrate ChatGPT into specialized tasks and optimize outputs for NIH-specific use cases.

By the end of this training, attendees will be able to:

Create and customize GPTs and projects to serve as tailored assistants for NIH-specific initiatives and domains.

Utilize additional intermediate features including Data Analysis, coding in Canvas, and Deep Research, to handle complex tasks and collaborative workflows.

Implement best practices for integrating ChatGPT into broader NIH processes while maintaining compliance and security standards.

Attendees are expected to be familiar with the basic functions of ChatGPT to be successful in this training (gained by attending ChatGPT 101), attending another relevant training, and/or using ChatGPT previously).

View Details

Monday

20

AI Club: An Artificial Intelligence-based Pipeline for Drosophila Behavioral Analysis

When: Mon, Apr 20, 2026 - 11:00 am - 12:00 pm Add to Calendar

Delivery: Hybrid

Location: NIH Library Training Room Building 10 Clinical Center South Entrance

Presented By: Ryan O'Neill PhD (NHLBI)

Organized by
Ryan O'Neill (NHLBI)

Description

An Artificial Intelligence-based Pipeline for Drosophila Behavioral Analysis

An Artificial Intelligence-based Pipeline for Drosophila Behavioral Analysis

View Details

Monday

20

Introduction to Data Wrangling Using Python: Part 1 of 2

When: Mon, Apr 20, 2026 - 1:00 pm - 2:00 pm Add to Calendar

Delivery: Online

Presented By: Cindy Sheffield (NIH Library)

Organized by
NIH Library

Description

This one-hour online training, is the first of a two-part series, which introduces participants to cleaning and exploring a patient health dataset using Python and pandas. Attendees will load tabular data, inspect structure and data types, summarize columns, and identify common data quality problems such as missing values, inconsistent formats, and duplicate records. They will then apply practical fixes, including standardizing height and weight units, parsing and normalizing dates of birth, splitting combined fields, Read More

This one-hour online training, is the first of a two-part series, which introduces participants to cleaning and exploring a patient health dataset using Python and pandas. Attendees will load tabular data, inspect structure and data types, summarize columns, and identify common data quality problems such as missing values, inconsistent formats, and duplicate records. They will then apply practical fixes, including standardizing height and weight units, parsing and normalizing dates of birth, splitting combined fields, and using Boolean masks to flag or correct implausible values.

By the end of this session students will be able to:

Import CSV data into pandas DataFrames and quickly understand column types, basic statistics, and overall data quality.

Identify duplicate or repeated patient records and decide whether to keep, correct, or remove them.

Detect and handle missing or inconsistent values using methods such as isna, fillna, filtering, and conditional replacement.

Standardize mixed formats (for example, heights with and without units, date strings in different formats, and numeric values embedded in text).

Create derived columns such as systolic and diastolic blood pressure, and use logical conditions to flag questionable or out-of-range values.

Attendees are expected to have:

Basic Python coding knowledge

Familiarity with an IDE and loading script and data files into the IDE. (Colab, Jupyter Notebooks)

Requirements:

Participants will receive a script file and data files prior to the training. These should be loaded and ready to use before the training session begins.

View Details

Tuesday

21

Introduction to Data Wrangling Using Python: Part 2 of 2

When: Tue, Apr 21, 2026 - 1:00 pm - 2:00 pm Add to Calendar

Delivery: Online

Presented By: Cindy Sheffield (NIH Library)

Organized by
NIH Library

Description

This one-hour online training, the second session of the two-part series, focuses on reshaping and enriching the cleaned patient dataset to prepare it for analysis and reporting. Attendees will practice splitting and recombining columns (for example, separating full names into first and last names), converting columns to appropriate data types, and engineering new fields such as outlier indicators and blood pressure status labels. The session also covers merging multiple tables (patient details, contact Read More

This one-hour online training, the second session of the two-part series, focuses on reshaping and enriching the cleaned patient dataset to prepare it for analysis and reporting. Attendees will practice splitting and recombining columns (for example, separating full names into first and last names), converting columns to appropriate data types, and engineering new fields such as outlier indicators and blood pressure status labels. The session also covers merging multiple tables (patient details, contact information, and subsets of records) and filtering or subsetting data to answer specific analytical questions.

By the end of this training, attendees will be able to:

Reshape and restructure data by splitting and combining columns, changing data types, and reordering or selecting relevant fields.

Engineer clinically useful features, including z-score–based outlier flags, hypertension indicators, and combined status columns for downstream models or dashboards.

Merge and join DataFrames using common keys (such as patient ID) to bring together core data with supplemental tables like contact information.

Filter and subset records based on multiple conditions (for example, patients with diabetes and abnormal blood pressure) to create analysis-ready datasets.

Attendees are expected to have:

To have attended Intro to Data Wrangling Using Python - Part 1 of the series

Basic Python coding knowledge

Familiarity with an IDE and loading script and data files into the IDE. (Colab, Jupyter Notebooks)

Requirements:

Participants will receive a script file and data files prior to the training. These should be loaded and ready to use before the training session begins.

View Details

Friday

24

ChatGPT Learning Session: Advanced Session - Custom GPTs and Data Analysis

When: Fri, Apr 24, 2026 - 11:00 am - 12:00 pm Add to Calendar

Delivery: Online

Presented By: Guest Speaker(s)

Organized by
OCIO| NIH Library| CIT

Description

Advanced ChatGPT training is part 3 of a three-part series.

This one-hour online training, led by OpenAI experts, is for those who have completed the ChatGPT 101 and 102 trainings. The training will focus on leveraging two of ChatGPT Enterprise's most powerful features: Custom GPTs and Data Analysis. Attendees will learn how to create specialized GPTs tailored for specific NIH tasks and how to use the Data Analysis feature to upload, interpret, and visualize Read More

Advanced ChatGPT training is part 3 of a three-part series.

This one-hour online training, led by OpenAI experts, is for those who have completed the ChatGPT 101 and 102 trainings. The training will focus on leveraging two of ChatGPT Enterprise's most powerful features: Custom GPTs and Data Analysis. Attendees will learn how to create specialized GPTs tailored for specific NIH tasks and how to use the Data Analysis feature to upload, interpret, and visualize data sets for deeper insights. This training is designed to provide the skills needed to apply these advanced tools to complex, enterprise-level projects.

By the end of this training, attendees will be able to:

Build and deploy Custom GPTs tailored to specific NIH workflows.

Use the Data Analysis feature to upload, analyze, and visualize data.

Apply advanced techniques to solve complex problems using ChatGPT Enterprise.

Attendees are expected to be able to utilize ChatGPT to be successful in this training. 

You can register for the other trainings in this series via the link(s) below: 

 ChatGPT 101

ChatGPT 102

View Details

Monday

27

AI Club: Artificial Evolution with Artificial Intelligence

When: Mon, Apr 27, 2026 - 11:00 am - 12:00 pm Add to Calendar

Delivery: Hybrid

Location: NIH Library Training Room Building 10 Clinical Center South Entrance

Presented By: Harutyun Saakyan PhD (NCBI)

Organized by
Ryan O'Neill (NHLBI)

Description

Artificial Evolution with Artificial Intelligence

Artificial Evolution with Artificial Intelligence

View Details

May

Thursday

14

Statistics and Epidemiology - Part 4: A Review of Epidemiology Concepts and Statistics

When: Thu, May 14, 2026 - 1:00 pm - 5:00 pm Add to Calendar

Delivery: Online

Presented By: Ninet Sinaii Ph.D. MPH (Biostatistics and Clinical Epidemiology Branch NIH Clinical Center)

Organized by
NIH Library

Description

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature.

This four-hour online training will provide a brief review of Read More

In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature.

This four-hour online training will provide a brief review of the principles of epidemiology, outbreak investigations, implications in public health, key concepts and terms, and commonly used statistics in epidemiology (e.g., morbidity and mortality rates; incidence and prevalence; relative risk; odds ratio; sensitivity and specificity). Time will be devoted to questions from attendees and references will be provided for in-depth self-study.

By the end of this training, attendees will be able to: 

Define epidemiology and its key principles

Share the purpose and function of outbreak investigations

Describe methods for measuring risk

Be familiar with screening and diagnostic accuracy indices and their differences

Describe when to use relative risks and odds ratios

Explain differences between confounding and interaction

View Details

Friday

15

NIH AI Symposium

When: Fri, May 15, 2026 - 9:00 am - 5:00 pm Add to Calendar

Delivery: In-Person

Location: Building 10, Masur Auditorium (Bethesda)

Presented By: Peter Kraft PhD (NCI), Michael Chiang MD (NEI), Francisco Pereira PhD (NIMH), RADM William Childs MD (NHLBI), Richard Scheuermann PhD (NLM), Brad Bower PhD (NIBIB), Ismail Baris Turkbey MD FSAR (NCI), Arash Afraz MD PhD (NIMH), Alison Motsinger-Reif PhD (NIEHS)

Organized by
Ryan O'Neill (NHLBI)

Description

Join us for a day-long symposium exploring AI approaches in biomedical sciences, with the aim of sharing effective AI implementation strategies across NIH.

Contact Lead Organizer Ryan O’Neill, PhD (oneillrs@nih.gov) for more info.

Sign language interpreting and CART services are available upon request to participate in this event. Individualsneeding either of these services and/or other reasonable accommodations should Read More

Join us for a day-long symposium exploring AI approaches in biomedical sciences, with the aim of sharing effective AI implementation strategies across NIH.

Contact Lead Organizer Ryan O’Neill, PhD (oneillrs@nih.gov) for more info.

Sign language interpreting and CART services are available upon request to participate in this event. Individualsneeding either of these services and/or other reasonable accommodations should contact Lisa Bossert (lisa.bossert@nih.gov) by May 1.

View Details

Bioinformatics Training and Education Program

Bioinformatics Training & Education Program

Enabling scientists to understand and analyze their own experimental data by providing instruction and training in bioinformatics software, databases, analyses techniques, and emerging technologies.

Classes & Events

Bioinformatics Resources

Bioinformatics Forums

Video Archive

Featured

(April 16) Sandrine Dudoit PhD, Distinguished Speakers Seminar Series

(March 19) Vince Carey PhD- Bioconductor Decade 3: Evolving an Open Ecosystem for Genomic Data Science

April 9 Webinar: OncoFold - Visualizing Somatic Mutations in 3D Protein Structures (Felix Dietlein MD PHD)

Biowulf New User Orientation

(March 25) Documenting Analysis with Jupyter Lab

Upcoming Classes & Events

March

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Description

Organized by

Description

Distinguished Speakers Seminar Series

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

April

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Organized by

Description

Description

Organized by

Description

Distinguished Speakers Seminar Series

Organized by

Description

Organized by

Description