ncibtep@nih.gov

Bioinformatics Training and Education Program

Classes & Events

class_id details description start_date Venues learning_levels Topic Tags delivery_method presenters Organizer seminar_series class_title
1971
Organized By:
NIH Library
Description

This one-hour online training will provide a high-level overview of Python coding concepts, as well as some of the integrative development environments (IDEs, such as Jupyter notebooks) used for Python coding. Python is a programming language used for data science, specifically: data analysis, statistical analysis, and visualization of results. The training will feature the following IDEs: Google Colaboratory: Jupyter Notebook; and Anaconda’s: Spyder, Jupyter Notebook, and JupyterLab. This overview training will demonstrate how ...Read More

This one-hour online training will provide a high-level overview of Python coding concepts, as well as some of the integrative development environments (IDEs, such as Jupyter notebooks) used for Python coding. Python is a programming language used for data science, specifically: data analysis, statistical analysis, and visualization of results. The training will feature the following IDEs: Google Colaboratory: Jupyter Notebook; and Anaconda’s: Spyder, Jupyter Notebook, and JupyterLab. This overview training will demonstrate how these skills can boost productivity, rigor, and transparency in reporting research findings.  

By the end of the training, attendees will be able to: 

  • Recognize four freely available IDEs for python coding
  • Identify fundamental components of python code
  • Understand how and why notebooks support rigor and transparency in analysis 

Attendees are not expected to have any prior knowledge of python coding or the IDEs to be successful in this training.  

If you choose to follow along with Google Colab or Jupyter Notebooks, these IDEs should be installed and ready to go. Code will be provided during the training for this option. 

This one-hour online training will provide a high-level overview of Python coding concepts, as well as some of the integrative development environments (IDEs, such as Jupyter notebooks) used for Python coding. Python is a programming language used for data science, specifically: data analysis, statistical analysis, and visualization of results. The training will feature the following IDEs: Google Colaboratory: Jupyter Notebook; and Anaconda’s: Spyder, Jupyter Notebook, and JupyterLab. This overview training will demonstrate how these skills can boost productivity, rigor, and transparency in reporting research findings.   By the end of the training, attendees will be able to:  Recognize four freely available IDEs for python coding Identify fundamental components of python code Understand how and why notebooks support rigor and transparency in analysis  Attendees are not expected to have any prior knowledge of python coding or the IDEs to be successful in this training.   If you choose to follow along with Google Colab or Jupyter Notebooks, these IDEs should be installed and ready to go. Code will be provided during the training for this option.  2025-12-09 10:00:00 Online Beginner Programming Online Cindy Sheffield (NIH Library) NIH Library 0 Python for Data Science. What to Learn, How to Get Started, and Why
1972
Organized By:
NIH Library
Description

This one-hour online training provides an introduction on how to sign up and access complimentary SAS training resources available to NIH and HHS employees. 

By the end of this training, attendees will be able to:  

  • Enroll in recommended SAS 9.4 trainings and courses
  • Navigate complimentary SAS tutorials, programming courses, and eLearning 

Attendees are not expected to have any prior knowledge of SAS ...Read More

This one-hour online training provides an introduction on how to sign up and access complimentary SAS training resources available to NIH and HHS employees. 

By the end of this training, attendees will be able to:  

  • Enroll in recommended SAS 9.4 trainings and courses
  • Navigate complimentary SAS tutorials, programming courses, and eLearning 

Attendees are not expected to have any prior knowledge of SAS to be successful in this training. 

This one-hour online training provides an introduction on how to sign up and access complimentary SAS training resources available to NIH and HHS employees.  By the end of this training, attendees will be able to:   Enroll in recommended SAS 9.4 trainings and courses Navigate complimentary SAS tutorials, programming courses, and eLearning  Attendees are not expected to have any prior knowledge of SAS to be successful in this training.  2025-12-10 11:00:00 Online Beginner Software Online Instructor (SAS) NIH Library 0 Tips for Getting Started with SAS Programming
1976
Organized By:
NCI Office of Data Sharing
Description

Join us as poster presenters from the ODS Symposium showcase innovative work that highlights powerful resources, inspiring examples, and the far-reaching impact of data sharing and reuse.

Moderators: Ying Huang and Mousumi Ghosh

Join us as poster presenters from the ODS Symposium showcase innovative work that highlights powerful resources, inspiring examples, and the far-reaching impact of data sharing and reuse.

Moderators: Ying Huang and Mousumi Ghosh

Join us as poster presenters from the ODS Symposium showcase innovative work that highlights powerful resources, inspiring examples, and the far-reaching impact of data sharing and reuse. Moderators: Ying Huang and Mousumi Ghosh 2025-12-10 13:00:00 Online Any Data Online Dr. Mousumi Ghosh (Office of Data Sharing ODDSS NCI),Dr. Ying Huang (Office of Data Sharing ODDSS NCI) NCI Office of Data Sharing 0 Cancer Data Sharing in Action: Resources, Impact, and Examples
1981
Organized By:
HPC Biowulf
Description

All Biowulf users, and all those interested in using the systems, are invited to call in to our Virtual Walk-in Consult to discuss problems and concerns, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. Users will be assigned to a breakout-session with a member of the HPC staff to discuss the problem 1-on-1.  We'll try to address simpler ...Read More

All Biowulf users, and all those interested in using the systems, are invited to call in to our Virtual Walk-in Consult to discuss problems and concerns, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. Users will be assigned to a breakout-session with a member of the HPC staff to discuss the problem 1-on-1.  We'll try to address simpler issues on the spot and follow up on more complex questions after the session.

Email staff@hpc.nih.gov for the meeting link. Registration not required. 

At the consult: You will initially join the  main lobby and triage area.  There, you can briefly describe your issue, and then will be invited to join a 1-on-1 breakout room with a staff member.
Once you are finished with your focused consultation you can return to the main meeting room if you have additional questions or topics to discuss.  Please
- mute when not speaking
- refrain from screen sharing until asked to do so in the breakout room
- screen share as you would in a public space with the understanding that other NIH HPC staff may join and view what you are sharing (i.e. look over your shoulder)
- be prepared to wait your turn if staff are already helping other users

All Biowulf users, and all those interested in using the systems, are invited to call in to our Virtual Walk-in Consult to discuss problems and concerns, from scripting problems to node allocation, to strategies for a particular project, to anything that is affecting your use of the HPC systems. Users will be assigned to a breakout-session with a member of the HPC staff to discuss the problem 1-on-1.  We'll try to address simpler issues on the spot and follow up on more complex questions after the session. Email staff@hpc.nih.gov for the meeting link. Registration not required.  At the consult: You will initially join the  main lobby and triage area.  There, you can briefly describe your issue, and then will be invited to join a 1-on-1 breakout room with a staff member.Once you are finished with your focused consultation you can return to the main meeting room if you have additional questions or topics to discuss.  Please- mute when not speaking- refrain from screen sharing until asked to do so in the breakout room- screen share as you would in a public space with the understanding that other NIH HPC staff may join and view what you are sharing (i.e. look over your shoulder)- be prepared to wait your turn if staff are already helping other users 2025-12-10 13:00:00 Any Computing Resources Online Biowulf Staff members HPC Biowulf 0 Virtual Walk-In Consult for Biowulf Users
1951
Join Meeting
Organized By:
BTEP
Description

Partek Flow enables scientists to construct analysis workflows for multi-omics sequencing data including DNA, bulk and single cell RNA, spatial transcriptomics, ATAC and ChIP. It is a point-and-click software suitable for those who wish to avoid the steep learning curve involved with analyzing sequencing data through coding. In this class, taught by Partek scientist, participants will learn about conducting QA/QC, performing cell type classification, obtaining differential analysis results, performing pathway analysis, and creating ...Read More

Partek Flow enables scientists to construct analysis workflows for multi-omics sequencing data including DNA, bulk and single cell RNA, spatial transcriptomics, ATAC and ChIP. It is a point-and-click software suitable for those who wish to avoid the steep learning curve involved with analyzing sequencing data through coding. In this class, taught by Partek scientist, participants will learn about conducting QA/QC, performing cell type classification, obtaining differential analysis results, performing pathway analysis, and creating visualizations for single cell RNA sequencing data. This session is a demonstration and not hands-on. Experience using or access to Partek Flow is not needed for participation. Attendance is restricted to NIH staff.

Partek Flow enables scientists to construct analysis workflows for multi-omics sequencing data including DNA, bulk and single cell RNA, spatial transcriptomics, ATAC and ChIP. It is a point-and-click software suitable for those who wish to avoid the steep learning curve involved with analyzing sequencing data through coding. In this class, taught by Partek scientist, participants will learn about conducting QA/QC, performing cell type classification, obtaining differential analysis results, performing pathway analysis, and creating visualizations for single cell RNA sequencing data. This session is a demonstration and not hands-on. Experience using or access to Partek Flow is not needed for participation. Attendance is restricted to NIH staff. 2025-12-10 14:00:00 Online Webinar Beginner Online Joe Wu (BTEP),Xiaowen Wang (Partek) BTEP 0 Introduction to Single Cell RNA Sequencing Analysis using Partek Flow
1978
Organized By:
CBIIT
Description

If you’re interested in learning how machine learning can help you tap into the rich data within electronic health records, join us to learn more about the “Memorial Sloan Kettering (MSK) Clinicogenomic Harmonized Oncologic Real-World Data set” otherwise known as MSK-CHORD. It includes data for over 25,000 cancer patients to help you identify clinicogenomic relationships that aren’t as obvious in smaller, siloed data sets.

Dr. Nikolaus Schultz, ...Read More

If you’re interested in learning how machine learning can help you tap into the rich data within electronic health records, join us to learn more about the “Memorial Sloan Kettering (MSK) Clinicogenomic Harmonized Oncologic Real-World Data set” otherwise known as MSK-CHORD. It includes data for over 25,000 cancer patients to help you identify clinicogenomic relationships that aren’t as obvious in smaller, siloed data sets.

Dr. Nikolaus Schultz, an NCI grantee and computational oncologist at MSK Cancer Center, will give an overview of the data set and demonstrate:

  • the feasibility of MSK-CHORD’s automated annotation from unstructured notes.
  • how MSK-CHORD can train machine learning models to predict patient outcomes.

Some of the resulting data from studies leveraging MSK-CHORD are available via cBioPortal.

 

If you’re interested in learning how machine learning can help you tap into the rich data within electronic health records, join us to learn more about the “Memorial Sloan Kettering (MSK) Clinicogenomic Harmonized Oncologic Real-World Data set” otherwise known as MSK-CHORD. It includes data for over 25,000 cancer patients to help you identify clinicogenomic relationships that aren’t as obvious in smaller, siloed data sets. Dr. Nikolaus Schultz, an NCI grantee and computational oncologist at MSK Cancer Center, will give an overview of the data set and demonstrate: the feasibility of MSK-CHORD’s automated annotation from unstructured notes. how MSK-CHORD can train machine learning models to predict patient outcomes. Some of the resulting data from studies leveraging MSK-CHORD are available via cBioPortal.   2025-12-11 10:00:00 Online Any Artificial Intelligence (Al),Data Online Nikolaus Schultz PhD (Computational Biology Center Memorial Sloan-Kettering Cancer Center) CBIIT 0 Automated Real-World Data Integration Improves Cancer Outcome Prediction
1977
Organized By:
NIH Pain SIG
Description

This talk will focus on analyses of the Patient Outcomes Repository for Treatment (PORT) which is a large registry of chronic pain treatment outcomes from patients seen in the pain clinics at the University Pittsburgh Medical Center (UPMC). Using methods such as propensity scoring, stratified modeling, and supervised machine learning, we can determine which treatments for chronic pain are or are not effective, the phenotypes most responsive to each treatment, and predict which treatments ...Read More

This talk will focus on analyses of the Patient Outcomes Repository for Treatment (PORT) which is a large registry of chronic pain treatment outcomes from patients seen in the pain clinics at the University Pittsburgh Medical Center (UPMC). Using methods such as propensity scoring, stratified modeling, and supervised machine learning, we can determine which treatments for chronic pain are or are not effective, the phenotypes most responsive to each treatment, and predict which treatments will be most effective in any new patient based on their phenotype (such as medications, injections, physical therapy, or mental health care).

This talk will focus on analyses of the Patient Outcomes Repository for Treatment (PORT) which is a large registry of chronic pain treatment outcomes from patients seen in the pain clinics at the University Pittsburgh Medical Center (UPMC). Using methods such as propensity scoring, stratified modeling, and supervised machine learning, we can determine which treatments for chronic pain are or are not effective, the phenotypes most responsive to each treatment, and predict which treatments will be most effective in any new patient based on their phenotype (such as medications, injections, physical therapy, or mental health care). 2025-12-11 11:00:00 Bldg 40 1201/1203 Any Artificial Intelligence (Al) In-Person Ajay Wasan (University of Pittsburgh) NIH Pain SIG 0 Analysis of Real World Evidence using Machine Learning to Improve Chronic Pain Treatment Outcomes
1973
Organized By:
NIH Library
Description

This one and a half-hour online training covers the basic principles of FAIR (Findable, Accessible, Interoperable, Reusable) data and why it is important to make your data FAIR.  

 By the end of this training, attendees will be able to:  

  • Define FAIR data   

  • ...Read More

This one and a half-hour online training covers the basic principles of FAIR (Findable, Accessible, Interoperable, Reusable) data and why it is important to make your data FAIR.  

 By the end of this training, attendees will be able to:  

  • Define FAIR data   

  • Explain what purpose FAIR data serves 

  • Apply FAIR data principles to make data findable, accessible, interoperable, and reusable 

This is an introductory level training. 

This one and a half-hour online training covers the basic principles of FAIR (Findable, Accessible, Interoperable, Reusable) data and why it is important to make your data FAIR.    By the end of this training, attendees will be able to:   Define FAIR data    Explain what purpose FAIR data serves  Apply FAIR data principles to make data findable, accessible, interoperable, and reusable  This is an introductory level training.  2025-12-16 11:30:00 Online Beginner Data Online Raisa Ionin (NIH Library) NIH Library 0 How to Make Your Data Fair
1979
Join Meeting
Organized By:
Spatial Biology Interest Group
Description

Abstract: Spatial transcriptomics (ST) is evolving rapidly as a pivotal technology in studying the biology of tumors and their associated tumor microenvironments (TME). However, where/how it can impact research remain unclear. In this talk, Dr. Chen will describe their recent study that performs rigorous benchmarking amongst the single cell ST platforms CosMx, MERFISH, and Xenium (uni/multi-modal) platforms (PMID: 41006245) and several computational approaches that performs multimodal ST data fusion, identifies locations and directions ...Read More

Abstract: Spatial transcriptomics (ST) is evolving rapidly as a pivotal technology in studying the biology of tumors and their associated tumor microenvironments (TME). However, where/how it can impact research remain unclear. In this talk, Dr. Chen will describe their recent study that performs rigorous benchmarking amongst the single cell ST platforms CosMx, MERFISH, and Xenium (uni/multi-modal) platforms (PMID: 41006245) and several computational approaches that performs multimodal ST data fusion, identifies locations and directions of spatial transcriptomic gradients (STG)  (PMID: 38562886) and infers metabolic flux (PMID: 37573313) from ST data. Finally, he will describe their experience in studying factors deriving immune checkpoint therapy resistance in HPV+ H&N cancer patients.

Bio: Dr. Chen obtained B. Eng. from Tsinghua University (Beijing) and Ph.D. in Electrical and Computer Engineering from University of Illinois at Urbana-Champaign. He is currently a full professor in the department of Bioinformatics and Computational Biology at the University of Texas MD Anderson cancer center. His primary interest is to develop computational methods to analyze and interpret high-throughput human genetic, functional and clinical data towards understanding the evolution of cancer as a consequence of genetics and environment and identifying molecular targets useful for cancer diagnosis and therapeutics. Among the computational tools he developed, BreakDancer, VarScan and Monovar have been widely used for characterizing genomes and transcriptomes of tumor tissues and single cells.

Abstract: Spatial transcriptomics (ST) is evolving rapidly as a pivotal technology in studying the biology of tumors and their associated tumor microenvironments (TME). However, where/how it can impact research remain unclear. In this talk, Dr. Chen will describe their recent study that performs rigorous benchmarking amongst the single cell ST platforms CosMx, MERFISH, and Xenium (uni/multi-modal) platforms (PMID: 41006245) and several computational approaches that performs multimodal ST data fusion, identifies locations and directions of spatial transcriptomic gradients (STG)  (PMID: 38562886) and infers metabolic flux (PMID: 37573313) from ST data. Finally, he will describe their experience in studying factors deriving immune checkpoint therapy resistance in HPV+ H&N cancer patients.Bio: Dr. Chen obtained B. Eng. from Tsinghua University (Beijing) and Ph.D. in Electrical and Computer Engineering from University of Illinois at Urbana-Champaign. He is currently a full professor in the department of Bioinformatics and Computational Biology at the University of Texas MD Anderson cancer center. His primary interest is to develop computational methods to analyze and interpret high-throughput human genetic, functional and clinical data towards understanding the evolution of cancer as a consequence of genetics and environment and identifying molecular targets useful for cancer diagnosis and therapeutics. Among the computational tools he developed, BreakDancer, VarScan and Monovar have been widely used for characterizing genomes and transcriptomes of tumor tissues and single cells. 2025-12-16 12:00:00 Online Any Next Gen Sequencing (NGS) Methods Online Ken Chen (University of Texas MD Anderson Cancer Center Spatial Biology Interest Group 0 Comparison of Spatial Transcriptomics (ST) Technologies and ST Gradient Analysis
1967
Join Meeting
Organized By:
BTEP
Description

In this lesson, attendees will learn the basics of ggplot2 to create simple, pretty, and effective figures with R. 

In this lesson, attendees will learn the basics of ggplot2 to create simple, pretty, and effective figures with R. 

In this lesson, attendees will learn the basics of ggplot2 to create simple, pretty, and effective figures with R.  2026-01-06 14:00:00 Online Beginner Programming Online Alex Emmons (BTEP) BTEP 0 Introduction to ggplot2 for R Data Visualization
1968
Join Meeting
Organized By:
BTEP
Description

In this lesson, attendees will continue learning how to create publishable figures with ggplot2. Topics will include statistical transformations, coordinate systems, and themes. 

In this lesson, attendees will continue learning how to create publishable figures with ggplot2. Topics will include statistical transformations, coordinate systems, and themes. 

In this lesson, attendees will continue learning how to create publishable figures with ggplot2. Topics will include statistical transformations, coordinate systems, and themes.  2026-01-08 14:00:00 Online Beginner Programming Online Alex Emmons (BTEP) BTEP 0 Plot Customization with ggplot2
1969
Join Meeting
Organized By:
BTEP
Description

In this lesson, attendees and instructor will work together to craft a publishable volcano plot using the skills previously learned. 

In this lesson, attendees and instructor will work together to craft a publishable volcano plot using the skills previously learned. 

In this lesson, attendees and instructor will work together to craft a publishable volcano plot using the skills previously learned.  2026-01-13 14:00:00 Beginner Programming Online Alex Emmons (BTEP) BTEP 0 From Data to Display - Crafting a Publishable Plot with ggplot2
1939
Coding Club Seminar Series

Join Meeting
Organized By:
BTEP
Description
Scikit-learn is a free and open-source Python library for machine learning. It is built on top of other fundamental Python libraries like NumPy, SciPy, and Matplotlib. Users will be introduced to scikit-learn and its usage, followed by the basic Machine Line pipeline and a simple Classification example using scikit-learn on a publicly available Diabetes dataset.
Scikit-learn is a free and open-source Python library for machine learning. It is built on top of other fundamental Python libraries like NumPy, SciPy, and Matplotlib. Users will be introduced to scikit-learn and its usage, followed by the basic Machine Line pipeline and a simple Classification example using scikit-learn on a publicly available Diabetes dataset.
Scikit-learn is a free and open-source Python library for machine learning. It is built on top of other fundamental Python libraries like NumPy, SciPy, and Matplotlib. Users will be introduced to scikit-learn and its usage, followed by the basic Machine Line pipeline and a simple Classification example using scikit-learn on a publicly available Diabetes dataset. 2026-01-14 14:00:00 Online Intermediate Programming,Statistics Online Titli Sarkar (ABCS-CCPM) BTEP 1 Introduction to scikit-Learn: Machine Learning with Python
1980
Join Meeting
Organized By:
CCR HiTIF Core
Description

Gil Kanfer, PhD, of the NCI CCR High-Throughput Imaging Facility (HiTIF), in the Laboratory of Receptor Biology and Gene Expression (LRBGE), will present the spatial biology analysis stack HiTIF is building to support Center for Cancer Research (CCR) researchers, with a focus on high-resolution spatial transcriptomics and multiplex protein imaging platforms existing in CCR Cores (e.g., Visium HD, Xenium-5k, CODEX) and how they can be turned into robust, reusable analysis ...Read More

Gil Kanfer, PhD, of the NCI CCR High-Throughput Imaging Facility (HiTIF), in the Laboratory of Receptor Biology and Gene Expression (LRBGE), will present the spatial biology analysis stack HiTIF is building to support Center for Cancer Research (CCR) researchers, with a focus on high-resolution spatial transcriptomics and multiplex protein imaging platforms existing in CCR Cores (e.g., Visium HD, Xenium-5k, CODEX) and how they can be turned into robust, reusable analysis workflows.

Using recent liver cancer and melanoma projects run by CCR investigators with the NCI CCR Single Cell Analysis Facility (SCAF) and Spatial Imaging Technology Resource (SpITR) core facilities in collaboration with HiTIF as examples, he will show how in-house algorithms—such as a zonation prediction model for mapping periportal vs pericentral regions and custom methods for collagen-based proximity and niche analysis—are combined with open-source tools to align images, integrate RNA and protein data, quantify cell-type composition and spatial organization, and systematically screen ligand–receptor interactions across conditions and time points. The talk will emphasize generalizable, technology-driven pipelines that take core-generated images all the way to quantitative, biologically interpretable spatial-omics readouts for CCR labs.

Attendance at this event is limited to NCI CCR personnel.

Gil Kanfer, PhD, of the NCI CCR High-Throughput Imaging Facility (HiTIF), in the Laboratory of Receptor Biology and Gene Expression (LRBGE), will present the spatial biology analysis stack HiTIF is building to support Center for Cancer Research (CCR) researchers, with a focus on high-resolution spatial transcriptomics and multiplex protein imaging platforms existing in CCR Cores (e.g., Visium HD, Xenium-5k, CODEX) and how they can be turned into robust, reusable analysis workflows. Using recent liver cancer and melanoma projects run by CCR investigators with the NCI CCR Single Cell Analysis Facility (SCAF) and Spatial Imaging Technology Resource (SpITR) core facilities in collaboration with HiTIF as examples, he will show how in-house algorithms—such as a zonation prediction model for mapping periportal vs pericentral regions and custom methods for collagen-based proximity and niche analysis—are combined with open-source tools to align images, integrate RNA and protein data, quantify cell-type composition and spatial organization, and systematically screen ligand–receptor interactions across conditions and time points. The talk will emphasize generalizable, technology-driven pipelines that take core-generated images all the way to quantitative, biologically interpretable spatial-omics readouts for CCR labs. Attendance at this event is limited to NCI CCR personnel. 2026-01-15 13:00:00 Online Any Software Online Gil Kanfer (HiTIF/LRBGE/CCR/NCI) CCR HiTIF Core 0 Custom Spatial Biology Analysis Pipelines for NCI CCR Researchers from the High-Throughput Imaging Facility (HiTIF)
1970
Join Meeting
Organized By:
BTEP
Description

This lesson introduces general recommendations and tips to consider when creating effective and reproducible visualizations. Additional topics to be discussed include multi-figure panels, complementary or related R packages, and the use of ggplot2 in functions. 

This lesson introduces general recommendations and tips to consider when creating effective and reproducible visualizations. Additional topics to be discussed include multi-figure panels, complementary or related R packages, and the use of ggplot2 in functions. 

This lesson introduces general recommendations and tips to consider when creating effective and reproducible visualizations. Additional topics to be discussed include multi-figure panels, complementary or related R packages, and the use of ggplot2 in functions.  2026-01-15 14:00:00 Online Beginner Programming Online Alex Emmons (BTEP) BTEP 0 Recommendations and Tips for Creating Effective Plots with ggplot2