| class_id | details | description | start_date | Venues | learning_levels | Topic | Tags | delivery_method | presenters | Organizer | seminar_series | class_title |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1967 |
DescriptionIn this lesson, attendees will learn the basics of ggplot2 to create simple, pretty, and effective figures with R. In this lesson, attendees will learn the basics of ggplot2 to create simple, pretty, and effective figures with R. |
In this lesson, attendees will learn the basics of ggplot2 to create simple, pretty, and effective figures with R. | 2026-01-06 14:00:00 | Online | Beginner | Programming | Online | Alex Emmons (BTEP) | BTEP | 0 | Introduction to ggplot2 for R Data Visualization | |
| 1986 |
Organized By:CBIITDescriptionAttend this January 8 webinar to tour the immersive workbench of Read More Attend this January 8 webinar to tour the immersive workbench of FlowJoTM: a tool for advanced flow cytometry data analysis.
Dr. Veronica Obregon-Perko, senior scientific advisor at BD Biosciences, will highlight the software’s key features, including:
Whether you’re a new or experienced user of FlowJo, this training session has something for everyone. |
Attend this January 8 webinar to tour the immersive workbench of FlowJoTM: a tool for advanced flow cytometry data analysis. Dr. Veronica Obregon-Perko, senior scientific advisor at BD Biosciences, will highlight the software’s key features, including: its metadata manager. the graph gallery and virtual concatenation. charts and integrated algorithms. Whether you’re a new or experienced user of FlowJo, this training session has something for everyone. | 2026-01-08 11:00:00 | Online | Any | Software | Online | Veronica Obregon-Perko (Informatics BD) | CBIIT | 0 | Unveiling FlowJo Version 11: The Future of Cytometric Data Analysis | |
| 1968 |
DescriptionIn this lesson, attendees will continue learning how to create publishable figures with ggplot2. Topics will include statistical transformations, coordinate systems, and themes. In this lesson, attendees will continue learning how to create publishable figures with ggplot2. Topics will include statistical transformations, coordinate systems, and themes. |
In this lesson, attendees will continue learning how to create publishable figures with ggplot2. Topics will include statistical transformations, coordinate systems, and themes. | 2026-01-08 14:00:00 | Online | Beginner | Programming | Online | Alex Emmons (BTEP) | BTEP | 0 | Plot Customization with ggplot2 | |
| 1969 |
DescriptionIn this lesson, attendees and instructor will work together to craft a publishable volcano plot using the skills previously learned. In this lesson, attendees and instructor will work together to craft a publishable volcano plot using the skills previously learned. |
In this lesson, attendees and instructor will work together to craft a publishable volcano plot using the skills previously learned. | 2026-01-13 14:00:00 | Beginner | Programming | Online | Alex Emmons (BTEP) | BTEP | 0 | From Data to Display - Crafting a Publishable Plot with ggplot2 | ||
| 1987 |
Organized By:NIH LibraryDescriptionThis one-hour online training, provided by SAS, will review multiple ways to combine SAS data sets. By the end of this training, attendees will be able to: Utilize Concatenation on SAS data sets (SET Statement, PROC SQL, PROC APPEND) Use Interleaving on SAS data sets (SET Statement with BY Statement) Merge SAS data sets (MERGE Statement, PROC SQL, etc.) Update SAS data sets (UPDATE, ...Read More This one-hour online training, provided by SAS, will review multiple ways to combine SAS data sets. By the end of this training, attendees will be able to: Utilize Concatenation on SAS data sets (SET Statement, PROC SQL, PROC APPEND) Use Interleaving on SAS data sets (SET Statement with BY Statement) Merge SAS data sets (MERGE Statement, PROC SQL, etc.) Update SAS data sets (UPDATE, MODIFY Statements, etc.) |
This one-hour online training, provided by SAS, will review multiple ways to combine SAS data sets. By the end of this training, attendees will be able to: Utilize Concatenation on SAS data sets (SET Statement, PROC SQL, PROC APPEND) Use Interleaving on SAS data sets (SET Statement with BY Statement) Merge SAS data sets (MERGE Statement, PROC SQL, etc.) Update SAS data sets (UPDATE, MODIFY Statements, etc.) | 2026-01-14 11:00:00 | Online | Intermediate | Software | Online | Instructor (SAS) | NIH Library | 0 | Ways to Combine SAS Data Sets | |
| 1939 |
Coding Club Seminar SeriesDescription
Scikit-learn is a free and open-source Python library for machine learning. It is built on top of other fundamental Python libraries like NumPy, SciPy, and Matplotlib. Users will be introduced to scikit-learn and its usage, followed by the basic Machine Line pipeline and a simple Classification example using scikit-learn on a publicly available Diabetes dataset.
Scikit-learn is a free and open-source Python library for machine learning. It is built on top of other fundamental Python libraries like NumPy, SciPy, and Matplotlib. Users will be introduced to scikit-learn and its usage, followed by the basic Machine Line pipeline and a simple Classification example using scikit-learn on a publicly available Diabetes dataset.
|
Scikit-learn is a free and open-source Python library for machine learning. It is built on top of other fundamental Python libraries like NumPy, SciPy, and Matplotlib. Users will be introduced to scikit-learn and its usage, followed by the basic Machine Line pipeline and a simple Classification example using scikit-learn on a publicly available Diabetes dataset. | 2026-01-14 14:00:00 | Online | Intermediate | Programming,Statistics | Online | Titli Sarkar (ABCS-CCPM) | BTEP | 1 | Introduction to scikit-Learn: Machine Learning with Python | |
| 1980 |
DescriptionGil Kanfer, PhD, of the NCI CCR High-Throughput Imaging Facility (HiTIF), in the Laboratory of Receptor Biology and Gene Expression (LRBGE), will present the spatial biology analysis stack HiTIF is building to support Center for Cancer Research (CCR) researchers, with a focus on high-resolution spatial transcriptomics and multiplex protein imaging platforms existing in CCR Cores (e.g., Visium HD, Xenium-5k, CODEX) and how they can be turned into robust, reusable analysis ...Read More Gil Kanfer, PhD, of the NCI CCR High-Throughput Imaging Facility (HiTIF), in the Laboratory of Receptor Biology and Gene Expression (LRBGE), will present the spatial biology analysis stack HiTIF is building to support Center for Cancer Research (CCR) researchers, with a focus on high-resolution spatial transcriptomics and multiplex protein imaging platforms existing in CCR Cores (e.g., Visium HD, Xenium-5k, CODEX) and how they can be turned into robust, reusable analysis workflows. Using recent liver cancer and melanoma projects run by CCR investigators with the NCI CCR Single Cell Analysis Facility (SCAF) and Spatial Imaging Technology Resource (SpITR) core facilities in collaboration with HiTIF as examples, he will show how in-house algorithms—such as a zonation prediction model for mapping periportal vs pericentral regions and custom methods for collagen-based proximity and niche analysis—are combined with open-source tools to align images, integrate RNA and protein data, quantify cell-type composition and spatial organization, and systematically screen ligand–receptor interactions across conditions and time points. The talk will emphasize generalizable, technology-driven pipelines that take core-generated images all the way to quantitative, biologically interpretable spatial-omics readouts for CCR labs. Attendance at this event is limited to NCI CCR personnel. |
Gil Kanfer, PhD, of the NCI CCR High-Throughput Imaging Facility (HiTIF), in the Laboratory of Receptor Biology and Gene Expression (LRBGE), will present the spatial biology analysis stack HiTIF is building to support Center for Cancer Research (CCR) researchers, with a focus on high-resolution spatial transcriptomics and multiplex protein imaging platforms existing in CCR Cores (e.g., Visium HD, Xenium-5k, CODEX) and how they can be turned into robust, reusable analysis workflows. Using recent liver cancer and melanoma projects run by CCR investigators with the NCI CCR Single Cell Analysis Facility (SCAF) and Spatial Imaging Technology Resource (SpITR) core facilities in collaboration with HiTIF as examples, he will show how in-house algorithms—such as a zonation prediction model for mapping periportal vs pericentral regions and custom methods for collagen-based proximity and niche analysis—are combined with open-source tools to align images, integrate RNA and protein data, quantify cell-type composition and spatial organization, and systematically screen ligand–receptor interactions across conditions and time points. The talk will emphasize generalizable, technology-driven pipelines that take core-generated images all the way to quantitative, biologically interpretable spatial-omics readouts for CCR labs. Attendance at this event is limited to NCI CCR personnel. | 2026-01-15 13:00:00 | Online | Any | Software | Online | Gil Kanfer (HiTIF/LRBGE/CCR/NCI) | CCR HiTIF Core | 0 | Custom Spatial Biology Analysis Pipelines for NCI CCR Researchers from the High-Throughput Imaging Facility (HiTIF) | |
| 1970 |
DescriptionThis lesson introduces general recommendations and tips to consider when creating effective and reproducible visualizations. Additional topics to be discussed include multi-figure panels, complementary or related R packages, and the use of ggplot2 in functions. This lesson introduces general recommendations and tips to consider when creating effective and reproducible visualizations. Additional topics to be discussed include multi-figure panels, complementary or related R packages, and the use of ggplot2 in functions. |
This lesson introduces general recommendations and tips to consider when creating effective and reproducible visualizations. Additional topics to be discussed include multi-figure panels, complementary or related R packages, and the use of ggplot2 in functions. | 2026-01-15 14:00:00 | Online | Beginner | Programming | Online | Alex Emmons (BTEP) | BTEP | 0 | Recommendations and Tips for Creating Effective Plots with ggplot2 | |
| 1988 |
Organized By:NIH LibraryDescriptionThis one-hour online training introduces applying data science and artificial intelligence (AI) techniques to signals and time-series datasets using MATLAB. The training will cover the entire AI pipeline, from signal exploration to deployment. Participants will explore the fundamentals of processing, analyzing, and visualizing signal data, as well as implementing machine learning and AI algorithms tailored for time-series datasets. This training is designed for researchers, engineers, and data scientists who ...Read More This one-hour online training introduces applying data science and artificial intelligence (AI) techniques to signals and time-series datasets using MATLAB. The training will cover the entire AI pipeline, from signal exploration to deployment. Participants will explore the fundamentals of processing, analyzing, and visualizing signal data, as well as implementing machine learning and AI algorithms tailored for time-series datasets. This training is designed for researchers, engineers, and data scientists who work with signals or temporal data and seek to enhance their analytical capabilities through MATLAB's data science and AI functionalities. By the end of this training, attendees will be able to:
Attendees are expected to be familiar with the basic functions of the MATLAB to be successful in this training. |
This one-hour online training introduces applying data science and artificial intelligence (AI) techniques to signals and time-series datasets using MATLAB. The training will cover the entire AI pipeline, from signal exploration to deployment. Participants will explore the fundamentals of processing, analyzing, and visualizing signal data, as well as implementing machine learning and AI algorithms tailored for time-series datasets. This training is designed for researchers, engineers, and data scientists who work with signals or temporal data and seek to enhance their analytical capabilities through MATLAB's data science and AI functionalities. By the end of this training, attendees will be able to: Understand the unique challenges and opportunities in analyzing signals and time-series data. Import, preprocess, and visualize signal and time-series datasets in MATLAB. Apply machine learning techniques, including supervised and unsupervised algorithms, to create predictive models for time-series data. Explore deep learning approaches, such as recurrent neural networks (RNNs) and long short-term memory (LSTM) networks, for advanced time-series analysis. Deploy trained AI models and automate workflows to integrate insights into research or operational pipelines. Utilize MATLAB’s documentation, online resources, and toolboxes to extend their data science and AI capabilities. Attendees are expected to be familiar with the basic functions of the MATLAB to be successful in this training. | 2026-01-16 12:00:00 | Online | Intermediate | Software | Online | Instructor (MATLAB) | NIH Library | 0 | Data Science and Artificial Intelligence: Signals and Time Series Datasets Using MATLAB | |
| 1985 |
Organized By:NCI Childhood Cancer Data Initiative SeriesDescriptionPart of the Childhood Cancer Data Initiative (CCDI) Webinar Series During this webinar, experts from the USC Norris Comprehensive Cancer Center (NCCC) and Children's Hospital Los Angeles (CHLA) will present:
Part of the Childhood Cancer Data Initiative (CCDI) Webinar Series During this webinar, experts from the USC Norris Comprehensive Cancer Center (NCCC) and Children's Hospital Los Angeles (CHLA) will present:
|
Part of the Childhood Cancer Data Initiative (CCDI) Webinar Series During this webinar, experts from the USC Norris Comprehensive Cancer Center (NCCC) and Children's Hospital Los Angeles (CHLA) will present: An overview of the CHLA population, the NCCC, and work done under the prior CCDI P30 supplement award (dbGaP accession: phs002518) Methylation sequencing of pediatric central nervous system tumors (technical approaches and results) Whole-slide imaging in the analysis of pediatric solid tumors AI-driven multimodal analysis and integrated reporting | 2026-01-16 13:00:00 | Online | Any | Cancer,Data | Online | Alexander Markowitz PhD,David Buckley PhD,James Amatruda MD PhD,Jennifer Cotter MD,Timothy Triche MD PhD | NCI Childhood Cancer Data Initiative Series | 0 | AI-Driven Multimodal Data Integration and Analysis to Improve Pediatric Cancer Diagnosis | |
| 1989 |
Organized By:NIH LibraryDescriptionThis one-hour online training offers an overview of the NIH-sponsored Generalist Repository Ecosystem Initiative (GREI) (Dataverse, Dryad, Figshare, Mendeley Data, Open Science Framework, Vivli, and Zenodo), and the role of participating in these repositories in the NIH data repository landscape for intramural researchers. The session will highlight how these repositories support compliance with the NIH Data Management and Sharing Policy. By the end of this training, attendees will be able to: Read More This one-hour online training offers an overview of the NIH-sponsored Generalist Repository Ecosystem Initiative (GREI) (Dataverse, Dryad, Figshare, Mendeley Data, Open Science Framework, Vivli, and Zenodo), and the role of participating in these repositories in the NIH data repository landscape for intramural researchers. The session will highlight how these repositories support compliance with the NIH Data Management and Sharing Policy. By the end of this training, attendees will be able to:
Attendees are not expected to have any prior knowledge of the NIH Data Repository Landscape. |
This one-hour online training offers an overview of the NIH-sponsored Generalist Repository Ecosystem Initiative (GREI) (Dataverse, Dryad, Figshare, Mendeley Data, Open Science Framework, Vivli, and Zenodo), and the role of participating in these repositories in the NIH data repository landscape for intramural researchers. The session will highlight how these repositories support compliance with the NIH Data Management and Sharing Policy. By the end of this training, attendees will be able to: Describe how generalist repositories fit into the NIH data repository landscape for intramural researchers. Understand how these repositories support compliance with the NIH Data Management and Sharing Policy Learn about the resources developed by GREI repositories to support data sharing workflows, including a generalist repository comparison chart, a generalist repository selection flowchart, a data submission checklist, and a data management and sharing plan guide. Gain practical insights from real-world examples, demonstrating how researchers use generalist repositories for data sharing and reuse, and how these efforts contribute to the broader NIH data sharing ecosystem. Attendees are not expected to have any prior knowledge of the NIH Data Repository Landscape. | 2026-01-21 13:00:00 | Online | Beginner | Databases | Online | Ishwar Chandramouliswaran (NIH/OD) | NIH Library | 0 | Data Sharing and Discovery in Generalist Repositories: Resources and Real-World Examples | |
| 1982 |
Organized By:CBIITDescriptionAttend this webinar to learn more about XNAT Scout—a new extension of the XNAT imaging informatics platform that’s designed to close the gap between artificial intelligence (AI) model development and clinical deployment. Washington University’s Dr. Daniel Marcus will introduce XNAT Scout’s architecture, key capabilities, and early deployment experiences. XNAT Scout provides structured tools for assembling training cohorts, managing annotations, benchmarking models, and monitoring performance ...Read More Attend this webinar to learn more about XNAT Scout—a new extension of the XNAT imaging informatics platform that’s designed to close the gap between artificial intelligence (AI) model development and clinical deployment. Washington University’s Dr. Daniel Marcus will introduce XNAT Scout’s architecture, key capabilities, and early deployment experiences. XNAT Scout provides structured tools for assembling training cohorts, managing annotations, benchmarking models, and monitoring performance over time. Integrated with XNAT’s mature imaging workflows and governance frameworks, it enables reproducible validation, multi-site collaboration, and deployment pathways aligned with clinical interoperability and security requirements. By unifying data curation, evaluation, and operationalization in one platform, XNAT Scout accelerates translation and supports health systems in safely adopting AI at scale. XNAT is a a globally used, open-source imaging informatics platform funded by the NCI Informatics Technology for Cancer Research (ITCR) program. |
Attend this webinar to learn more about XNAT Scout—a new extension of the XNAT imaging informatics platform that’s designed to close the gap between artificial intelligence (AI) model development and clinical deployment. Washington University’s Dr. Daniel Marcus will introduce XNAT Scout’s architecture, key capabilities, and early deployment experiences. XNAT Scout provides structured tools for assembling training cohorts, managing annotations, benchmarking models, and monitoring performance over time. Integrated with XNAT’s mature imaging workflows and governance frameworks, it enables reproducible validation, multi-site collaboration, and deployment pathways aligned with clinical interoperability and security requirements. By unifying data curation, evaluation, and operationalization in one platform, XNAT Scout accelerates translation and supports health systems in safely adopting AI at scale. XNAT is a a globally used, open-source imaging informatics platform funded by the NCI Informatics Technology for Cancer Research (ITCR) program. | 2026-01-22 11:00:00 | Online | Any | Artificial Intelligence (Al) | Online | Daniel Marcus (Washington University School of Medicine in St. Louis) | CBIIT | 0 | XNAT Scout: Enabling Translational AI | |
| 1984 |
Organized By:NCIDescriptionJoin the NCI Cohort Consortium for a webinar on innovative approaches to improving data interoperability across cohort studies. The session will highlight efforts to apply the Observational Medical Outcomes Partnership Common Data Model to survey data and introduce tools, including Code Map capability, that enable mapping across data models based on Common Data Element semantic concepts. These approaches are designed to enhance data integration and support collaborative cancer cohort research. Join the NCI Cohort Consortium for a webinar on innovative approaches to improving data interoperability across cohort studies. The session will highlight efforts to apply the Observational Medical Outcomes Partnership Common Data Model to survey data and introduce tools, including Code Map capability, that enable mapping across data models based on Common Data Element semantic concepts. These approaches are designed to enhance data integration and support collaborative cancer cohort research. |
Join the NCI Cohort Consortium for a webinar on innovative approaches to improving data interoperability across cohort studies. The session will highlight efforts to apply the Observational Medical Outcomes Partnership Common Data Model to survey data and introduce tools, including Code Map capability, that enable mapping across data models based on Common Data Element semantic concepts. These approaches are designed to enhance data integration and support collaborative cancer cohort research. | 2026-01-27 12:00:00 | Online | Any | Programming | Online | Denise Warzel (CBIIT),Nicole Gerlanc (NCI/DCEG) | NCI | 0 | Improving Data Interoperability Across Cohort Studies | |
| 1990 |
Organized By:NIH LibraryDescriptionThis hour-and-a-half online training will examine how humans process and encode visual information and how visual attributes can be utilized to create effective visualizations. This will focus on enhancing graphic literacy, exploring methods for making better visualizations, and using stakeholder needs to guide your design choices. By the end of this training, attendees will be able to:
This hour-and-a-half online training will examine how humans process and encode visual information and how visual attributes can be utilized to create effective visualizations. This will focus on enhancing graphic literacy, exploring methods for making better visualizations, and using stakeholder needs to guide your design choices. By the end of this training, attendees will be able to:
|
This hour-and-a-half online training will examine how humans process and encode visual information and how visual attributes can be utilized to create effective visualizations. This will focus on enhancing graphic literacy, exploring methods for making better visualizations, and using stakeholder needs to guide your design choices. By the end of this training, attendees will be able to: Analyze how different visual encodings affect the accuracy of data interpretation. Use Gestalt principles and preattentive attributes to design visualizations that improve clarity, grouping, and rapid perception. Evaluate the appropriateness of color scales. Identify and correct common visualization pitfalls. | 2026-02-09 13:00:00 | Online | Beginner | Data | Online | NIH Library Staff | NIH Library | 0 | Principles of Effective Data Visualization | |
| 1991 |
Organized By:NIH LibraryDescriptionThis one-hour and thirty minute online training is part one of an introductory two-part series for those who want to learn about research data management and sharing, or for those who are interested in a refresher. The series provides detailed information on managing and sharing data from the first data planning stage, through the data life cycle, to data archiving, and finally to selecting an appropriate repository for data preservation. &...Read More This one-hour and thirty minute online training is part one of an introductory two-part series for those who want to learn about research data management and sharing, or for those who are interested in a refresher. The series provides detailed information on managing and sharing data from the first data planning stage, through the data life cycle, to data archiving, and finally to selecting an appropriate repository for data preservation. By the end of part one of this training series, attendees will be able to:
During Part 2, attendees will learn about sharing and archiving data. You must register separately for Part 2 of this training. This training is introductory, no prior knowledge required. |
This one-hour and thirty minute online training is part one of an introductory two-part series for those who want to learn about research data management and sharing, or for those who are interested in a refresher. The series provides detailed information on managing and sharing data from the first data planning stage, through the data life cycle, to data archiving, and finally to selecting an appropriate repository for data preservation. By the end of part one of this training series, attendees will be able to: Understand data management best practices Become familiar with data management tools Have a solid knowledge of the resources, enabling data sharing During Part 2, attendees will learn about sharing and archiving data. You must register separately for Part 2 of this training. This training is introductory, no prior knowledge required. | 2026-02-10 14:00:00 | Online | Beginner | Data | Online | Raisa Ionin (NIH Library) | NIH Library | 0 | Data Management and Sharing, Part 1 of 2 | |
| 1992 |
Organized By:NIH LibraryDescriptionThis hour and half online training is part two of an introductory two-part series for those who want to learn about research data management and sharing, or for those who are interested in a refresher. The series provides detailed information on managing and sharing data from the first data planning stage, through the data life cycle, to data archiving, and finally to selecting an appropriate repository for data preservation. By the ...Read More This hour and half online training is part two of an introductory two-part series for those who want to learn about research data management and sharing, or for those who are interested in a refresher. The series provides detailed information on managing and sharing data from the first data planning stage, through the data life cycle, to data archiving, and finally to selecting an appropriate repository for data preservation. By the end of part two of this training series, attendees will be able to:
|
This hour and half online training is part two of an introductory two-part series for those who want to learn about research data management and sharing, or for those who are interested in a refresher. The series provides detailed information on managing and sharing data from the first data planning stage, through the data life cycle, to data archiving, and finally to selecting an appropriate repository for data preservation. By the end of part two of this training series, attendees will be able to: Have a solid knowledge of the resources, enabling data sharing Understand how data is archived and preserved Part 1 of this training covers understanding research data, how to manage research data, and how to work with data. During Part 2, attendees learn about sharing and archiving data. This training is introductory, no prior knowledge required. | 2026-02-11 14:00:00 | Online | Beginner | Data | Online | Raisa Ionin (NIH Library) | NIH Library | 0 | Data Management and Sharing, Part 2 of 2 | |
| 1993 |
Organized By:NIH LibraryDescriptionIn partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature. This four-hour online training will address fundamental statistical concepts including ...Read More In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature. This four-hour online training will address fundamental statistical concepts including hypothesis testing, p-values and confidence intervals, types of data and their distributional importance, and bias and confounding. Time will be devoted to questions from attendees and references will be provided for in-depth self-study. By the end of this training, attendees will be able to:
|
In partnership with the NIH Clinical Center's Biostatistics and Clinical Epidemiology Service (BCES), the NIH Library is offering several trainings that cover general concepts behind statistics and epidemiology. These trainings will help participants better understand and prepare data, interpret results and findings, design and prepare studies, and understand the results in published literature. This four-hour online training will address fundamental statistical concepts including hypothesis testing, p-values and confidence intervals, types of data and their distributional importance, and bias and confounding. Time will be devoted to questions from attendees and references will be provided for in-depth self-study. By the end of this training, attendees will be able to: Describe key concepts in statistical procedures Understand the steps involved in hypothesis testing Define p-values and be familiar with their appropriate uses Describe confidence intervals and their uses Understand differences in types of data and how to summarize them Describe bias and confounding | 2026-02-12 13:00:00 | Online | Intermediate | Statistics | Online | Ninet Sinaii Ph.D. MPH (Biostatistics and Clinical Epidemiology Branch NIH Clinical Center) | NIH Library | 0 | Overview of Statistical Concepts | |
| 1994 |
|
This one-hour online training will cover the fundamentals, applications, and ethical considerations of Artificial Intelligence (AI). Attendees will explore key topics such as machine learning, deep learning, data handling, and real-world AI applications across various industries. The session will also delve into the ethical implications of AI and provide insights on becoming AI literate. Whether you're a seasoned professional or just starting your AI journey, this session will equip you with essential knowledge to navigate the AI landscape effectively and make informed decisions in our data-driven world. By the end of this training, attendees will be able to: Understand the core concepts of AI Recognize the significance of ethical considerations in AI Begin the journey toward AI literacy Attendees are not expected to have any prior knowledge of AI to be successful in this training. | 2026-02-20 13:00:00 | Online | Beginner | Artificial Intelligence (Al) | Online | NIH Library Staff | NIH Library | 0 | AI Literacy: Navigating the World of Artificial Intelligence | |
| 1995 |
Organized By:NIH LibraryDescriptionThis one and a half hour online training equips participants with powerful data wrangling techniques using R and the tidyverse ecosystem. The tidyverse is a cohesive ecosystem of R packages designed to make data science workflows more intuitive and efficient through consistent syntax and design principles. Designed for both beginners and those looking to refine their skills, this training addresses the challenges posed by messy datasets. By ...Read More This one and a half hour online training equips participants with powerful data wrangling techniques using R and the tidyverse ecosystem. The tidyverse is a cohesive ecosystem of R packages designed to make data science workflows more intuitive and efficient through consistent syntax and design principles. Designed for both beginners and those looking to refine their skills, this training addresses the challenges posed by messy datasets. By the end of this training, attendees will be able to
Requirements Attendees are expected to have a basic understanding of R and RStudio. To proceed, attendees should have done the following: |
This one and a half hour online training equips participants with powerful data wrangling techniques using R and the tidyverse ecosystem. The tidyverse is a cohesive ecosystem of R packages designed to make data science workflows more intuitive and efficient through consistent syntax and design principles. Designed for both beginners and those looking to refine their skills, this training addresses the challenges posed by messy datasets. By the end of this training, attendees will be able to Diagnose and address common data quality issues in clinical datasets. Apply systematic approaches to clean and standardize text, dates, and numerical values. Transform messy data and handle missing values using tidyverse functions, including appropriate imputation strategies. Design reproducible, automated data-cleaning workflows with tidyverse tools for transformation and aggregation. Requirements Attendees are expected to have a basic understanding of R and RStudio. To proceed, attendees should have done the following: Installed R and RStudio. Have a basic understanding of R and RStudio. Reviewed our R basics training on the NIH Data Services: On Demand Content YouTube Playlist, if you are new to R | 2026-02-23 13:00:00 | Online | Intermediate | Programming | Online | Doug Joubert (NIH Library) | NIH Library | 0 | Taming Messy Data: Practical R Wrangling with the Tidyverse | |
| 1996 |
Organized By:NIH LibraryDescriptionThis one hour and half hour online training will equip attendees with essential knowledge and skills for effective interactions with Large Language Model (LLM) AI chatbots. Explore the intricacies of prompt engineering and its pivotal role in optimizing the conversational capabilities of LLMs. Emphasizing best practices and practical applications, this training features live demonstrations and provides valuable skills for the effective use of LLMs. Read More This one hour and half hour online training will equip attendees with essential knowledge and skills for effective interactions with Large Language Model (LLM) AI chatbots. Explore the intricacies of prompt engineering and its pivotal role in optimizing the conversational capabilities of LLMs. Emphasizing best practices and practical applications, this training features live demonstrations and provides valuable skills for the effective use of LLMs. By the end of this training, attendees will be able to:
Attendees are not expected to have any prior knowledge of AI chatbots to be successful in this training. |
This one hour and half hour online training will equip attendees with essential knowledge and skills for effective interactions with Large Language Model (LLM) AI chatbots. Explore the intricacies of prompt engineering and its pivotal role in optimizing the conversational capabilities of LLMs. Emphasizing best practices and practical applications, this training features live demonstrations and provides valuable skills for the effective use of LLMs. By the end of this training, attendees will be able to: Define LLMs, prompt patterns, and prompt engineering Identify potential uses and issues to consider when using LLMs in the biomedical research field Use a selection of prompt patterns to improve generated output from LLMs Identify resources for learning more about prompt engineering in LLMs Attendees are not expected to have any prior knowledge of AI chatbots to be successful in this training. | 2026-02-27 13:00:00 | Online | Beginner | Artificial Intelligence (Al) | Online | Alicia Lillich (NIH Library),Joelle Mornini (NIH Library) | NIH Library | 0 | Best Practices for Prompt Generation in AI Chatbots | |
| 1941 |
Distinguished Speakers Seminar SeriesDescriptionIn this talk, Dr. Carey will describe how Bioconductor approaches new challenges in supporting open method development and reproducible In this talk, Dr. Carey will describe how Bioconductor approaches new challenges in supporting open method development and reproducible |
In this talk, Dr. Carey will describe how Bioconductor approaches new challenges in supporting open method development and reproducibleanalyses in genomic data science. He will discuss aspects of the project that bear on education in cancer epidemiology andcomputational cancer genomics, and on emerging topics in software and data engineering for scalable omics analyses. | 2026-03-19 13:00:00 | Online | Any | Software | Online | Vincent J. Carey (Brigham and Women\'s Hospital Harvard Medical School) | BTEP | 1 | Bioconductor Decade 3: Evolving an Open Ecosystem for Genomic Data Science | |
| 1983 |
Organized By:NCIDescription
Overview
This 3-day, virtual workshop will explore how foundation models—a powerful class of advanced AI models —can transform cancer research and clinical care. We will focus on their potential to improve diagnosis, prognosis, and treatment response, with a strong emphasis on clinical translation and technology development. Key Topics:
Overview
This 3-day, virtual workshop will explore how foundation models—a powerful class of advanced AI models —can transform cancer research and clinical care. We will focus on their potential to improve diagnosis, prognosis, and treatment response, with a strong emphasis on clinical translation and technology development. Key Topics:
Agenda (https://events.cancer.gov/dctd/foundationmodel/agenda) |
Overview This 3-day, virtual workshop will explore how foundation models—a powerful class of advanced AI models —can transform cancer research and clinical care. We will focus on their potential to improve diagnosis, prognosis, and treatment response, with a strong emphasis on clinical translation and technology development. Key Topics: Foundation Model Primer: A high-level introduction to foundation models. Multimodal Data: Combining pathology, radiology, omics, and patient data into unified models. Prediction: Predicting therapeutic response, resistance, and patient outcomes. Validation and Reproducibility: Ensuring model results are consistent and reliable for real-world clinical performance and use. Diagnostic Case Studies: Real-world applications for early detection and automated diagnostics. Federated Learning: Approaches to training robust models across multiple institutions—without sharing sensitive patient data Challenges, Risk, and Regulation: Addressing model interpretability and regulatory considerations for clinical adoption. Agenda (https://events.cancer.gov/dctd/foundationmodel/agenda) | 2026-03-24 10:00:00 | Online | Any | Artificial Intelligence (Al) | Online | Asif Rizwan (NCI) | NCI | 0 | Foundational Models for Cancer: Advancing Diagnosis, Prognosis, and Treatment Response | |
| 1920 |
Distinguished Speakers Seminar SeriesDescriptionThe ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell differentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved ...Read More The ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell differentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved transcriptomics further allows the measurement of gene expression levels along with the location of the RNA molecules within a tissue. Transcriptomics exemplifies the range of issues one encounters in a data science workflow, where the data are complex in a variety of ways, questions are not always clearly formulated, there are multiple analysis steps, and drawing on rigorous statistical principles and methods is essential to derive meaningful and reliable biological results. In this talk, Dr. Dudoit will provide a survey of statistical questions related to the analysis of single-cell transcriptome sequencing data to investigate the differentiation of stem cells in the brain, including, exploratory data analysis, expression quantitation, cluster analysis, and the inference of cellular lineages. She will also address differential expression analysis in spatial transcriptomics. |
The ability to measure gene expression levels for individual cells (vs. pools of cells) and with spatial resolution is crucial to address many important biological and medical questions, such as the study of stem cell differentiation, the discovery of cellular subtypes in the brain, and cancer diagnosis and treatment. Single-cell transcriptome sequencing (RNA-Seq) allows the high-throughput measurement of gene expression levels for entire genomes at the resolution of single cells. Spatially-resolved transcriptomics further allows the measurement of gene expression levels along with the location of the RNA molecules within a tissue. Transcriptomics exemplifies the range of issues one encounters in a data science workflow, where the data are complex in a variety of ways, questions are not always clearly formulated, there are multiple analysis steps, and drawing on rigorous statistical principles and methods is essential to derive meaningful and reliable biological results. In this talk, Dr. Dudoit will provide a survey of statistical questions related to the analysis of single-cell transcriptome sequencing data to investigate the differentiation of stem cells in the brain, including, exploratory data analysis, expression quantitation, cluster analysis, and the inference of cellular lineages. She will also address differential expression analysis in spatial transcriptomics. | 2026-04-16 13:00:00 | Online | Any | Omics | Online | Sandrine Dudoit (UC Berkeley) | BTEP | 1 | Learning from Data in Single-Cell Transcriptomics |