ncibtep@nih.gov

Bioinformatics Training and Education Program

Data Wrangling with R

Data Wrangling with R

When: November 27, 2023 - December 20, 2023
Share

About this Course

Welcome to the Data Wrangling with R course series! The purpose of this course is to introduce you to essential R packages and functions that will make your life easier when it comes time to explore, clean, transform, and summarize your data.  Around 50-80 % of a data scientists time is often said to be devoted to data wrangling, or the act of getting data into a specific format. We can reduce some of this time simply by becoming more familiar with the packages and tools dedicated to tidying, transforming, and summarizing data. In R, one such collection of packages is known as the tidyverse, which will be the focus of this course. 

Each lesson will immediately be followed by a one-hour help session. Help sessions will be structured around a set of practice problems for you to test your new skills. Though, we welcome all questions!

No experience with R is necessary to attend this course. The first few lessons will be focused on getting acquainted with R and RStudio.

You will not need to install R on your computer for this class. Instead, we will be using R through DNAnexus, a cloud platform for bioinformatics analysis. Details will follow upon registration. 

Description
This will be a no coding introduction to R, RStudio, and the Tidyverse. In this lesson, we will review some of the advantages of using R for data analysis and will get you acquainted with the RStudio environment. The end of the lesson will shift focus to getting everyone connected to the course on DNAnexus.
This will be a no coding introduction to R, RStudio, and the Tidyverse. In this lesson, we will review some of the advantages of using R for data analysis and will get you acquainted with the RStudio environment. The end of the lesson will shift focus to getting everyone connected to the course on DNAnexus.
Details
When
Mon, Nov 27, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description
This lesson will focus on some of the basics of R programming including naming and assigning R objects, recognizing and using R functions, understanding data types and classes, and becoming familiar with the R programming syntax.
This lesson will focus on some of the basics of R programming including naming and assigning R objects, recognizing and using R functions, understanding data types and classes, and becoming familiar with the R programming syntax.
Details
When
Wed, Nov 29, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description
In this lesson, we will learn how to import simple and complex data and how to avoid common mistakes. We will also learn how to reshape data, for example, from wide to long format, with tidyr.
In this lesson, we will learn how to import simple and complex data and how to avoid common mistakes. We will also learn how to reshape data, for example, from wide to long format, with tidyr.
Details
When
Mon, Dec 04, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description
This lesson will be a brief reprieve from data wrangling and will instead introduce the basics of plotting with ggplot2.
This lesson will be a brief reprieve from data wrangling and will instead introduce the basics of plotting with ggplot2.
Details
When
Wed, Dec 06, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description
In this lesson, we will learn how to improve code interpretability with the pipe (%>%) from the magrittr package. We will also learn how to merge and filter data frames.
In this lesson, we will learn how to improve code interpretability with the pipe (%>%) from the magrittr package. We will also learn how to merge and filter data frames.
Details
When
Mon, Dec 11, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description
In this lesson, we will continue to wrangle data using dplyr, focusing on functions such as group_by(), arrange(), summarize(), and mutate().
In this lesson, we will continue to wrangle data using dplyr, focusing on functions such as group_by(), arrange(), summarize(), and mutate().
Details
When
Wed, Dec 13, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description

In this lesson, we will learn about specialized data containers / classes that are shared across Bioconductor packages. These classes allow us to store and easily manage multiple -omics types. We will discuss some of the properties of these classes and gain insight into how to access and subset the data stored within. 

In this lesson, we will learn about specialized data containers / classes that are shared across Bioconductor packages. These classes allow us to store and easily manage multiple -omics types. We will discuss some of the properties of these classes and gain insight into how to access and subset the data stored within. 

Details
When
Mon, Dec 18, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar
Description
In this lesson, we will review many of the important concepts we learned throughout the course. We will also practice using our skills together on a realistic data set.
In this lesson, we will review many of the important concepts we learned throughout the course. We will also practice using our skills together on a realistic data set.
Details
When
Wed, Dec 20, 2023 - 1:00 pm - 2:00 pm
Where
Online Webinar