ncibtep@nih.gov

Bioinformatics Training and Education Program

Data Wrangling Workshop

Data Wrangling Workshop

 When: Oct. 9th, 2024 10:00 am - 11:30 am

Learning Level: Any

To Know

Where:
Online Webinar
Organizer:
NIH Library
Presented By:
Doug Joubert (NIH Library)
This class has ended.

About this Class

This 1.5 hour online workshop will focus on data wrangling using tidy data principles. Tidy data describes a standard way of storing data that facilitates analysis and visualization within the tidyverse ecosystem. There will be a discussion of what makes data "tidy," and methods for reshaping your data using dplyr and tidyr functions. 

By the end of this training, attendees will be able to demonstrate how to:

  • Describe the purpose of the dplyr and tidyr packages
  • Select certain columns and rows in a data frame
  • Add new columns to a data frame that are functions of existing columns
  • Use the split-apply-combine concept for data analysis

Requirements

Prior to attending this training, you will need to have:

  1. Installed R and RStudio
  2. Taken the Introduction to R and RStudio training. If not, here are some resources for getting started:
    1. Introduction to R
    2. Introduction to RStudio
    3. Introduction to Scripts in RStudio

Note on Technology

Registrants will receive an email with information and instructions to install and verify access to R and RStudio before the training.  If you register the day before the training, you may not have time to download and properly install the necessary software. If you do not have the software installed, this training will be demo only.