ncibtep@nih.gov

Bioinformatics Training and Education Program

Data Wrangling Workshop

Data Wrangling Workshop

 When: Jul. 8th, 2024 1:00 pm - 3:00 pm

Learning Level: Any

To Know

Where:
NIH Library Training Room, Building 10, Clinical Center, South Entrance
Organizer:
NIH Library
Presented By:
Doug Joubert (NIH Library)
This class has ended.

About this Class

This two-hour in-person workshop will focus on data wrangling using tidy data principles. Tidy data describes a standard way of storing data that facilitates analysis and visualization within the tidyverse ecosystem. There will be a discussion of what makes data "tidy," and methods for reshaping your data using dplyr and tidyr functions. 

By the end of this training, attendees will be able to demonstrate how to:

  • Describe the purpose of the dplyr and tidyr packages
  • Select certain columns and rows in a data frame
  • Add new columns to a data frame that are functions of existing columns
  • Use the split-apply-combine concept for data analysis

Requirements

Prior to attending this training, you will need to have:

  1. Installed R and RStudio
  2. Taken the Introduction to R and RStudio training. If not, here are some resources for getting started:
    1. Introduction to R
    2. Introduction to RStudio
    3. Introduction to Scripts in RStudio

Note on Technology

Participants are expected to bring their own laptops to this training. NIH Staff using an NIH-laptop can easily connect to the staff Wi-Fi. If participants are bringing a personal laptop, they are restricted to using the NIH Public Wi-Fi. 

Registrants will receive an email with information and instructions to install and verify access to R and RStudio before the training.  If you register the day before the training, you may not have time to download and properly install the necessary software. If you do not have the software installed, this training will be demo only.