ncibtep@nih.gov

Bioinformatics Training and Education Program

Data Wrangling Workshop

Data Wrangling Workshop

 When: May. 13th, 2024 10:00 am - 12:00 pm

Learning Level: Any

To Know
  • Where: NIH Library Training Room
  • Organized By: NIH Library
  • Presented By: Doug Joubert (NIH Library), Joelle Mornini (NIH Library)

About this Class

This in-person workshop will focus on data wrangling using tidy data principles. Tidy data describes a standard way of storing data that facilitates analysis and visualization within the tidyverse ecosystem. There will be a discussion of what makes data "tidy," and methods for reshaping your data using dplyr and tidyr functions. Prior to attending this class, you will need to have:

  1. Installed R and RStudio
  2. Taken the Introduction to R and RStudio class. If not, here are some resources for getting started:
    1. Introduction to R
    2. Introduction to RStudio
    3. Introduction to Scripts in RStudio

By the end of this class, attendees will be able to demonstrate how to describe the purpose of the dplyr and tidyr packages, select certain columns in a data frame, select certain rows in a data frame according to filtering conditions, and add new columns to a data frame that are functions of existing columns.

Note on Technology

The NIH Library has 24 pre-configured Windows laptops that you are welcome to use during this training on a first come, first served basis. You are also welcome to bring your own laptop (PC or Mac). NIH Staff bringing their own NIH-laptop can easily connect to the staff Wi-Fi. If participants are bringing a personal laptop, they are restricted to using the NIH-Guest-Network Wi-Fi.

Registrants will receive an email with information and instructions to install and verify access to R and RStudio before the class.  If you register the day before the class, you may not have time to download and properly install the necessary software. If you do not have the software installed, this training will be demo only.