CSC's trainings and events have moved

Find our upcoming trainings and events at

This site is an archive version and is no longer updated.

Go to CSC Customer trainings and Events

null DC-2020
Data Carpentry with R
Date: 14.05.2020 9:00 - 15.05.2020 17:00
Location details: The event is organised at the CSC Training Facilities located in the premises of CSC at Keilaranta 14, Espoo, Finland. The best way to reach us is by public transportation; more detailed travel tips are available.
Language: english-language
lecturers: Lecturers: Jesse Harrison (CSC) and Bishwa Ghimire (University of Helsinki)
Helper: Kerstin Lenk (Tampere University)
  • 120 for-finnish-academics
  • 560 for-others
The fee covers all materials, lunches as well as morning and afternoon coffees.
The seats are filled in the registration order. If a cancellation is received five (5) business days prior to the course, the course fee will be refunded with the exception of a handling fee of 10 €. For no-shows and cancellations after the cut of date no refunds will be made. Registration can be transferred to someone else from the same organization without additional charge.

Payment can be made with electronic invoicing, credit card, or direct bank transfer. Note that for electronic invoicing you need the operator and e-invoicing address (OVT code) of your organization. Please also note that invoice reference is needed for electronic invoicing in your organization, so please have this available when registering.
Additional Information

Data Carpentry workshops are for any researcher who has data they want to analyze, and no prior computational experience is required. This hands-on workshop teaches basic concepts, skills and tools for working more effectively with data. We will cover data organization in spreadsheets and learn how to use the R programming language for data wrangling and visualization.

The course takes place in CSC's Dogmi classroom, which has a computer for every participant. For further information on the course contents and links to individual lessons, please visit the course GitHub page.

Learning outcomes

In this workshop, participants will learn:

  • Good data entry practices (data in spreadsheets)
  • How to navigate RStudio
  • R syntax and how to write R code
  • Importing and exporting data to / from R
  • How to use tidyverse for data wrangling and plotting
  • Using R together with SQL databases


This introductory course is aimed at graduate students and other researchers. You don't need to have any previous knowledge of the tools that will be presented at the workshop.

Note: To participate in this workshop, you will need a Haka account and password. The course will run on RStudio instances available via the CSC Notebooks service ( Please contact the course staff if you do not have a Haka account.


Day 1

Data Organization in Spreadsheets

  • Introduction
  • Formatting Data
  • Common Formatting Problems
  • Dates as Data
  • Quality Control
  • Exporting Data

Data Analysis with R

  • Overview of R and RStudio
  • Introduction to R
  • Starting with Data

Day 2

Data Analysis with R (Continued)

  • Manipulating Data Frames with dplyr
  • Data Visualisation with ggplot2
  • SQL Databases and R