Arun Srinivasan

R's data.table co-developer

Arun is one of the main contributors to the data.table package. He started using R in late 2011 and works as a data analyst at Open Analytics. He has a passion for developing tools and applying algorithms facilitating big-data analyses, and routinely works with data sizes in the order of several GBs.

Prerequisites

Course Description

The R data.table package is rapidly making its name as the number one choice for handling large datasets in R. This online data.table tutorial will bring you from data.table novice to expert in no time. Once you are introduced to the general form of a data.table query, you will learn the techniques to subset your data.table, how to update by reference and how you can use data.table’s set()-family in your workflow. The course finishes with more complex concepts such as indexing, keys and fast ordered joins. Upon completion of the course, you will be able to use data.table in R for a more efficient manipulation and analysis process. Enjoy!

1

Data.table novice

Free

Introduction on what exactly a data.table is, how it differs from the traditional data.frame in R, and understanding the general form of a data.table query.

Resources

Groups

About

DataCamp offers interactive R and Python courses on topics in data science, statistics, and machine learning. Learn from a team of expert teachers in the comfort of your browser with video lessons and fun coding challenges.