On this course you will learn how to write data reports quickly and effectively and communicate the results with your colleagues. We’ll teach you how to automate the reporting process so that you be able to transform your analysis into reproducible reports in HTML, pdf, slideshow or Microsoft Word document.

The tidyverse is essential for any statistician or data scientist who deals with data on a day-to-day basis. By focusing on small key tasks, the tidyverse suite of packages removes the pain of data manipulation.

Python is a powerful, fast programming language that plays well with others, runs everywhere, is friendly and easy to learn. Its a general purpose language in nature, however, it has a large number of packages which allows it to be suitable for a variety of tasks from data analysis to web scraping. Pythons syntax is simple yet elegant making it easy to read and quick to develop in. This course will cover the basics of how the language works looking at the core of components Python language.

This is a one day intensive course on advanced graphics with R. The standard plotting commands in R are known as the Base graphics. In this course, we cover more advanced graphics packages – in particular, ggplot2.

Do you want to create interactive documents? Do you want your reports to automatically update when the data changes? Then this course is for you! This course is based on a workshop run by Garret Grolemund (RStudio) and Colin Gillespie (Jumping Rivers) at Strata 2015.

This is a one-day intensive course on the R package Shiny. The course will be a mixture of lectures and computer practicals. Shiny allows you to create cutting-edge, interactive web-graphics. Regardless of your background, Shiny will enable you to present your data in new and innovative ways.

From the very beginning, R was designed for statistical modelling. Out of the box, R makes standard statistical techniques easy. This course covers standard statistical techniques, such as hypothesis testing, regression and clustering.

Despite the promise of big data, inferences are often limited not by the size of data but rather by its systematic structure. Only by carefully modelling this structure can we take full advantage of the data — big data must be complemented with big models and the algorithms that can fit them. Stan is a

Despite the promise of big data, inferences are often limited not by the size of data but rather by its systematic structure. Stan is a platform for facilitating this modelling, providing an expressive modelling language for specifying bespoke models and implementing state-of-the-art algorithms to draw subsequent Bayesian inferences.

This two-day intensive course will equip you with the knowledge and tools to undertake a variety of tasks in a standard machine learning analytics pipeline. In addition to techniques such as cross-validation, training and testing our models we will examine a range of model building algorithms for both regression and classification supervised learning problems on top of an introduction to unsupervised learning.

Python (along with R) has become the dominant language in machine learning and data science. It is now commonly used to fit complex models to messy datasets. PyTorch is an open-source machine learning library for Python, based on Torch, used for applications such as natural language processing. It is primarily developed by Facebook’s artificial-intelligence research group,

Deep learning is a cutting edge machine learning technique for classification and regression. In the past few years it has produced state-of-the-art results in fields such as image classification, natural language processing, bioinformatics and robotics. This course will cover the main ideas of deep learning, and how to implement it in practice with tensorflow: a software framework for efficient and scalable deep learning. We’ll introduce the high-level keras library, which allows deployment of standard neural networks with just a few lines of code, as well as the details of raw tensorflow, allowing a deeper level of customisation.

The benefit of using a programming language such as R is that we can automate repetitive tasks. This course covers the fundamental techniques such as functions, for loops and conditional expressions. By the end of this course, you will understand what these techniques are and when to use them.

Docker is a popular platform for packaging, deploying, and running applications. These applications run in containers. Crucially, this container can be used on any system: a developer’s laptop, systems on premises, or in the cloud. Containerization is a technology that’s been around for a long time, but it’s gained a new lease of life with

Predicting the future is a tough problem. Some things, like lottery numbers, are inherently unpredictable. Others, like air temperatures and rainfall, are reasonably predictable. Time series analysis makes it possible to assess whether or not predictions are possible and, if they are, build a model which can generate informed predictions for the future with realistic estimates of uncertainty.

This is a 1/2 day session that gives a overview of where and how R is used. Using a combination of lecture based case studies, and hands-on practicals we’ll cover some of the latest developments in the R world.

This course is aimed at statisticians and data scientists already familiar with a dynamic programming language (such as R, Python or Octave) who would like to learn how to use Scala. Scala is a free modern, powerful, strongly-typed, functional programming language, well-suited to statistical computing and data science applications. In particular, it is fast and efficient and is designed to easily exploit modern multi-core and distributed computing architectures.

This is a one-day intensive course on efficient R programming. This course will be a mixture of lectures and computer practicals. This course is aimed at anyone who uses R but wants tips and techniques for speeding up their code.

This is a one day intensive course on building an package in R. This course will be a mixture of lectures and computer practicals. The main focus will be getting a working R package ready for distribution.

As spatial datasets get larger more sophisticated software needs to be harnessed for their analysis. R is now a widely used open source software platform for working with spatial data thanks to its powerful analysis and visualisation packages.

WHAT TRAINING COURSE IS RIGHT FOR YOU?

Moving you from data storage to data insights with our expert training courses. Contact our support team if you have any questions about a specific course or if you need a course creating tailored to your needs.