Category: bioinformatics

Here is the link to my new course at PACKT publishing. I apologize in advance for some of their video editing choices but you will definitely learn a lot and be able to work through a variety of practical examples to meet your bioinformatic needs. I will upload the R code on GitHub and post the links to the files for all the videos in the course section of my website rjbioinformatics.com. So be sure to stay tuned!

Advertisements

Here is the link to my new course at PACKT publishing. I apologize in advance for some of their video editing choices but you will definitely learn a lot and be able to work through a variety of practical examples to meet your bioinformatic needs. I will upload the R code on GitHub and post the links to the files for all the videos in the course section of my website https://rjbioinformatics.com/video-course/. So be sure to stay tuned!

Video Description

The R language is widely used among statisticians and data miners to develop statistical software and data analysis.

In this course, we’ll start by diving into the different types of R data structures and you’ll learn how the R programming language handles data. Then we’ll look in-depth at manipulating different datasets in R. After that, we’ll dive into data visualization with R, using basic plots, heat maps, and networks. We’ll explore the different flow control loops of the R programming language, and you’ll learn how to debug your code.

In the second half of the course, you’ll get hands-on working with the various statistical methods in R programming. You’ll find out how to work with different probability distributions, various types of hypothesis testing, and statistical analysis with the R programming language.

By the end of this video course, you will be well-versed in the basics of R programming and the various concepts of statistical data analysis with R.

Style and Approach

This fast-paced, practical guide is filled with real-world examples that will take you on a journey through the various concepts and phases of statistical analysis using the R programming language.

Sometimes it is helpful to simulate gene expression data to test code or to see how your results look with simulated values from a particular probability distribution. Here I am going to show you how to simulate RNAseq expression data counts from a uniform distribution with a mininum = 0 and maximum = 1200.

Sometimes it is helpful to simulate gene expression data to test code or to see how your results look with simulated values from a particular probability distribution. Here I am going to show you how to simulate RNAseq expression data counts from a uniform distribution with a mininum = 0 and maximum = 1200.

Now it’s your turn! What other probability distributions could we simulate data from to perform a mock RNA seq experiment to determine how many genes could be different by chance? You can even use a bootstrap approach to calculate the p-value after running 1000 permutations of the code. Of course, to circumvent these problems we use adjusted p values but it is always nice to go back to basics and stress the importance of applying statistical methods when looking at differentially expressed genes. I encourage you all to leave your answers in the comment section below to inspire others.