Other sites

Blog Archives

For a long time I tracked a discussion on LinkedIn that consisted of various opinions about using SAS vs R. Some people can take this very personal. Recently there was an interesting post at the DataCamp blog addressing this topic. They also prov...

Descriptive analysis between treatment and control groups can reveal interesting patterns or relationships, but we cannot always take descriptive statistics at face value. Regression and matching methods allow us to make controlled comparisons to reduc...

Embarrassingly I'm stumped on this...I have a program in R for looking at grade distributions in my class. I found something weird recently with my 'ifelse' processing. I noticed that my program seemed to be over counting Cs and under counting...

From: Decomposition: The Statistics Software Signal http://seanjtaylor.com/post/39573264781/the-statistics-software-signal"When you don't have to code your own estimators, you probably won't understand what you're doing. I'm not saying that you defini...

From: http://www.r-bloggers.com/data-driven-science-is-a-failure-of-imagination/I think I like this distinction between Bayesian and Frequentist statistics: "we are nearly always ultimately curious about the Bayesian probability of the hypothesis ...

HT: Revolution Analytics Very good discussion about real applied econometrics and analytics including the use of ARIMA models, decision trees, and genetic algorithms. He also has a very smart approach in his attitude toward big data and data s...

Leo Spizzirri does an excellent job of providing mathematical intuition behind eigenvector centrality. As I was reading through it, I found it easier to just work through the matrix operations he proposes using R. You can find his paper her...

Albert Au Yeung provides a very nice tutorial on non-negative matrix factorization and an implementation in python. This is based very loosely on his approach. Suppose we have the following matrix of users and ratings on movies:If we use the information above to form a matrix R it can be decomposed into two matrices...

In a previous post, I described the basics of social network analysis. I plan to extend that example here with an application in predictive analytics. Let's suppose we have the following network (visualized in R)Suppose we have used the igraph package ...