Other sites

Most Popular Learners in mlr

For the development of mlr as well as for an “machine learning expert” it can be handy to know what are the most popular learners used.
Not necessarily to see, what are the top notch performing methods but to see what is used “out there” in the real world.
Thanks to the nice little package cranlogs from metacran you can at least get a slight estimate as I will show in the following…

First we need to install the cranlogs package using devtools:

devtools::install_github("metacran/cranlogs")

Now let’s load all the packages we will need:

library(mlr)library(stringi)library(cranlogs)library(data.table)

Do obtain a neat table of all available learners in mlr we can call listLearners().
This table also contains a column with the needed packages for each learner separated with a ,.

# obtain used packages for all learners
lrns=as.data.table(listLearners())all.pkgs=stri_split(lrns$package,fixed=",")

Note: You might get some warnings here because you likely did not install all packages that mlr suggests – which is totally fine.

Now we can obtain the download counts from the rstudio cran mirror, i.e. from the last month.
We use data.table to easily sum up the download counts of each day.

Remarks

This is not really representative of how popular each learner is, as some packages have multiple purposes (e.g. multiple learners).
Furthermore it would be great to have access to the trending list.
Also most stars at GitHub gives a better view of what the developers are interested in.
Looking for machine learning packages we see there e.g: xgboost, h2o and tensorflow.