Updates to the Data Science Virtual Machine for Linux

The Data Science Virtual Machine (DSVM) is a virtual machine image on the Azure Marketplace assembled for data scientists. The goal of the DSVM is provide a broad array of popular data-oriented tools in a single environment, and make data scientists and developers highly productive in their work. It's available for both Windows and Linux, and the Linux edition has just received a major update. (The Windows edition was also updated recently.)

This latest update adds local standalone instances of Hadoop, YARN and Spark: ideal for prototyping code in your DSVM instance before deploying to a large cluster for production-level applications. This update also upgrades Microsoft R Server to version 9, adding the MicrosoftML package for machine learning, and the mrsdeploy package to operationalize R code on a production server.

You can find instructions for provisioning the Linux DSVM on Azure here, which also includes a detailed list of its components and how to launch them. For more on the latest updates to the DSVM, follow the link below.