Tag: BigData

In a recent release, Azure Data Lake Analytics (ADLA) takes the capability to process large amounts of files of many different formats to the next level. This blog post is showing you an end to end walk-through of generating many Parquet files from a rowset, and process them at scale with ADLA as well as…

Do you use Data Lake Analytics and wonder how many Analytics Units your jobs should have been assigned? Do you want to see if your job could consume a little less time or money? The recently-announced AU Analyzer tool can help you today! See our recent announcement of the AU Analyzer, available in both Visual…

Customers love to use Azure Data Lake across their organizations by enabling their Data Lake Analytics accounts to use multiple Data Lake Store accounts as data sources. Business can have flexibility with resource usage and allocation, but this creates challenges for the Data Lake account administrators who struggle with understanding what is being used, how…

Got some time to learn Big Data Technologies? How about starting with Hive which is considered the de facto standard for SQL queries in Hadoop We just released HDInsight labs used during the BUILD conference code challenge. You will need 2 things to run these labs 1- HDInsight Cluster – How to create? 2- Step by Step Instructions…

We are excited to announce that with today’s release of Cloudera Enterprise 5.11 you can now run Spark, Hive, and MapReduce workloads in a Cloudera cluster on Azure Data Lake Store (ADLS). Cloudera customers can now take advantage of the many benefits of running clusters on ADLS. And ADLS brings to its customers another valuable…

Working with Hive, I regularly find myself staring at a csv/tsv/json files wondering where to start…. Hive View 2.0 is a new Web Experience in HDInsight 3.6 that greatly simplifies many common Hive Tasks and makes it easy to author and debug hive queries. In this post, we will look into 5 key feature that…

We will keep this page updated with HDInsight HBase/ Phoenix related commonly asked questions. You can leave comments/questions on this blog. Also, official channel to provide HDInsight related feedback and make feature requests is here What is the advantage of using HBase in Azure HDInsight? Azure HDInsight HBase – A NoSql database like no other …

This blog is written by Nitin Verma, Sr. Software Engineer, HDInsight. Do you restart or re-create your HDInsight HBase clusters often? and wished restart/re-create times were faster? if yes, please read on- This blog introduces a new script for HDInsight HBase service through which you can flush the MemStore of all HBase tables conveniently. The script…