MapR 5.0 Documentation : Installing Apache Drill 0.5 on MapR

Created by Bridget Bevens, last modified on Oct 01, 2014

Installation Overview

You can install and run Apache Drill on any number of nodes in your Hadoop cluster. Install the mapr-drill package on each node that you want to run Apache Drill. The mapr-drill package installs the Drillbit daemon and the Drill shell.

The Drillbit daemon is the core Drillbit service that runs on a node. Drill’s processing capacity increases with the number of Drillbit services running in a cluster. Each node running the Drillbit service can receive, plan, and execute queries sent from a client. Warden manages the Drillbit service, which simplifies the installation process and management of Apache Drill. For more information about Warden, refer to Warden in the MapR Architecture Guide.

The Drill shell is the command line interface, a pure-Java console-based utility, for connecting to relational databases and executing SQL commands.

After you install Apache Drill, you can perform any of the following tasks:

MapR distribution version 3.1.1 or 4.0.1. Verify that you have added the MapR repository on your system. You should have the maprtech.repo in the directory /etc/yum.repos.d/ with the following content:

You cannot run Apache Drill 0.5.0 against HBase 0.98.x. If you install HBase packages from MapR's ecosystem-4.x or ecosystem-all repositories, you currently get HBase 0.98.x, by default. You must uninstall HBase 0.98.x and then install HBase version 0.94.17 or 0.94.21.

Installing Apache Drill

Issue the following command to install the mapr-drill package on a node:

RedHat and CentOS

$ sudo yum install mapr-drill

Ubuntu

$ sudo apt-get install mapr-drill

SuSE

$ sudo zypper install mapr-drill

(Optional) If you want to run Apache Drill against MapR-DB, you must include the hbase.scan.sizecalculator.enabled property in the drill.exec block of the /opt/mapr/drill/drill-<version>/conf/drill-override.conf file, and set the property to false.

Do not include the property in drill-override.conf if you run Apache Drill against HBase.

Apache Drill does not require working sets to fit in memory at query execution time. Drill's default memory settings should suffice for most use cases. Depending on your use case, Drill performance may significantly benefit from more memory. Please modify Drill's memory settings, as well as initial and maximum heap sizes appropriately, in /opt/mapr/drill/drill-<version>/conf/drill-env.sh.

Run configure.sh to refresh the node configuration.

Example

/opt/mapr/server/configure.sh -R

Verify that the Drillbit service is running on the node.

You can issue the following command to verify the status of the Drillbit service from the command line:

jps

You should see Drillbit as one of the services listed. If you do not see Drillbit in the list, you can issue the following maprcli command to start the Drillbit service on the node: