Installing the Lily HBase Indexer Service

To query data stored in HBase, you must install the Lily HBase Indexer service. This service indexes the stream of records being added to HBase tables. This process is scalable, fault
tolerant, transactional, and operates at near real-time (NRT). The typical delay is a few seconds between the time data arrives and the time the same data appears in search results.

Choosing where to Deploy the Lily HBase Indexer Service Processes

To accommodate the HBase ingest load, you can run as many Lily HBase Indexer services on different hosts as required. See the HBase replication documentation for details on how to plan
the capacity. You can co-locate Lily HBase Indexer service processes with SolrCloud on the same set of hosts.

By default, this tool is installed when Cloudera Search is installed using parcels, such as in a Cloudera Manager deployment. If you are using a package installation and this tool does
not exist on your system, you can install this tool using the commands described in this topic.

To install the Lily HBase Indexer service on RHEL systems:

$ sudo yum install hbase-solr-indexer hbase-solr-doc

To install the Lily HBase Indexer service on Ubuntu and Debian systems:

$ sudo apt-get install hbase-solr-indexer hbase-solr-doc

To install the Lily HBase Indexer service on SUSE-based systems:

$ sudo zypper install hbase-solr-indexer hbase-solr-doc

Important: For the Lily HBase Indexer to work with CDH 5, you may need to run the following command before issuing Lily HBase MapReduce jobs:

If this documentation includes code, including but not limited to, code examples, Cloudera makes this available to you under the terms of the Apache License, Version 2.0, including any required
notices. A copy of the Apache License Version 2.0 can be found here.