Owner

Current status

Detailed Description

The Apache Accumulo™ sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.

Benefit to Fedora

Apache Accumulo offers a big data management store for Hadoop users. Including it in Fedora increases usefulness of the Apache Hadoop that is already in Fedora, by providing more options for data warehousing and analysis for academic and commercial users.

Scope

Proposal owners: The Accumulo package will provide all the functionality from the upstream release, packaged for Fedora.

Other developers: N/A (not a System Wide Change)

Release engineering: N/A (not a System Wide Change)

Policies and guidelines: N/A (not a System Wide Change)

Upgrade/compatibility impact

N/A (not a System Wide Change)

How To Test

Execute the example code, and test frameworks (continuous ingest, randomwalk) included in the upstream release.

User Experience

Users should be able run a single-node or cluster instance of Accumulo on Fedora, and be able to develop applications which leverage Accumulo as a database.

Dependencies

Currently, no other Fedora packages depend on Accumulo.
Accumulo will depend on other Fedora packages, including Apache Hadoop, Apache ZooKeeper, and Apache Thrift.