RDFPath MapReduce Processor

RDFPath is an expressive, navigational query langauge enabling to retrieve information from diverse knowledge bases represented in RDF.
RDFPath MapReduce Processor makes use of the Hadoops' MapReduce for distributing the workload accociated with on a cluster of machines, and is well-suited for data-intensive, ETL-like scenarios.

A more recent implementation built on top of Impala and Spark is published on GitHub:RDFPath Engine

Requirements

Hadoop, we recommend Cloudera's Distribution of Hadoop CDH3, the implementation is tested with CDH 3