Jumbo: Beyond MapReduce for Workload Balancing

Over the past decade several frameworks such as Google MapReduce have been developed that allow data processing with unprecedented scale due to their high scalability and fault tolerance. However, these systems provide both new and existing challenges for workload balancing that have not yet been fully explored. The MapReduce model in particular has some inherent limitations when it comes to workload balancing. In this paper, the authors introduce Jumbo, a distributed data processing platform that allows them to go beyond MapReduce and work towards solving the load balancing issues.