Answer:a
Explanation:Goal of Crunch is to make pipelines that are composed of many user-defined functions simple to write, easy to test, and efficient to run.

2. Point out the correct statement :
a) Scrunch’s Java API is centered around three interfaces that represent distributed datasets
b) All of the other data transformation operations supported by the Crunch APIs are implemented in terms of three primitives
c) A number of common Aggregator implementations are provided in the Aggregators class
d) All of the mentionedView Answer

Answer:c
Explanation:PGroupedTable provides a combineValues operation that allows a commutative and associative Aggregator to be applied to the values of the PGroupedTable instance on both the map and reduce sides of the shuffle.

3. For Scala users, there is the __________ API, which is built on top of the Java APIs
a) Prunch
b) Scrunch
c) Hivench
d) All of the mentionedView Answer

4. The Crunch APIs are modeled after _________ , which is the library that Google uses for building data pipelines on top of their own implementation of MapReduce.
a) FlagJava
b) FlumeJava
c) FlakeJava
d) All of the mentionedView Answer

Answer:b
Explanation:The Apache Crunch project develops and supports Java APIs that simplify the process of creating data pipelines on top of Apache Hadoop.

5. Point out the wrong statement :
a) Crunch pipeline written by the development team sessionizes a set of user logs generates are then processed by a diverse collection of Pig scripts and Hive queries
b) Crunch pipelines provide a thin veneer on top of MapReduce
c) Developers have access to low-level MapReduce APIs
d) None of the mentionedView Answer

Answer:d
Explanation:Crunch is extremely fast, only slightly slower than a hand-tuned pipeline developed with the MapReduce APIs.