Question arises as a side issue from <a href="https://stackoverflow.com/questions/38296950/apache-spark-timing-foreach-operation-on-javardd" rel="nofollow">Apache Spark timing forEach operation on JavaRDD</a>, where I am still looking for a good answer to the core question of how best to time RDD creation.

Answer1:

dropResultsN is the persisted RDD (which is the RDD produced by mapping dataSetN onto the method standin.call()).

Contact Us

No Copyright Statement

This site is a non-profit exchange learning website. All resources are collected online. Copyright belongs to its copyright owner. This site does not enjoy copyright. If you think it is harmful to your copyright, please contact us and we will delete it at the first time.

Statement of Compliance with the Law

The information collected in this website does not mean that XSZZ. ORG agrees with its statement or description, nor does it constitute any suggestion. It is only for the study and reference of interested parties. If you need to use it, you must abide by the provisions of Chinese law.