I want to learn some more about the scale, but not scale in general but scaling data storing and processing. Nowadays if you hear about the big data, it often implies hadoop based distributed system. As Yahoo is one of the biggest contributors to Hadoop at the moment, the best tutorial is also on their ...