Hashing can be useful in Data Warehousing as well It can give you the ability to break large problems into smaller more manageable sizes or scale out your ETL process ;) PDW even uses a similar method to distribute data internally among all the compute nodes.