The result used a single-node (Intel donated quad-socket) and was able to beat clusters and even graph-specialized architectures such as the Cray XMT and the Convey HC-1ex Graph Personality. It is 2.9x more efficient than any other single-node result, and 28x more efficient than any cluster result.