Title:Performance of Kepler GTX Titan GPUs and Xeon Phi System

Abstract: NVIDIA's new architecture, Kepler improves GPU's performance significantly
with the new streaming multiprocessor SMX. Along with the performance, NVIDIA
has also introduced many new technologies such as direct parallelism, hyper-Q
and GPU Direct with RDMA. Apart from other usual GPUs, NVIDIA also released
another Kepler 'GeForce' GPU named GTX Titan. GeForce GTX Titan is not only
good for gaming but also good for high performance computing with CUDA.
Nevertheless, it is remarkably cheaper than Kepler Tesla GPUs. We investigate
the performance of GTX Titan and find out how to optimize a CUDA code
appropriately for it. Meanwhile, Intel has launched its new many integrated
core (MIC) system, Xeon Phi. A Xeon Phi coprocessor could provide similar
performance with NVIDIA Kepler GPUs theoretically but, in reality, it turns out
that its performance is significantly inferior to GTX Titan.

Comments:

7 pages, 6 figures, 3 tables, Contribution to proceedings of the 31st International Symposium on Lattice Field Theory (Lattice 2013), July 29 - August 3, 2013