I recorded the times for copying host to gpu, fft
on gpu, and copying gpu back to host. I am not reporting the times
to read from standardinp and write to stdout.
The results are shown in the table below: