in single gpu case, both codes work well.but in multi-gpu case, h_Test1 in code one gets right data, while h_Test1 in code two gets error data. It happens in all devices except device 0.i really don't know why this happen. I guess it may be a bug.does anybody know about the reason?thank you for your kindness!