Google’s New Text-to-Speech AI Is so Good We Bet You Can’t Tell It From a Real Human

Based on the paper, it's highly probable that "gen" indicates speech generated by Tacotron 2, and "gt" is real human speech. ("GT" likely stands for "ground truth," a machine learning term that basically means "the real deal.") Assuming this is correct, here are the answers to the test: "That girl did a video …