To my ears, this sounds more like a word clock issue than pure distortion / modulation.

Basically, you have 2 digital audio devices which are free running with no common 'clock' to syncronise the audio. You PC will make a good estimate of when the data packets in the stream are starting and stopping, but without correct word clock (which says when each 'word' or data starts), it can't get them to line up correctly. And you get nasty artifacts in the audio.

Basically, you've lost the podcast - sorry. I don't know of anything that can fix that.