Yanny or Laurel? You decide!

The new gold or blue internet debate is about a new audio clip which sounds like Yanny to some people and Laurel to others. Various explanations have been given such as the audio quality of your headset or speakers and the age of the person listening (with the hypothesis that older people have trouble hearing the higher frequencies of the recording and therefore hear something else than younger people).

When I play this on my desktop computer with my headset, I hear Yanny. However, listening to the same audio on my phone speaker I hear Laurel, so the audio characteristics of the rendering device do play a role.

Interestingly enough, if you listen to a1.wav (compressed to stretched) it starts out sounding like Yanni almost to the end, but if you listen to a2.wav (stretched to compressed) it starts out sounds like Laurel almost to the end! The human brain seems to want to cling to what it heard just previously, so the context of what you hear matters (a lot)!!!

Interesting post, Arnoud! It was odd to me that the pitch shift changed what word it sounded like he was saying. When I originally listened to the recording, if I focused on the lower frequencies I could hear "laurel" but focusing on the higher frequencies I heard "yanny". I first tried to see if I could get rid of the higher frequency noise and preserve more of the original voice with a low pass filter.

Then (by trial and error) I tried to filter off the high frequencies, and got some faint sounds, amplified it, filtered it one more time for a little extra low pitched noise, and used a low pass filter to get rid of the noise from all the amplification and filtering. I can somewhat hear a muffled whisper of "yanny" and less of laurel, perhaps others more knowledgeable about signal processing could venture further down this path.

I wasn't satisfied with the high pass filtered answer since it wasn't distinct enough that is certainly where "yanny" is coming from, but I thought the low pass filtered result made it pretty clear the original speaker was saying "laurel". The most interesting result I found was that doing the same pitch shift adjustments you made on the low pass filtered recording, it only sounds like "laurel" whether sped up or sped down, which was convincing enough for me.