Regarding the laurel/yanny discussion, people don’t realize how much of our spoken communication involves us using facial cues, tone and mouth/lip movement to understand what is said. Think of how it is more difficult to follow a phone conversation (no face time). IMO there are more questions regarding clarification of what is being said. Also, think of watching television and when someone is speaking, but the camera is not facing them. Sometimes things are missed.
Yes, some people have hearing loss, but not everyone, and the wiki link provided by
@SimonBurchell explains this idea. The quality of the sound is not controlled, and can’t be because people are hearing it on many different devices. If it were in a controlled setting, we would have a better idea of how different people might hear it.
This is kind of like the misleading example of the ugly dress (which we all agree

). It is a two dimensional photo of a sequinned dress. The camera cannot show the true “colour” of something that is reflecting light.