When someone speaks, there are two things going on: the words, and how they say the words. You can vastly change the meaning of an utterance by changing the tone of voice, rhythm, word emphasis, etc. Imagine removing the words, replacing them with non-significant gibberish, and being left only with tone of voice, rhythm, emotional emphasis. In the absence of words, what is communicated? Huge amounts about the mood, attitude, and personality texture of the speaker. What you'd be hearing, in the absence of understandable words, is that person's personal music.
Ever notice that you just love the sound of a certain person's voice? Math Is Hard once said she loved Morgan Freeman's voice so much she could sit and listen to him read the phone book. The opposite's also true: some people's personal music is quite ugly, and you can't stand the very sound of their voice. There's everything in between and more gradients along all other axes.