As a young man I enjoyed listening to a particular series of French instructional programs. I didn’t understand a word, but was nevertheless enthralled. Was it because the sounds of human speech are thrilling? Not really. Speech sounds alone, stripped of their meaning, don’t inspire. We don’t wake up to alarm clocks blaring German speech. We don’t drive to work listening to native spoken Eskimo, and then switch it to the Bushmen Click station during the commercials. Speech sounds don’t give us the chills, and they don’t make us cry – not even French.
But music does emanate from our alarm clocks in the morning, and fill our cars, and give us chills, and make us cry. According to a recent paper by Nidhya Logeswaran and Joydeep Bhattacharya from the University of London, music even affects how we see visual images. In the experiment, 30 subjects were presented with a series of happy or sad musical excerpts. After listening to the snippets, the subjects were shown a photograph of a face. Some people were shown a happy face – the person was smiling – while others were exposed to a sad or neutral facial expression. The participants were then asked to rate the emotional content of the face on a 7-point scale, where 1 mean extremely sad and 7 extremely happy.
The researchers found that music powerfully influenced the emotional ratings of the faces. Happy music made happy faces seem even happier while sad music exaggerated the melancholy of a frown. A similar effect was also observed with neutral faces. The simple moral is that the emotions of music are “cross-modal,” and can easily spread from sensory system to another. Now I never sit down to my wife’s meals without first putting on a jolly Sousa march.
Although it probably seems obvious that music can evoke emotions, it is to this day not clear why. Why doesn’t music feel like listening to speech sounds, or animal calls, or garbage disposals? Why is music nice to listen to? Why does music get blessed with a multi-billion dollar industry, whereas there is no market for “easy listening” speech sounds?
In an effort to answer, let’s first ask why I was listening to French instructional programs in the first place. The truth is, I wasn’t just listening. I was watching them on public television. What kept my attention was not the meaningless-to-me speech sounds (I was a slow learner), but the young French actress. Her hair, her smile, her mannerisms, her pout… I digress. The show was a pleasure to watch because of the humans it showed, especially the exhibited expressions and behaviors.
The lion share of emotionally evocative stimuli in the lives of our ancestors would have been from the faces and bodies of other people, and if one finds human artifacts that are highly evocative, it is a good hunch that it looks or sounds human in some way.
As evidence that humans are the principal source of emotionality among human artifacts, consider human visual signs. Visual signs, I have argued, have culturally evolved to look like natural objects, and have the kinds of contour combinations found in a three-dimensional world of opaque objects. Three-dimensional world of opaque objects? Nothing particularly human about that, and that’s why most linguistic signs – like the letters and words on this page – are not emotionally evocative to look at.
But visual signs do sometimes have emotional associations. For example, colors are notoriously emotionally evocative, and arguments about what color something should be painted are the source of an alarming number of marital arguments. And “V” stimuli, such as that yield sign on the street, have long been realized (within the human factors literature) to serve as the most evocative geometrical shape for warning symbols. But notice that color and “V” stimuli are plausibly about human expression. In particular, color has recently been argued to be “about” human skin and the exhibited emotions – which is why red grabs our attention, since it’s associated with blushing and blood – and “V” stimuli have been suggested to be “about” angry faces (namely, angry eyebrows).
Which brings us back to music and the Logeswaran paper. Music is exquisitely emotionally evocative, which is why a touch of happy music makes even unrelated pictures seem more pleasant. In light of the above, then, we are led to the conclusion that the artifact of music should contain some distinctly human elements.
The question, of course, is what those elements are. One candidate is our expressive speech – perhaps music is just an abstract form of language. However, most of the emotion of language is in the meaning, which is why foreign languages that we don’t understand rarely make us swoon with pleasure or get angry. That’s also why emotional speech from an unfamiliar language isn’t featured on the radio!
But there is a second auditory expressive behavior we humans carry out – our bodily movements themselves. Human movement has been conjectured to underlie music as far back as the Greeks. As a hypothesis this has the advantage that we have auditory systems capable of making sense of the sounds of people moving in our midst – an angry stomper approaching, a delicate lilter passing, and so on. Some of these movements trigger positive emotions – they conjure up images of pleasant activities – while others might be automatically associated with fear or anxiety. (The sound of running makes us wonder what we’re running from.) If music were speech-driven, then it is missing out on the largest part of speech’s expressiveness – the meaning. But if music sounds like human expressive movements, then it sounds like something that, all by itself, is rich in emotional expressiveness, and can be easily interpreted by the auditory system.
Regardless of whether music is emotional intonation from speech or a summary of expressive movements – or something else altogether – the new research by Logeswaran and Bhattacharya adds yet more fuel to the expectation that music has been culturally selected to sound like an emotionally expressive human. While it is not easy for us to see the human ingredients in the modulations of pitch, intensity, tempo and rhythm that make music, perhaps it is obvious to our auditory homunculus.
ABOUT THE AUTHOR(S)
Mark Changizi is a professor in the Department of Cognitive Science at Rensselaer Polytechnic Institute, and is the author of the recently published book “The Vision Revolution”.