Chapter 1
Context: The Sound of Music
Bruce Johnson
Much is made in popular music studies of the decisive and differentiating role of mediations.1 Yet almost nothing is made of the most fundamental set of differentiating mediations, without which there can be no cultural transaction at all: the sensorium. As a broad introductory simplification, it may be said that in traditional musicology, music is generally categorized as an âartformâ, the bearer of aesthetics. In popular music studies, the emphasis is on music as a âcultureâ, the bearer of meanings. Initially we wish to situate music in terms of its sensory materiality: music as sound. To do so recognizes the material and biological foundations of culture, and joins with recent emerging research fields such as the anthropology of the senses. As long ago as 1749, Diderot observed that the âstate of our organs and of our senses has a great influence on our metaphysics and our ethics, and our most purely intellectual ideas, if I may express it thus, are very much dependent on the structure of our bodyâ.2 In 1891 pioneer radio wave researcher Heinrich Hertz, who, appropriately in this context, would give his name to the frequency in cycles per second of sound, spoke of the ânarrow borderland of the sensesâ between consciousness and the âworld of actual thingsâ. He declared that for a âproper understanding of ourselves and of the world it is of the highest importance that this borderland should be thoroughly exploredâ.3 The connection was largely overlooked in twentieth-century cultural theory. Studies of gendered identity and power relations, for example, have been marked by âa preference for encountering embodiment via social, representational, or symbolic analysis at the expense of biological dataâ.4 Emerging work in such fields as the anthropology of the senses, however, is gradually repositioning physiology into the study of culture.5
To locate music a priori in the sensorium in this way opens the door to the distinctive phenomenology and physiology of sounding and hearing. Before a musical experience can be analysed as either an aesthetic or cultural transaction, it must be, by this categorization, a sensory episode experienced primarily through the ear. Of course a rich repertoire of reception and interpretation is also deployed. Music is also a tactile phenomenon experienced through vibration. It is usually accompanied also by visual impressions which might be a supplementary mediation of the acoustic event, as in the sight of a band performing or the view from a car or from a sofa as one listens to radio or recordings. And there are also gustatory and olfactory sensations that form part of the totality, as in the obvious example of dining and drinking in a restaurant with live or piped music. The experience is also entangled with memory, emotion, dynamics of identity and taste, relations of power or conflict. But all this is activated by a sound entering the ear. It is the special properties of sound in itself which invest music with certain forms of power that ally it to noise in a way that is not the case with other non-acoustic expressive forms like painting.
We do not wish to enter into the debates about the superiority or otherwise of acoustic vis-Ă -vis scopic modalities.6 Our point is simply to underscore differences between them. Sound is capable of certain effects which are not available to the other major sensory partner in our public social transactions, vision. âSound has always been a privileged tool to âcreate an effectâ, to astonish. ⊠Sound undeniably has an immediate emotional power that has been used by every cultureâ.7 Of special relevance here is the distinctive power of sound to arouse and also to produce organic damage. Sound in and of itself can produce profound distress. Among the most powerful of sounds, and the most often deployed in social intercourse, is the human voice. It has power as an acoustic presence, prior to whatever the specific utterance might generate semiotically. In 1662, a condemned prisoner in Newgate reported that the condemned hold had âneither bench, stool, nor stick for any person there. They lie like swine upon the ground, one upon another, howling and roaring â it was more terrible to me than deathâ.8 This is an example of the sonic effect appropriately called âperditionâ.9 The well known example of the anguished final sounds made by Hamlet in the First Folio version of the play exemplifies the enormous resonant power of this sonic effect.10
There is an enormous range of non-lexical vocal effects apart from perdition, and popular music scholars are likely to understand this better than lexicon-bound legal and literary studies. And the more the voice is inflected by musicality, by factors over and above verbal denotation, the more arresting it becomes. It has been found that six-month-old infants âshowed more attention to their mothersâ singing episodes than to their speaking episodesâ. They were âhypnotizedâ by video of their mothersâ singing, âgluedâ to the image âfor extended periods. Mothersâ speaking was not as engaging as their singingâ.11 The scatting of a jazz singer, the pentecostal shrieks of gospel music, James Brown and Little Richard, denote nothing, yet declare everything about emancipation, abandonment, ecstasy. A non-German speaker can feel the power of Hitlerâs speeches. The oratorical manipulation of rhythm, repetition, disruption, timbre and volume produces sonic effects that can transcend deficiencies in linguistic sophistication. Transcribed on the page, a Baptist sermon by, for example, the Reverend Leo Daniels looks mawkish, naĂŻve and emotionally negligible. Heard, it can transfix even the most sophisticated audience of literature students.12 A ghoulishly corporealized illustration of the âpresenceâ of the voice, its organic origins and its ubiquity, was the reported case of one Edmund Kemper, who killed his mother because of her nagging. He decapitated her, excised the larynx and stuffed it in the garbage disposal unit. But when he switched it on, it spewed the remains back at him: âEven when she was dead, she was still bitching at me. I couldnât get her to shut up.â13
The grotesquerie of this report is a trope for the physically intrusive aspect of sound. Unlike vision, sound enters the body with extremely intensity, as experienced in even relatively everyday situations such as a voice whispering in your ear at a dinner party, which immediately sets up an intimate secondary level of social engagement. Human sound emerges directly from inside the body, while sight plays over the surface.14 The voice is the body, irreducible site of our social being and proclamation of life. The dead may be touched, smelled, seen and tasted â but not heard (again, a point made throughout Hamlet). It is to the characteristic of silence that we attached the word âdeathlyâ. The voice is the living body projected directly into the social space, a kind of nakedness, one reason our culture is discomfited at public vocalization such as yelling and singing, and especially involuntary vocalization such as sobbing. Unlike the projection of the body through its visual aspect, vocal utterance enters the body of the receiver, and is the sound of the body of the emitter, the sound of breath over complex internal interactions between tissue, bone, muscle, nerves, mediating identity in ways that are physiologically related to the cortex very differently from the visual faculty.
Sound and the voice are also able to instantly modify the radius of their impact. By shouting, whispering, changing timbre and register, the human being can control and transform the perceived character of her/his identity to a degree, range and rapidity unavailable to any form of identity projection. It is our most versatile faculty for acceptable public negotiation. Of all our organs the larynx has the highest ratio of nerve to muscle fibre, making it capable of an immeasurable range of expressive nuances which can circumvent semantics. Standing alone, the written words, âI am going homeâ cannot achieve a fraction of the expressive possibilities available to their spoken/sung form, which may range from irony to disgust, sadness, fear, regret, elation, disappointment, triumph. At the extremes of experience, words fail us, but sound does not. The scream, the howl, the sob and sigh, all are both unintelligible to verbal analysis, yet ultimate modes of expressiveness.
The springs of this expressiveness are not semantic, but somatic. What is it about sound that invests it with such unique power in social interaction? There is a further deeply ambiguous characteristic which provides a springboard to some of the answers. Sound floods the space it enters. It is an obvious enough point, but its ramifications for the power of sound qua sound are complex and far-reaching. Within the conventionally understood sensorium, only smell has the same property, but with none of the finely articulated and directed possibilities of sound. It is a property with ambiguous potential. On the one hand, sound becomes a unifying envelope, immersing all those present in more or less the same expressive flood.15 Hearing internalizes, and sound projects, a shared experience. It constructs and mobilizes collective identity. âThe sonic effect produces a common sense because it gathers together into unified and harmonious listening what other disciplinary knowledge divides.â16
On the other hand, relative to the specular, sound is inescapable because ubiquitous. The CRESSON group (Centre de recherchĂ© sur lâespace sonore et lâenvironment urbain) in Grenoble, identifies the âubiquity effectâ, a conscious searching for the source of a sound, and at least a disorienting momentary failure to find it. âThe uncertainty produced by a sound about its origin establishes a power relationship between an invisible emitter and the worried receptor. The ubiquity effect is an effect of power.â17 The ubiquity effect generally produces discomfort, ranging from mild anxiety, a feeling of faintness, to âthe most uncontrollable panicâ.18 â[N]ot to know where a sound comes from is almost to believe in the manifestation of a superior force or a transcendental power: God, the State, Nature, the Father.â19 Thus, while the semantic content of utterance may be pivotal in directing its energies on behalf of or in opposition to particular interests, in the first instance the inchoate power thus harnessed derives from the phenomenology and physiology of sonic effects.
There is evidence that auditory stimuli â and therefore music â bypass âconscious awarenessâ in eliciting emotions, suggesting that in some cases âaffect precedes inferenceâ.20 Furthermore, affect is involuntary, inescapable and difficult to revoke.21 Zajonc argues that the affective system does not process, for example, the lexical elements of a message, but something more primary. Even when the âcontent of recorded utterances is nearly completely obliterated by means of electronic masking, filtering or random splicing of the tape, subjects can still encode the emotions expressed in these utterances quite reliablyâ.22 That is, âmusicalityâ and intonation, rather than verbal content, have priority in the formation of emotional responses to sonic stimuli. Zajoncâs arguments have been experimentally reinforced in the work of neuroscientist Joseph LeDoux, who found that the first stage of arousal is an involuntary reaction to sounds.23 There are at least two pathways between aural stimulus and response, of different orders of complexity. The first is purely physiological; the second involves cultural processing, and is supplemented by the hippocampus, a processor that contextualizes the stimulus with memory.24 For present purposes, the work of Zajonc and LeDoux suggests a âfoundation/scaffoldingâ model of sonic (and therefore musical) affect. That is, that there is a primary reaction to a threatening stimulus. It is physiological and involuntary, and creates a matrix within which a secondary and cognitively mediated response draws on cultural memory to articulate the precise nature of that threat. The implication is that certain emotional responses to sound are so-to-speak hard-wired, constrained biologically, rather in the manner of a foundation to a building that wonât determine what is constructed on it, but which limits what may be.25 Sound is power, unharnessed. Specific physical characteristics of sound and its relationship to the audient can alter physiological states.
The point can be illustrated through the connection between sound localization, its source in relation to the listener, and pitch. The body employs several systems of sound localization. âInteraural intensity differenceâ relies on the reduced level of intensity experienced in the ear farther from a laterally directed sound source, and is less effective for lower pitched sounds, because the long wavelengths â as long as 40 feet for the lowest musical notes26 â are unaffected by the presence of the head. It is thus most effective in localizing high-pitched sounds with a wavelength shorter than the width of the head â above around 200 Hz. Apart from the organic damage that low register sound might produce (see below), the difficulty of identifying its source can induce profound disorientation and anxiety. A second system...