Europe PMC

This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy.

Abstract 


Animal and human studies have suggested that posterior temporal, parietal, and frontal regions are specifically involved in auditory spatial (location and motion) processing, forming a putative dorsal "where" pathway. We used scalp EEG and current density mapping to investigate the dynamics of this network in human subjects presented with a varying acoustic stream in a two-factor paradigm: spatial versus pitch variations, focused versus diverted attention. The main findings were: (i) a temporo-parieto-frontal network was activated during the whole duration of the stream in all conditions and modulated by attention; (ii) the left superior temporal cortex was the only region showing different activations for pitch and spatial variations. Therefore, parietal and frontal regions would be involved in task-related processes (attention and motor preparation), whereas the differential processing of acoustic spatial and object-related features seems to take place at the temporal level.

Free full text 


Logo of halLink to Publisher's site
J Cogn Neurosci. 2005 Nov; 17(11): 1691–1703.
PMCID: PMC4764672
PMID: 16269106

Dynamics of a temporo-fronto-parietal network during sustained spatial or spectral auditory processing

Abstract

Animal and human studies have suggested that posterior temporal, parietal, and frontal regions are specifically involved in auditory spatial (location and motion) processing, forming a putative dorsal “where” pathway. We used scalp EEG and current density mapping to investigate the dynamics of this network in human subjects presented with a varying acoustic stream in a two-factor paradigm: spatial versus pitch variations, focused versus diverted attention. The main findings were: (i) a temporo-parieto-frontal network was activated during the whole duration of the stream in all conditions and modulated by attention; (ii) the left superior temporal cortex was the only region showing different activations for pitch and spatial variations. Therefore, parietal and frontal regions would be involved in task-related processes (attention and motor preparation), whereas the differential processing of acoustic spatial and object-related features seems to take place at the temporal level.

Keywords: Acoustic Stimulation, Attention, Auditory Perception, Brain Mapping, Cerebral Cortex, Functional Laterality, Humans, Nerve Net, Nonlinear Dynamics, Pitch Discrimination

INTRODUCTION

Both neurophysiological works in animals and human brain imaging studies have suggested separate auditory processing pathways for object-related and spatial features, similarly to the visual system (Rauschecker, 1998). Within the lateral belt of the auditory cortex of anaesthetized macaque monkeys, neurons with narrow spatial tuning were situated in the caudal region, whereas a better selectivity to monkey vocalizations was found in the rostral region (Tian, Reser, Durham, Kustov, & Rauschecker, 2001; Rauschecker & Tian, 2000). Two recent functional MRI studies reported the same functional dissociation within the temporal auditory cortex in humans: anterior lateral auditory areas (lateral Heschl’s gyrus, anterior planum temporale, and planum polare) are specifically activated by pitch changes, whereas posterior medial auditory regions (posteromedial planum temporale) are activated by spatial location changes (Hart, Palmer, & Hall, 2004; Warren & Griffiths, 2003). Connectivity studies in macaque have identified (i) a ventral route connecting rostral auditory belt and parabelt areas, rostral superior temporal gyrus and ventral prefrontal cortex, and (ii) a dorsal route connecting caudal auditory belt and parabelt areas, and parietal and dorsolateral prefrontal cortices (Kaas & Hackett, 2000; Rauschecker & Tian, 2000; Hackett, Stepniewska, & Kaas, 1999; Kaas, Hackett, & Tramo, 1999; Romanski, Bates, & Goldman-Rakic, 1999; Romanski, Tian, et al., 1999). By analogy with the visual system, these ventral/ dorsal routes were suggested to support two functionally distinct “what” and “where” auditory pathways (Kaas & Hackett, 1999).

In humans, many lesion (Clarke et al., 2002; Griffiths, Rees, Witton, et al., 1996), functional neuroimaging (Hart et al., 2004; Rama et al., 2004; Pavani, Macaluso, Warren, Driver, & Griffiths, 2002; Warren, Zielinski, Green, Rauschecker, & Griffiths, 2002; Zatorre, Bouffard, Ahad, & Belin, 2002; Alain, Arnott, Hevenor, Graham, & Grady, 2001; Maeder et al., 2001; Griffiths, Green, Rees, & Rees, 2000; Lewis, Beauchamp, & DeYoe, 2000; Baumgart, Gaschler-Markefski, Woldorff, Heinze, & Scheich, 1999; Bushara et al., 1999; Griffiths & Green, 1999; Weeks et al., 1999; Griffiths, Rees, Rees, et al., 1998), and electrophysiological (Kaiser & Bertrand, 2003; Kaiser, Ripper, Birbaumer, & Lutzenberger, 2003; Ducommun et al., 2002; Lutzenberger, Ripper, Busse, Birbaumer, & Kaiser, 2002; Xiang et al., 2002; Alain et al., 2001; Anourova et al., 2001) data are consistent with the hypothesis of “what” and “where” separate auditory streams: anterior temporal and inferior frontal regions would process auditory “object” information and posterior temporal areas, parietal and frontal cortices are involved in the spatial processing of both stationary and moving sounds. This dual pathway was recently supported in a neuroimaging meta-analysis by Arnott et al. (2004).

However, the extent and the functional role of these two pathways remain unclear and controversial (Hall, 2003; Middlebrooks, 2002; Belin & Zatorre, 2000; Cohen & Wessinger, 1999). Indeed, the parietal areas do not seem to be specifically involved in auditory spatial processing and could also be activated during pitch-oriented tasks (Maeder et al., 2001; Rama et al., 2000; Weeks et al., 1999; Zatorre, Mondor, & Evans, 1999; Zatorre, Evans, & Meyer, 1994).

In a recent neuroimaging study (Zatorre, Bouffard, et al., 2002), a clear parietal activation could be found only when transforming spatial auditory information into an oriented joystick movement. Moreover, the activation of the parieto-frontal network, during audio-spatial processing, could be due to goal-directed selective attention in active paradigms, as well as stimulus-driven attentional capture (Behrmann, Geng, & Shomstein, 2004). In passive listening paradigms, an involuntary attentional switch towards more attractive stimuli, such as moving sounds for instance, could account for parietal activities (Pavani et al., 2002; Warren et al., 2002; Xiang et al., 2002; Griffiths, Green, et al., 2000; Griffiths & Green, 1999; Griffiths, Rees, Rees, et al., 1998). Thus, the dorsal auditory pathway, usually considered as specifically processing spatial auditory information, could rather be involved in nonstimulus-specific cognitive processes.

The present study aims at clarifying the role of the different components of the temporo-parieto-frontal network during the processing of pitch-related or spatial acoustic features. We used ERP mapping in human subjects processing varying acoustic streams in situations of controlled attention. By means of a two-factor paradigm (spatial vs. pitch variations, focused vs. diverted attention), we could characterize brain regions specifically involved in the processing of pitch and/or spatial acoustic features and investigate how their activity could be modulated by attention. Furthermore, by analyzing the dynamics of the temporo-parieto-frontal network, it became possible to dissociate components related to the sensory integration of specific acoustic features from those components corresponding to higher-level cognitive processes, such as attention for instance.

RESULTS

Two participants had to be excluded because of excessive eye blinks or muscular activity. Thus, 14 subjects were included in the analysis considering four conditions: attention oriented (A) to the streams with spatial (AS condition) or pitch (AP) variations, and attention diverted (control, C) from acoustic feature variations (CS and CP, respectively).

Behavioral Data

Correct response rates did not differ between attention and control tasks (mean value, 93.1% and 92.2%, respectively, p > .05), or within the attention task, between spatial and pitch variation processing (93.2% and 92.9%, respectively, p > .05). All subjects reported that they were mentally following spatial or pitch variations of the auditory streams in the AS and AP conditions while they were actively ignoring the auditory stream trajectories in the control task. Moreover, the EOG signal was drifting during stimulus presentation and no attention or acoustic variation effect was noticed in the EOG time courses (p > .05) until 700 msec after variation onset.

Topographical Analysis

The main component elicited by the auditory stimulus onset, N1, showed a classical ERP topography with a fronto-central maximum and a polarity reversal across the sylvian fissure, in both A and C conditions (Figure 1). Computing scalp current densities (SCDs) by surface laplacian has proved to be efficient in separating temporal, frontal (Giard et al., 1994), and parietal (Kaiser & Bertrand, 2003) components of auditory responses. Applied to the present data, SCD maps showed, in both tasks, bilateral current sink/source patterns over temporal regions and a current sink over fronto-central areas (Figure 1). These components were maintained throughout the stimulus duration.

An external file that holds a picture, illustration, etc.
Object name is halms172386f1.jpg

Stream-onset N1 topographies. Left hemisphere ERP and SCD maps (mean value between 80 and 130 msec, 14 subjects) in attention and control conditions. SCD allows to better disclose temporal and frontal components. Black ovals indicate electrode groups chosen for the analysis: fronto-central current sink (F), left superior temporal current sink (LST), and left inferior temporal current source (LIT). Very similar topographies were obtained over the right hemisphere and symmetrical electrode groups were chosen for the right temporal components.

After this N1 component, we found a parieto-occipital component largely prominent in the attention condition (Figure 2). SCD analysis permitted to dissociate two subcomponents: a current source over superior parietal cortices and bilateral sources over temporo-parieto-occipital (TPO) junctions, clearly visible on the attention-minus-control maps (Figure 2).

An external file that holds a picture, illustration, etc.
Object name is halms172386f2.jpg

Topographies during the varying period. ERP and SCD maps (left, back, and right views, mean value between 385 and 595 msec after variation onset, 14 subjects) in attention and control conditions, and difference maps between conditions. In ERP maps, a positive component is present in the attention condition and is topographically distributed over parietal and occipital areas. SCD maps allow to disclose distinct and focal current sources over the superior parietal cortex for both tasks and over TPO junctions in the attention condition only, clearly visible on the attention-minus-control maps. Ovals indicate the electrode groups chosen for the analysis: superior parietal current source (P), right and left TPO current sources (RTPO and LTPO, respectively), in addition to the frontal and temporal components already defined in the N1 topography (Figure 1).

In order to estimate the time courses of those components, SCD maps were averaged across subjects and subsets of electrodes were selected to cover eight regions. Fronto-central (F: Fz/Cz), right/left superior temporal (RST: C4 and LST: C3), and right/left inferior temporal (RIT: T8/P8 and LIT: T7/P7) electrode groups were defined from the N1 topography (Figure 1), whereas superior parietal (P: CP1/Pz/CP2) and right/left temporo-parieto-occipital (RTPO: PO4 and LTPO: PO3) electrode groups were from the topographies of the varying period (Figure 2). SCD curves were averaged within each group for subsequent statistical analysis (Figures 3 and and44).

An external file that holds a picture, illustration, etc.
Object name is halms172386f3.jpg

Temporal and frontal SCD time courses. Mean SCD curves (14 subjects) are drawn over the three analysis periods (defined in Figure 6C) for the frontal (F), left superior and inferior temporal (LST and LIT, respectively), and right superior and inferior temporal (RST and RIT, respectively) electrode groups. The time structure of the stimulus is shown at the top of the figure (gray ovals represent additional noise-bursts used in the control task). During the stationary part, only mean attention (A) and control (C) curves are drawn, whereas in the varying part all conditions are depicted with different line symbols. The evoked N1/P2 complex to auditory stream onset is clearly visible at all electrode groups. N1/P2 to acoustic variations onset can also be noticed, although less pronounced. Note the polarity reversal throughout the stimulus duration between superior and inferior temporal components bilaterally. Significant differences between the attention and the control conditions are indicated by gray bars (ANOVA: *p < .05, **p < .01, ***p < .001). During the varying part, the left temporal components show a significantly more pronounced sustained response for pitch (AP and CP) than for spatial variations (AS and CS) with stars symbolizing the level of significance (ANOVA: *p < .05, ***p < .001). The black horizontal bars at the LST site indicate interaction effects between attention and acoustic variation (p < .05).

An external file that holds a picture, illustration, etc.
Object name is halms172386f4.jpg

Parietal SCD time courses. Mean SCD curves (14 subjects) are presented for the three analysis periods (defined in Figure 6C) for the LTPO and RTPO, respectively, and the superior parietal (P) electrode groups. The time structure of the stimulus is shown at the top of the figure (gray ovals represent additional noise-bursts used in the control task). During the stationary part, only mean attention (A) and control (C) curves are drawn, whereas in the varying part all conditions are depicted with different line symbols. Note the large TPO activities developing in the attention condition only and a sustained parietal activity enhanced by attention. Significant differences between the attention and the control conditions are indicated by gray bars (ANOVA: *p < .05, **p < .01, ***p < .001).

Fronto-Central Component

The frontal component (F in Figure 3) started with a transient N1/P2 complex followed by a sustained negative deflection throughout the stimulus duration (emerging at 210 msec for all conditions, p < .001), on which a small N1/P2 complex was superimposed at variation onset. During the stationary period, both stream-onset N1 mean amplitude (80–130 msec) and the sustained negative deflection (210–420 msec, p < .05, and 525–735 msec, p < .05) were more negative for the attention condition (p < .001). No effect was detected for the P2 component. During the varying part, no attention, acoustic variation, or interaction effect was found significant for the sustained component.

Temporal Components

The temporal N1/P2 complexes were followed by a sustained bilateral deflection with polarity reversal, maintained during the entire stream duration (emerging at 210 msec for all conditions, p < .01 at RST, p < .05 at LST, and p < .01 at LIT).

During the stationary period, N1 mean amplitude (80–130 msec) to auditory stream onset was significantly more negative for the attention than the control condition at LST (p < .05), RST (p < .05), and LIT (p < .001) sites. The sustained waves were partially enhanced in the A condition during the stationary period as indicated by gray bars in Figure 3. No effect was detected for the P2 component.

At stream variation onset, a small N1/P2 complex was superimposed on the sustained waves. During the varying period, pitch variations induced a larger sustained activity than spatial variations over the left auditory cortex, irrespective of attention (LST: all 210-msec windows from 385 to 910 msec, p < .01 and LIT: all 210-msec windows from 595 to 910 msec, p < .05). Additionally, an interaction effect (Attention × Acoustic variations) was found at the LST site (all 210-msec windows from 490 to 910 msec, p < .05). Wilcoxon tests showed that the negativity was significantly larger for the AP condition than for AS and CP (p < .01), and for CP than CS (p < .05), whereas no significant difference was found between the AS and CS conditions. The same effects were found during the final period as indicated in Figure 3.

The apparent “noise” on these temporal waveforms actually corresponded to the periodic steady-state response (35 msec) elicited by the amplitude modulation of the auditory stream.

Temporo-Parieto-Occipital Components

Bilateral positive sustained components (LTPO and RTPO in Figure 4) were significantly emerging, over the TPO junctions, in the A condition only, from 250 msec to the end of the stationary period (p < .05) and from 350 msec after the variation onset until the end of the stream (p < .05). In the A condition, irrespective of the type of acoustic variations, these components were significantly more positive during the stationary (all 210-msec windows from 210 to 735 msec, p < .05), varying (all 210-msec windows from 280 to 910 msec, p < .01), and final (all 210-msec windows from −840 to −420 msec, p < .01) periods. No difference between pitch and spatial variations, no interaction effect, and no interhemispheric difference were found significant for these components.

Superior Parietal Component

A positive sustained component (P in Figure 4) was significantly emerging, over superior parietal cortices, in all conditions, from 100 msec to the end of the stream (p < .05). This component was significantly more positive in the A condition during the stationary (all 210-msec windows from 210 to 735 msec, p < .01), varying (all 210-msec windows from 280 to 910 msec, p < .001), and final (all 210-msec windows from −840 to −420 msec, p < .001) periods. No difference between pitch and spatial variations and no interaction effect was found significant for this component. There was no significant interhemispheric difference (Wilcoxon test, p > .05) when comparing SCD activities at the left and right parietal electrodes (Cp1 and Cp2).

DISCUSSION

ERP correlates of spatial or pitch variation processing were investigated during a highly demanding auditory task requiring the continuous processing of stream trajectory, contrasted to a similarly difficult control condition with diverted attention. Using similarly complex pitch and spatial trajectories enabled to characterize the brain regions involved in such feature integration. SCD mapping permitted to both disentangle several active brain structures and estimate their dynamics. It then became possible to dissociate components related (i) to basic auditory integration (sustained during the whole stream duration), (ii) to sensory integration of specific acoustic features (during the varying period only), (iii) to cognitive processes required by the tasks and common to both features (sustained during the whole stream duration). Furthermore, by comparing two active tasks with identical stimuli and similar demands on arousal and motor response preparation, we could investigate how attention modulated these activities. The most important findings were: (i) a fronto-parietal network activated during the whole duration of the stream in all conditions and modulated by attention, irrespective of acoustic features; (ii) a sustained activity over the TPO junctions during both the stationary and the varying parts in the attention condition only; and (iii) a left temporal component differentially activated by spatial and pitch acoustic variations.

Attention Network

As parietal and frontal activities were maintained during the whole duration of the stream and were only modulated by the orientation of attention and not by the acoustic feature to be integrated, they are likely to be related to high-level task-related processes.

Fronto-Central Component

The sustained fronto-central negative component was enhanced in the attention task during the stationary part. This could reflect the subject’s expectation of the next critical event (i.e., the acoustic variation onset). During the varying part, this component did not differ between conditions and could be associated with common processes required by both tasks: the anticipatory attention to the imperative acoustic event (end of the stream or noise bursts) and the preparation of the motor response. It was similar, in topography and time course, to the frontal component reported in a previous working memory study during the expectation of the crucial stimuli (Kaiser & Bertrand, 2003), and could be considered as a contingent negative variation component (Brunia & van Boxtel, 2001).

Superior Parietal Component

The sustained superior parietal activity was emerging in both tasks and enhanced in the attention condition during the whole duration of the stream. This activity could be due to eye movement planning (Andersen, 1995), because we found slow EOG drifts during stimulus presentation with no difference between tasks during the varying part.

Parietal subregions have also been shown to be involved in “multisensory integration and coordinate transformation required to convert sensory input to motor output” (Andersen & Buneo, 2002). In the present study, subjects had to achieve a complex transformation converting the acoustic input into an oriented joystick movement (right–left or front–back). That could explain why both tasks activate the parietal cortex. This interpretation is consistent with recent neuroimaging results (Zatorre, Bouffard, et al., 2002) showing a clear parietal activation only when transforming spatial auditory information into a directed joystick movement.

Finally, in the present study, the superior parietal activity could simply reflect supramodal attentional processes (Andersen & Buneo, 2002). In the attention condition, the parietal activity enhancement could be due to selective attention, including the voluntary allocation of attention to spatial or pitch features of the auditory stream (Behrmann et al., 2004; Zatorre, Mondor, et al., 1999; Pugh et al., 1996). Interestingly, in a recent study, a superior parietal cortex activation has been found for attention to sound motion irrespective of the stimulus properties (spectro-temporal or spatial) (Hart et al., 2004). In addition, Warren and Griffiths (2003), in a passive paradigm of spatial versus pitch auditory perception, did not find any parietal activity and explained it by “the lack of an output task.” Thus, the parietal areas, which were proposed as contributing to auditory spatial processing by several authors (Hart et al., 2004; Zatorre, Bouffard, et al., 2002; Alain et al., 2001; Maeder et al., 2001; Lewis et al., 2000; Bushara et al., 1999; Weeks et al., 1999), may as well be involved in task-related processes such as auditory attention and/ or motor preparation. Besides, in passive listening paradigm (Pavani et al., 2002; Warren et al., 2002; Griffiths, Green, et al., 2000; Griffiths & Green, 1999; Griffiths, Rees, Rees, et al., 1998), these parietal activities could be due to stimulus-driven attentional capture. These interpretations permit to explain why parietal activities could also be found during pitch-related tasks (Maeder et al., 2001; Rama, Paavilainen, et al., 2000; Weeks et al., 1999; Zatorre, Mondor, et al., 1999; Zatorre, Evans, et al., 1994).

Thus, the parietal activity could be related to high-level task-related cognitive processes that would be common to both features. In the present study, the parietal and frontal components seem to be part of the network usually found to be involved in attentional processes and motor preparation (Jancke, Specht, Shah, & Hugdahl, 2003; Brunia & van Boxtel, 2001; Posner & Dehaene, 1994).

Temporo-Parieto-Occipital Components

By means of SCD mapping, it has also been possible to detect large and very focal bilateral TPO activities in the attention condition. To our knowledge, no similar activity has ever been found in neuroimaging studies on sound motion, which were generally based on passive listening or simple stimuli. In contrast, we used more complex varying stimuli in a demanding task. In the attention condition, to be able to follow both types of acoustic variations, most of the subjects (11 among 14) asserted that they had mentally generated a visual representation of the sound. It should be noted that TPO electrode sites are close to V5 visual areas (Figure 5) known to be activated during visual motion perception and mental imagery (Goebel, Khorram-Sefat, Muckli, Hacker, & Singer, 1998). The bilateral TPO activation could thus correspond to visual mental imagery induced by both types of acoustic variation processing. The existence of these same activities with small amplitude before the acoustic variation onset could be associated with a preparatory attentional process selectively involving the brain areas which will be later engaged in visual mental imagery.

An external file that holds a picture, illustration, etc.
Object name is halms172386f5.jpg

Surface electrode positions on a normalized brain with neuroimaging findings. The Talairach coordinates of activations induced by visual motion in area V5 (diamonds) were collected in several neuroimaging studies (Dumoulin et al., 2000; Lewis et al., 2000; Sunaert, Van Hecke, Marchal, & Orban, 1999, 2000; Chawla et al., 1999; Goebel et al., 1998). These activations were superimposed on a 3-D rendered magnetic resonance image template from the Montreal Neurological Institute (MNI), back view. Dashed ovals cover V5 activities. The EEG electrodes of the present experiment (10–20 and 10–10 electrode systems and additional intermediate electrodes) are positioned on a realistic scalp surface (Oostenveld & Praamstra, 2001), and head and brain contours are coregistered (rotation and scaling) to the MNI template. Electrodes corresponding to RTPO and LTPO activities are indicated in black.

Temporal Components

At last, sustained bilateral temporal components were characterized by SCD polarity reversals throughout the stimulus duration. These activities most likely originate from superior temporal auditory areas (Gutschalk, Patterson, Rupp, Uppenkamp, & Scherg, 2002) and reflect auditory integration. We found these bilateral temporal activities for spatial and pitch processing in both tasks. Previous studies have suggested bilateral (Hart et al., 2004; Warren & Griffiths, 2003; Pavani et al., 2002; Warren et al., 2002; Zatorre, Bouffard, et al., 2002; Lewis et al., 2000) or right-lateralized (Baumgart et al., 1999) activation of the planum temporale when contrasting moving or spatially distributed sounds to stationary sounds. For pitch processing, right-lateralized (Patterson, Uppenkamp, Johnsrude, & Griffiths, 2002; Zatorre et al., 1994) or bilateral (Hart et al., 2004; Warren & Griffiths, 2003; Thivard, Belin, Zilbovicius, Poline, & Samson, 2000) activation in superior temporal areas has been found when contrasting sequences with changing pitch to sequences with fixed pitch.

During the varying part, left temporal activities were particularly more important for pitch than for spatial variations. This result suggests a differential processing of spatial and pitch variations in the left temporal auditory cortex. This dissociation could correspond either to an increased neuronal activity for pitch processing or to different configurations of neural sources for spatial and pitch processing. To our knowledge, few studies have been interested in the difference between pitch and spatial processing within the temporal cortex. By contrasting sound sequences with changing pitch to sequences with changing location, Hart et al. (2004) and Warren and Griffiths (2003) have shown a dissociation between pitch and spatial processing in the right and left superior temporal planes. Nevertheless, our left-lateralized effect is consistent with Hart et al.’s additional activity for pitch processing in the left planum polare. According to these studies, a posteromedial (spatial)–anterolateral (pitch) dissociation would exist within the superior temporal cortex of both hemispheres. Close configurations of neural sources for spatial and pitch processing that could not be disentangled with the spatial resolution of SCD computation could explain why we could not find any difference in the right hemisphere.

As an interaction effect between attention and acoustic variations has been found during the same period for the left superior component, pitch and spatial feature integration within the auditory cortex could correspond to automatic processes that are differentially modulated by attention. In addition, there was a small attention effect on the left temporal activity during the stationary part. This effect prior to stream variation could be due to an attention-dependent preactivation of the auditory areas which will be subsequently involved in differential feature processing.

All these modulations of the temporal activities are consistent with a current hypothesis (Griffiths & Warren, 2002) suggesting a role of the planum temporale in differentially processing auditory spectro-temporal information.

Conclusion

Combining the use of long-lasting auditory streams and a precise spatio-temporal analysis of electrophysiological signals allows to clarify the role of the temporal, parietal, and frontal regions in auditory processing. The frontal component would be involved in unspecific task-related processes, such as crucial stimulus expectation and motor preparation. The parietal activity would reflect task-specific processes, such as selective attention and preparation for auditory–motor transformation, as suggested in recent reviews (Hall, 2003; Griffiths & Warren, 2002; Middlebrooks, 2002). The temporal cortices would be more specifically involved in auditory integration, acoustic spatial and pitch features being differentially processed within the left temporal auditory areas.

The present study provides further experimental evidence for a differential processing of “what” and “where” acoustic features within the temporal auditory areas.

METHODS

Subjects

Sixteen paid subjects (15 right-handers, 6 men, mean age 24 years) participated in the study and were selected during a selection session on the basis of their performance in both tasks. All were free of neurological diseases and had normal hearing. Written informed consent was obtained from each subject.

Task

Subjects had to perform two different tasks in separate blocks. In each block, the stimuli consisted of pitch-varying streams randomly alternating with spatially varying streams with equal probabilities. In the stream-oriented attention task (called attention task), subjects were asked to follow sound variations and report its final direction (high/low pitch or left/right) with a four-direction joystick (up/down or left/right, respectively). In the diverted attention task (called control task), subjects were asked to compare two successive noise-bursts superimposed to the end of the stream, thus orienting their attention away from the varying stream. These two bursts randomly differed in location or pitch, independently of the underlying stream variations. Subjects had to report with the same joystick whether the second burst was to the right or to the left, or lower- or higher-pitched than the first one. Performances were balanced between tasks and also between spatial and pitch feature processing with difficulty levels adapted to each subject (see Procedure section). Each new stimulus started between 3.3 and 4.1 sec after the subject’s response and no specific speed instructions were given.

Stimuli

Auditory streams consisted of 35-msec band-passed noise-bursts [5-semitone (st) wide, starting from a frequency F, and cosine tapered] and were presented through headphones (Sennheiser PX30, Wedemark, Germany). In both tasks, they were composed of two successive parts, a stationary part of random duration (735, 805, 875 msec with equal probabilities) followed by an acoustically varying part (duration ranging from 980 to 2205 msec).

Stream trajectories were piecewise-linear functions, identical for pitch and spatial variations (Figure 6A). Trajectories of pitch-varying streams were following piecewise-linear functions ΔF(t) defined in semitones (st). These functions were equal to zero during the stationary part and were increasing or decreasing between −ΔFm and +ΔFm during the varying part. Trajectories of spatially varying streams were following piecewise-linear functions ΔA(t) defined in degrees. These functions were equal to 0° (ahead) during the stationary part and were increasing or decreasing between −ΔAm and +ΔAm during the varying part. Depending on the difficulty level in the attention task, ΔAm was ranging between 30° and 60°, and ΔFm between 6 and 14 st. Nevertheless, the speed of variation remained about the same across difficulty levels (around 30 st/sec and 160°/sec). To minimize predictability, there were, for each difficulty level, at least six different trajectories varying by the number of segments of the functions ΔF(t) and ΔA(t) (randomized between 4 and 7). Moreover, the trajectories were interrupted at variable times, either when the variations were reaching an extremum or four bursts before. This led to about 12 different time courses, identical for spatial and pitch variations, consisting of 3 to 6 direction changes for each difficulty level.

An external file that holds a picture, illustration, etc.
Object name is halms172386f6.jpg

Stimulus structure. (A) Stream trajectories. Auditory streams were composed of two successive parts, a stationary and a varying part. Streams varied in location or pitch, following zigzag trajectories. Each trajectory of pitch-varying stream was following a piecewise-linear function ΔF(t) (gray line) defined in semitones. In the case of spatial variations, the functions ΔA(t) were varying between −ΔAm and +ΔAm. These functions were identical for pitch and spatial variations. They consisted of 4 to 7 segments and could be interrupted at variable times, either when the variations were reaching an extremum (black diamonds) or four bursts before (black circles). (B) Stream frequency content. Gray bars represent the 35-msec, band-passed and 5-st wide noise-bursts which composed of auditory streams. In the case of pitch-varying streams, the lower frequency F of each burst was defined in semitones by the function: F(t) = ΔF(t) + F0 ± 1 with ΔF(t) as described in B. The black dashed line represents the trajectory ΔF(t)+F0, and the black dashes represent the actual lower frequency F(t) of each burst. (C) Analysis periods. Three periods were chosen for signals analysis: stationary ([−150, 735 msec], time 0 being the onset of the stream), varying ([−150, 910 msec], time 0 being the onset of the varying part), and final ([−840, −420 msec], time 0 being the offset of the stream). They correspond to the periods having the shortest duration for each part and common to both tasks, that is, before noise-burst (gray ovals) occurrence in the control condition. Freq = frequency; loc = location; st = semitones.

To minimize habituation effects for both types of variation, the lower frequency F of each burst was randomly varying by ±1 st around F0, with F0 equiprobably chosen between 1318, 1480, and 1661 Hz for each stimulus. In the case of pitch-varying streams, the lower frequency of each burst was thus defined in semitones by the function: F(t) = ΔF(t) + F0 ± 1 with ΔF(t) as described above (see Figure 6B). In the case of spatially varying streams, F(t) is only varying around F0, F(t) = F0 ± 1, throughout stimulus duration. Pitch-varying streams were localized ahead at 0° azimuth.

To give the impression of a continuously moving sound source, each burst was convoluted with an average-listener, head-related transfer function (see http://sound.media.mit.edu/KEMAR.html) with a 5°-step in azimuth. Compared to other studies using only interaural time differences to position sounds in space with earphones, this procedure creates a better percept of sound objects located in the external space by modulating phase, amplitude, and frequency content of the sounds.

In the control condition, two additional noise-bursts (5-st wide, starting from a frequency F, and lasting 100 msec) appeared successively 420 and 140 msec before the end of the stream (Figure 6C). They differed either in location [azimuths: ±ΔAb; frequency: F] or in pitch [azimuth: 0°; frequency: F ± ΔFb]. Depending on the difficulty level, ΔAb was ranging from 5° to 35° and ΔFb from 1 to 8 st. Because the trajectories were randomized and interrupted at variable times, subjects could not predict the end of the stream, and thus, the occurrence of the noise-bursts.

More generally, to minimize habituation and predictability, many acoustic parameters were randomized: the central pitch of the stationary part (F0), the pitch of each burst (F), and the stream trajectory. Furthermore, by randomizing stream duration, the subject’s attention was maintained focused throughout the stimulus in both tasks, and sustained processing of the stream trajectories was ensured in the attention task. With such a randomization strategy, it seems unlikely that subjects could memorize and recognize all these trajectory patterns during a recording session.

Procedure

In the first session, subjects were selected on the basis of their ability to perceive spatial variations, to perform the different tasks, and to minimize blinks and eye movements assessed by electrooculogram. First, they were familiarized with sound sequences and tasks in a stepwise learning procedure using visual feedback. Then, they performed blocks of each task with different difficulty levels, until we could determine the difficulty level corresponding to an 80% correct response rate for each kind of stimuli in each task (spatially or pitch-varying streams in the attention task and location- or pitch-differing noise-bursts in the control task). Only 1 subject out of 17 had to be excluded because of excessive eye blinks.

In the second session, subjects’ EEG were recorded during 12 blocks of 30 trials in both attention and control tasks, yielding a total of 360 trials per task (180 with spatial and 180 with pitch variations). Attention (A) and control (C) blocks were presented in an AA–CC–AA–CC–. . . design. For each subject, the difficulty level was determined during the selection procedure as mentioned above. In some cases, because of subjects’ performance improvement or tiredness, the difficulty level was adapted after each block to maintain a constant performance (around 80% correct response rate) throughout the session. Subjects were instructed to keep their eyes open while fixating a point, and to refrain from blinking during stimulus presentation.

EEG Recordings

The subjects were seated in an armchair in a sound-attenuated and electrically shielded chamber. EEG was recorded from 36 scalp electrodes (Easy Cap, FMS, Herrsching, Germany) referenced to the nose (ground electrode at forehead and impedances below 5 kΩ) and EOG from two electrodes were placed centrally above the right eye and at its outer canthus. The electrode positions are shown schematically in Figure 7. Signals were amplified, filtered (0.05–200 Hz bandwidth), and sampled at 1000 Hz (Synamps, Neuroscan Labs, Sterling, VA).

An external file that holds a picture, illustration, etc.
Object name is halms172386f7.jpg

EEG recording sites. Electrodes were placed according to the following scheme: 19 locations of the 10–20 electrode system, 13 locations of the 10–10 electrode system, and 4 additional intermediate locations. One should note the high density of electrodes over parietal areas required for precise SCD estimates.

Data Analysis

Trials contaminated by eye blinks or excessive muscular activity, or corresponding to incorrect responses, were rejected from further analysis. ERPs were averaged and digitally filtered (low-pass 35 Hz) in three time periods, called “stationary,” “varying,” and “final” periods and locked to stationary stream onset, varying stream onset, and stream offset, respectively (Figure 6C). Due to the randomization in duration of the stationary and varying parts, analysis was only performed on the periods having the shortest duration for each part (latencies defined in Figure 6C). As the analysis mainly focused on the slow waves extending over the three periods, a common amplitude baseline was defined between −150 and 0 msec before stream onset. For each subject, the ERPs were computed separately in the four conditions corresponding to the 2 × 2 factorial design: Attention (stream-oriented or -diverted) × Acoustic variation (spatial or pitch). The mean number of correct and nonartifacted trials per condition ranged from 127 to 143 out of 180.

Topographical analysis of the evoked responses was performed not only by scalp potential mapping, but also by SCD estimates (surface laplacian) using spherical spline interpolation (Perrin, Pernier, Bertrand, & Echallier, 1989). By providing sharper spatial peaks and troughs, SCDs allow to spatio-temporally disentangle multiple overlapping sources. SCD time courses were reconstructed at each electrode site.

Statistical analyses were performed on SCD estimates, averaged across electrode subsets corresponding to the main topographical components. To test for the emergence of slow waves, Wilcoxon tests were performed on 105-msec moving windows compared to the prestimulus baseline. Furthermore, to estimate the dynamics of attention and acoustic variation effects, two-way ANOVAs were computed on mean amplitudes over a 50-msec window for the transient responses (stationary period: 80–130 msec for N1 and 165–215 msec for P2) and over 210-msec moving windows with a 105-msec step for the slow waves (from 210 to 735 msec in the stationary period, from −210 to 0 msec and from 280 to 910 msec in the varying period, and from −840 to −420 msec in the final period).

All signal analyses were performed with ELAN-Pack software developed at INSERM U280.

Acknowledgments

We thank Jochen Kaiser for helpful discussions, and Pierre-Emmanuel Aguera and Jean-François Echallier for technical support.

References

  • Alain C, Arnott SR, Hevenor S, Graham S, Grady CL. “What” and “where” in the human auditory system. Proceedings of the National Academy of Sciences, USA. 2001;98:12301–12306. [Europe PMC free article] [Abstract] [Google Scholar]
  • Andersen RA. Encoding of intention and spatial location in the posterior parietal cortex. Cerebral Cortex. 1995;5:457–469. [Abstract] [Google Scholar]
  • Andersen RA, Buneo CA. Intentional maps in posterior parietal cortex. Annual Review of Neuroscience. 2002;25:189–220. [Abstract] [Google Scholar]
  • Anourova I, Nikouline VV, Ilmoniemi RJ, Hotta J, Aronen HJ, Carlson S. Evidence for dissociation of spatial and nonspatial auditory information processing. Neuroimage. 2001;14:1268–1277. [Abstract] [Google Scholar]
  • Arnott SR, Binns MA, Grady CL, Alain C. Assessing the auditory dual-pathway model in humans. Neuroimage. 2004;22:401–408. [Abstract] [Google Scholar]
  • Baumgart F, Gaschler-Markefski B, Woldorff MG, Heinze HJ, Scheich H. A movement-sensitive area in auditory cortex. Nature. 1999;400:724–726. [Abstract] [Google Scholar]
  • Behrmann M, Geng JJ, Shomstein S. Parietal cortex and attention. Current Opinion in Neurobiology. 2004;14:212–217. [Abstract] [Google Scholar]
  • Belin P, Zatorre RJ. “What,” “where” and “how” in auditory cortex. Nature Neuroscience. 2000;3:965–966. [Abstract] [Google Scholar]
  • Brunia CH, van Boxtel GJ. Wait and see. International Journal of Psychophysiology. 2001;43:59–75. [Abstract] [Google Scholar]
  • Bushara KO, Weeks RA, Ishii K, Catalan MJ, Tian B, Rauschecker JP, Hallett M. Modality-specific frontal and parietal areas for auditory and visual spatial localization in humans. Nature Neuroscience. 1999;2:759–766. [Abstract] [Google Scholar]
  • Chawla D, Buechel C, Edwards R, Howseman A, Josephs O, Ashburner J, Friston KJ. Speed-dependent responses in V5: A replication study. Neuroimage. 1999;9:508–515. [Abstract] [Google Scholar]
  • Clarke S, Bellmann Thiran A, Maeder P, Adriani M, Vernet O, Regli L, Cuisenaire O, Thiran JP. What and Where in human audition: Selective deficits following focal hemispheric lesions. Experimental Brain Research. 2002;147:8–15. [Abstract] [Google Scholar]
  • Cohen YE, Wessinger CM. Who goes there? Neuron. 1999;24:769–771. [Abstract] [Google Scholar]
  • Ducommun CY, Murray MM, Thut G, Bellmann A, Viaud-Delmon I, Clarke S, Michel CM. Segregated processing of auditory motion and auditory location: An ERP mapping study. Neuroimage. 2002;16:76–88. [Abstract] [Google Scholar]
  • Dumoulin SO, Bittar RG, Kabani NJ, Baker CL, Jr, Le Goualher G, Bruce Pike G, Evans AC. A new anatomical landmark for reliable identification of human area V5/MT: A quantitative analysis of sulcal patterning. Cerebral Cortex. 2000;10:454–463. [Abstract] [Google Scholar]
  • Giard MH, Perrin F, Echallier JF, Thevenet M, Froment JC, Pernier J. Dissociation of temporal and frontal components in the human auditory N1 wave: A scalp current density and dipole model analysis. Electroencephalography and Clinical Neurophysiology. 1994;92:238–252. [Abstract] [Google Scholar]
  • Goebel R, Khorram-Sefat D, Muckli L, Hacker H, Singer W. The constructive nature of vision: Direct evidence from functional magnetic resonance imaging studies of apparent motion and motion imagery. European Journal of Neuroscience. 1998;10:1563–1573. [Abstract] [Google Scholar]
  • Griffiths TD, Green GG. Cortical activation during perception of a rotating wide-field acoustic stimulus. Neuroimage. 1999;10:84–90. [Abstract] [Google Scholar]
  • Griffiths TD, Green GG, Rees A, Rees G. Human brain areas involved in the analysis of auditory movement. Human Brain Mapping. 2000;9:72–80. [Europe PMC free article] [Abstract] [Google Scholar]
  • Griffiths TD, Rees A, Witton C, Shakir RA, Henning GB, Green GG. Evidence for a sound movement area in the human cerebral cortex. Nature. 1996;383:425–427. [Abstract] [Google Scholar]
  • Griffiths TD, Rees G, Rees A, Green GG, Witton C, Rowe D, Buchel C, Turner R, Frackowiak RS. Right parietal cortex is involved in the perception of sound movement in humans. Nature Neuroscience. 1998;1:74–79. [Abstract] [Google Scholar]
  • Griffiths TD, Warren JD. The planum temporale as a computational hub. Trends in Neurosciences. 2002;25:348–353. [Abstract] [Google Scholar]
  • Gutschalk A, Patterson RD, Rupp A, Uppenkamp S, Scherg M. Sustained magnetic fields reveal separate sites for sound level and temporal regularity in human auditory cortex. Neuroimage. 2002;15:207–216. [Abstract] [Google Scholar]
  • Hackett TA, Stepniewska I, Kaas JH. Prefrontal connections of the parabelt auditory cortex in macaque monkeys. Brain Research. 1999;817:45–58. [Abstract] [Google Scholar]
  • Hall DA. Auditory pathways: Are “What” and “Where” appropriate? Current Biology. 2003;13:R406–R408. [Abstract] [Google Scholar]
  • Hart HC, Palmer AR, Hall DA. Different areas of human non-primary auditory cortex are activated by sounds with spatial and nonspatial properties. Human Brain Mapping. 2004;21:178–190. [Europe PMC free article] [Abstract] [Google Scholar]
  • Jancke L, Specht K, Shah JN, Hugdahl K. Focused attention in a simple dichotic listening task: An fMRI experiment. Brain Research, Cognitive Brain Research. 2003;16:257–266. [Abstract] [Google Scholar]
  • Kaas JH, Hackett TA. “What” and “where” processing in auditory cortex. Nature Neuroscience. 1999;2:1045–1047. [Abstract] [Google Scholar]
  • Kaas JH, Hackett TA. Subdivisions of auditory cortex and processing streams in primates. Proceedings of the National Academy of Sciences, USA. 2000;97:11793–11799. [Europe PMC free article] [Abstract] [Google Scholar]
  • Kaas JH, Hackett TA, Tramo MJ. Auditory processing in primate cerebral cortex. Current Opinion in Neurobiology. 1999;9:164–170. [Abstract] [Google Scholar]
  • Kaiser J, Bertrand O. Dynamics of working memory for moving sounds: An event-related potential and scalp current density study. Neuroimage. 2003;19:1427–1438. [Abstract] [Google Scholar]
  • Kaiser J, Ripper B, Birbaumer N, Lutzenberger W. Dynamics of gamma-band activity in human magnetoencephalogram during auditory pattern working memory. Neuroimage. 2003;20:816–827. [Abstract] [Google Scholar]
  • Lewis JW, Beauchamp MS, DeYoe EA. A comparison of visual and auditory motion processing in human cerebral cortex. Cerebral Cortex. 2000;10:873–888. [Abstract] [Google Scholar]
  • Lutzenberger W, Ripper B, Busse L, Birbaumer N, Kaiser J. Dynamics of gamma-band activity during an audiospatial working memory task in humans. Journal of Neuroscience. 2002;22:5630–5638. [Europe PMC free article] [Abstract] [Google Scholar]
  • Maeder PP, Meuli RA, Adriani M, Bellmann A, Fornari E, Thiran JP, Pittet A, Clarke S. Distinct pathways involved in sound recognition and localization: A human fMRI study. Neuroimage. 2001;14:802–816. [Abstract] [Google Scholar]
  • Middlebrooks JC. Auditory space processing: Here, there or everywhere? Nature Neuroscience. 2002;5:824–826. [Abstract] [Google Scholar]
  • Oostenveld R, Praamstra P. The five percent electrode system for high-resolution EEG and ERP measurements. Clinical Neurophysiology. 2001;112:713–719. [Abstract] [Google Scholar]
  • Patterson RD, Uppenkamp S, Johnsrude IS, Griffiths TD. The processing of temporal pitch and melody information in auditory cortex. Neuron. 2002;36:767–776. [Abstract] [Google Scholar]
  • Pavani F, Macaluso E, Warren J, Driver J, Griffiths T. A common cortical substrate activated by horizontal and vertical sound movement in the human brain. Current Biology. 2002;12:1584. [Abstract] [Google Scholar]
  • Perrin F, Pernier J, Bertrand O, Echallier JF. Spherical splines for scalp potential and current density mapping. Electroencephalography and Clinical Neurophysiology. 1989;72:184–187. [Abstract] [Google Scholar]
  • Posner MI, Dehaene S. Attentional networks. Trends in Neurosciences. 1994;17:75–79. [Abstract] [Google Scholar]
  • Pugh KR, Offywitz BA, Shaywitz SE, Fulbright RK, Byrd D, Skudlarski P, Shankweiler DP, Katz L, Constable RT, Fletcher J, Lacadie C, Marchione K, Gore JC. Auditory selective attention: An fMRI investigation. Neuroimage. 1996;4:159–173. [Abstract] [Google Scholar]
  • Rama P, Paavilainen L, Anourova I, Alho K, Reinikainen K, Sipila S, Carlson S. Modulation of slow brain potentials by working memory load in spatial and nonspatial auditory tasks. Neuropsychologia. 2000;38:913–922. [Abstract] [Google Scholar]
  • Rama P, Poremba A, Sala JB, Yee L, Malloy M, Mishkin M, Courtney SM. Dissociable functional cortical topographies for working memory maintenance of voice identity and location. Cerebral Cortex. 2004;14:768–780. [Abstract] [Google Scholar]
  • Rauschecker JP. Cortical processing of complex sounds. Current Opinion in Neurobiology. 1998;8:516–521. [Abstract] [Google Scholar]
  • Rauschecker JP, Tian B. Mechanisms and streams for processing of “what” and “where” in auditory cortex. Proceedings of the National Academy of Sciences, USA. 2000;97:11800–11806. [Europe PMC free article] [Abstract] [Google Scholar]
  • Romanski LM, Bates JF, Goldman-Rakic PS. Auditory belt and parabelt projections to the prefrontal cortex in the rhesus monkey. Journal of Comparative Neurology. 1999;403:141–157. [Abstract] [Google Scholar]
  • Romanski LM, Tian B, Fritz J, Mishkin M, Goldman-Rakic PS, Rauschecker JP. Dual streams of auditory afferents target multiple domains in the primate prefrontal cortex. Nature Neuroscience. 1999;2:1131–1136. [Europe PMC free article] [Abstract] [Google Scholar]
  • Sunaert S, Van Hecke P, Marchal G, Orban GA. Motion-responsive regions of the human brain. Experimental Brain Research. 1999;127:355–370. [Abstract] [Google Scholar]
  • Sunaert S, Van Hecke P, Marchal G, Orban GA. Attention to speed of motion, speed discrimination, and task difficulty: An fMRI study. Neuroimage. 2000;11:612–623. [Abstract] [Google Scholar]
  • Thivard L, Belin P, Zilbovicius M, Poline JB, Samson Y. A cortical region sensitive to auditory spectral motion. NeuroReport. 2000;11:2969–2972. [Abstract] [Google Scholar]
  • Tian B, Reser D, Durham A, Kustov A, Rauschecker JP. Functional specialization in rhesus monkey auditory cortex. Science. 2001;292:290–293. [Abstract] [Google Scholar]
  • Warren JD, Griffiths TD. Distinct mechanisms for processing spatial sequences and pitch sequences in the human auditory brain. Journal of Neuroscience. 2003;23:5799–5804. [Europe PMC free article] [Abstract] [Google Scholar]
  • Warren JD, Zielinski BA, Green GG, Rauschecker JP, Griffiths TD. Perception of sound-source motion by the human brain. Neuron. 2002;34:139–148. [Abstract] [Google Scholar]
  • Weeks RA, Aziz-Sultan A, Bushara KO, Tian B, Wessinger CM, Dang N, Rauschecker JP, Hallett M. A PET study of human auditory spatial processing. Neuroscience Letters. 1999;262:155–158. [Abstract] [Google Scholar]
  • Xiang J, Chuang S, Wilson D, Otsubo H, Pang E, Holowka S, Sharma R, Ochi A, Chitoku S. Sound motion evoked magnetic fields. Clinical Neurophysiology. 2002;113:1–9. [Abstract] [Google Scholar]
  • Zatorre RJ, Bouffard M, Ahad P, Belin P. Where is “where” in the human auditory cortex? Nature Neuroscience. 2002;5:905–909. [Abstract] [Google Scholar]
  • Zatorre RJ, Evans AC, Meyer E. Neural mechanisms underlying melodic perception and memory for pitch. Journal of Neuroscience. 1994;14:1908–1919. [Europe PMC free article] [Abstract] [Google Scholar]
  • Zatorre RJ, Mondor TA, Evans AC. Auditory attention to space and frequency activates similar cerebral systems. Neuroimage. 1999;10:544–554. [Abstract] [Google Scholar]

Citations & impact 


Impact metrics

Jump to Citations

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/155401962
Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/155401962

Smart citations by scite.ai
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by EuropePMC if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
Explore citation contexts and check if this article has been supported or disputed.
https://scite.ai/reports/10.1162/089892905774589244

Supporting
Mentioning
Contrasting
5
13
0

Article citations


Go to all (21) article citations