Professional Documents
Culture Documents
Edited by Dale Purves, Duke University, Durham, NC, and approved November 6, 2018 (received for review July 9, 2018)
Musical training is associated with a myriad of neuroplastic changes event-related potentials (ERPs) and fMRI show differential
in the brain, including more robust and efficient neural processing of speech activity in musicians at cortical levels of the nervous system
clean and degraded speech signals at brainstem and cortical levels. (13–16). Collectively, an overwhelming number of studies have
These assumptions stem largely from cross-sectional studies be- implied that musical training shapes auditory brain function at
tween musicians and nonmusicians which cannot address whether multiple stages of subcortical and cortical processing and, in turn,
training itself is sufficient to induce physiological changes or whether bolsters the perceptual organization of speech.
preexisting superiority in auditory function before training predis- Problematically, innate differences in auditory system function
poses individuals to pursue musical interests and appear to have could easily masquerade as plasticity in cross-sectional studies on
similar neuroplastic benefits as musicians. Here, we recorded neuro- auditory learning (17) and music-related plasticity (18–20). This
electric brain activity to clear and noise-degraded speech sounds in concern is reinforced by the fact that musical skills such as pitch
individuals without formal music training but who differed in their and timing perception develop very early in infancy (i.e., 6 mo of
receptive musical perceptual abilities as assessed objectively via the age; ref. 21) and may even be linked to certain genetic markers
Profile of Music Perception Skills. We found that listeners with (22–25). Unfortunately, the majority of studies linking musi-
naturally more adept listening skills (“musical sleepers”) had en- cianship to speech–language plasticity are cross-sectional and
hanced frequency-following responses to speech that were also include only self-reports of musicians’ experience (e.g., refs. 10,
more resilient to the detrimental effects of noise, consistent with 14, and 18). It remains possible that certain individuals have
the increased fidelity of speech encoding and speech-in-noise naturally enriched auditory systems that predispose them to
benefits observed previously in highly trained musicians. Further
pursue musical interests (e.g., ref. 26). Consequently, widely
comparisons between these musical sleepers and actual trained
reported musician advantages in auditory perceptual tasks may
musicians suggested that experience provides an additional
be due to intrinsic differences unrelated to formal training.
Current conceptions of musicality define it as an innate, natural,
boost to the neural encoding and perception of speech. Collec-
and spontaneous development of widely shared traits, constrained
tively, our findings suggest that the auditory neuroplasticity of
by our cognitive abilities and biology, that underlie the capacity for
music engagement likely involves a layering of both preexisting
music (27). While being “musical” no doubt encompasses more
(nature) and experience-driven (nurture) factors in complex sound
than simple hearing or perceptual abilities (e.g., instrumental
processing. In the absence of formal training, individuals with intrin-
production creativity), for this investigation we focus on the re-
sically proficient auditory systems can exhibit musician-like auditory ceptive aspects of musicality (i.e., auditory perceptual skills), fol-
function that can be further shaped in an experience-dependent lowing the long tradition of assessing formal music abilities
manner. through strictly perceptual measures (28–32). Among the normal
| |
EEG experience-dependent plasticity auditory event-related brain potentials | Significance
|
frequency-following responses nature vs. nurture
Mankel and Bidelman PNAS | December 18, 2018 | vol. 115 | no. 51 | 13131
experience-dependent “boost” on top of preexisting differences in our scalp responses are generated, we can still conclude that
auditory brain function. among people without formal musical training, certain individ-
uals’ brains produce phase-locked neural responses that better
Discussion capture the acoustic information in speech.
By recording neuroelectric brain responses in highly skilled lis- Despite parallels in ERPs between high-scoring PROMS lis-
teners who lack formal musical training, we provide strong evi- teners and trained musicians, it is possible that cortical and/or
dence that inherent auditory system function, in the absence of behavioral differences in speech processing emerge only (i) after
experience, is associated with enhanced neural encoding of speech strong experiential plasticity rather than subtle innate function or
and auditory perceptual advantages. Our study explicitly shows that (ii) online during tasks requiring top-down processing and/or at-
certain individuals have musician-like auditory neural function, as tention. Indeed, we found behavioral QuickSIN enhancements in
has been conventionally indexed via speech FFRs. More broadly, musicians but not in musical sleepers (SI Appendix, Fig. S2). This
these findings challenge assumptions that the neuroplasticity asso- suggests that stronger, more protracted experiences might be
ciated with musical training and speech processing is solely needed to observe plasticity at later stages of the auditory system
experience-driven (cf. refs. 2, 3, and 18). Importantly, we do not and to transfer the effects to speech perception. Structural and
claim that our study negates the possibility that actual musical functional neuroimaging studies have revealed striking differences
training can confer experience-dependent auditory plasticity (e.g., in musicians at the cortical level which are predictive of musical
refs. 16 and 43–46). Rather, our data argue that preexisting factors and language abilities (e.g., refs. 2, 43, and 51). Thus, musicianship
may play a larger role in putative links between musical experience might tune language-related networks of the brain more broadly,
and enhanced speech processing than conventionally thought. beyond those core auditory sensory responses indexed by our
Neurological differences among intrinsically skilled listeners were FFRs and ERPs. Full-brain imaging of musical sleepers would be
particularly evident in speech FFRs, which showed individuals with a logical next step to further explore the relationship between
higher musicality scores had faster and more robust neural re- innate listening abilities and cortical function. Presumably, musi-
sponses to the voice pitch (F0) and timbre (harmonics) cues of cal sleepers might show more pervasive cortical differences out-
speech, even amid interfering noise. In fact, the advantages we find side the auditory system as evaluated here. In any case, our data
here in nonmusician musical sleepers are remarkably similar to reinforce the fact that neuroplasticity is not an all-or-nothing
those reported in trained musicians, who similarly show enhanced phenomenon and that it manifests in different gradations at
neural encoding and behavioral recognition of clean and noise- lower (brainstem) vs. higher (cortical/behavioral) levels of the
degraded speech (10, 12, 19, 41). Additionally, we found greater auditory system (14, 40) in accordance with a listener’s experience.
neural noise was associated with poorer auditory skills (i.e., lower It is well established that cortical responses are heavily modulated
musicality scores). Neural noise has been interpreted as reflecting by attention, and the neuroplastic benefits of musical training are
the variability in how sensory information is translated across the consistently stronger under active than under passive listening tasks
brain (47). Insomuch as lesser noise reflects a greater efficiency in (52, 53). Because our study utilized a passive listening paradigm, it is
auditory processing and perception (34, 35), the higher-quality perhaps unsurprising that we failed to find group differences in
neural representations we find among high PROMS scorers may neural N1–P2 amplitudes among high- and low-musicality individ-
allow more veridical readout of signal identity and thus account for uals if cortical ERP enhancements emerge mainly under states of
their superior behavioral abilities. Our data are also consistent with goal-directed attention (13, 14). Paralleling the ERP data, we sim-
recent fMRI findings demonstrating that the strength of (passively ilarly found that PROMS groups did not differ in behavioral
measured) resting-state connectivity between auditory and motor QuickSIN scores, in contrast to the SIN benefits observed in trained
brain regions before training is related to better musical proficiency musicians (SI Appendix, Fig. S2) (10, 39, 42). One interpretation of
in short-term instrumental learning (43). These findings, along with these data is that the QuickSIN and ERPs are not sensitive enough
current EEG data, suggest that intrinsic differences in neural to detect the finer individual differences in speech processing among
function may predict outcomes in a variety of auditory contexts, nonmusicians as revealed by FFRs, a putative marker of subcortical
from understanding someone at the cocktail party, where pitch and processing (7). Indeed, direct comparisons between each class of
timbre cues are vital for understanding a noise-degraded talker (10, speech-evoked responses show not only that they are functionally
39, 48), to success in music-training programs (43). distinct (14, 40) but also that FFRs are more stable than cortical
One interesting finding was that variations in the neural encoding ERPs both within and between listeners (54). The lower variability
of speech were better explained by certain perceptual domains (i.e., of brainstem FFRs may offer a better reflection of innate, hardwired
PROMS subtest scores). Among perceptual subtests, FFR spectral processes of audition (55) than the cortex (e.g., ERPs, behavior),
measures were best predicted by tuning scores, whereas accent which are more malleable and heavily influenced by subject state,
perception was best predicted by FFR latency measures. The tuning attention, and task demands.
subtest requires detection of a subtle pitch manipulation (<1/2 While our data provide evidence that natural propensities in
semitone) within a musical chord (30). Given that FFRs reflect the auditory skills might account for certain de novo enhancements in
neural integrity of stimulus properties, higher-fidelity FFR re- the brain’s speech processing, provocatively, they also show that
sponses (i.e., increased F0 amplitudes, decreased noise, faster la- formally trained musicians (40) have an additional boost in neuro-
tencies) may allow finer discrimination of acoustic details at the behavioral function, even beyond those auditory systems deemed
behavioral level and account for the superior auditory perceptual inherently superior. These results are consistent with randomized
skills we find in our individuals with high PROMS scores. In this control studies of music-enrichment programs that report treatment
regard, our data here in musical sleepers aligns closely with recent effects following 1–2 y of music training (16, 20, 46). Thus, longi-
findings by Nan et al. (16), who showed that short-term musical tudinal studies still provide compelling evidence for brain plasticity
training enhances the neural processing of pitch and improves associated with musical training (16, 43–46). However, an aspect
speech perception in children following short-term music lessons. that remains relatively uncontrolled in previous studies is possible
Neurophysiological measures revealed that group differences placebo effects. Reminiscent of recent revelations in the cognitive
in speech processing were more apparent in FFRs than in ERP brain-training literature (56), individuals may enter into (music)
responses. This opposite pattern of effects across neural mea- training expecting to receive benefits, in which case perceptual–
sures (i.e., larger FFRs and reduced ERPs; compare Fig. 2E with cognitive gains may be partially epiphenomenal. Additionally, indi-
Fig. 3C) is reminiscent of both animal (49) and human (14, 40, viduals with undetected superiorities in listening abilities may re-
50) electrophysiological studies which show that, even for the main more engaged or motivated in music-training programs, an
same task, neuroplastic changes in brainstem (e.g., FFR) re- aspect that may confound outcomes of longitudinal studies.
ceptive fields tend in the opposite direction of changes in audi- Still, it is possible that our groups differed on some other envi-
tory cortex (e.g., ERPs). For a discussion on the neural generators ronmental factor rather than inherent auditory perceptual skills per
of the FFR, see SI Appendix, SI Discussion. Regardless of where se. For example, high-scoring PROMS listeners may differ in their
experience and were identified as right-handed according to the Edinburgh spectively (34, 35, 47, 50) (see Fig. 2A). FFR latency was measured as the
Handedness Survey (59). All participants were screened for a history of neu- maximum cross-correlation between the stimulus and FFR waveforms be-
ropsychiatric disorders. Formal musical training is thought to enhance the tween 6–12 ms, the expected onset latency of brainstem responses (7, 8, 14).
neural encoding of speech (14, 18–20, 40, 41), particularly in noise (10, 42). ERP analysis. We measured the overall magnitude of the N1–P2 complex,
Hence, all participants were required to have minimal formal musical training reflecting the early registration of sound in cerebral cortex, as the voltage dif-
(i.e., <3 y) and no musical training within the past 5 y. Critically, groups did not ference between the two individual waves (14). Individual latencies were mea-
differ in their years of formal musical training [high-scoring group = 0.57 ± 0.63 y, sured as the N1 between 80 and 120 ms and P2 between 130 and 180 ms (40).
Mankel and Bidelman PNAS | December 18, 2018 | vol. 115 | no. 51 | 13133
Statistics. Individual t tests assessed group differences across various de- Both FFR neural noise and F0 amplitudes were evaluated as neural pre-
mographic data, PROMS total and subtest scores, and QuickSIN scores. dictors of listeners’ auditory perceptual abilities (PROMS scores). In addi-
Identical conclusions were obtained with parametric and nonparametric tion to total scores, we assessed relations between subtest scores (i.e.,
tests for variables that were nonnormally distributed. Unless otherwise melody, tuning, accent, tempo) and speech FFRs to determine which as-
noted, we used two-way, mixed-model ANOVAs (group × noise level with pects of auditory perception (i.e., timing, spectral discrimination, and
subjects as a random factor; SAS9.4, GLIMMIX) with gender as a covariate to others) were best predicted by physiological responses. Best predictors
analyze all neural data. However, gender was not a significant covariate in were determined as the models minimizing AIC. GLME regressions and
any of our analyses. Effect sizes for omnibus ANOVAs are reported as visualization were achieved using the fitglme and fitlm functions in
Cohen’s d. Spearman’s rho assessed brain–brain correlations between FFR MATLAB, respectively.
and ERP measures. Conditional studentized residuals confirmed the absence
of influential outliers. ACKNOWLEDGMENTS. We thank Caitlin Price for comments on previous
GLMEs evaluated relations between behavioral (PROMS musicality scores) versions of this manuscript and Jessica Yoo for assistance with data collection.
and neural responses (FFRs). Subjects were modeled as a random factor This work was supported by a grant from the University of Memphis Research
nested within group in the regression to model both random intercepts Investment Fund and by National Institute on Deafness and Other Commu-
(per subject) and slopes (per group) [e.g., PROMS ∼ FFRFO + (subjgroup)]. nication Disorders of the NIH Grant R01DC016267 (to G.M.B.).
1. Kraus N, Chandrasekaran B (2010) Music training for the development of auditory 32. Wallentin M, et al. (2010) The Musical Ear Test, a new reliable test for measuring
skills. Nat Rev Neurosci 11:599–605. musical competence. Learn Individ Differ 20:188–196.
2. Herholz SC, Zatorre RJ (2012) Musical training as a framework for brain plasticity: 33. Killion MC, Niquette PA, Gudmundsen GI, Revit LJ, Banerjee S (2004) Development of
Behavior, function, and structure. Neuron 76:486–502. a quick speech-in-noise test for measuring signal-to-noise ratio loss in normal-hearing
3. Münte TF, Altenmüller E, Jäncke L (2002) The musician’s brain as a model of neuro- and hearing-impaired listeners. J Acoust Soc Am 116:2395–2405.
plasticity. Nat Rev Neurosci 3:473–478. 34. Skoe E, Krizman J, Kraus N (2013) The impoverished brain: Disparities in maternal
4. Moreno S, Bidelman GM (2014) Examining neural plasticity and cognitive benefit education affect the neural response to sound. J Neurosci 33:17221–17231.
through the unique lens of musical training. Hear Res 308:84–97. 35. Anderson S, Parbery-Clark A, White-Schwoch T, Kraus N (2012) Aging affects neural
5. Coffey EBJ, Mogilever NB, Zatorre RJ (2017) Speech-in-noise perception in musicians: precision of speech encoding. J Neurosci 32:14156–14164.
A review. Hear Res 352:49–69. 36. Bidelman GM, Krishnan A, Gandour JT (2011) Enhanced brainstem encoding predicts
6. Sohmer H, Pratt H, Kinarti R (1977) Sources of frequency following responses (FFR) in musicians’ perceptual advantages with pitch. Eur J Neurosci 33:530–538.
man. Electroencephalogr Clin Neurophysiol 42:656–664. 37. Parbery-Clark A, Marmel F, Bair J, Kraus N (2011) What subcortical-cortical relation-
7. Bidelman GM (2018) Subcortical sources dominate the neuroelectric auditory ships tell us about processing speech in noise. Eur J Neurosci 33:549–557.
frequency-following response to speech. Neuroimage 175:56–69. 38. Bidelman GM, Davis MK, Pridgen MH (2018) Brainstem-cortical functional connec-
8. Coffey EB, Herholz SC, Chepesiuk AM, Baillet S, Zatorre RJ (2016) Cortical contribu- tivity for speech is differentially challenged by noise and reverberation. Hear Res 367:
tions to the auditory frequency-following response revealed by MEG. Nat Commun 7: 149–160.
11070. 39. Parbery-Clark A, Skoe E, Lam C, Kraus N (2009) Musician enhancement for speech-in-
9. Weiss MW, Bidelman GM (2015) Listening to the brainstem: Musicianship enhances noise. Ear Hear 30:653–661.
intelligibility of subcortical representations for speech. J Neurosci 35:1687–1691. 40. Bidelman GM, Weiss MW, Moreno S, Alain C (2014) Coordinated plasticity in brain-
10. Parbery-Clark A, Skoe E, Kraus N (2009) Musical experience limits the degradative stem and auditory cortex contributes to enhanced categorical speech perception in
effects of background noise on the neural processing of sound. J Neurosci 29: musicians. Eur J Neurosci 40:2662–2673.
14100–14107. 41. Musacchia G, Strait D, Kraus N (2008) Relationships between behavior, brainstem and
11. Bidelman GM, Krishnan A (2010) Effects of reverberation on brainstem representa- cortical encoding of seen and heard speech in musicians and non-musicians. Hear Res
241:34–42.
tion of speech in musicians and non-musicians. Brain Res 1355:112–125.
42. Zendel BR, Alain C (2012) Musicians experience less age-related decline in central
12. Kraus N, Skoe E, Parbery-Clark A, Ashley R (2009) Experience-induced malleability in
auditory processing. Psychol Aging 27:410–417.
neural encoding of pitch, timbre, and timing. Ann N Y Acad Sci 1169:543–557.
43. Wollman I, Penhune V, Segado M, Carpentier T, Zatorre RJ (2018) Neural network
13. Marie C, Delogu F, Lampis G, Belardinelli MO, Besson M (2011) Influence of musical
retuning and neural predictors of learning success associated with cello training. Proc
expertise on segmental and tonal processing in Mandarin Chinese. J Cogn Neurosci
Natl Acad Sci USA 115:E6056–E6064.
23:2701–2715.
44. Hyde KL, et al. (2009) The effects of musical training on structural brain development:
14. Bidelman GM, Alain C (2015) Musical training orchestrates coordinated neuro-
A longitudinal study. Ann N Y Acad Sci 1169:182–186.
plasticity in auditory brainstem and cortex to counteract age-related declines in
45. Jaschke AC, Honing H, Scherder EJA (2018) Longitudinal analysis of music education
categorical vowel perception. J Neurosci 35:1240–1249.
on executive functions in primary school children. Front Neurosci 12:103.
15. Shahin A, Bosnyak DJ, Trainor LJ, Roberts LE (2003) Enhancement of neuroplastic P2
46. Kraus N, et al. (2014) Music enrichment programs improve the neural encoding of
and N1c auditory evoked potentials in musicians. J Neurosci 23:5545–5552.
speech in at-risk children. J Neurosci 34:11913–11918.
16. Nan Y, et al. (2018) Piano training enhances the neural processing of pitch and im-
47. Skoe E, Krizman J, Anderson S, Kraus N (2015) Stability and plasticity of auditory
proves speech perception in Mandarin-speaking children. Proc Natl Acad Sci USA 115:
brainstem function across the lifespan. Cereb Cortex 25:1415–1426.
E6630–E6639.
48. Zendel BR, Alain C (2009) Concurrent sound segregation is enhanced in musicians.
17. Jääskeläinen IP, Ahveninen J, Belliveau JW, Raij T, Sams M (2007) Short-term plasticity
J Cogn Neurosci 21:1488–1498.
in auditory cognition. Trends Neurosci 30:653–661. 49. Slee SJ, David SV (2015) Rapid task-related plasticity of spectrotemporal receptive
18. Wong PC, Skoe E, Russo NM, Dees T, Kraus N (2007) Musical experience shapes human fields in the auditory midbrain. J Neurosci 35:13090–13102.
brainstem encoding of linguistic pitch patterns. Nat Neurosci 10:420–422. 50. Bidelman GM, Villafuerte JW, Moreno S, Alain C (2014) Age-related changes in the
19. Musacchia G, Sams M, Skoe E, Kraus N (2007) Musicians have enhanced subcortical subcortical-cortical encoding and categorical perception of speech. Neurobiol Aging
auditory and audiovisual processing of speech and music. Proc Natl Acad Sci USA 104: 35:2526–2540.
15894–15898. 51. Du Y, Zatorre RJ (2017) Musical training sharpens and bonds ears and tongue to hear
20. Tierney AT, Krizman J, Kraus N (2015) Music training alters the course of adolescent speech better. Proc Natl Acad Sci USA 114:13579–13584.
auditory development. Proc Natl Acad Sci USA 112:10062–10067. 52. Chobert J, Marie C, François C, Schön D, Besson M (2011) Enhanced passive and active
21. Trehub SE (2003) The developmental origins of musicality. Nat Neurosci 6:669–673. processing of syllables in musician children. J Cogn Neurosci 23:3874–3887.
22. Tan YT, McPherson GE, Peretz I, Berkovic SF, Wilson SJ (2014) The genetic basis of 53. Zendel BR, Alain C (2013) The influence of lifelong musicianship on neurophysio-
music ability. Front Psychol 5:658. logical measures of concurrent sound segregation. J Cogn Neurosci 25:503–516.
23. Pulli K, et al. (2008) Genome-wide linkage scan for loci of musical aptitude in Finnish 54. Bidelman GM, Pousson M, Dugas C, Fehrenbach A (2018) Test-retest reliability of
families: Evidence for a major locus at 4q22. J Med Genet 45:451–456. dual-recorded brainstem versus cortical auditory-evoked potentials to speech. J Am
24. Ukkola LT, Onkamo P, Raijas P, Karma K, Järvelä I (2009) Musical aptitude is associated Acad Audiol 29:164–174.
with AVPR1A-haplotypes. PLoS One 4:e5534. 55. Ehret G (1997) The auditory midbrain, a “shunting yard” of acoustical information
25. Park H, et al. (2012) Comprehensive genomic analyses associate UGT8 variants with processing. The Central Auditory System, eds Ehret G, Romand R (Oxford Univ Press,
musical ability in a Mongolian population. J Med Genet 49:747–752. New York), pp 259–316.
26. Norton A, et al. (2005) Are there pre-existing neural, cognitive, or motoric markers for 56. Foroughi CK, Monfort SS, Paczynski M, McKnight PE, Greenwood PM (2016) Placebo
musical ability? Brain Cogn 59:124–134. effects in cognitive training. Proc Natl Acad Sci USA 113:7470–7474.
27. Honing H (2018) The Origins of Musicality (The MIT Press, Cambridge, MA). 57. Fritz J, Elhilali M, Shamma S (2005) Active listening: Task-dependent plasticity of
28. Barros CG, et al. (2017) Assessing music perception in young children: Evidence for spectrotemporal receptive fields in primary auditory cortex. Hear Res 206:159–176.
and psychometric features of the M-factor. Front Neurosci 11:18. 58. Kunert R, Willems RM, Hagoort P (2016) An independent psychometric evaluation of
29. Gordon EE (1989) Advanced Measures of Music Audiation (GIA, Chicago). the PROMS measure of music perception skills. PLoS One 11:e0159103.
30. Law LNC, Zentner M (2012) Assessing musical abilities objectively: Construction and 59. Oldfield RC (1971) The assessment and analysis of handedness: The Edinburgh in-
validation of the profile of music perception skills. PLoS One 7:e52508. ventory. Neuropsychologia 9:97–113.
31. Seashore CE, Lewis L, Saetveit JG (1960) Seashore Measures of Musical Talents (The 60. Bidelman GM (2015) Towards an optimal paradigm for simultaneously recording
Psychological Corporation, New York). cortical and brainstem auditory evoked potentials. J Neurosci Methods 241:94–100.