Skip to main content

Tapping into the vocal learning and rhythmic synchronization hypothesis

The Original Article was published on 06 November 2024

Abstract

In this article, I present three main points that could benefit the “vocal learning and rhythmic synchronization hypothesis”, encompassing neurogenetic mechanisms of gene expression transmission and single motor neuron function, classification of different behavioral motor phenotypes (e.g., spontaneous vs. voluntary), and other evolutionary considerations (i.e., the involvement of reward mechanisms).

Peer Review reports

The “vocal learning and rhythmic synchronization hypothesis” (VLRSH) [1], formulated by Ani Patel, aims to explain how the evolution of advanced vocal learning in humans and parrots may have resulted in their ability to synchronize nonvocal movements to a rhythmic beat. I believe this hypothesis provides one of the most promising avenues for linking the evolution of two core components (i.e., vocal learning and rhythmic synchronization) that are necessary for complex sensorimotor behaviors such as speech, song, and dance. Earlier versions of this hypothesis [2, 3] have been influential in shaping my research agenda, where I currently focus on studying the brain pathways involved in speech and dance production and learning in humans. In this version of the VLRSH [1], Patel delves deeper into the neurobiological mechanisms, proposing that the evolution of a strong integration between auditory regions and vocal dorsal premotor regions in ancestral humans (via the laryngeal pitch control pathway) involved gene regulation changes which fortuitously enhanced the strength of neural connections between auditory and nonvocal dorsal premotor regions near the vocal dorsal premotor regions.

Ani Patel has invested significant effort in synthesizing existing literature to construct a hypothesis that is both robust and testable. While acknowledging the solidity of Patel’s work that has uniquely inspired my research, I identify areas within the hypothesis that merit further refinement and exploration. In this commentary, I intend to address three specific points derived from Patel’s hypothesis. Firstly, I will dig into the neural mechanisms and suggest an alternative or complementary scenario for the gene regulation changes in neighboring brain regions that, according to Patel [1], could have led to the enhancement of the brain pathways needed for rhythmic synchronization. Secondly, I advocate for a more profound understanding of what constitutes “voluntary”, “involuntary”, “reflexive” or “spontaneous” movements, whether vocal or nonvocal, as a beneficial addition to the hypothesis. Thirdly, I will present my perspective on an additional parameter that Patel suggests could have contributed to the human and parrot ability to dance: their “craving for social interaction and a strong sensitivity to social reward [4]”.

Adjacent and/or overlapping motor circuits for vocal learning and rhythm synchronization

One of the fundamental tenets of the VLRSH is the notion that gene regulation changes in specific neurons, particularly those engaged in the integration between auditory and vocal dorsal premotor regions, triggered alterations in the gene regulation of adjacent or nearby neurons involved in the integration between auditory and nonvocal dorsal premotor regions. However, the mechanistic details of how neurons can directly impact the gene expression of neighboring neurons remain unclear. While certain mechanisms exist, such as neurotransmitter signaling influencing the electrical activity of the receiving neuron, the release of neurotrophic factors from one neuron to another, or activity-dependent plasticity, none of these mechanisms explain the proposed “infection” of gene expression to adjacent neurons posited by the VLRSH.

While formulating my working hypothesis for studying in tandem the speech and dance brain pathways in humans, I was able to find only limited instances of research demonstrating the feasibility of gene expression transmission from one neuron to another. For instance, experiments involving Xenopus tectal neurons revealed that overexpressing Candidate Plasticity Gene 15 (CPG15) not only enhanced dendritic outgrowth and synapse maturation in the directly overexpressed neurons but also influenced these characteristics in adjacent neurons, potentially through intercellular interactions [5, 6]. Another set of experiments in transgenic animal models of neurodegenerative diseases also identified the “spread” of gene expression properties in neighboring neurons [7]: in these models, the accumulation of Tau in neurons expressing a transgene resulted in tau aggregates developing in adjacent neurons lacking the transgene but receiving projections from transgene-expressing neurons, possibly through a trans-synaptic prion-like mechanism [7]. Additionally, in neuropathic pain mouse models, experimental injury to a set of neurons in the dorsal root ganglion led to differential gene regulation in the injured neurons also affecting gene regulation in nearby intact neurons, possibly through the activation of intracellular second messenger systems inducing immediate early genes (IEGs) controlling expression changes in other genes [8]. While these studies offer insights into potential mechanisms for studying the VLRSH, the direct transmission of changes in gene expression from one neuron to adjacent neurons remains a concept not firmly established.

Testing this tenet of the VLRSH hypothesis is challenging, even with advanced spatial transcriptomic tools like single-nucleus RNA sequencing that enable us to identify gene expression patterns in single nuclei of individual neurons. One approach to test the idea of direct transmission of gene expression changes from one neuron to adjacent neurons would involve manipulating (e.g., knocking down) the expression of a gene in one or more neurons and conducting single-nucleus RNA-sequencing in both manipulated and adjacent neurons. Confirmation of the VLRSH tenet would require observing, for example, a significant downregulation trend for the knocked-down gene in adjacent neurons of mutant animals, along with significant similarities in the regulation patterns of pathway-related genes, compared to equivalent comparisons of analogous neurons in wild-type animals. Similar approaches are anticipated to unveil the dynamic and intricate mechanisms of neuron-to-neuron interactions.

An alternative or complementary hypothesis I wish to put forward is that it might be not the adjacent, but the very same neurons being responsible for both the abilities of vocal learning and rhythmic entrainment. In other words, I hypothesize that some of the same neurons in the primary motor cortex (M1) that are responsible for the precise and rhythmic control of vocal muscles might also project to other nonvocal movement muscles, such as hand or leg muscles. Is there any evidence supporting the idea of M1 neurons projecting to distinct motor muscles? Recent strong evidence reveals that injecting a pseudorabies virus with different fluorophores into both the cricothyroid (laryngeal) muscle and the extensor carpi radialis (forelimb) muscles stained overlapping single neurons in the mouse M1 [9]. Remarkably, this phenomenon was not confined to a small subset of neurons; instead, the proportions of co-labeled cortical neurons spanned from 5.3 to 25% across M1.This suggests that, even in a species like the mouse with limited vocal flexibility [10], the same neurons controlling laryngeal muscles for vocal communication also project to forelimb muscles for various hand movements. In another study [11], which utilized orthograde axonal transport tracing methods in rats, injections focused on the motor face cortex (jaw, lip, and tongue areas) resulted in the same terminal fields of the basilar pontine nuclei of the brainstem where limb sensorimotor cortices’ injections also terminated. A more recent study [12] in monkeys identified that M1 cells were not specifically tuned to individual muscles but exhibited a more “functional” activity, e.g., specific to various functions of different muscles, like synergist, fixator, or antagonist functions (see also [13]). These collective studies challenge the notion of a one-to-one relationship between a muscle and a motor neuron. If this holds true, it suggests that the same motor neurons that evolved the ability to finely and rhythmically control laryngeal muscles may have also employed this ability in their projections to other muscles involved in different body movements.

Turning our attention to humans, is there any evidence indicating that the same motor neurons project to different muscles? Contrary to the traditional view presented by Penfield’s motor homunculus [14], where each muscle type corresponds to a specific part of M1, recent evidence [15, 16] challenges the notion of a “one muscle, one motor neuron” principle and suggests that motor representations in human M1 are organized based on different principles, such as the “like attracts like” principle, where effectors for movements that frequently occur together or serve similar functions tend to cluster or be spatially organized together. Arguably the most compelling evidence for an overlap of vocal and nonvocal projecting neurons arises from a series of studies [17, 18] where intracortical electrodes were implanted in the conventionally identified “hand” area of the dorsal M1 and showed significant firing rate changes during speech. Notably, these studies not only demonstrated that the “hand” area encodes specific spoken phonemes, but also achieved high-accuracy decoding of these phonemes using both intracortical multiunit spikes and local field potential power. Recent functional Magnetic Resonance Imaging (fMRI) studies by Gordon et al. [19] showed that human M1 comprises motor-specific regions intermingled with so-called “inter-effector” integrating areas lacking movement specificity. In our ongoing work in preparation, we are leveraging this evidence by comparing fMRI activation maps of various speech-related laryngeal movements with rhythmic hand, leg, hip, and other movements, while excluding relevant activity from control tasks, such as non-speech laryngeal movements (e.g., coughing or laughing) and non-rhythmic body movements. Our data suggest the involvement of both overlapping and adjacent brain regions in speech (vocal) laryngeal movements and other rhythmic (dance) body movements. Further exploration will unfold the neuronal-level processes in humans, shedding light on whether mechanisms within the same and/or adjacent neurons contribute to fine rhythm entrainment abilities in body muscles beyond the larynx. Future investigations into the auditory-to-motor integration pathways proposed by Patel [1] in humans are also expected to provide insights into the connectivity patterns supporting the motor cortex-based scenarios I proposed here.

Given the temporal or spatial limitations of most neuroimaging techniques, such as fMRI and electroencephalography (EEG), in testing my hypothesis regarding the overlap of neurons in the M1 responsible for both vocal and nonvocal muscle control, I suggest using advanced neural recording technology, neuropixels, to analyze the response of neurons across M1. Neuropixels, with nearly 1,000 electrode sensors in an area about the size of a human eyelash, allow recording individual neuron activity in awake participants. Recently, Leonard et al. [20] recorded from 685 neurons in the superior temporal gyrus, a critical region for speech perception, and found single neurons encoding various speech sound cues, such as features of consonants and vowels, relative vocal pitch, onsets, and amplitude envelope. To test my hypothesis, an experiment would require recording single neuron activity in different M1 layers, specifically the laryngeal and hand motor areas, while participants speak or tap their fingers rhythmically to an internal or external beat. Confirming the hypothesis would indicate overlap in neurons activated during speech and finger tapping cues, such as pitch and tempo, respectively, while refuting it would suggest different neurons for each activity. Intracortical implant arrays detecting single-unit activity could also be applied to similar protocols for further investigation.

Towards a framework for classifying different types of movements

Shifting focus, I propose that the VLRSH could benefit from a clearer framework outlining the specific types of vocal and nonvocal movements suitable for studying the hypothesis’s confirmation or falsification. This could be achieved with a precise definition and categorization of movements referred to, in Patel’s manuscript, as “spontaneous”, “involuntary”, “voluntary”, or “reflexive”. For example, Patel refers to a study [21] on rats passively exposed to music, where small head movements around the times of beats are interpreted by him as “involuntary” or “reflexive”. In motor neuroscience, the term “spontaneous” is often used interchangeably with these terms. However, Patel specifies in the VLRSH manuscript [1] that “spontaneous” refers instead to rhythmic movements occurring “without reliance on formal training”. I interpret the latter as referring mostly to whether these rhythmic movements are “innate” or “learned”, rather than “spontaneous” (sensu “reflexive”), “reflexive”, “involuntary” or “voluntary”. This interpretation gains support from Patel’s subsequent passages [1], where he discusses a possible developmental period in which children acquire the ability to synchronize predictively and sporadically to a tempo. This suggests to me that, for Patel, “spontaneous”, in the context of rhythmic synchronization, primarily denotes a form of natural (not formal) learning, akin to first language acquisition, a perspective with which I agree. While recent evidence aligns with this perspective (e.g., [22]), the term “spontaneous” might not accurately convey the intended message, particularly given its usage as synonymous with “involuntary” or “reflexive” in the broader neuroscience field.

Interestingly, we recently shared a preprint [23] introducing a relevant framework that delves into a more refined classification of human vocalizations (i.e., laryngeal muscle movement) that is built upon the terms “innate”, “learned”, “spontaneous” and “volitional”. Traditionally, scientists have categorized human vocalizations into “innate” and “learned”, with innate encompassing vocalizations like coughing, laughter, sneezing, and yawning, and learned including, basically, speech. The general understanding has been that innate vocalizations are predominantly produced “spontaneously”, without any volitional vocal control [24]. However, a caveat arises from findings indicating that these so-called “innate” vocalizations often require intentional modulation of vocal output [25], which is evident in the differences of these vocalizations across different cultures and languages (e.g., laughter), with these differences being readily discernible by the listeners [26]. Adding complexity, so-called “innate” vocalizations can be either “spontaneous” (e.g., laughter evoked by tickling) or “volitional” (e.g., fake laughter). Relevant accounts [27] attempting to categorize honest vs. fake “Duchenne” smiles (not laughter) further highlight the learnability component of fake smiles, making one wonder if they should be categorized as innate in the first place. Similarly, for laughter, there could be types of laughter that are full products of learning and not volitional imitations of one’s innate laughter. Moreover, vocalizations typically considered “learned” may be emitted in contexts that are more “spontaneous” than “volitional” (e.g., exclaiming “Ouch!” in response to pain).

Based on these insights, we proposed a continuum for human vocalizations, ranging from innate to learned on one axis and from spontaneous to volitional on the other [23]. Within this continuum, one can find innate vocalizations produced spontaneously, such as exclaiming “Ah!” in response to pain, or innate vocalizations produced volitionally, as seen in “ironic” or “fake” laughter (“Ha, ha!”). Similarly, there are learned vocalizations in both (volitional and spontaneous) categories, with examples like volitional expressions such as “Thanks!” in a conversation, or learned vocalizations produced spontaneously, like exclaiming “Ouch!” in response to pain. While extending this framework to classify other body movements (e.g., hand or leg) would require a separate discussion, I am optimistic that this continuum can serve as a valuable foundation for a deeper understanding of where rhythmic movements may fall within this spectrum.

Vocal learning, rhythm synchronization and social reward

According to Patel [1], another contributing factor to the ability for rhythmic movement observed in humans and parrots could be attributed to their “craving for social interaction and a strong sensitivity to social reward [4]”. I wholeheartedly agree with Patel and believe that this factor may play an even more pivotal role in the ability of humans and parrots to synchronize their body movements to a beat. Here, I would like to explore further the framework through which we examine concepts like “social interaction” and “social reward”. Typically, these terms are often interpreted within a Pavlovian framework, where the reward is perceived as external. For instance, as Patel [1] notes, “Snowball and other pet parrots that move rhythmically to music often get attention and praise from their owners for this behavior”. However, Schmidt and Kaplan [28]raise a significant question: “Does Snowball dance by himself, even when no one is watching?”. In other words, does Snowball engage in dance without any external reward? Is the act of dancing internally rewarding for Snowball?

The field of neuroscience has yet to establish a clear distinction between external and internal rewards, along with the corresponding brain pathways underlying them. It is still unknown whether this lack of clarity in their distinction is due to an absence of evidence or a genuine absence of differences between these two types of reward pathways. Examining evidence from songbirds, it is noteworthy that dopamine is implicated in both what we typically label as external and internal rewards. For instance, in male zebra finches, dopamine levels in Area X of the striatum are higher when they sing to attract females (external reward) compared to when they sing in isolation (internal reward) [29]. This increase may be attributed to differential activity of the reuptake transporter (a noradrenaline transporter in birds) in the Ventral Tegmental Area (VTA) axons projecting to Area X. Blocking this transporter pharmacologically equalizes dopamine levels during undirected singing to those during directed singing [29]. However, the question remains whether these distinctions are sufficient to support distinct external (social, sexual) and internal reward brain pathways. The picture is not entirely clear, especially considering that VTA dopamine neurons projecting to Area X were also found to encode error-and-reward during undirected singing (i.e., internal reward) [30]. Altogether, these studies underscore the necessity for a more nuanced understanding of reward brain pathways as integral components of relevant pathways responsible for auditory perception and motor (vocal or nonvocal) production, as observed in the domains of speech/song and dance.

In a series of studies [31,32,33,34,35], I have presented evidence and proposed a role for oxytocin and dopamine within the vocal learning circuitry. Could these neurotransmitters also contribute to our understanding of the relevant reward mechanisms (external and/or internal) involved in the rhythmic synchronization abilities of humans and parrots? Studies involving administration of intranasal oxytocin (OT) suggest that OT, compared to placebo, increases synchronized interpersonal movement during dance [36] and during an interactive finger-tapping paradigm [37]. While not directly related to dance but pertinent to a crucial component of dancing, music perception has been linked to increases in both oxytocin [38, 39] and dopamine [40]. It would be intriguing to test whether beat synchronization would increase in parrots after OT administration. Further research is essential to unravel the entire complexity of the reward neurocircuitry and explore potential differences in external versus internal reward mechanisms during dancing, especially considering instances when individuals dance alone, without an audience.

In conclusion, I believe that all the above remarks can only highlight how promising the VLRSH is. Moving forward, conducting behavioral studies that simultaneously explore different species’ abilities for vocal learning and rhythm entrainment is crucial to either validate or challenge the VLRSH. Complementary neural tracing, single-unit recordings, and neuroimaging studies are essential to elucidate the underlying brain pathways. Spatial transcriptomic analyses can provide more targeted insights into the regulation profiles of neurons involved in vocal learning, rhythmic movement, or both. Beyond the confines of this hypothesis, gaining a deeper understanding of how gene expression may propagate from one neuron to another is imperative for the field. Additionally, there is much more to discover regarding the organization of regions like the primary motor cortex and whether motor neurons exhibit a more muscle-specific, “function”-specific, or other organization. Lastly, developing clearer frameworks and working hypotheses to define terms used for specific behaviors, such as distinguishing between spontaneous and volitional movements or internal and external rewards, is a necessary step forward for the field.

Data availability

No datasets were generated or analysed during the current study.

References

  1. Patel AD. Beat-based dancing to music has evolutionary foundations in advanced vocal learning. BMC Neurosci, 25, 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12868-024-00843-6.

  2. Patel AD, Iversen JR, Bregman MR, Schulz I. Experimental evidence for synchronization to a musical beat in a Nonhuman Animal. Curr Biol. May 2009;19(10):827–30. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.cub.2009.03.038.

  3. Jao Keehn RJ, Iversen JR, Schulz I, Patel AD. Spontaneity and diversity of movement to music are not uniquely human. Curr Biol. Jul. 2019;29. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.cub.2019.05.035. 13, pp. R621–R622.

  4. Wilson M, Cook PF. Rhythmic entrainment: why humans want to, fireflies can’t help it, pet birds try, and sea lions have to be bribed. Psychon Bull Rev. Dec. 2016;23(6):1647–59. https://doiorg.publicaciones.saludcastillayleon.es/10.3758/s13423-016-1013-x.

  5. Cantallops I, Haas K, Cline HT. Postsynaptic CPG15 promotes synaptic maturation and presynaptic axon arbor elaboration in vivo, Nat Neurosci, vol. 3, no. 10, pp. 1004–1011, Oct. 2000, https://doiorg.publicaciones.saludcastillayleon.es/10.1038/79823.

  6. Nedivi E, Wu G-Y, Cline HT. Promotion of Dendritic Growth by CPG15, an Activity-Induced Signaling Molecule, Science (1979), vol. 281, no. 5384, pp. 1863–1866, Sep. 1998, https://doiorg.publicaciones.saludcastillayleon.es/10.1126/science.281.5384.1863.

  7. Perez-Rando M, Dujardin S, Bennett RE, Commins C, Nibhanupudy T, Hyman BT. Synaptic and metabolic gene expression alterations in neurons that are recipients of proteopathic tau seeds, Acta Neuropathol Commun, vol. 8, no. 1, pp. 1–10, Dec. 2020, https://doiorg.publicaciones.saludcastillayleon.es/10.1186/S40478-020-01049-7/FIGURES/5.

  8. Wang H et al. Oct., Chronic neuropathic pain is accompanied by global changes in gene expression and shares pathobiology with neurodegenerative diseases, Neuroscience, vol. 114, no. 3, pp. 529–546, 2002, https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S0306-4522(02)00341-X.

  9. Vargas CDM et al. Jan., A Functional and Non-Homuncular Representation of the Larynx in the Primary Motor Cortex of Mice, a Vocal Non-Learner, bioRxiv, p. 2024.02.05.579004, 2024, https://doiorg.publicaciones.saludcastillayleon.es/10.1101/2024.02.05.579004.

  10. Chabout J, Sarkar A, Dunson DB, Jarvis ED. Male mice song syntax depends on social contexts and influences female preferences, Front Behav Neurosci, vol. 9, Apr. 2015, https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fnbeh.2015.00076.

  11. Mihailoff GA, Lee H, Watt CB, Yates R. Projections to the basilar pontine nuclei from face sensory and motor regions of the cerebral cortex in the rat, Journal of Comparative Neurology, vol. 237, no. 2, pp. 251–263, Jul. 1985, https://doiorg.publicaciones.saludcastillayleon.es/10.1002/cne.902370209.

  12. Griffin DM, Hoffman DS, Strick PL. Corticomotoneuronal cells are ‘functionally tuned,’ Science (1979), vol. 350, no. 6261, pp. 667–670, Nov. 2015, https://doiorg.publicaciones.saludcastillayleon.es/10.1126/science.aaa8035.

  13. Graziano MSA, Aflalo TN. Mapping Behavioral Repertoire onto the Cortex, Neuron, vol. 56, no. 2, pp. 239–251, Oct. 2007, https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neuron.2007.09.013.

  14. Penfield WRT. The cerebral cortex of man; a clinical study of localization of function. Macmillan; 1950.

  15. Ejaz N, Hamada M, Diedrichsen J. Hand use predicts the structure of representations in sensorimotor cortex, Nat Neurosci, vol. 18, no. 7, pp. 1034–1040, Jul. 2015, https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nn.4038.

  16. Huber L, et al. Sub-millimeter fMRI reveals multiple topographical digit representations that form action maps in human motor cortex. NeuroImage. Mar. 2020;208:116463. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neuroimage.2019.116463.

  17. Stavisky SD, Rezaii P, Willett FR, Hochberg LR, Shenoy KV, Henderson JM, Decoding Speech from Intracortical Multielectrode Arrays in Dorsal ‘Arm/Hand Areas’ of Human Motor Cortex, in. 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), IEEE, Jul. 2018, pp. 93–97. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/EMBC.2018.8512199.

  18. Wilson GH, et al. Decoding spoken English from intracortical electrode arrays in dorsal precentral gyrus. J Neural Eng. Nov. 2020;17(6):066007. https://doiorg.publicaciones.saludcastillayleon.es/10.1088/1741-2552/abbfef.

  19. Gordon EM, et al. A somato-cognitive action network alternates with effector regions in motor cortex. Nature. May 2023;617(7960):351–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41586-023-05964-2.

  20. Leonard MK, et al. Large-scale single-neuron speech sound encoding across the depth of human cortex. Nature. Feb. 2024;626(7999):593–602. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41586-023-06839-2.

  21. Ito Y, Shiramatsu TI, Ishida N, Oshima K, Magami K, Takahashi H. Spontaneous beat synchronization in rats: Neural dynamics and motor entrainment, Sci Adv, vol. 8, no. 45, Nov. 2022, https://doiorg.publicaciones.saludcastillayleon.es/10.1126/sciadv.abo7019.

  22. Nave KM, Snyder JS, Hannon EE. Sustained musical beat perception develops into late childhood and predicts phonological abilities. Dev Psychol, 2023.

  23. Theofanopoulou C, Jarvis ED. Converging evidence for a human spoken language pathway. OSF Preprints, 2023.

  24. Gervais M, Wilson DS. The evolution and functions of laughter and humor: a synthetic approach., Q Rev Biol, vol. 80, no. 4, pp. 395–430, Dec. 2005, https://doiorg.publicaciones.saludcastillayleon.es/10.1086/498281.

  25. Pisanski K, Cartei V, McGettigan C, Raine J, Reby D. Voice Modulation: a window into the origins of Human Vocal Control? Trends Cogn Sci. Apr. 2016;20(4):304–18. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.tics.2016.01.002.

  26. Kamiloğlu RG, Tanaka A, Scott SK, Sauter DA. Perception of group membership from spontaneous and volitional laughter, Philosophical Transactions of the Royal Society B: Biological Sciences, vol. 377, no. 1841, Jan. 2022, https://doiorg.publicaciones.saludcastillayleon.es/10.1098/rstb.2020.0404.

  27. Krumhuber EG, Kappas A. More What Duchenne Smiles Do, Less What They Express, Perspectives on Psychological Science, vol. 17, no. 6, pp. 1566–1575, Nov. 2022, https://doiorg.publicaciones.saludcastillayleon.es/10.1177/17456916211071083.

  28. Schmidt MF, Kaplan M. Toward a comparative understanding of beat perception and synchronization. BMC Neurosci, 25, 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12868-024-00852-5.

  29. Sasaki ED, Sotnikova A, Gainetdinov TD, R. R., Jarvis. Social context-dependent singing-regulated dopamine. J Neurosci. 2006;26(35):9010–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Gadagkar V, Puzerey PA, Chen R, Baird-Daniel E, Farhang AR, Goldberg JH. Dopamine neurons encode performance error in singing birds, Science (1979), vol. 354, no. 6317, pp. 1278–1282, Dec. 2016, https://doiorg.publicaciones.saludcastillayleon.es/10.1126/science.aah6837.

  31. Theofanopoulou C, Gedman G, Cahill JA, Boeckx C, Jarvis ED. Universal nomenclature for oxytocin–vasotocin ligand and receptor families, Nature, vol. 592, no. 7856, pp. 747–755, Apr. 2021, https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41586-020-03040-7.

  32. Theofanopoulou C. A journey from speech to dance through the field of oxytocin. Compr Psychoneuroendocrinol. Nov. 2023;16:100193. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.cpnec.2023.100193.

  33. Theofanopoulou C. Implications of oxytocin in human linguistic cognition: from genome to Phenome. Front Neurosci. Jun. 2016;10. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fnins.2016.00271.

  34. Theofanopoulou C, Boeckx C, Jarvis ED. A hypothesis on a role of oxytocin in the social mechanisms of speech and vocal learning, Proceedings of the Royal Society B: Biological Sciences, vol. 284, no. 1861, 2017, https://doiorg.publicaciones.saludcastillayleon.es/10.1098/rspb.2017.0988.

  35. Tobari Y, et al. Oxytocin variation and brain region-specific gene expression in a domesticated avian species. Genes Brain Behav. Feb. 2022;21(2). https://doiorg.publicaciones.saludcastillayleon.es/10.1111/gbb.12780.

  36. Josef L, Goldstein P, Mayseless N, Ayalon L, Shamay-Tsoory SG. The oxytocinergic system mediates synchronized interpersonal movement during dance. Sci Rep. Feb. 2019;9(1):1894. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-018-37141-1.

  37. Gebauer L, Witek MAG, Hansen NC, Thomas J, Konvalinka I, Vuust P. Oxytocin improves synchronisation in leader-follower interaction. Sci Rep. Dec. 2016;6(1):38416. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/srep38416.

  38. Nilsson U. Soothing music can increase oxytocin levels during bed rest after open-heart surgery: a randomised control trial. J Clin Nurs. Aug. 2009;18(15):2153–61. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/j.1365-2702.2008.02718.x.

  39. Ooishi Y, Mukai H, Watanabe K, Kawato S, Kashino M. Increase in salivary oxytocin and decrease in salivary cortisol after listening to relaxing slow-tempo and exciting fast-tempo music. PLoS ONE. Dec. 2017;12(12):e0189075. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0189075.

  40. Salimpoor VN, Benovoy M, Larcher K, Dagher A, Zatorre RJ. Anatomically distinct dopamine release during anticipation and experience of peak emotion to music, Nat Neurosci, vol. 14, no. 2, pp. 257–262, Feb. 2011, https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nn.2726.

Download references

Acknowledgements

Not applicable.

Funding

CT wishes to thank The Rockefeller University and the Center for the Ballet and the Arts (New York University).

Author information

Authors and Affiliations

Authors

Contributions

CT wrote the paper.

Corresponding author

Correspondence to Constantina Theofanopoulou.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The author declares that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This comment refers to the article available online at https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12868-024-00843-6.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Theofanopoulou, C. Tapping into the vocal learning and rhythmic synchronization hypothesis. BMC Neurosci 25, 63 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12868-024-00863-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12868-024-00863-2

Keywords