色色啦

News

The human brain tracks speech more closely in time than other sounds

The way that speech processing differs from the processing of other sounds has long been a major open question in human neuroscience. Researchers at Aalto University have endeavoured to answer this by investigating brain representations for natural spoken words using machine learning models and comparing them with representations of environmental sounds that refer to the same concepts 鈥 such as the word cat and a cat meowing.
Aivokuori seuraa 盲盲nen piirteit盲 hyvin t盲sm盲llisesti ymm盲rt盲盲kseen puhetta. Kuva: Aalto-yliopisto

Humans can effortlessly recognize and react to natural sounds and are especially tuned to speech. There have been several studies aimed to localize and understand the speech-specific parts of the brain, but as the same brain areas are mostly active for all sounds, it has remained unclear whether or not the brain has unique processes for speech processing, and how it performs these processes. One of the main challenges has been to describe how the brain matches highly variable acoustic signals to linguistic representations when there is no one-to-one correspondence between the two, e.g. how the brain identifies the same words spoken by very different speakers and dialects as the same.

For this latest study, the researchers, led by Professor Riitta Salmelin, decoded and reconstructed spoken words from millisecond-scale brain recordings in 16 healthy Finnish volunteers. They adopted the novel approach of using the natural acoustic variability of a large variety of sounds (words spoken by different speakers, environmental sounds from many categories) and mapping them to magnetoencephalography (MEG) data using physiologically-inspired machine-learning models. These types of models, with time-resolved and time-averaged representations of the sounds, have been used in brain research before. The novel, scalable formulation by co-lead author Ali Faisal allowed for applying such models to whole-brain recordings, and this study is the first to compare the same models for speech and other sounds.  

Aalto researcher and lead author Anni Nora says, 鈥榃e discovered that time-locking of the cortical activation to the unfolding speech input is crucial for the encoding of speech. When we hear a word, e.g. 鈥渃at鈥, our brain has to follow it very accurately in time to be able to understand its meaning鈥.

As a contrast, time-locking was not highlighted in cortical processing of non-speech environmental sounds that conveyed the same meanings as the spoken words, such as music or laughter. Instead, time-averaged analysis is sufficient to reach their meanings. 鈥楾his means that the same representations (how a cat looks like, what it does, how it feels etc.) are accessed by the brain also when you hear a cat meowing, but the sound itself is analyzed as a whole, without need for similar time-locking of brain activation鈥, Nora explains.

Time-locked encoding was also observed for meaningless new words. However, even responses to human-made non-speech sounds such as laughing didn鈥檛 show improved decoding with the dynamic time-locked mechanism and were better reconstructed using a time-averaged model, suggesting that time-locked encoding is special for sounds identified as speech.

Results indicate that brain responses follow speech with especially high temporal fidelity

The current results suggest that, in humans, a special time-locked encoding mechanism might have evolved for speech. Based on other studies, this processing mechanism seems to be tuned to the native language with extensive exposure to the language environment during early development.

The present finding of time-locked encoding, especially for speech, deepens the understanding of the computations required for mapping between acoustic and linguistic representations (from sounds to words). The current findings raise the question of what specific aspects within sounds are crucial for cueing the brain into using this special mode of encoding. To investigate this further, the researchers aim next to use real-life like auditory environments such as overlapping environmental sounds and speech.

鈥楩uture studies should also determine whether similar time-locking might be observed with specialization in processing other sounds through experience, e.g. for instrumental sounds in musicians鈥, Nora says

Future work could investigate the contribution of different properties within speech acoustics and the
possible effect of an experimental task to boost the use of time-locked or time-averaged mode in sound processing. These machine learning models could also be very useful when applied to clinical groups, such as investigating individuals with impaired speech processing.

Read More:

Dynamic time-locking mechanism in the cortical representation of spoken words

Contact information:

Dr Anni Nora, Aalto University
anni.nora@aalto.fi

Professor Riitta Salmelin, Aalto University
Phone: +358 50 344 2745
riitta.salmelin@aalto.fi

  • Updated:
  • Published:
Share
URL copied!

Read more news

arotor adjustable stiffness test setup
Cooperation, Research & Art Published:

Major funding powers development of next-generation machine technology aimed at productivity leap in export sectors

The BEST research project is developing new types of sealing, bearing, and damping technology.
TAIMI-hanke rakentaa tasa-arvoista ty枚el盲m盲盲. Kuva: Kauppakorkeakoulu Hanken.
Research & Art Published:

The TAIMI project builds an equal working life 鈥 a six-year consortium project seeks solutions to recruitment and skill challenges

Artificial intelligence (AI) is changing skill requirements, the population is aging, and the labor shortage is deepening. Meanwhile, the potential of international experts often remains unused in Finland. These challenges in working life are addressed by the six-year TAIMI project funded by the Strategic Research Council, and implemented by a broad consortium.
Unite! Seed Fund 2026: Call opens on 20 January. Applications open for student activities, teaching and learning, research and PhD.
Cooperation, Research & Art, Studies, University Published:

Unite! Seed Fund 2026: Call opens on 20 January 2026

Gain an early overview of the Unite! Seed Fund Call of Spring 2026. The call includes three funding lines: Student Activities, Teaching and Learning, and Research and PhD.
Deepika Yadav in the Computer science building in Otaniemi. Photo: Matti Ahlgren.
Appointments Published:

Deepika Yadav leverages technology to improve women's health

Deepika Yadav recently began as an assistant professor at the Department of Computer Science in the field of human-computer interaction (HCI) and interaction design for health and wellbeing.