Audition

the challenge of understanding the human auditory brain

Physical variability of speech combined with its perceptual constancy make speech recognition a challenging task. The human auditory brain, however, is able to perform speech recognition effortlessly.

How does the auditory brain learn to robustly recognise auditory objects, such as naturally spoken words, in a transform invariant manner even despite the huge variability within the raw auditory wave inputs? What are the areas within the extensive auditory brain hierarchy that are important for this task? What is the simplest neural code sufficient to represent the learnt auditory objects within the output layers of the auditory brain hierarchy? Does the brain use rate or temporal encoding to represent auditory objects? These are some of the questions that we are trying to address.

making sense of sound

Neurophysiological studies have provided insights into the architecture and response properties of different areas within the auditory brain hierarchy, however the precise computational mechanisms used to learn stimulus specific transform invariant representations of auditory objects, such as phonemes or words, are currently unknown. In order to understand these computational mechanisms, we have developed an unsupervised spiking neural network model grounded in the known neurophysiology of the auditory brain. This model can be used to make neurophysiologically testable hypotheses about the mechanisms used by the brain to perform auditory object recognition. We are working closely with the Oxford Auditory Neuroscience Group to test the hypothesis generated by the model. Furthermore, the model can be used as a prototype for developing a novel approach to automatic speech recognition (ASR), which, due to its grounding in the neurophysiology of the auditory brain, should be able to cope with the robustness problems that current modern ASR systems are struggling to solve, such as speaker variability and speech recognition in noise among others.

Why not to find out more about our research on Behaviour?

The Oxford Foundation for Theoretical Neuroscience and Artificial Intelligence is incorporated through Companies House as a charitable company limited by guarantee (Company No. 5722895), and is registered as a charity with the Charity Commission for England and Wales (Charity No. 1116075). The board of trustees includes academics and research scientists from the Department of Experimental Psychology at the University of Oxford, who have many years of experience in developing computer models of the brain.
The registered office is Midland House, West Way, Botley, Oxford OX2 0PH, United Kingdom.
+44 1865 809444 trustees@oftnai.org

audition & language

the challenge of understanding the human auditory brain

making sense of sound

Why not to find out more about our research on Behaviour?