Skip to main navigation Skip to search Skip to main content

Deep learning in auditory attention decoding: A systematic review

  • Xi'an Jiaotong-Liverpool University
  • University of Edinburgh

Research output: Contribution to journalReview articlepeer-review

Abstract

Auditory attention decoding (AAD) utilizes electroencephalography (EEG) to detect an individual's attentional focus in noisy, multi-speaker environments, serving as a cornerstone for next-generation neuro-steered hearing aids. Although deep learning has recently surpassed traditional linear models in decoding accuracy, a holistic overview connecting preprocessing pipelines with advanced model architectures is lacking. This systematic review addresses this deficiency by analyzing 82 studies selected via PRISMA guidelines. We comprehensively dissect the AAD framework, from data processing to the evolution of architectures like convolutional neural networks (CNNs), graph neural networks (GNNs), and Transformers. Beyond reviewing trends, we synthesize practical design guidance, recommending the alignment of preprocessing complexity with model capacity and the use of hybrid architectures to capture spatiotemporal dynamics. Crucially, this study highlights persistent challenges impeding real-world transferability, including the reliance on high-density EEG montages incompatible with wearables, the computational latency on edge devices, and the lack of realistic acoustic scenarios. By identifying these bottlenecks and synthesizing effective design choices, this review offers a structured reference for developing robust, low-latency, and subject-independent auditory attention detection systems suitable for practical brain-computer interface applications.

Original languageEnglish
Article number2640250
JournalSystems Science and Control Engineering
Volume14
Issue number1
DOIs
Publication statusPublished - 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Auditory attention decoding
  • deep learning
  • electroencephalography

Cite this