EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Toward Large-Scale EEG Model

Chengxuan Qin, Rui Yang*, Wenlong You, Zhige Chen, Longsheng Zhu, Mengjie Huang, Zidong Wang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The increasing number of dispersed EEG dataset publications and the advancement of large-scale Electroencephalogram (EEG) models have increased the demand for practical tools to manage diverse EEG datasets. However, the inherent complexity of EEG data, characterized by variability in content data, metadata, and data formats, poses challenges for integrating multiple datasets and conducting large-scale EEG model research. To tackle the challenges, this paper introduces EEGUnity, an open-source tool that incorporates modules of "EEG Parser", "Correction", "Batch Processing", and "Large Language Model Boost". Leveraging the functionality of such modules, EEGUnity facilitates the efficient management of multiple EEG datasets, such as intelligent data structure inference, data cleaning, and data unification. In addition, the capabilities of EEGUnity ensure high data quality and consistency, providing a reliable foundation for large-scale EEG data research. EEGUnity is evaluated across 25 EEG datasets from different sources, offering several typical batch processing workflows. The results demonstrate the high performance and flexibility of EEGUnity in parsing and data processing.

Original languageEnglish
Pages (from-to)1653-1663
Number of pages11
JournalIEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING
Volume33
DOIs
Publication statusPublished - 2025

Keywords

  • Brain-computer-interface
  • electroencephalogram data integration
  • large-scale model
  • open-source software

Fingerprint

Dive into the research topics of 'EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Toward Large-Scale EEG Model'. Together they form a unique fingerprint.

Cite this