Mutual Information-Driven Subject-Invariant and Class-Relevant Deep Representation Learning in BCI
- Authors
- Jeon, Eunjin; Ko, Wonjun; Yoon, Jee Seok; Suk, Heung-Il
- Issue Date
- 2021
- Publisher
- IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- Keywords
- Electroencephalography; Feature extraction; Training; Mutual information; Brain modeling; Transfer learning; Decoding; Brain-computer interface (BCI); deep learning; domain adaptation; electroencephalogram; motor imagery; mutual information; subject-independent; transfer learning
- Citation
- IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/135803
- DOI
- 10.1109/TNNLS.2021.3100583
- ISSN
- 2162-237X
- Abstract
- In recent years, deep learning-based feature representation methods have shown a promising impact on electroencephalography (EEG)-based brain-computer interface (BCI). Nonetheless, owing to high intra- and inter-subject variabilities, many studies on decoding EEG were designed in a subject-specific manner by using calibration samples, with no concern of its practical use, hampered by time-consuming steps and a large data requirement. To this end, recent studies adopted a transfer learning strategy, especially domain adaptation techniques. Among those, we have witnessed the potential of adversarial learning-based transfer learning in BCIs. In the meantime, it is known that adversarial learning-based domain adaptation methods are prone to negative transfer that disrupts learning generalized feature representations, applicable to diverse domains, for example, subjects or sessions in BCIs. In this article, we propose a novel framework that learns class-relevant and subject-invariant feature representations in an information-theoretic manner, without using adversarial learning. To be specific, we devise two operational components in a deep network that explicitly estimate mutual information between feature representations: 1) to decompose features in an intermediate layer into class-relevant and class-irrelevant ones and 2) to enrich class-discriminative feature representation. On two large EEG datasets, we validated the effectiveness of our proposed framework by comparing with several comparative methods in performance. Furthermore, we conducted rigorous analyses by performing an ablation study in regard to the components in our network, explaining our model's decision on input EEG signals via layer-wise relevance propagation, and visualizing the distribution of learned features via t-SNE.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Artificial Intelligence > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.