Introduction
Moter imagery (MI) is one of the classic brain computer interface (BCI) paradigms, which has been widely used in neural rehabilitation. It refers to the utilization of imagined motor movements, rather than actual physical actions [1]. This endogenous spontaneous EEG pattern differs from the evoked EEG activity, since it does not require any external stimulus, but relies solely on the imagined movements generated by the user. The EEG data collected from scalp of users can be used to accurately identify the intended motor tasks. Among various types of EEG signals, MI has received considerable attention and is regarded as a flexible measure of brain activities. As a result, MI-BCI has broad applications in assisting patients with object control and self-care, as well as serving as a tool for rehabilitation and physiotherapy to help patients recover their motor abilities to the fullest extent possible [2], [3]. The high temporal resolution of EEG data is a desirable merit that enables the investigation and diagnosis of various brain disorders or mechanisms [4], [5]. MI is a response to the cognitive task of imagining hand or leg movements and has been widely examined for BCI applications. As such, automated MI classification using machine learning [6], [7] and deep learning techniques [8], [9] has been extensively explored. However, the single-brain MI-BCI paradigm faces several technical challenges, such as low recognition accuracy and weak stability since the decision is made by a single person, and there may be a lack of cross-checking EEG data [10].
The emergence of EEG hyperscanning technology aims to overcome the limitations of traditional single-brain BCI systems. Hyperscanning refers to the technique that synchronously records the brain activity from two or more users who engage in a specific cognitive task. Therefore, the neural mechanisms of social interaction between brains are revealed. Multi-brain BCI can be implemented using such techniques. The advent of hyperscanning technology has allowed the study of social interaction neural mechanisms to advance from a “stand-alone paradigm” to a “two-brain/multi-brain neuroscience” by involving multiple individuals and improved the ecological validity beyond observation of single individual [11]. Though different brains might have different underlying mechanisms in encoding the external world, the mental distance among these encoded stimuli is expected to remain constant to a great extent. Therefore, it is rational to presume that inter-brain coupling relationships exist in a representation space of a higher level [12]. This is the reason why multi-brain synergistic BCI can be utilized in multi-brain interaction experiments, such as MI. EEG-based hyperscanning is a straightforward and cost-effective method for recording brain activities, which is widely utilized in multi-brain BCI systems.
The hyperscanning technique is often utilized to study social interaction, including the joint decision and initiator-receiver interactions [13]. In joint decision hyperscanning research, it is indicated that an increasing number of subjects improves the classification performance of multi-brain joint operation hyperscanning systems through the fusion of multi-brain EEG signals [14]. The application of multi-brain BCI in group rehabilitation offers benefits such as peer social support, increased accessibility and opportunity for treatment [15].
To address the challenges mentioned above, we proposed a novel coupling feature-based few-shot learning decoding method for multi-brain MI-BCI. This approach achieved the improved classification accuracy with a limited amount of EEG data. In summary, the present study comprises the following contributions.
A multi-brain MI experimental paradigm is designed to incorporate idle state detection, with the aim of considering the interactions and coupling features among multi-brain. Therefore, more comprehensive information such as the inter-brain connectivity and synchronization patterns is captured, which was not typically provided in traditional single-brain studies. This could potentially be taken as a new paradigm for rehabilitation.
The proposed method consists of a feature extraction module for mining cross-brain coupling information, as well as a few-shot learning module to conduct feature classification. Specifically, this method leverages hypergraph learning to extract interpretable representations of brain-to-brain coupling relationships, which can describe higher-order relationships between multiple nodes. Furthermore, it exploits the property that tensor decomposition can extract discriminative features of high-dimensional data. We employ a few-shot learning MI-BCI module to deal with the common problem that only limited amount of data is usually available in MI-BCI field.
Experiments evaluated the effectiveness of the coupling features as well as the recognizing MI task based on EEG only using a small amount of data, which could facilitate the development of stable MI-BCI with limited training data. In addition, A new evaluation metric, to quantify the impact of inter-brain coupling coordination relationships on the joint classification decision among multi-brain is proposed. The results show that there is a positive correlation between the cross-brain coupling coordination degree and the classification accuracy.
We structure the remainder of this paper as follows. Section II presents the relevant works especially on the related methods and Section III indicates the implementation of our proposed method. In Section IV, we describe the multi-brain experimental paradigm. Section V reveals the experimental results. The cross-brain coupling coordination degree is investigated in Section VI, followed by the conclusion in Section VII.
Related Works
In what follows, we present a description of the relevant techniques that are involved in our proposed method. Specifically, these techniques include the hypergraph learning, tensor decomposition, and few-shot learning.
A. Hypergraph Learning
Hypergraph structures are extensively employed in various fields for modeling higher-order data correlations. It not only preserves valuable information, but also models relationships among multiple nodes. This idea was first introduced in [16], which proposed a propagation process over hypergraph structures. The goal of transition inference on hypergraphs is to minimize the discrepancy of labels among strongly connected vertices. Zhou et al. employed the hypergraph theory to partition EEG samples into a predetermined number of clusters, where each cluster corresponds to an emotional category [17]. Vertices within the same cluster share similar emotional characteristics. In [18], Gao et al. proposed a novel seizure detection approach by integrating hypergraph features with machine learning.
B. Tensor Decomposition
Similar to the matrix factorization, tensor decomposition can be applied to tensor-structured data, which inherently takes advantage of the interactions between multiple modes of the tensor [19]. EEG signals are commonly represented as matrices, and analyzed using methods like time series and spectral analysis, as well as matrix decomposition. Typically, EEG signals exhibit more than the two temporal and spatial patterns and necessitate the tensors representations [20]. Recently, tensor decomposition has proven highly effective in extracting and analyzing features of EEG signals, leading to remarkable outcomes [21], [22]. In this paper, we primarily employ the Tucker decomposition to extract the core feature tensor of a hypergraph generated by EEG data from two subjects.
C. Few-Shot Learning
Few-shot learning is another important machine learning paradigm, it achieves fast adaptation and transfer abilities of the model by utilizing a limited amount of data to learn new target categories. For example, in [23], a universal few-shot learning framework was proposed, wherein the classifier is demanded to discern novel classes that are absent in the training set, and only sparse examples are accessible for each emerging class. In [24], the authors proposed to compute the distance between the input data and the prototype representation of each class in the metric space, and subsequently performed classification based on the proximity to each prototype. In [25], a Siamese neural network was proposed for single-shot image recognition, which achieved comparable performance to human recognition abilities. Few-shot learning methods can be applied to EEG signal decoding. In [26], an end-to-end trainable learning paradigm, MLCL, was proposed for decoding emotion recognition from EEG signals. Reference [27] proposed a meta-learning strategy to search for optimal parameters for BCI decoders, which resulted in an increased motor imagery classification accuracy for EEG-based MI-BCI decoding. A novel two-way few-shot network was designed in [28], which was capable of effectively learning representative features for unseen target categories and classifying them with limited MI EEG data.
Method
This paper proposes a coupling feature-based few-shot learning model for robust EEG decoding in MI-BCI, given only limited data. In Fig. 1, we present the overall framework of our proposed method consisting of a coupling feature extraction module and a few-shot learning module, which will be applied to the three-class decoding task in multi-brain MI. In the first module, we employ the hypergraph learning and Tucker decomposition to extract the coupling relationship features among multi-brain in performing the same MI task. To handle limited sample size, we adopt the relation network as a few-shot learning module for the subsequent three-class decoding task.
The overall framework of our proposed method applied to EEG multi-brain MI scenario of three-classification task.
A. Hypergraph Learning-Based Coupling Feature Extraction
Specifically, hypergraphs are defined on a finite set
A hypergraph is defined as \begin{align*} \mathbf {H}=h(v,e) = \begin{cases} \displaystyle 1, & \text {if} \quad v \in e;\\ \displaystyle 0, & \text {if} \quad v \notin e. \end{cases} \tag{1}\end{align*}
Let
By drawing inspiration from recent studies [29], [30], [31], we utilize the sparse coding model (also named Lasso regression in statistics), to construct \begin{equation*} \underset {\alpha _{i}}{\min } \frac {1}{2}\Vert x_{i} - \mathbf {A}_{i}\alpha _{i}\Vert ^{2}_{2}+\lambda \Vert \alpha _{i}\Vert _{1}, \tag{2}\end{equation*}
Through selecting an appropriate value for
We assume that the coupling relationships among channels should exhibit non-negative correlations. Accordingly, we can obtain the coupling relationships (interactions) between the selected central channel and other channels within the same hyperedge, thereby filtering out any insignificant or spurious connections. The methodology employed in this study enables the effective representation of coupling relationships among several channels by leveraging the local information within each hyperedge in the hypergraph. After optimizing the
Based on the above analysis, hypergraph learning automatically adjusts of the influence of different hyperedges by learning their respective weight values, consequently reducing redundancy within the hyperedges and generating distinctive FCNs. Obviously, when
B. Tucker Decomposition-Based Feature Compression
Considering two matrices
In this section, we will delve into the mode-\begin{align*} \chi &\approx \mathbf {G}\times _{1}\mathbf {A}\times _{2}\mathbf {B}\times _{3}\mathbf {C} = \sum \limits _{p=1}^{P}\sum \limits _{q=1}^{Q}\sum \limits _{r=1}^{R} g_{pqr}a_{p}\circ b_{q}\circ c_{r} \\ & =[\mathbf {G};\mathbf {A},\mathbf {B},\mathbf {C}], \tag{3}\end{align*}
Below, we utilize the core tensor derived from the Tucker decomposition for EEG decoding task. The element-wise representation of tensor \begin{align*} x_{def} &\approx \sum \limits _{p=1}^{P}\sum \limits _{q=1}^{Q}\sum \limits _{r=1}^{R} g_{pqr}a_{dp}b_{eq}c_{fr}, \\ d&=1, {\dots },D, e=1, {\dots },E, f=1, {\dots },F. \tag{4}\end{align*}
Following the application of hypergraph learning on the EEG data, two hypergraphs were obtained that depict the intricate brain coupling relationships of paired-group subjects during performing cognitive tasks. Both hypergraphs are concatenated to produce a three-dimensional tensor, which is subjected to Tucker decomposition across three dimensions, i.e., subject, channel and channel. As a result, the core tensor is extracted to capture the coupling relationships between the two subjects, which will be employed in the subsequent classification module.
C. Relation Network-Based Few-Shot Learning
The Relation Network is comprised of two modules, i.e., the embedding module and the relation module. To be precise, the embedding module is trained by the core tensors obtained by hypergraph learning and Tucker decomposition, whereas the relation module is trained on the merged feature map of the train and the test core tensors. In particular, the training set is
The selected loss function for the model is the mean square error loss \begin{equation*} \mathrm{argmin} \sum _{i=1}^{m} \sum _{j=1}^{n} (r_{i,j}-1(y_{i}==y_{j}))^{2}, \tag{5}\end{equation*}
The embedding module consists of four convolutional blocks, each among which comprises a 64-filter
The structure of the relation module consists of two convolutional blocks and two fully-connected layers. Each convolutional block employs a 64-filter
Experiments
This Section describes the multi-brain MI paradigm for EEG data acquisition and the setup for the EEG decoding method.
A. The Multi-Brain MI Experiment Design
16 healthy participants were recruited and grouped into eight pairs for the purpose of EEG data collection. All participants were inexperienced with the BCI system, and had received detailed instructions regarding the experimental protocol prior to the commencement of this study. To optimize the experimental experience for participants, adjustments were made to the distance between the chair and LCD monitor based on their feedback, as illustrated in Fig. 2(a). In the experiment, two participants performed the MI task simultaneously and two connected Neuroscan amplifier were used to record the EEG signals according to international 10–20 system. The sampling rate of the signal is 1000 Hz. It was down-sampled to 100 Hz in this study. Signal impedance was maintained at or below 15K
Multi-brain MI experimental environment and design. (a) Environmental configuration of the multi-brain MI EEG data acquisition. (b) Experiment protocol in each trial including the instruction for 1.5 s, the MI execution for 4 s, and resting state for 2 s.
The purpose of this multi-brain MI experiment was to decipher distinct user intentions within a three-class MI task, including left hand, right hand motor imagery and idle state. There were five sessions in the experiment and each session consisted of 75 trials. Each trial lasted 7.5s, comprising video clip cue for 1.5 seconds, the MI task for 4s, and the rest duration for 2s. Fig. 2(b) showed the protocol of our experiment. A short training phase was used to participants to familiarize them with the experiment before the formal sessions.
B. Model Setup of the Few-Shot Learning Framework
Initially, we present the
In the experimental setup, we implemented leave-one-subject-out cross-validation, where among 8 groups of participants, signals from 7 groups of subjects were used as training dataset, and the rest was used for testing. During the training process, K samples are randomly selected for training, while all remaining samples were used for validation.
To assess the effectiveness of our proposed approach comprehensively, we validated the approach on situations with different numbers of few shots (i.e.,
Results
In this section, we show the results of model evaluations and performance comparisons. Specifically, the error-bar value used in Fig.3– Fig.7 is the standard deviation.
Accuracy comparisons between the single-brain mode and the dual-brain mode for the three-class MI decoding task. Asterisks represent statistical significance levels (*
Accuracy comparisons for each group between the single-brain and the dual-brain modes.
Investigation on the impacts of hypergraph learning and Tucker-based tensor decomposition.
Investigation on the impacts of Tucker decomposition-based feature compression in each of the eight groups.
A. Performance of Single-Brain and Dual-Brain Modes
We first compare the experimental performance between the multi-brain MI paradigm and the single-brain. Specifically, the multi-brain paradigm used here is the dual-brain paradigm. The Fig. 3 shows that the average classification accuracy of the multi-brain mode is nearly 10% higher than that of the single-brain mode in the 1-shot case. More prominently, around 15% performance improvement was achieved by the multi-brain mode in the 5-shot, the 10-shot, and 15-shot cases. As shown in Fig. 4, the dual-brain outperforms the single-brain for all paired groups in all cases of 1-shot, 5-shot, 10-shot and 15-shot. The dual brain advantages of the fifth, seventh and eighth groups gradually emerged as the number of shots increased, with the improved accuracy of dual-brain than single-brain modes ranging from 1.57% to 15.99%. This obviously demonstrates the capability of capturing the inter-brain coupling relationships in the multi-brain mode, which is beneficial for improving the EEG decoding accuracy and leads to superior performance compared to the popular single-brain mode in motor imagery. The underlying rationale is also intuitive. The multi-brain MI involves brain-to-brain coupling relationships and enhances traditional MI by promoting the mutual recognition, collaboration, and synchronization between users. As a result, our study demonstrates that multi-brain MI has greater classification accuracy than single-brain mode.
B. Comparison to the State-of-the-Art Methods
We conduct experiments by comparing our proposed EEG decoding method with state-of-the-art methods, including the EEG Convolutional Neural Network (EEGNet) [8], the Filter Bank Common Spatial Patterns (FBCSP) [7], the Graph Convolutional Network (GCN) [34], the Prototypical Networks (Proto. Nets) [24], the Siamese Neural Network (Siamese Nets) [25], the Model-Agnostic Meta-Learning for EEG Motor Imagery Decoding in Brain-Computer-Interfacing (MAML-BCI) [27] and the two-way few-shot network (DA-RelationNet) [28].
Generally, EEGNet is a compact convolutional neural network model, which was originally proposed for MI EEG decoding. FBCSP is a common spatial pattern-based spatial filtering technique, which is implemented by feature selection combined with frequency band segmentation. GCN basically incorporates graphs into convolutional neural network to better model the data correlations, which was originally proposed for solving semi-supervised learning. Prototypical Networks and Siamese Neural Network are few-shot learning methods. MAML-BCI is a meta-learning deep architecture with three processing stages, designed to optimize the parameters of BCI decoders so that they can quickly generalize to different subjects. DA-RelationNet is a dual attention relation network that can generalize on unseen subjects by using few-shot learning and a FT strategy for EEG-based MI classification.
The parameters of the above compared methods were set according to the original papers.
Table I presents the results for the three-class MI classification task (i.e., left hand, right hand, and idle state). EEGNet exhibited the lowest performance in small-sample tasks. For example, it achieved an accuracy of about 40% only when
C. The Impacts of Frequency Bands and Time Windows
This Section performs experiments with EEG data corresponding to different frequency bands and time windows, to identify the best one. Since the motor imagery task mainly correlated to the
Besides the different frequency bands, we tested the experimental data with different time window periods, that is, we fix the entire frequency band and then respectively divide the MI task time windows (i.e., the 4 seconds from 1.5s to 5.5s) into four segments, i.e., [0, 1], 1, 2, 2, 3], and [3], [4]. The classification accuracy results corresponding to these different time segments are presented in Table III. Obviously, the highest accuracy was achieved when the time window is set as [1], [2], i.e., the time window from the first second to the second second after the onset of MI and its accuracies corresponding to different numbers of training samples are shown in the second row. These findings indicate that there might be a startup time required for the synchronization between the two participants, which occurs after the second second of the execution.
Furthermore, in order to explore the proper duration for motor imagery task analysis, we conducted the experiment on the decoding accuracy between data with 0-1s, 0-2s, 0-3s, and 0-4s time window cases. Results shown in the Table IV, it can be observed that the 0-2s exhibits the better decoding accuracy through different number of few-shots, which indicates that the proper duration for the MI task lies within the first two seconds.
D. Ablation Experiments
In this section, we verify the effectiveness of each component of our proposed EEG decoding method by ablation experiments. We evaluate its performance by reserving or deleting a sub-module. Fig. 5(a) shows us the classification accuracies with and without the hypergraph learning, and in Fig. 5(b), a comparison of recognition accuracies with and without the Tucker decomposition is provided. From both experiments, we can see that there are only slight differences between using and not using a certain technique, in the case of
Furthermore, we have performed ablation experiments to demonstrate the effectiveness of each component in our proposed methods, i.e., the hypergraph learning and the Tucker decomposition. It is obvious that the combination of both components leads to performance improvements of 8.12% (i.e., 68.49% vs. 60.37%) and 5.92% (i.e., 68.49% vs. 62.57%), respectively.
To more intuitively show the results on each paired group, we provide the ablation experimental results in Fig. 6 and Fig. 7, which demonstrate that our proposed method was consistently effective for all the eight groups. In more detail, the results of the second, third and fourth groups are superior to those of the remaining groups. As shown in Fig. 6, the performance of these three groups increased by 15 percentage points with the help of the hypergraph learning technique. Additionally, when
To provide more insights into the learned hypergraph between subjects, in Fig. 8, we visualize the inter-brain and intra-brain hypergraph connections of subjects during motor imagery and idle states. Notably, a greater number of inter-brain connections were observed during the task. In contrast, the idle state is characterized by almost non-existent inter-brain connections and has primarily the intra-brain connections. The more connections mean that there are more information flow and synchronization. During the idle state trials, the paired participants can imagine any different motor imagery task independent content therefore the synchronization becomes lower. This disparity allows for better differentiating the task and idle states, leading to improved decoding performance. This f1gure 1 ntuitively explains why the hypergraph learning techniques can bring higher classification accuracy in multi-brain MI decoding.
The visualization of brain hypergraph connections during task (left subfigure) and idle (right subfigure) states.
In Fig. 9, we visualize the low-dimensional data representations with and without the Tucker decomposition. In Fig. 9(a), where Tucker decomposition techniques were not utilized, it is evident that there was inadequate differentiation between the features of the left and right hands. Nonetheless, in Fig. 9(b), the features respectively corresponding to the left and right hands became more separable after utilizing the Tucker decomposition techniques.
Illustration to the impact of Tucker decomposition by low-dimensional data visualization.
Discussion
In this section, we first introduce a quantitative metric for measuring the cross-brain coupling coordination degree, which is then evaluated on our collected multi-brain MI EEG data.
A. Cross-Brain Coupling Coordination Degree
To further investigate the impact on decoding accuracy, we adopt a new metric in this paper inspired from the field of economics, the cross-brain Coupling Coordination Degree (CCD). This metric serves as an analytical tool for assessing the level of coordinated development of phenomena.
Through a scholarly inquiry into the definitions and models of coupling in other disciplines such as economics and engineering, we have the following definition of the coupling degree between two brains:\begin{equation*} C_{i} =2\left \{{\frac {u_{i}(1) u_{i}(2) }{(u_{i}(1) +u_{i}(2))^{2} } }\right \}^{1/2}, \tag{6}\end{equation*}
Inspired by the graph theory, the shortest path length is employed to determine the most efficient route between two vertices. However, in cases where two nodes in the network are not connected, the calculation yields an infinite shortest path length, which lacks practical significance. To mitigate this issue, the concepts of global efficiency \begin{align*} E_{g} &=\frac {1}{N(N-1)} \sum _{i\ne j=1}^{N} \frac {1}{l_{ij} }, \tag{7}\\ u_{i}&=E_{l} =\frac {1}{N} \sum _{i=1}^{N} E_{g} (i), \tag{8}\end{align*}
The \begin{equation*} T_{i} =a \times u_{i}(1) +b \times u_{i}(2), \tag{9}\end{equation*}
To comprehensively present the coupling degree, intra and inter the single-brain 1 and the single-brain 2, this study integrates the coupling degree \begin{equation*} D_{i}=\sqrt {C_{i}\times T_{i}}, \tag{10}\end{equation*}
Consequently, CCD objectively reflects the level of coordination development intra and inter each single brain in the paired group, effectively avoiding abnormal situations where the information transmission efficiency level is low, but their coordination is high. The cross-brain coupling coordination degree model is characterized by simplicity, comprehensiveness, operability, and visual analysis.
B. Evaluation of the CCD Metric
Table V provides an analysis to the cross-brain CCD metric across the eight groups in our multi-brain MI experiments. The results demonstrate that the second, third and fourth groups (i.e,, the bold numbers) display the most effective cross-brain coupling coordination degree among the eight groups. This outcome is in line with our findings in Section V-D, indicating that the information flow for intra single brain as well as the inter paired brains have positive impact on the decoding accuracy.
C. Performance of Multi-Brain Brain Computer Interface From the View of Cognitive Neuroscience
Brain computer interface research has made tremendous progress in recent years. However, it remains challenging to transfer its results from the lab to the marketplace. Herein, one of the big challenges is underdeveloped paradigm which is invented about 30 years ago such as motor imagery [35]. To break the shortcomings of traditional BCI such as low stability, poor performance, researchers also try to propose a new scheme as hybrid BCI (hBCI), which is implemented by combining two or more kinds of EEG signals (i.e. motor imagery and P300), another combining EEG and other signals (i.e. Electromyography (EMG) signal) [36]. In this paper, we propose a new multi-brain experimental design and the corresponding EEG decoding method for motor imagery. For such multi-brain computer interface, it can be taken as a kind of hBCI paradigm, which combines the various kind of EEG signals as well as the interactions from multi-brains.
Our study demonstrated that the better classification performance was obtained when the dual-brain mode was compared to the single-brain mode. Previous studies on social interactions based on hyperscanning reveal that the synchronization between multiple brains is enhanced in the scenario of collaborative tasks since the participants share the same strategy and objective [11], [37]. An intuitive reason is that inter-brain synchrony is positively correlated with performance metrics. Moreover, alpha band was relevant to the social interactions [38] and effectively in motor imagery task, which might explain why the decoding accuracy obtained from
Conclusion
In this paper, we proposed an EEG-based multi-brain MI decoding method embedding the coupling relation feature extraction and few-shot learning to efficiently learn representative multi-brain features and classify them with limited amount of EEG data. A comprehensive multi-brain MI-BCI decoding study was conducted by employing the hypergraph learning, Tucker decomposition and relation network. Our experimental results validated the superiority of the multi-brain experimental paradigm over the single-brain paradigm. Notably, the results confirmed the effectiveness and importance of the coupling relationships between participants to the classification of MI. We also proposed a cross-brain CDC metric to quantify the relationships between brains. The results reiterate the contribution of coupling relationship to the classification performance, showing a positive correlation between the coupling strength and classification performance. Finally, the results demonstrate that our proposed method still works well even in the case of the limited samples (i.e., few shots). This study could provide a solution for high-performance BCI involving multiple participants.