Introduction

Control of a computer or any other electronic device using only brainwaves is a long-hoped-for next-generation interaction tool (Leeb et al., 2006). Direct brain–computer interfaces (BCIs) can provide severely disabled people with new augmentative communication and control technology (Wolpaw et al., 2002). In the field of health care and life sciences, a new nonmuscular channel could help restore voluntary motor control or improve the effectiveness of the rehabilitation process (Chaudhary et al., 2016). Beyond medical applications, recent experimental research has explored the usability of BCIs in the fields of multimedia and gaming (van Erp et al., 2012). Healthy customers could use this new input modality as part of an enhanced game experience (Bos et al., 2010).

The detection of cortical generators may help control external devices such as robots (Spataro et al., 2017), virtual environments (Leeb et al., 2006), or spelling devices (Kaplan et al., 2013) with various degrees of accuracy. Different strategies have been implemented to provide a convenient and straightforward way to command such devices. For instance, a user could steer a wheelchair by thinking about his or her right or left hand (Tsui et al., 2011). Functional connectivity changes within associated regions of the cortex would then allow the two classes to be distinguished (Pfurtscheller & Neuper, 2001). A motor imagery (MI) task is often preceded by the study of motor execution (ME). Sensorimotor rhythms are spontaneously generated in both tasks, and analyzing the latter should help establish a more profound understanding of the former. The specific mental states resulting from these tasks can be produced in an external stimulus-paced mode (synchronous BCI) or an internally paced mode (asynchronous BCI). The aforementioned brain signals are representative of endogenous BCI because they do not depend on any external stimuli (Torres et al., 2020).

Steady-state visual evoked potentials (SSVEP) (González-Mendoza et al., 2015) and P300 visual evoked potential (VEP) (Nicolas-Alonso & Gomez-Gil, 2012) represent the most commonly used exogenous BCIs. SSVEPs are elicited by the repetitive presentation of a visual stimulus that flickers or reverses at a fixed frequency. When a user focuses his or her gaze on a specific flickering image, an induced periodic brain electrical response is produced and can be associated with a unique command. Similarly, P300 is generated by a sequence of visual stimuli. When an infrequent event appears, it elicits an electrophysiological response that can be correlated with a specific choice. Other BCIs may use different types of brain activity, for instance emotion (Torres et al., 2020), speech (Wang et al., 2013) or hearing (Nijboer et al., 2008).

Many methods have been employed over the past seven decades to measure electrical or magnetic brain activity through invasive or noninvasive means (Waldert et al., 2009). Invasive approaches, such as electrocorticography (ECoG) (Schalk & Leuthardt, 2011), are believed to support more complex applications, because high spatial and temporal resolution may be achieved. However, because of several technical issues and the inherent risk of surgery, only a few severely disabled patients have been implanted with electrodes so far (Oxley et al., 2020). Noninvasive approaches, such as functional magnetic resonance imaging (fMRI) (Sitaram et al., 2007), positron emission tomography (PET) (Aine, 1995), magnetoencephalography (MEG) (da Silva, 2013) and electroencephalography (EEG) (Curran & Stokes, 2003), have all been extensively used as neuroimaging techniques (Vallabhaneni et al., 2005).

EEG represents the most prevalent method of signal acquisition for noninvasive BCI. Electrical brain oscillatory activity, generated by feedback loops in complex networks of neurons populations, is recorded by scalp electrodes (Vallabhaneni et al., 2005). The advantages of practicality, ease of use, high temporal resolution, and high coverage are counterbalanced by the limited topographical resolution and frequency range (Hill & Wolpaw, 2016). However, consistent activation patterns associated with specific cognitive tasks have been highlighted during fMRI or PET studies (Cabeza & Nyberg, 2000). Assuming that the strength and accuracy of each modality would be preserved, the complementarity of information can be exploited by multimodal imaging techniques. The integrated knowledge gained from high spatial resolution measurements should help improve the accuracy of EEG-based BCI applications (Blinowska et al., 2009).

Current EEG-based BCIs are often recorded from multiple channels. A higher number of electrodes ensures a greater coverage of the brain and a more precise solution to the inverse problem. For uniformly distributed sensors over the full surface of the head, source localization accuracy has been shown to increase drastically from 31 to 63 sensors but much less from 63 to 123 electrodes (Michel, 2004). However, a greater number of channels can reduce the accuracy of the classification because while some sensors carry significant information, others introduce noise and thus deteriorate the results (Müller et al., 2000). This so-called curse of dimensionality can be alleviated by selecting an optimized subset of electrodes. The lower computational complexity also leads to easier real-time calculation and detection of predefined cognitive tasks.

Channel selection is used to identify this optimized subset of EEG electrodes. Most reviews and studies focus on the evaluation and efficiency of nonparametric algorithms to establish the optimal number of channels involved in a specific application (Alotaiby et al., 2015; Ghaemi et al., 2017; He et al., 2009; Barachant & Bonnet, 2011; Li et al., 2011; Arvaneh et al., 2010; Schröder et al., 2005; Qi et al., 2021; Gurve et al., 2020; Lal et al., 2004). To the best of our knowledge, no reviews have been carried out using prior findings from diverse recording modalities (fMRI, PET, EEG). For this purpose, the current paper uses a selection of past studies for four commonly used paradigms—MI, ME, SSVEP, and P300. For each task, the most recurring electrodes across studies are defined as the core channel selection (CCS) subset. The CCS represents the lowest common denominator, meaning that the CCS electrodes are supposed to be always activated and could be used either directly or through a specific combination to determine the paradigm characteristics.

Method

One of the primary purposes of a meta-analysis is to draw conclusions from past studies often reporting disparate results. Pooling independent effect size estimates can inform about the direction and magnitude of an effect even more accurately than any of the individual estimates. Non-systematic biases that may arise, for instance from preprocessing steps, are assumed to become insignificant using this approach (Sacchet & Knutson, 2013).

Literature search

The review is based on the PRISMA (preferred reporting items for systematic reviews and meta-analyses) statement and the recommendations of the Cochrane Collaboration (Liberati et al., 2009; Eccleston & Morley, 2009). Fully published articles were drawn from searches on Google Scholar in September 2020. English-language studies in which the subjects were asked to perform either motor imagery (MI), motor execution (ME), or to focus on SSVEP or P300, were included. This literature search yielded 117 articles.

All neuroimaging techniques were incorporated as long as the subjects were healthy human participants performing the correct task (e.g., only hand movement was considered for MI and ME; foot motion was rejected). Ninety-seven articles met these further requirements. The quality of the data was the most exclusive criteria. Studies that did not specify the exact location (3D coordinates from any atlas or EEG electrodes) of the brain structures involved in the task or did not provide enough data to compute the mean and standard error or deviation of these locations were excluded from further analysis (see Fig. 1). After applying these criteria, 21 articles remained. Figure 1a reproduces the standard PRISMA flowchart diagram, while Fig. 1b highlights the precise number of papers excluded for each exclusion criterion.

Fig. 1
figure 1

a Standard PRISMA flowchart diagram; b details of the exclusion criteria used with the associated number of studies, n.

The main purpose of this review is to create a knowledge-based channel selection framework with sufficient statistical power and practical significance. The latter can be established by quantifying the effect size. To compute all these measures, either the Z-score, the t-test, the Cohen’s r, or Cohen’s q should be available along with the corresponding standard error and deviation when necessary. The lack of reporting of such measures (or access to the original data) is a well-known issue in neuroscience (Poldrack et al., 2017) and might explain the generally small numbers of included studies in meta-analyses ((n = 9) (Snyder & Hall, 2006), (n = 30) (Mullin et al., 2016), (n = 9)) (Arns et al., 2013).

The metadata for the 21 articles (n = 21) included in this meta-analysis is detailed in Table 1. In total, 311 healthy subjects participated in these studies, with a highly variable sample size ranging from 5 to 50 participants (mean = 14.8 ± 11.1). Most experiments used fMRI as a single recording technique (62%) or EEG (24%). The preprocessing details for cleaning, normalization and spatial filtering are mostly linked to the technical modality employed, and only three studies use channel re-referencing.

Table 1 Main features of included studies; analyzed data are from * (only for multiple recordings)

These studies compare brain activations according to three main themes: (a) task dissimilarities, (b) stimuli characteristics, and (c) individual differences. Among the first group, four studies (Gerardin et al., 2000; Stippich et al., 2002; Kraeutner et al., 2014; Burianová et al., 2013) compared MI to ME, while four others (Menon et al., 1997; Ramirez-Quintana et al., 2020; Ardekani et al., 2002; Mulert et al., 2004) investigated the target vs. no target detection in P300, and two (Clark et al., 2000; Bledowski et al., 2004) the differences between target and distractor. One paper (Grafton et al., 1996) specifically investigated the difference between MI and viewing, whereas one (Begliomini et al., 2018) was dedicated to the differences between the left and the right hand in ME, and another one (Perlstein et al., 2003) to the dissimilarities between memory and control.

Fewer studies examined the impact of stimuli depending on their frequency (three articles (Srinivasan et al., 2007; Srinivasan et al., 2006; Martinez et al., 2007)), color (only one (Ikegami et al., 2012)), or location (also only one (Russo et al., 2007)). Differences between left- and right-handed individuals were examined in two papers (Willems et al., 2009; Begliomini et al., 2008), and a final one (Mokienko et al., 2013) was interested in the impact of BCI training on neural source activation.

EEG channel determination

Multichannel EEG measures signals using either the 10-20, 10-10, or 10-5 international systems for standard electrode placement (Jurcak et al., 2007). The underlying neural generators are recorded as a mixture of signals emitted from several potential time-varying sources inside the brain (von Bünau et al., 2010). The goal of the BCI paradigms is to produce distinct patterns of cortical activation to discriminate between various classes (Halder et al., 2011). For commonly used paradigms, knowledge of task-dependent cortical generators can inform the relevant recording positions for channel selection. In this paper, we review various studies that use source localization to identify significant cortical generators. The closest of the 62 selected EEG electrodes are then estimated for each of the results obtained from fMRI or PET. Our goal is to provide a knowledge-based channel selection framework that could be used either as a verification step or as part of the classification procedures.

The Koessler 3D anatomical atlas (Koessler et al., 2009) was used to correlate the 3D Talairach coordinates of the EEG electrode of the 10-10 system with their cortical projection points. For convenience, the Koessler 3D anatomical atlas is available as a CSV file at https://github.com/QinXinlan/review-effect-size. This 10-10 resolution combines the advantages of higher precision (yet still providing background compatibility to the 10-20) and reproducibility (by avoiding overlapping that may arise with the 10-5). The likely Brodmann area (BA) (respectively macro-anatomical structure) beneath each sensor is then calculated as the most frequently found in the population. Using the mean 3D Talairach coordinates and standard deviations of each sensor, it is possible to compute the Euclidian distance to each cortical generator reported in the reviewed articles. Standard deviations in x, y, and z play asymmetric roles, so the most likely electrode is not necessarily the closest one. When two EEG electrodes are nearly equidistant from a cortical source, both are recorded in the following tables with the electrode at the minimal Euclidian distance in the first position.

Power analysis

Statistical power analysis reveals whether a finding actually reflects a true effect. By definition, the power of a statistical test is the probability that it will yield significant results. Its complement β = 1 − power represents the error rate of failing to reject a false null hypothesis (type II). The statistical power is linked to three parameters: the significance criterion (type I error rate), the sample size, and the effect size. This last variable represents the degree to which the phenomenon can be detected. The larger the effect size is, the more easily the phenomenon can be exhibited (Cohen, 1977).

Every corresponding author of the 21 studies has been contacted and solicited to provide necessary information for computing the effect size. Most effects are reported as group differences; thus, Cohen’s d has been favored in this review (Cohen, 1977). Heuristically, this number measures the statistical level of distinction between two variables. Given two sample groups, the greater the distance between their means, the more easily they can be separated. The Cohen’s d index measures this distance in terms of standard deviations units and is considered dimensionless: \({\mathrm{Cohen}}^{\hbox{'}}\mathrm{s}\ d=\frac{M_1-{M}_2}{SD}\) (where M1 is the mean of group 1, M2 the mean of group 2, and SD, the standard deviations, are assumed equal in this equation, as they are similar in reality). A more-specific Cohen’s d formula can be used, or other metrics can allow the characterization of the phenomenon. For instance, in a P300 experiment, group 1 could be defined as the target stimuli and group 2 as the nontarget stimuli.

For all studies, the mean across subjects (computed from Z-score, t-test, percentage of signal change, or raw data) was calculated on every selected electrode. For each task and each electrode, the mean effect size was computed, and weighted by the number of subjects in individual studies. To assess the statistical significance of the results, a 95% confidence interval was also outlined. In this review, most of the confidence intervals exclude 0, meaning that the mean effect size is statistically significant at α = 0.05 (Ellis, 2010). If the mean effect size, the lower bound of the confidence interval, and the weights used in the corresponding computation are large enough, the selected EEG channel will be validated. The R code used to calculate the effect sizes per study (Ben-Shachar et al., 2020) and the weighted mean effect size is available online at https://github.com/QinXinlan/review-effect-size.

Sample size estimation

How many studies would be necessary to obtain a sufficient statistical power? Conventionally, in meta-analysis methods, fixed effects and random effects models are used. In the former case, the included studies are assumed to represent the entire universe of studies of interest and hence, there is no possibility of sampling error (Hunter & Schmidt, 2000). In view of the high variability of the phenomena under study, the included studies are considered to only represent a small portion of all the available data and thus, the slightly more sophisticated random effects model is preferred.

To evaluate the sample size, i.e., the number of studies, choices and hypotheses must be made on some parameters. Typically, the type I error rate is set at α = 0.05 and the relative seriousness of type I to type II error is 4 to 1 ( β = 0.2 and thus power = 0.8). The standard normal cumulative distribution function for β = 0.2 yields a value of −0.842, and a mean of the Z statistic of 1.64 − (−0.842) = 2.482 for a one-tailed test.

Under the random effects hypothesis, the Z statistic has a normal distribution with a mean equal to \({\lambda}^{\ast }=\frac{\overline{d}-0}{\sqrt{v^{\ast }/k}}\), where k is the number of studies (random effects estimates are usually noted with an asterisk (Hedges & Pigott, 2001)). The “typical” sampling variance of the random effects estimate is calculated (Valentine et al., 2010) with \({v}^{\ast }=v+{\tau}^2=\frac{2}{n}+\frac{{\overline{d}}^2}{4}\ast n+{\tau}^2\). Given a random effects analysis, the number of studies should then be \(k=\frac{v^{\ast }}{{\left(\overline{d}/{\lambda}^{\ast}\right)}^2}\), where λ = 2.482.

A scoping review on different electrodes led us to believe that the estimated population effect size may vary from \(\overline{d}=1.5\) to \(\overline{d}=3\) with a variance τ2 = 1 and for an average number of subjects per study n = 15. For an estimated population effect size of \(\overline{d}=1.5\), approximately four studies (k = 3.2) are needed under the random effects assumptions to obtain an approximate power of 0.80. For a greater estimated effect size of \(\overline{d}=3\), only one study (k = 0.88) is needed to obtain a sufficient power, with all other hypotheses remaining equal.

Hypothesis

This review and its potential applications rely on three assumptions: (1) despite the substantial inherent variability in the exact source localization of neural generators, the sensorimotor and visual networks are postulated to be consistent across subjects and sessions, and the other cortical networks relevant to our BCI paradigms should probably be considered more subject-dependent; (2) the estimated EEG electrodes or their nearest neighbors are likely to reflect most of the relevant cortical information; (3) the classification accuracy is presumed to be optimized by the choice of an adequate subset of electrodes.

Results

Motor imagery (MI)

MI-based BCI is a popular interaction paradigm because it relies on voluntary movement and can be used, in theory, by healthy and impaired subjects alike in synchronous or asynchronous paradigms. In particular, the movement of the right and left hands is often selected because of its easily distinguishable activity in the contralateral cortical regions responsible for the movement of the limbs. MI is usually defined as the mental rehearsal of a motor act (Crammond, 1997). This cognitive process appears to correspond to the activation of the neural correlates of motor representations (Rizzolatti & Craighero, 2004). It includes the planning and preparation of movements but not the motor output or somatosensory feedback. The neural network involved during MI activates several cortical areas depending on the task and on the participant’s handedness and familiarity with BCI and the task. Several studies (Gerardin et al., 2000; Stippich et al., 2002; Kraeutner et al., 2014; Burianová et al., 2013; Grafton et al., 1996; Willems et al., 2009; Mokienko et al., 2013) have described the mapping of cerebral networks by localizing the main sources in each functional area across participants. Table 2 compiles these results with similar inclusion criteria (i.e., for healthy subjects).

Table 2 3D Talairach coordinates of significant cluster maxima for motor imagery of left and right hand movements with corresponding international 10-10 EEG system closest electrode(s); BA: Brodmann area; Hem: hemisphere; L: left; R: right; NP: not pertinent; * for BCI-trained subjects, ** for BCI-naïve subjects; ++ specific to MI; left underline for left-handed subject’s dominant hand; right underline for right-handed subject’s dominant hand; and Cohen’s d mean effect size with a 95% confidence interval and the number of relevant studies in superscript

The estimated EEG electrodes are then classified into three main clusters as shown in Fig. 2. The first one gathers all electrodes that are not contralateral (i.e., either bilateral or unilateral). The second cluster represents all contralateral sources that depend on the user experience (familiarity with BCI). The third one corresponds to all contralateral sources consistently present among subjects. Ipsilateral clusters can also be activated alongside contralateral sources to a lesser extent (Porro et al., 2000). If an electrode may be classified as both contralateral and unilateral, the bilaterality may be more influential and the electrode is labelled as such. Finally, the effects of handedness for the dominant hand on electrode activation is emphasized with an upper black point. For instance, FC2 is mostly activated for left-handed participants when moving their left dominant hand.

Fig. 2
figure 2

Motor imagery paradigm: Activated electrodes with their Brodmann areas and handedness-linked brain laterality

For the MI task, the CCS subset consists of electrodes {FC4, FC2, FC1, C4, C6, C5, C3, CP3, CP1}.

Motor execution (ME)

A voluntary movement is comprised of three phases: planning, execution, and recovery. During the first phase, similar functional circuits, located in the frontoparietal, subcortical, and cerebellar areas, are activated in both ME and MI, suggesting they share a common pattern in the planning and preparation tasks. However, MI also activates distinct regions that can be predominantly found in the left hemisphere, namely the middle temporal gyrus and the fusiform gyrus (BA 21 and 37, electrodes TP7, P7, and T7). The predicted visual consequences of an action are believed to be represented in the former gyrus (Schippers & Keysers, 2011), whereas lesions in the latter impair pantomime recognition (Varney & Damasio, 1987). Thus, these regions could be hypothesized to participate in the planning of an imaginary movement. Furthermore, lesions in the superior and inferior parietal cortex (BA 7 and 40; electrodes CP4, P1, and P2) lead to impairment in the ability to imagine movements. Therefore, these cortices could be responsible for ensuring that the action being executed matches the intended action (Danckert et al., 2002).

During ME, the secondary sensory area (S2, BA 43; electrode CP6) seems to display heightened activation and is assumed to be related to the processing of proprioceptive input (Dresel et al., 2005). More specifically, action planning has been associated with an increase of activity in the left-lateralized supramarginal gyrus (Króliczak et al., 2016) (BA 40; electrode CP5). A possible explanation of these differences between MI and ME relies on the evolutionary theory of motor task learning. Mentally rehearsing the action would allow for multiple attempts without the risk of causing any harm and, hence, would help find the best strategy for a difficult motor task (Gerardin et al., 2000). Some studies (Gerardin et al., 2000; Stippich et al., 2002; Kraeutner et al., 2014; Burianová et al., 2013) compared the somatotopic mapping during MI and ME, while others (Begliomini et al., 2008, 2018) examined the effects of handedness. Their results are summarized in Table 3.

Table 3 3D Talairach coordinates of significant cluster maxima for motor execution of left and right hand movement; ++ specific to ME; NP: not pertinent; BA: Brodmann area; Cohen’s d mean effect size with a 95% confidence interval and the number of relevant studies in superscript

For the ME task, only two clusters have been identified, as illustrated in Fig. 3—either non-contralateral (bilateral or unilateral) or contralateral. The ME topographical distribution, as expected, largely overlaps the MI one. However, the assumption that MI is a more complex cognitive task might explain why the number of cortical generators activated with ME is much smaller for a simple task (Hardwick et al., 2017). Interestingly, mental chronometry studies have shown that the time course of MI positively correlates with ME, illustrating the parallelism between these two tasks (Vargas et al., 2004).

Fig. 3
figure 3

Motor execution paradigm: activated electrodes with their Brodmann areas and handedness-linked brain laterality

For right-handed subjects (an estimated 90% of the worldwide population), right hand MI and ME activates a restricted cluster compared to left hand MI and ME (Burianová et al., 2013). Right-handed subjects also show a predominant activation of motor and visuomotor control in the contralateral (i.e., left) hemisphere, while left-handed subjects have been characterized by a more bi-hemispheric recruitment of neuronal circuits when performing a precision grasping task. This might indicate a left hemispheric location of visuomotor control (Begliomini et al., 2018). In general, the more repetitive the task is, the more specialized and smaller the neural circuit would be. Therefore, the nondominant hand, being used less often, would require the additional control provided by a more widespread neural activation. Finally, it should be noted that the complexity of the task, implying other functions, influences the lateralization and the level of activation in some regions is age-dependent (Ward & Frackowiak, 2003).

For the ME task, the CCS subset consists of electrodes {FC4, FC2, FC1, FC3, C2, C1, C4, C3}. A common CCS for all motor-related tasks (MI and ME) could also be defined with the electrodes {FC1, FC2, FC4, C3, C4}.

Steady-state visual evoked potentials (SSVEP)

Changes in the visual field are known to impact EEG activity. In particular, the periodic contrast or luminance modulation of a fixed frequency (usually at a range of 6–40 Hz) elicits an SSVEP that can be detected at the same fundamental frequency as the flickering stimulus. The cortical oscillations are phase locked to the periodic stimulus and appear to be predominantly present in posterior occipital areas as well as the lateral geniculate nucleus and optic radiation (Krolak-Salmon et al., 2003). The two major neural generators seem to be localized in the primary visual cortex (V1/V2) and the motion-sensitive (MT/V5) areas, respectively. However, the two minor contributors seem to be located in the mid-occipital (V3A) and ventral occipital (V4/V8) areas (Russo et al., 2007). In addition to the occipital areas, steady-state responses may also flicker over frontal and prefrontal areas depending on the stimulus frequency (Srinivasan et al., 2007). Furthermore, the narrow frequency bands surrounding each flicker frequency appear to synchronize different patterns of cortical functional networks. Table 4 shows the EEG electrodes calculated from the topographical maps of the potential SSVEP power at the frequencies relevant to one of our previous experiments (Srinivasan et al., 2006; Luu & Ferree, 2005).

Table 4 Mainly activated EEG electrodes according to the stimulus frequency

SSVEP-based BCIs often have a high temporal and spectral resolution (usually less than 0.1 Hz) as well as high accuracy and a high information transfer rate (ITR)(Nunez et al., 2006), but can produce visual fatigue or discomfort (Zhu et al., 2010). The temporal frequency, spatial frequency, contrast, luminance, color, and hue of the driving flickering stimulus all influence the amplitude and phase of the SSVEP (Zhu et al., 2010; Regan, 1989). This dependency on the input frequency partly reflects the delay between the retina and primary visual cortex as well as the delays between areas of the visual system (Schmolesky et al., 1998). Moreover, the area of on-screen stimulus also has an impact on the cortical modulation. For instance, a 6 Hz frequency sinusoidal waveform displayed in the upper-left quadrant of the screen will elicit a different SSVEP waveform than a similar stimulation displayed in the lower left quadrant (see Table 5)(Nicolas-Alonso & Gomez-Gil, 2012; Russo et al., 2007).

Table 5 3D Talairach coordinates of the two significant cluster maxima for steady-state visual potentials depending on the visual stimulus location on the screen. The available data did not allow calculations of the Cohen’s d effect size per electrode.

The review of five previous studies (Perlstein et al., 2003; Srinivasan et al., 2006, 2007; Martinez et al., 2007; Russo et al., 2007) is summarized in Fig. 4. Three different types of information are illustrated using distinct systems. The color clusters illustrate the differences between contralateral and unilateral or bilateral sources. The corner symbols depict the activations that depend on the on-screen quadrant location (upper or lower, left or right). Finally, the text colors highlight whether the EEG electrodes are activated for some or all frequencies relevant to our experiment.

Fig. 4
figure 4

Steady-state visual evoked potential paradigm: Activated electrodes with their Brodmann areas and brain laterality linked to quadrant location or full-screen frequency

For the SSVEP task, the CCS subset consists of electrodes {P1, PZ, P2, PO3, POZ, PO4, O1, OZ, O2}.

P300

Attention to a change in the environment elicits a burst of activity, peaking at about 300 ms after the sensory stimulus. This P300 event-related potential (ERP) is an umbrella term encompassing two separate attentional processes. The P3a, or Novelty P3, occurs in response to all rare sounds or images—designated as deviant stimuli—regardless of whether they are targets (Spencer et al., 2001; Gaeta et al., 2003). The P3b appears after the appearance of a low-probability target item embedded in a train of high-probability nontarget (or standard) items. Both the amplitudes of the P3a and P3b, as well as their peak latencies, can be characterized as functions increasing (respectively decreasing) with age (Fjell & Walhovd, 2004). Additionally, the topographical distribution of the two P3 components seems to also be age dependent (West et al., 2010).

Typically elicited between 250 ms and 500 ms, the P3b is believed to reflect the process of directed attention leading to conscious awareness of salient stimuli (Menon et al., 1997). Detecting this cognitive component can reflect user intention and provide useful biomarkers for normal aging or several brain or mental diseases (Friedman, 2003; Rossini et al., 2007). Table 6 summarizes the locations and time-courses (when available) of its neural generators from seven studies (Menon et al., 1997; Ramirez-Quintana et al., 2020; Ardekani et al., 2002; Mulert et al., 2004; Clark et al., 2000; Bledowski et al., 2004; Ikegami et al., 2012). Furthermore, the source contributions calculated from the functional magnetic resonance imaging (fMRI) activation clusters are also indicated. However, as the authors warn, the intrinsic technical and theoretical differences between ERP and fMRI might cause some discrepancies. For instance, the temporally dispersed activity reflected by fMRI (which is typically integrated over 1 second and then averaged) might get lost when compared to ERP (which is only averaged) (Bledowski et al., 2004).

Table 6 3D Talairach coordinates of significant cluster maxima for the P3b component of the P300 event-related potential; NP: not pertinent; BA: Brodmann area; Cohen’s d mean effect size with a 95% confidence interval and the number of relevant studies in superscript

Two different types of information are depicted in Fig. 5. One highlights the contribution to the generation of the P300 for each electrode (low or high), while the other indicates the time-dependent activation for several EEG electrodes, using five 50-ms time intervals (Mulert et al., 2004). The chrono stimulation is illustrated on two consecutive images for the sake of simplicity.

Fig. 5
figure 5

P3b component of the P300 paradigm: Electrodes’ level and chronology of activation with corresponding Brodmann areas

For the P300 task, the CCS subset consists of electrodes {CP3, CP4, P1, P3, P2, P4, P6}.

Discussion

In this paper, we first reviewed the source localization of cortical generators induced by four widely used BCI paradigms. The closest EEG electrodes were then computed for each of these task-dependent neuronal sources. Our objective is to create a paradigm-based channel selection framework that could be used as a reference by any EEG researcher.

Inter-subject variability

Brain atlases are becoming more ubiquitous in the field of neuroscience because they provide a reference framework for the analysis and visualization of neuroimaging data. By nature, the human cerebral cortex is deeply convoluted, and its folding pattern presents dramatic inter-subject variability (Van Essen & Dierker, 2007). Brain size and shape, as well as the dimension of cortical areas, also differ drastically across individuals (Andrews et al., 1997). Quantification of the inter-subject variability of functional connectivity has demonstrated a nonuniform distribution across brain regions. The prefrontal and temporoparietal regions exhibit the highest level of variability, whereas the sensorimotor and visual networks show the lowest level (Mueller et al., 2013). The higher variability in the former regions is hypothesized to indicate the larger influence of environmental or epigenetic over purely genetic factors. Conversely, the latter regions seem to represent the hallmarks of common mammalian brain evolution and development (Zilles & Amunts, 2013).

The estimated EEG electrodes corresponding to known cortical generators do not hold identical significance probability. The lower inter-subject variability of sensorimotor and visual networks should be more constant across subjects and result in a higher average accuracy for BCI classification. Assuming the repeatability of the motor or visual task performed by the subject, the estimated EEG electrodes for an individual can be hypothesized to remain mostly unchanged over time and across sessions if they belong to motor (BA 4, 6, 8), sensory (BA 1, 2, 3, 5, 40), or visual (BA 17, 18, 19) regions. For electrodes that are part of other brain areas, more variability should be expected between subjects.

Equivalent current dipole

Scalp EEG electrodes are generally believed to record macroscopic postsynaptic potentials created by assemblies of pyramidal cells of the neocortex (mostly located in cortical layers III, V, and VI) (Olejniczak, 2006). To measure this generated electric field at a distance from the sources, the underlying neuronal currents must be well organized in space and time. The estimation of cortical generators corresponding to a certain distribution of electrical potentials recorded at the scalp is an ill-posed problem, known as the inverse problem. Different models of the neuronal sources and of the volume conductor have been investigated to estimate approximate solutions. Most neurophysiological findings are based on the simplest source model, namely the equivalent current dipole. Because of the columnar organization of the cortex, the resulting EEG potential is assumed to behave as if it were produced by normally oriented current dipoles over the entire cortical sheet (da Silva, 2004; Attal et al., 2009).

An equivalent dipole approximates the barycenter of the corresponding active cortical area at a given time with no appreciable delay in the scalp sensor measurement. Assuming that the signal propagates linearly and instantaneously inside a homogeneous and isotropic medium, both the position and orientation of the dipole affect the potential scalp spatial pattern. If the dipole is radial to the scalp surface or located in superficial cortical areas, the closest EEG electrode should record the highest potential activity regardless of its orientation. For a deeper tangential dipole, the maximum amplitude would be recorded at a neighboring electrode (Congedo, 2013). Hence, under the linear conduction model assumption, the closest EEG electrode or its neighbors should receive most of the signal generated by the underlying cortical generator.

Signal-to-noise ratio

Four main types of noise affect the recorded EEG signals. Instrumental noise might arise from equipment but is typically low and is considered to be uncorrelated with the signals. Environmental noise is routinely avoided by acquiring the data in a sound-attenuated and electromagnetically shielded EEG chamber. Biological noise emerging from extra-cerebral artifacts, such as blinks or muscle movements, may be removed using preprocessing algorithms, such as ICA (Shen et al., 2002) or ABC (Guttmann-Flury et al., 2019). We presume that the channel selection procedure is applied to cleaned data, free from these three types of noise. However, the underlying cerebral background noise is still present and accounts for most of the EEG signals.

Conventionally, background noise summarizes all spontaneous neural activity of various magnitudes and frequencies unrelated to the task at hand. With the hypothesis that this noise is generated from randomly distributed and statistically independent stationary dipole sources, averaging single events should yield the reproducible part of the signals and cancel the noise. Thus, time-locked averaging to a stimulus (visual or auditory) can be used to determine the locations and time courses of the relevant sources with a high signal-to-noise ratio (SNR) (de Munck et al., 2002). Latency adjusted averaging can also be used to enhance the SNR of evoked responses (e.g., by applying an adequate narrow-band Gaussian filter) (de Munck et al., 2002; Burghoff et al., 2005). Furthermore, SNR can be improved by channel selection, which may also increase classification accuracy and reduce computational complexity. Assuming that close electrodes record similar background noise, subtracting a low-SNR neighboring electrode from a selected one might meliorate the SNR, and, thus, the accuracy (Wang et al., 2004).

Limitations

This review summarizes the results from 21 studies. Each paradigm uses the same type of task; however, different conditions or experimental setups might alter the homogeneity of the results. For example, in the MI paradigm, subjects were asked to imagine flexion or extension of single fingers, specific finger movements, or precision grasping of objects.

Additionally, the participants’ demographics are biased towards right-handedness; more specifically, in the visual-related experiments (SSVEP or P300), no article mentions left-handed participants. For motor-related experiments (MI or ME), specifying right-handedness as an inclusion criterion accounts for 62% of the reviewed papers but 24% do not report on it, 9% mention both, and 5% refer to left-handedness only. Nevertheless, the effects of handedness have been well established in the motor areas (Zapała et al., 2020) and, to a lesser degree, in the visual areas (Willems et al., 2010). Hence, the results derived from these studies should be applied carefully, and the linked variability must be considered.

For motor-related tasks, participants were instructed to use their dominant, nondominant, or both hands depending on the paradigm. These conditions yield a hand-dependent heterogeneity. Figure 6 illustrates this issue for four of the reviewed studies on electrode FC1 during an MI task. For this electrode, data were only available for right-handed subjects. The individuals’ Cohen’s d effect sizes were clearly greater when using the dominant hand compared to the left hand. As a consequence, the right-weighted mean effect size differed from the left one. However, for most electrodes, data were insufficient to generalize the observations. To harmonize the results, the weighted mean effect size for both hands were only reported in Tables 2 and 3.

Fig. 6
figure 6

Cohen's d effect size per study for MI task on electrode FC1 and weighted mean effect size according to the subject handedness, hand used, and sample size N (number of subjects)

Moreover, the data acquisition was usually executed during a single session. Inter-session variability has been highlighted for time-varying brain functions (resting state networks) (Meyer et al., 2013). Considering the dynamic aspect of task learning involved in most BCI paradigms, the average cortical locations probably vary for a single subject across time and across sessions (Chervyakov et al., 2016). The procedure of averaging over trials is arguably suboptimal but is still the most commonly used. Intra-subject variability, as a result, is mostly unaccounted for in this review.

This variability issue is also reflected in the Cohen’s d effect size. For one study (Ramirez-Quintana et al., 2020), the corresponding author provided the raw data from eight subjects who performed 34 trials each. Thus, it was possible to calculate the effect size by averaging all trials regardless of which subject performed the action (N = 272). To compare, the ERP was computed for each subject, and the resulting values were used to determine the effect size across subjects (N = 8). The obtained subject-dependent effect size (5.18 ± 1.1) proved to be much larger than the trial-dependent effect size (1.39 ± 0.05), in which the means were obtained from electrode O1 and displayed with their standard error. This highlights the greater intra-subject variability compared to the inter-subject one.

When averaging over all trials for a specific subject, the signal of interest stands out while the noise decreases. Theoretically, a high number of trials would completely cancel the noise. A researcher might favor a type of calculation depending on the application the study is aimed at. To characterize an ERP, one might prefer to average per subject to remove intra-subject variability and to get a noise-free signal. When designing a new study, the weighted mean effect size could be used to predict the minimum number of subjects to include in the experiment. However, it should be noted that, because nearly all reviewed studies compute subject-dependent averages, the results shown in the tables are only valid for similar computations.

Conclusion

The current review summarizes the subsets of EEG electrodes corresponding to the most significant cortical source activations. Sample sizes were estimated depending on the effect sizes values to obtain a statistical power of 80%. For every electrode, the weighted mean Cohen’s d provides the magnitude of the corresponding effect size. It should be noted that each element in a subset must have a large Cohen’s d with a high lower bound confidence interval, confirming its significance.

For each of the four commonly used BCI paradigms—MI, ME, SSVEP, and P300—the core channel selection (CCS) was extracted from the defined groups. These CCS allow implementing semiparametric algorithms that could adapt to various modalities of these paradigms. A meta-analysis with a greater number of studies would lead to more specifically adapted and extended subsets. Ideally, this meta-analysis should be followed by a sufficiently powered experiment that would assess the validity of the different CCS. It could also be interesting to compare the CCS to other data-driven channel selection methods.

A relevant definition of subsets would replace the nonparametric channel selection procedure while reducing the computational complexity and optimizing the BCI accuracy. This knowledge-based channel selection framework would have the advantages of practicality and rapidity, allowing for an easy implementation.