Multi-objective optimization for EEG channel selection and accurate intruder detection in an EEG-based subject identification system

Moctezuma, Luis Alfredo; Molinas, Marta

doi:10.1038/s41598-020-62712-6

Download PDF

Article
Open access
Published: 03 April 2020

Multi-objective optimization for EEG channel selection and accurate intruder detection in an EEG-based subject identification system

Luis Alfredo Moctezuma¹ &
Marta Molinas¹Â

Scientific Reports volumeÂ 10, ArticleÂ number:Â 5850 (2020) Cite this article

4156 Accesses
25 Citations
2 Altmetric
Metrics details

Subjects

Abstract

We present a four-objective optimization method for optimal electroencephalographic (EEG) channel selection to provide access to subjects with permission in a system by detecting intruders and identifying the subject. Each instance was represented by four features computed from two sub-bands, extracted using empirical mode decomposition (EMD) for each channel, and the feature vectors were used as input for one-class/multi-class support vector machines (SVMs). We tested the method on data from the event-related potentials (ERPs) of 26 subjects and 56 channels. The optimization process was performed by the non-dominated sorting genetic algorithm (NSGA), which found a three-channel combination that achieved an accuracy of 0.83, with both a true acceptance rate (TAR) and a true rejection rate (TRR) of 1.00. In the best case, we obtained an accuracy of up to 0.98 for subject identification with a TAR of 0.95 and a TRR 0.93, all using seven EEG channels found by NSGA-III in a subset of subjects manually created. The findings were also validated using 10 different subdivisions of subjects randomly created, obtaining up to 0.97Â Â±Â 0.02 of accuracy, a TAR of 0.81Â Â±Â 0.12 and TRR of 0.85Â Â±Â 0.10 using eight channels found by NSGA-III. These results support further studies on larger datasets for potential applications of EEG in identification and authentication systems.

Towards a minimal EEG channel array for a biometric system using resting-state and a genetic algorithm for channel selection

Article Open access 10 September 2020

Selecting EEG channels and features using multi-objective optimization for accurate MCI detection: validation using leave-one-subject-out strategy

Article Open access 30 May 2024

Automated methodology for optimal selection of minimum electrode subsets for accurate EEG source estimation based on Genetic Algorithm optimization

Article Open access 02 July 2022

Introduction

An authentication system includes a stage in which the data is used in a multi-class model with all the subjects in the dataset to identify a specific subject. It also includes a verification step to compare the data from the claimed subject with that of the true subject alone in the dataset to detect whether the subject is an intruder or not. The order of these stages may differ depending on the approach.

Subject identification and authentication based on electroencephalography (EEG) has been presented as a potential candidate for the creation of new biometric systems. However, certain aspects need to be analyzed and improved before reaching an industrial level application. One is intruder detection, which is an essential security layer in any secure system. This is also important for the development of portable low-density EEG devices that retain similar accuracy as high-density EEG. Here, we consider both problems, authentication, and verification.

A number of different paradigms are used to stimulate and record EEG signals in various subject identification approaches: imagined speech^1,2,3, resting-state potentials^4,5, and event-related potentials (ERPs)⁶. ERPs are very low-voltage signals emitted by the human brain that appears on the scalp in response to specific events or stimuli. They produce several well-known patterns, of which one of the most studied is the P300 peak, which occurs approximately 300 ms after the onset of the stimulus^7,8.

In general, resting-state potentials and ERPs have been shown to be good candidates for a new biometric system for which there are several different state-of-the-art approaches^6,9,10,11,12, with the localization of the relevant channels differing, depending on the paradigm. Here, we used the ERP approach to stimulate the EEG signals.

Four main objectives need to be considered to ensure an optimal global configuration: 1) select the minimum number of relevant EEG channels, 2) reject all intruders, 3) accept all subjects that are already in the system, and 4) identify the subject by multi-class classification. Consideration of these four objectives ensures high-quality results.

In addition to classification accuracy, the evaluation of biometric systems may also include consideration of the true acceptance rate (TAR), false acceptance rate (FAR), and true rejection rate (TRR). The TAR is the percentage of times a system correctly verifies a true claim of identity, FAR the percentage of intruders that are verified as non-intruders, and TRR the percentage of times it correctly rejects an intruder.

An approach with one-second EEG signals from the FP1 and FP2 channels and a 256-Hz sample rate during the resting state has been proposed for a biometric system, extracting features directly from the raw data and using Fisherâs discriminant analysis¹⁰, obtaining an up to 0.966 TAR and a 0.034 FAR. Another approach used two-second EEG signals from the FP1 and FP2 channels, with a 2048-Hz sample rate, and authors used a set of classifiers to perform multi-class classification¹¹. They obtained an accuracy of 0.93 and a false positive identification rate of 0.165. A third approach presented the results of a study using the Cz EEG channel, which was selected manually, on 20 subjects during the resting-state¹², obtaining a TAR of 1.0 and TRR of over 0.8. None of these studies attempted to systematically select the minimal number of optimal channels to perform the task.

An important element is dimensionality reduction, which can be tackled through channel selection and feature extraction. Several approaches can be used to accomplish this task, including those based on methods such as principal component analysis (PCA)¹³, discrete wavelet transform (DWT)³, empirical mode decomposition (EMD)^2,3,6, and even approaches using raw data as input for different configurations of neural networks (NN)^14,15. Based on the current state-of-the-art and the results of our previous studies, we used EMD for sub-band extraction and then extracted four features for each: instantaneous and Teager energy and Higuchi and Petrosian fractal dimensions.

Here, we used a one-class support vector machine (SVM) classifier with a radial basis function (RBF) kernel to create an intruder detection layer. Then, we used the linear SVM classifier for multi-class classification, which was evaluated for accuracy by 10-fold cross-validation. Considering a common configuration for all tasks, one-class classification, multi-class classification, and channel selection presents an optimization problem that can limit the results and be computationally expensive. We tested performing searches in the limited space using the non-dominated sorting genetic algorithm (NSGA), which uses the Pareto-optimal solutions and a niche method to maintain stable sub-populations of good points¹⁶. The NSGA-II and NSGA-III methods were tested and the results compared, given their use of the elitism approach and predefined reference points.

Results

General configuration for intruder detection using all EEG channels

As described below, we used a two-stage approach for the entire process, illustrated in Fig.Â 1. Briefly, we created a one-class SVM model in which we aimed to train the model to recognize subjects that are already in the system and to reject those who are not (Intruders). In the first part of this experiment, we thus trained the model using subjects with IDs 1â13 (non-intruder) and only EEG signals from session one, using 30 instances and all EEG channels (56 channels). We then used the EEG signals from all the subjects of session two, considering subjects 14â26 as intruders, to validate the model (see Fig.Â 1). We evaluated the results using the TAR, TRR, and accuracy of multi-class classification (TableÂ 1).

Table 1 TAR, TRR, and accuracy for subject identification and authentication with EEG data from all channels using different nu and gamma values for one-class SVM.

Full size table

TableÂ 1Â presents an example of the results using subjects 1â13 as non-intruders and subjects 14â26 as intruders. They show that approximately 90% of the subjects were correctly accepted, but that also that only approximately 8% of the intruders were correctly rejected. However, changing the nu and gamma parameters for the SVM RBF changed the TAR and TRR to approximately 50% in both cases.

Given that all subjects with access (subjects 1â13) passed the first layer, we created a multi-class classifier for subject identification. We experimentally defined and used a SVM with a linear kernel to create this model, because of the results obtained in previous studies and also because it was found to be the best solution experimentally. The flowchart of the complete method is presented in Fig.Â 1. The accuracy we obtained following 10-fold cross-validation was 0.98, with a standard deviation of 0.02 (TableÂ 1).

We used this approach because our aim was to find the best configuration for the entire process. Creating a model using only the subjects with correct permission who passed the first layer would have affected the results and therefore not be valid.

Solving the four-objective optimization problem using NSGA-II with subjects 1â13 as non-intruders and 14â26 as intruders

From here on, we present experiments that simultaneously considered all the problems to investigate whether there is a particular combination that can solve them. The experiment consisted of solving the optimization problem defined in section methods using NSGA-II. It consists of finding the best nu and gamma for the SVM with the RBF kernel to increase the TAR, TRR, and accuracy of subject identification or maintain them as high as possible for previous configurations, while using the smallest number of possible EEG channels. In short, we performed NSGA-II for the channel selection method using the first 56 genes in a chromosome to represent the EEG channels and then four genes each to select the best nu and gamma parameters, obtaining thus a chromosome of 64 genes.

Several plots of the results obtained considering the four objectives are presented in Fig.Â 2 to illustrate the importance of the optimization process, as only 11.11% of the possible channel combinations resulted in a TAR and TRR between 0.9 and 1.0 (Sub-fig.Â 2e). The classification accuracy in relationship to the number of channels used and the Pareto-front are shown in Sub-fig.Â 2d,f.

The results for the Pareto-front for all objectives are presented in TableÂ 2. NSGA-II found a two-channel combination for which we obtained a TAR of 0.91, TRR of 0.88, and an accuracy of 0.78 for subject identification. NSGA-II also found a 12-channel combination for which the accuracy of subject identification was 0.93, the TAR 0.93, and the TRR 0.95. This result shows that it is possible to reduce the number of channels from 23, 24, etc (which gave similar accuracies) to almost half using this approach.

Table 2 TAR, TRR, and accuracies obtained in the Pareto-front for four objectives solved with NSGA-II using subjects 1â13 as non-intruders.

Full size table

Solving the four-objective optimization problem using NSGA-II with subjects 14â26 as non-intruders and subjects 1â13 as intruders

With the aim of providing a more global result, we repeated the previous experiment using the same configuration but now considering subjects 14â26 as non-intruders and subjects 1â13 as intruders. The results obtained for the four objectives are presented in TableÂ 3.

Table 3 TAR, TRR, and accuracies obtained for the first 30 EEG channels in the Pareto-front for four objectives solved with NSGA-II using subjects 14â26 as non-intruders.

Full size table

As in the previous experiment, we obtained an accuracy of up to 0.83 for subject identification, with both a TAR and TRR of 1.00, using just a three-channel combination (see TableÂ 3). Increasing the classification accuracy for subject identification, while maintaining the same TAR and TRR, required 16 EEG channels, in contrast to the previous experiment for which the optimal number of EEG channels was 12.

TableÂ 3presents the results obtained in the Pareto-front for the first 30 EEG channels, indicating the accuracies obtained, TAR and TRR as well as the nu and gamma values used for creating the one-class classifiers for obtaining thus TAR and TRR results. We marked in gray the most relevant accuracies, TAR and TRR and the corresponding no. channels used, for those results we also added the nu and gamma values used in each case, in order to observe if there exist similarities between those cases.

The channel combinations for this and the previous experiments were independent and likely differed. We generated Venn diagrams to compare the channels in the Pareto-front between this and the previous experiment to detect a possible pattern or a more relevant area (Fig.Â 3). The EEG channels used to obtain the results marked in gray in TableÂ 2 and the channel localization in Sub-fig.Â 3c are presented in Sub-fig.Â 3a. The results marked in gray in TableÂ 3 are shown in Sub-fig.Â 3b and EEG channel localization in Sub-fig.Â 3d.

FigureÂ 3 shows some channels in a black circle if that channel is in an intersection with one or more subsets. For instance, sub-fig.Â 3c shows CPZ channel in a black circle which mean that it was in one or more subsets, as it is shown in sub-fig.Â 3a. It is important to highlight these for the discussion of the results and for comparison purposes with the following experiments in this document.

NSGA-III for solving the four-objective optimization problem

We repeated the previous two experiments for solving the four-objective optimization problem with the same configuration, but now using NSGA-III. A comparison between the results obtained in the Pareto-front in the two experiments, using subjects 1â13 for the training (subjects 1â13 as non-intruders and 14â26 as intruders) and subjects 14â26 for training (subjects 14â26 as non-intruders and 1â13 as intruders), is shown in TableÂ 4.

Table 4 TAR, TRR, and accuracies obtained in the Pareto-front when using 7â15 EEG channels with four objectives solved with NSGA-III using subjects 1â13 as non-intrudes and 14â26 as non-intruders and vice-versa.

Full size table

In this experiment, we found subsets with 9, 10, and 14 optimal EEG channels using subjects 1â13 as non-intruders and subsets with 7, 11, and 14 EEG channels using subjects 14â26 as non-intruders. As in the previous experiments, a comparison of several relevant subsets presented in TableÂ 4 is presented in Fig.Â 4 for both cases, either using subjects 1â13 as non-intruders (Sub-figs.Â 4a,c) or 14â26 as non-intruders (Sub-figs.Â 4b,d).

FigureÂ 4 presents a comparison between different subsets found by NSGA-III when using Subjects 1â13 as non-intruders and also using subjects 1â13 as intruders. This fig. shows a lower number of channels in the interceptions, but it also shows that most of the EEG channels used for obtaining the best results presented in TableÂ 4 were obtained using channels around the parietal and occipital areas.

Testing the proposal in 10 random subdivisions of subjects using NSGA-II and NSGA-III

In previous experiments, we presented the results obtained using different subsets manually selected with 50% of the subjects as non-intruders and 50% as intruders (I.e subjects 1â13 as non-intruders and 14â26 as intruders, and vice-versa.). We also presented the differences found when using NSGA-II or NSGA-III. However to provide a more general validation of our proposal, we created random subsets with 50% of the subjects as non-intruders and 50% as intruders, then we solved the optimization problem by considering the four objectives at the same time. This process is repeated 10 times, obtaining thus, 10-fold cross-validation of the proposed method. We also repeated the experiment using both algorithms, NSGA-II and NSGA-III, the mean results and the standard deviation are presented in TableÂ 5.

Table 5 Mean TAR, TRR, and accuracies obtained in the Pareto-front when using 7â15 EEG channels validated in 10 random subdivisions of all the subjects, using 50% as intruders and 50% as non-intruder.

Full size table

The results presented in TableÂ 5, shows that when considering 10 random partitions of the subjects as non-intruders or intruder, the mean accuracy decreases in both cases when using NSGA-II and NSGA-III. Also the standard deviation when using a lower number of channels than 10, is higher than 10%. This is because in each partition randomly created, the number of the best channel array as well as the best channels, are not the same. For instance, in previous experiment presented in TableÂ 4 it was clearly presented than using subjects 1â13 as non-intruders the best results are obtained with nine EEG channels, but considering subjects 14â26, the best results are obtained using seven channels.

As an example, TableÂ 5 shows that when using eight EEG channels the accuracies, TAR and TRR are similar in both cases, using NSGA-II and NSGA-III. However, the standard deviation is higher than 10% for TAR and TRR, which means that for some subsets of subjects, the best results were not obtained with eight channels, i.e sometimes with seven and sometimes with nine channels as in the previous experiments. In summary, this new experiment has shown that the accuracy for subject identification is consistently high (I.e higher than 0.96 in all the cases, as in the previous experiments exposed), but depending on which subset of subjects is used as intruders or non-intruders, TAR and TRR may vary.

Discussion

This study shows the promise of a new biometric system using a novel data source. Here, we present an EEG-based biometric system as a good candidate for use in authentication systems. In our previous works, we have studied and compared various paradigms, i.e. resting-state potentials and ERPs, using various types of electrodes, a various number of channels, and channel localization^2,3,4,6. Several parameters are yet to be optimized. There are thus no currently available industrial-level EEG-based biometric systems. In the context of designing a portable EEG headset, applications for multi-task purposes and scenarios are being widely studied. We propose using NSGA-based algorithms for the optimization process, with the final objective of reducing the necessary number of EEG channels for subject identification. These algorithms depend upon several parameters, which influence the performance and results. Additionally, these machine-learning algorithms also require the definition of several parameters, which we defined using eight genes of a created chromosome.

Here, we introduced a new scheme for subject identification and authentication, showing that we can identify subjects by their EEG brain signals and distinguish between subjects who were part of the trained dataset from those that are intruders. Using NSGA-II in our first experiments, we found channel subset combinations consisting of only two EEG channels with which we obtained an accuracy of 0.78, with a TAR of 0.91 and a TRR of 0.88. However, eight, nine, or twelve channels were required to increase the value of the results for the objectives when they were applied simultaneously. NSGA-III found subsets with seven, nine, ten, or eleven EEG channels with an accuracy of up to 0.99 and both a TAR and TRR of 1.00. Initially, we aimed at creating a new fixed headset with a limited number of EEG channels, but as the results of this work shows, it is not possible to argue that certain âgoodâ subset works better than other, since there exist different factors that are critical when choosing whether it is better to use a lower number of EEG channels or propose improvements in the classification stage. Our proposal has shown that there exist different channel subsets with which we can obtain high accuracies, TAR and TRR. However, deeper analysis and more experiments are required to perform within a larger population.

P300 from ERPs have shown to be good candidates but they are not the gold standard for this application since there is not yet sufficient research evidence to support it. They have been proposed in this work as candidates since it has shown to exhibit strong signatures unique to the subject and the process does not require any training, which will be essential in a real-life application. In a real scenario, the biometric system can display something on a screen (an image, a weak flashlight beam directly to the eyes, etc.), record the brain activity corresponding to the response to the presentation, and use it for the identification and authentication process. The internal state of the subject, such as the resting state, could also be used as an alternative obtaining specific information on the subject, as discussed in our previous investigation⁴. The EEG channel selection process is in itself interesting because it can provide information about the most relevant areas in the brain for a certain neural task, for a certain subject or group of subjects. This can be analyzed using a-priori information related to the paradigm, which can limit the search space and therefore the results.

The results presented in the first experiments show that most of the common channels in the subsets providing the highest accuracy, TAR, and TRR, come from occipital and parietal areas, but there were also some important channels in the frontal area (FC2, FC3, FC6, FC8, F6, AF7, AF8, and Fp1). In short, a final conclusion about the minimum number of necessary EEG channels for Subject identification taking into account the classification accuracy, TAR, and TRR, cannot be proposed solely based on the results of this work since depending on different factors (I.e number of subjects, trials, sessions, feature extraction method, channel selection approach and their parameters, etc.), the minimum necessary number of channels will be different. Additionally, the channel localization for the subsets found differ between them and also they differ if we use NSGA-II or NSGA-III methods, which is clearly presented in Figs.Â 3 and 4. When considering 10 random subdivisions of the subjects, the mean TAR and TRR decrease, and the standard deviation increases, additionally the nu and gamma values used are different in each subdivision, but the classification accuracies are maintained similar to our first presented experiments.

The complexity of the analysis can be as high as that required. In our first experiments, we trained a model with EEG signals from session 1 and the authentication and verification process was constructed using EEG signals from session 2. However, an analysis of different sessions from different days/weeks/months is also necessary before a proof of concept, due to brain plasticity and how it can affect this biometric approach. Another important aspect that requires further study is the scalability, to verify the number of subjects that can be added to this system while maintaining similar performance to that when using a small number of subjects. Here, we created a first-layer using the EEG data from all the subjects to search for a method to increase the TAR and TRR. However, we will compare this approach against a model for each subject, with a possible combination of both, to obtain better results for the TAR and TRR. We will focus on all these relevant aspects in future studies involving the optimization of multiple parameters related to the feature extraction and machine learning methods by using discrete values for representing the chromosomes and not only as a binary sequence. Another important aspect to be further investigated is the use of larger datasets with a kÂ âÂ fold validation, and verify if a possible modification to our proposal can find a single array of best EEG channels subset for different subjects subdivisions randomly created while solving consistently all the defined objectives and the necessary parameters by optimization as in our experiments presented and discussed in this document.

Our research has been focused towards a portable (non-invasive) wireless dry single-channel or low-density EEG system for different applications, which can help the subjects identification process by providing EEG information from different channel combinations by using a movable sensor⁴. Following the results found in this work and other applications proposed previously, we will analyze the possibility of a fixed or movable electrode version of a new EEG headset which can incorporate the best results obtained in this work for subject identification and authentication.

Methods

Although a research-grade EEG device guarantees a controlled environment and high-quality multi-channel recording, this is offset by the high computational cost, high density, non-portability of the equipment, and the use of inconvenient conductive gels. Depending on the paradigm and the task, certain EEG channels provide only redundant or poor information.

Recent technology development in dry EEG sensors has created new possibilities for the development of new types of portable EEG systems. An important step towards this goal is a reduction in the number of EEG channels while increasing or at least maintaining the same performance as high-density EEG by using multi-objective optimization algorithms.

Dataset

We used EEG signals from 26 subjects (24 right-handed and 2 left-handed), with an average age of 29.2 Â± 5.5 years, from 56 passive Ag/AgCl EEG electrodes (VSM-CTF compatible system) which were placed following the extended 10â20 international system (illustrated in Fig.Â 5). The EEG signals were all referenced to the nose and the ground electrode was placed on the shoulder, the impedances were kept below 10 kÎ©. The EEG data was collected in five sessions and 60 instances per each session with a sample rate of 600Hz but they were down-sampled at 200 Hz¹⁷.

The protocol used to record the EEG signals used the P300-speller paradigm (as is illustrated in Fig.Â 6) and introduced in¹⁷. Briefly, the target letter (the letter to be presented) is indicated by a green circle for 1 s. Then, letters and numbers (6 X 6 items, 36 possible items displayed on a matrix) are flashed in groups of six characters. Next, the display remains blank for a period of resting-state from 2.5 to 4 s. During this random period, the subjects are requested to remember the letter displayed. Then, the letter chosen by the implemented P300 classifier is displayed for 1.3 s. If the presented letter is the one that was previously presented, the subject sends a positive response, otherwise the subject sends a negative response.

An example of a positive feedback-related response corresponding to the target letter i is shown in Fig.Â 6. For the experiments presented here, we used only the positive-feedback responses, therefore, the number of positive-feedback trials can be different between subjects and sessions, and we selected the minimum number of positive-feedback related responses, which is 25 instances per session and per subject.

Pre-processing

The common average reference (CAR) method was used to improve the signal-to-noise ratio from the EEG signal by removing the common information from all electrodes that was simultaneously recorded. CAR can be computed for an EEG channel ${V}_{i}^{CAR}$, where i is the number of the channel, as follows:

$${V}_{i}^{CAR}={V}_{i}^{ER}-\frac{1}{n}{\sum }_{j=1}^{n}{V}_{j}^{ER}$$

(1)

where ${V}_{i}^{ER}$ is the potential between the i-th electrode and the reference, and n the number of electrodes.

Feature extraction

We used EMD to decompose the EEG signals into a set of intrinsic mode functions (IMFs)¹⁸. Certain redundant IMFs with shape and frequency content different from those of the original signal may appear during the sifting process. These signals show maximum Minkowski distances with respect to the original signal¹⁹.

We used the closest two IMFs, based on the Minkowski distance, and each IMF was characterized by extracting a set of four features²: Instantaneous energy, which provides the energy distribution²⁰, Teager energy, which reflects variations in both the amplitude and frequency of the signal^20,21, Higuchi fractal dimension, which approximates the mean length of the curve using segments of k samples and estimates the dimension of a time-varying signal directly in the time domain²², and Petrosian fractal dimension to provide rapid computation of the fractal dimension of an EEG signal by translating the time series into a binary sequence²³.

The process for extracting four features for each selected IMF returns eight features per channel and it is repeated for each channel used to then concatenate them to obtain a unique feature vector that represents the EEG signal for each instance.

Classification

We used a two-layer classification system, which is briefly introduced below.

One-class SVM with RBF

The SVM consisted of an unsupervised algorithm that learns a decision function for outlier detection, classifying new data as similar to or different from that of the training set. The radial basis function (RBF) was used as the kernel and certain important parameters required fitting. We found nu and gamma to be two important basic parameters for our proposed system. The nu parameters were an upper bound on the fraction of training errors and a lower bound of the fraction of support vectors that should be in the interval [0, 1]. Gamma defines how much influence a single training example has. The larger the gamma, the closer other examples must be to be affected and the interval must be greater than 0; normally it is 1âno_features.

A grid search can be used to adjust SVM parameters by cross-validation, which has been shown to be powerful and able to significantly improve accuracy. However, it is a very slow process²⁴. These parameters differ depending on the size of the feature vector and it is necessary to re-compute them each time. In an optimization problem, this process must be executed multiple times and it is computationally expensive. As explained below, We used eight genes of a bigger chromosome to fit these parameters (four for nu and four for gamma) using a genetic algorithm (GA).

Multi-class linear SVM

We tested a multi-class SVM, which provides a global solution. The classification complexity does not depend on the dimensionality of the feature space and the sensitivity to the number of features is relatively low²⁵, as the necessary time to create a model is ${\mathscr{O}}({N}^{3})$, where N is the length of the feature vector, and ${\mathscr{O}}(1)+{\mathscr{O}}(N)$ is required to predict the class of a new instance using the created model²⁶.

We used accuracy as the metric to evaluate the performance of the process using 10-fold cross-validation.

EEG channel selection

The EEG channel-selection process is critical for the development of a portable low-cost headset and makes it possible to focus on the EEG channels that contain the most information.

We used a genetic algorithm, as they are used to solve complex optimization and exhaustive search problems²⁷. We attempted to solve the multi-objective optimization problem using the non-dominated sorting genetic algorithms (NSGA)¹⁶, which uses a non-dominated sorting ranking selection method to emphasize good candidates and a niche method to maintain stable sub-populations of good points (Pareto-front).

NSGA-II solved certain problems related to the computational complexity, non-elitist approach, and the need to specify a sharing parameter to ensure diversity in a population presented in the first version. NSGA-II reduced the computational cost from O(MN³) to O(MN²), where M is the number of objectives and N the population size. Additionally, the elitist approach was introduced by comparing the current population with the previously found best non-dominated solutions²⁸.

Later, NSGA-III was introduced, which follows the NSGA-II framework, but uses a set of supplied or predefined reference points that emphasizes population members that are non-dominated, yet close to the supplied set (^29,30). The predefined set of reference points are used to ensure diversity in the obtained solutions. We used a systematic approach that places points on a normalized hyper-plane that is equally inclined to all objective axes and has an intersection with each axis. For example, in a three-objective optimization problem, the reference points are created on a triangle with apexes at (1,Â 0,Â 0),Â (0,Â 1,Â 0), and (0,Â 0,Â 1)^30,31. NSGA-III has shown its efficiency in solving two- to 15-objective optimization problems (²⁹).

Definition of the problem to optimize

Once the non-intruders and intruders subsets were defined, the signals were pre-processed and the features extracted. They can be used as input to the authentication system, which can be distributed in two different ways, as presented in Figs.Â 1 and 7. As mentioned previously, here, we used only the approach presented in Fig.Â 1. However, the use of a more complex system is required to fit certain important parameters and select the most relevant EEG channels, which in this case was analyzed as an optimization problem.

The problem to be optimized is defined by four unconstrained objectives: decrease the number of EEG channels, maximize the accuracy of the multi-class classification, maximize the number of accepted subjects with access, and maximize the number of intruders rejected. Each population size in each iteration is defined as 30, which was selected experimentally. The termination criterion for the optimization process is defined by the objective space tolerance, which is defined as 0.0001. This criterion is calculated every 10th generations. If optimization is not achieved, the process stops after a maximum of 500 generations.

The chromosome created to represent the search space in the scalp is presented in Fig.Â 5, where genes 1â56 represent the EEG channels and the nu parameter is calculated using genes 57â60 and the gamma parameter calculated using genes 61â64. When calculating the nu and gamma parameters, the binary representation is converted to a decimal, which represents the position in a vector with the possible values for the parameter. Thus possible values were defined experimentally, which in a key-value array are {0:Â 0.000001,Â 1:Â 0.0001,Â 2:Â 0.0005,Â 3:Â 0.001,Â 4:Â 0.005,Â 5:Â 0.01,Â 6:Â 0.1,Â 7:Â 0.2,Â 8:Â 0.3,Â 9:Â 0.4,Â 10:Â 0.5,Â 11:Â 0.6,Â 12:Â 0.7,Â 13:Â 0.8,Â 14:Â 0.9,Â 15:Â 1.0}, for both nu and gamma. The complete process is illustrated in Fig.Â 8.

We extracted eight features per EEG channel from all subjects and for each instance following the previously explained method, in which the results are organized and stored for iterative use, as shown in Fig.Â 8. The entire process is then handled by NSGA-II or NSGA-III, which starts creating all possible candidates using a binary chromosome representation, for which the corresponding subset of features for the channels is obtained, represented as 1 for genes 1â56 of the chromosome, the nu parameter calculated using genes 57â60, and the gamma parameter calculated using genes 61â64.

Then, the obtained classification accuracy, number of accepted subjects with access, number of rejected subjects, and number of EEG channels used are returned to NSGA-II or NSGA-III to evaluate each chromosome in the current population. The process is repeated, creating different populations by the NSGA until the termination criterion is reached.

Data availability

The dataset used for this study can be found at BCI Challenge @ NER 2015.

References

Brigham, K.Â & Kumar, B. V. Subject identification from electroencephalogram (eeg) signals during imagined speech. In 2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS),Â 1â8Â (IEEE, 2010).
Moctezuma, L. A.Â & Molinas, M. Eeg-based subjects identification based on biometrics of imagined speech using emd. In International Conference on Brain Informatics,Â 458â467Â (Springer, 2018).
Moctezuma, L. A., Torres-GarcÃa, A. A., VillaseÃ±or-Pineda, L. & Carrillo, M. Subjects identification using eeg-recorded imagined speech. Expert Systems with Applications 118, 201â208 (2019).
ArticleÂ Google ScholarÂ
Moctezuma, L. A.Â & Molinas, M. Subject identification from low-density eeg-recordings of resting-states: A study of feature extraction and classification. In Future of Information and Communication Conference, 830â846 (Springer, 2019).
Di, Y. et al. Robustness analysis of identification using resting-state eeg signals. IEEE Access 7, 42113â42122 (2019).
ArticleÂ Google ScholarÂ
Moctezuma, L. A.Â & Molinas, M. Event-related potential from eeg for a two-step identity authentication system. In IEEE 17th International Conference on Industrial Informatics (INDIN) (IEEE, 2019).
Farwell, L. A. & Donchin, E. Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroencephalography and clinical Neurophysiology 70, 510â523 (1988).
ArticleÂ CASÂ Google ScholarÂ
Fabiani, M. et al. Definition, identification, and reliability of measurement of the p300 component of the event-related brain potential. Advances in psychophysiology 2, 78 (1987).
Google ScholarÂ
Gui, Q., Ruiz-Blondet, M. V., Laszlo, S. & Jin, Z. A survey on brain biometrics. ACM Comput. Surv. 51, 112:1â112:38 (2019).
ArticleÂ Google ScholarÂ
Riera, A., Soria-Frisch, A., Caparrini, M., Grau, C. & Ruffini, G. Unobtrusive biometric system based on electroencephalogram analysis. EURASIP Journal on Advances in Signal Processing 2008, 18 (2008).
MATHÂ Google ScholarÂ
Safont, G., Salazar, A., Soriano, A.Â & Vergara, L. Combination of multiple detectors for eeg based biometric identification/authentication. In 2012 IEEE International Carnahan Conference on Security Technology (ICCST),Â 230â236Â (IEEE, 2012).
Hu, B., Liu, Q., Zhao, Q., Qi, Y.Â & Peng, H. A real-time electroencephalogram (eeg) based individual identification interface for mobile security in ubiquitous environment. In 2011 IEEE Asia-Pacific Services Computing Conference,Â 436â441Â (IEEE, 2011).
Davis, P., Creusere, C. D.Â & Kroger, J. Subject identification based on eeg responses to video stimuli. In 2015 IEEE International Conference on Image Processing (ICIP), 1523â1527 (IEEE, 2015).
Chen, J., Mao, Z., Yao, W.Â & Huang, Y. Eeg-based biometric identification with convolutional neural network. Multimedia Tools and ApplicationsÂ 1â21Â (2019).
Sun, Y., Lo, F. P.-W. & Lo, B. Eeg-based user identification system using 1d-convolutional long short-term memory neural networks. Expert Systems with Applications 125, 259â267 (2019).
ArticleÂ Google ScholarÂ
Srinivas, N. & Deb, K. Muiltiobjective optimization using nondominated sorting in genetic algorithms. Evolutionary computation 2, 221â248 (1994).
ArticleÂ Google ScholarÂ
Margaux, P., Emmanuel, M., SÃ©bastien, D., Olivier, B. & JÃ©rÃ©mie, M. Objective and subjective evaluation of online error correction during p300-based spelling. Advances in Human-Computer Interaction 2012, 4 (2012).
ArticleÂ Google ScholarÂ
Huang, N. E. et al. The empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences 454, 903â995 (1998).
ArticleÂ ADSÂ MathSciNetÂ Google ScholarÂ
Boutana, D., Benidir, M.Â & Barkat, B. On the selection of intrinsic mode function in emd method: application on heart sound signal. In 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010), 1â5 (IEEE, 2010).
Didiot, E., Illina, I., Fohr, D. & Mella, O. A wavelet-based parameterization for speech/music discrimination. Computer Speech & Language 24, 341â357 (2010).
ArticleÂ Google ScholarÂ
Jabloun, F.Â & Cetin, A. E. The teager energy based feature parameters for robust speech recognition in car noise. In 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No. 99CH36258), vol. 1, 273â276 (IEEE, 1999).
Higuchi, T. Approach to an irregular time series on the basis of the fractal theory. Physica D: Nonlinear Phenomena 31, 277â283 (1988).
ArticleÂ ADSÂ MathSciNetÂ Google ScholarÂ
Petrosian, A. Kolmogorov complexity of finite sequences and recognition of different preictal eeg patterns. In Proceedings Eighth IEEE Symposium on Computer-Based Medical Systems,Â 212â217Â (IEEE, 1995).
Syarif, I., Prugel-Bennett, A. & Wills, G. Svm parameter optimization using grid search and genetic algorithm to improve classification performance. Telkomnika 14, 1502 (2016).
ArticleÂ Google ScholarÂ
Joachims, T. Making large-scale svm learning practical. Tech. Rep., Technical report, SFB 475: KomplexitÃ¤tsreduktion in Multivariaten (1998).
Abdiansah, A.Â & Wardoyo, R. Time complexity analysis of support vector machines (svm) in libsvm. International journal computer and application (2015).
Chugh, T., Sindhya, K., Hakanen, J. & Miettinen, K. A survey on handling computationally expensive multiobjective optimization problems with evolutionary algorithms. Soft Computing 23, 3137â3166 (2019).
ArticleÂ Google ScholarÂ
Deb, K., Pratap, A., Agarwal, S. & Meyarivan, T. A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE transactions on evolutionary computation 6, 182â197 (2002).
ArticleÂ Google ScholarÂ
Deb, K. & Jain, H. An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach, part i: solving problems with box constraints. IEEE Transactions on Evolutionary Computation 18, 577â601 (2013).
ArticleÂ Google ScholarÂ
Jain, H. & Deb, K. An evolutionary many-objective optimization algorithm using reference-point based nondominated sorting approach, part ii: handling constraints and extending to an adaptive approach. IEEE Transactions on Evolutionary Computation 18, 602â622 (2013).
ArticleÂ Google ScholarÂ
Das, I. & Dennis, J. E. Normal-boundary intersection: A new method for generating the pareto surface in nonlinear multicriteria optimization problems. SIAM journal on optimization 8, 631â657 (1998).
ArticleÂ MathSciNetÂ Google ScholarÂ

Download references

Acknowledgements

This study received funding from Enabling Technologies - NTNU, under the project âDavid versus Goliath: single-channel EEG unravels its power through adaptive signal analysis - FlexEEGâ. The funder was not involved in the study design, collection, analysis, or interpretation of data, writing of this article, or the decision to submit it for publication.

Author information

Authors and Affiliations

Department of Engineering Cybernetics, Norwegian University of Science and Technology, 7491, Trondheim, Norway
Luis Alfredo MoctezumaÂ &Â Marta Molinas

Authors

Luis Alfredo Moctezuma
View author publications
You can also search for this author in PubMedÂ Google Scholar
Marta Molinas
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

L.A.M. devised the methods, performed the experiments, analyzed the results, and wrote the manuscript. M.M. defined the problem of the minimal number of EEG channels, discussed the results, and wrote the manuscript.

Corresponding author

Correspondence to Luis Alfredo Moctezuma.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Moctezuma, L.A., Molinas, M. Multi-objective optimization for EEG channel selection and accurate intruder detection in an EEG-based subject identification system. Sci Rep 10, 5850 (2020). https://doi.org/10.1038/s41598-020-62712-6

Download citation

Received: 09 December 2019
Accepted: 18 March 2020
Published: 03 April 2020
DOI: https://doi.org/10.1038/s41598-020-62712-6

This article is cited by

GRU-powered sleep stage classification with permutation-based EEG channel selection
- Luis Alfredo Moctezuma
- Yoko Suzuki
- Takashi Abe
Scientific Reports (2024)
EEG-Driven Biometric Authentication for Investigation of Fourier Synchrosqueezed Transform-ICA Robust Framework
- Kutlucan Gorur
- Emre Olmez
- Onursal Cetin
Arabian Journal for Science and Engineering (2023)
EEG temporalâspatial transformer for person identification
- Yang Du
- Yongling Xu
- Pengcheng Ma
Scientific Reports (2022)
Automated methodology for optimal selection of minimum electrode subsets for accurate EEG source estimation based on Genetic Algorithm optimization
- Andres Soler
- Luis Alfredo Moctezuma
- Marta Molinas
Scientific Reports (2022)
A Multi-objective Optimization Approach for the Synthesis of Granular Computing-Based Classification Systems in the Graph Domain
- Luca Baldini
- Alessio Martino
- Antonello Rizzi
SN Computer Science (2022)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

General configuration for intruder detection using all EEG channels

Solving the four-objective optimization problem using NSGA-II with subjects 1â13 as non-intruders and 14â26 as intruders

Solving the four-objective optimization problem using NSGA-II with subjects 14â26 as non-intruders and subjects 1â13 as intruders

NSGA-III for solving the four-objective optimization problem

Testing the proposal in 10 random subdivisions of subjects using NSGA-II and NSGA-III

Discussion

Methods

Dataset

Pre-processing

Feature extraction

Classification

One-class SVM with RBF

Multi-class linear SVM

EEG channel selection

Definition of the problem to optimize

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links

Solving the four-objective optimization problem using NSGA-II with subjects 1â13 as non-intruders and 14â26 as intruders

Solving the four-objective optimization problem using NSGA-II with subjects 14â26 as non-intruders and subjects 1â13 as intruders