We classify the driver distraction level (neutral, low, medium, and high) based on different wavelets and classifiers using wireless electroencephalogram (EEG) signals. 50 subjects were used for data collection using 14 electrodes. We considered for this research 4 distraction stimuli such as Global Position Systems (GPS), music player, short message service (SMS), and mental tasks. Deriving the amplitude spectrum of three different frequency bands theta, alpha, and beta of EEG signals was based on fusion of discrete wavelet packet transform (DWPT) and FFT. Comparing the results of three different classifiers (subtractive fuzzy clustering probabilistic neural network, K-nearest neighbor) was based on spectral centroid, and power spectral features extracted by different wavelets (db4, db8, sym8, and coif5). The results of this study indicate that the best average accuracy achieved by subtractive fuzzy inference system classifier is 79.21% based on power spectral density feature extracted by sym8 wavelet which gave a good class discrimination under ANOVA test.
1. Introduction
In many countries distraction is responsible for many car accidents. The National Highway Traffic Safety administration (NHTSA) estimates that 100,000 of police-reported crashes because of driver fatigue happened each year [1]. Thereby, it is important to develop automatic detectors of this state. Most of the automatic detection methods are based on analyzing the driver behavior to detect abnormal actions [2] or using image processing technique to monitor and evaluate his head position and eye movement or blinking [3, 4]. Drowsiness can also be identified through electroencephalographic (EEG) signals, which contain alertness information [5]. The EEG plays fatal role into measuring the electrical activity of the brain [6]. Different signal processing techniques, like wavelet transform (WT) [5], means comparison test [7], independent component analysis [8] with different classifiers such as neural networks (NNs) [8–10], and Fussy Logic [11], have been applied to detect drowsiness in EEG signals. Driving is a complex task in which different skills and functions are combined simultaneously; therefore monitoring drivers’ attention regarding brain resources is a strong challenge for researchers and analytics in the field of cognitive brain research and brain interface computer. The level of interference from task-irrelevant stimulus information (conflict), reflected in slowed responses and decreased accuracy for incompatible relative to compatible stimuli, is found to be reduced after the processing of an incompatible as compared with a compatible stimulus [12]. Recently, soft computing had been seen as an attractive alternative, and several methods were developed for trajectory design and robot motion control using neurofuzzy techniques [13]. Data-driven approaches have been widely applied to solve industrial problems encountered in the real life, including control engineering, instrumentation and measurement, computer security, intelligent transportation systems, and vehicles [14]. Causes of distractions during driving were quite widespread, including eating, drinking, talking with passengers, use of cell phones, reading, fatigue, problem solving, and using in-car equipment such as GPS, media player, and in-vehicle entertainment, thus making it likely that the problem of driver is inattention [15–18]. Many researchers have proposed a lot of methods to detect attention change using physiological changes such as eye blinking, heart rate, pulse rate or skin electric potential, and especially brain wave [19]. EEG based methods mainly focus on the monitoring of the alertness variation of driver fatigue due to drowsiness, whereas the detecting approaches of alertness change in tasks requiring sustained attention have been seldomly explored [20]. This work has two objectives: (1) to select the optimal wavelet function for getting the better classification accuracy from the alpha, theta, and beta band features and (2) to determine the classifier which gives better average and individual classification rate. In our work, we have used audiovisual stimuli for evoking four different levels such as neutral, low, medium, and high distraction. Two features, spectral centroid and power spectral density (PSD), are derived using wavelet transform on theta, alpha, and beta band. These numerical features are classified using three different classifiers, namely, K-nearest neighbor (KNN), probabilistic neural network (PNN), and fuzzy inference system. In our last recent work, we used PNN classifier to classify driver drowsiness level (sleepy state) and achieved 61% based on db4, and we expect that this accuracy would be more, if fuzzy classifier had been used [21]. This paper adds on significant solution for driver distraction level related to EEG bands and their position in the packet of the wavelet transform explored mathematically by their designated equations. In this work, a set of four distraction stimuli, namely, media player, GPS, mental task, and SMS message are induced by using audio-visual stimuli. The rest of this paper is organized as follows. In Section 2, we summarize the research methodology by elucidating the data acquisition process. Sections 3 and 4 explain feature extraction using wavelet transform and classification of distraction level by different classifiers, respectively. Section 5 illustrates the overview of the results and discussion of this present work, and conclusions are given in Section 6.
2. Data Acquisition
Mobile phone is considered as the main reason of driver distraction compared to other distraction reasons such as GPS, music and video players, and mental thinking. Therefore, we applied these four distractions to develop suitable database for this work using EEG signals. Figure 1 shows a simulated environment of real driving in one of our university laboratories based on simulation driving software. Infrared camera had been used to capture the driver face image for data validation after finishing the experiment.
Protocol flow. (a) Subject distracted by mobile (mental task). (b) Driving car environment. (c) Subject distracted by GPS. (d) Electors position and signals.
Before start driving, the subject was asked to initially keep eyes closed for 2 min duration followed by another two minutes for open eyes. After this neutral initialization, the driver was asked to drive for 30 minutes containing different tasks of distraction, each 2 minutes duration, such as media player, GPS, mental thinking by answering few mental questions through mobile phone, and finally he or she should type and send SMS messages. Through this protocol and according to the continuous performance test (CPT), we can determine whether the subject is in low, medium, or high level distraction according to his/her time response through relooking to the screen and controlling the steering wheel continuously. For the first 30 subjects, we first determined visually the 1 sec. duration of distraction (like typing in GPS or SMS messages) and considered as low level. Secondly, for the medium level, the continuous 2 sec. distraction time was extracted, whereas the continuous 3 sec. distraction time is assumed to be as a high level.
In this work, 50 subjects (43 Males and 7 Females) in the age range of 24 years to 34 years have participated. Emotive EEG system is used to acquire the EEG signals over the complete scalp through 14 electrodes (FP1, FP2, F7, F8, F3, F4, T7, T8, P7, P8, O1, O2, A1, and A2). All the electrodes are placed over the subject scalp based on international 10–20 system of electrode placement. EEG signals are acquired at a sampling frequency of 128 Hz and band pass was filtered between 0.05 Hz and 60 Hz. The reference electrode and ground electrode are placed on right and left ear lobes. The impedance of the electrodes is kept below 5 KΩ.
3. Feature Extraction
Brain electrical signals are time-varying and nonstationary signals, which have different frequency elements at different times. Indeed, the EEG signals cannot be considered as stationary even under short duration, since it can exhibit considerable short term nonstationary [22]. Therefore, DWT is a more suitable method to decompose the EEG signal into its different frequency bands and retain the signal information in both time and frequency domain unlike FFT or STFT [22, 23]. In this work, the spectrum features from the EEG signals for different distraction levels are derived from three frequency bands, namely, theta, alpha, and beta, by applying four different wavelets (db4, db8, sym8, and coif5). These wavelet functions have been chosen due to their near optimal time frequency localization properties. Moreover, the waveforms of these wavelets are similar to the waveforms to be detected in the EEG signal, the orthogonal property, and optimal number of filter coefficients for reducing the computational complexity. Therefore, extraction of EEG signals features is more likely to be successful [23]. Due to the nonstationary nature of EEG signals, we need to analyze them onto basis functions created by dilation and shifting the mother wavelet function. In general, the characteristic nature of mother wavelet function should be similar in shape to the original signal under processing. The extracted wavelet coefficients provide a compact representation that shows the energy distribution of the EEG signal in time and frequency [24].
The researchers are utilizing discrete wavelet packet transform (DWPT) for efficient frequency band localization. DWPT decomposes both high and low frequency component of the input signal into any level of decomposition as shown in Figure 2, unlike normal wavelet transform which decomposes only the approximation coefficients in the subsequent levels. In this work, DWPT is used to obtain three frequency bands, namely, theta (4–8 Hz), alpha (8–12 Hz), and beta (14–32 Hz) frequency bands, for distraction detection. PSD estimates of noise signals from a finite number of its samples are based on three fundamentally different approaches, namely, parametric, nonparametric, and subspace method. Though the computation complexity is higher during the PSD computation using DWPT and FFT approach, it gives good classification accuracy on efficiently distinguishing the distraction levels. As a beginning of this research, we computed the PSD feature through DWPT and FFT. In future, we aim to analyze the significance of PSD through DWPT alone for distraction levels classification.
Five level EEG signal decomposition using discrete wavelet packet transform (DWPT).
The mathematical derivation of the approximation coefficients (CA0, CA1, and CAK) is by taking the N samples of the input signal S and extend it to N*=N+2(M-2)+C, as C is a constant which is equal to 0 for even N or 1 for odd N [25]. This extension is highly needed to make matching between the numbers of input samples with the wavelet filter coefficients, and this thing should be applied on each input to any level. Therefore the new extended signal S is as follows:
(1)S=[S0,S1,S2,…,SN*-1].
And by applying wavelet decomposition on this S signal by performing convolution of the input samples with low pass filter coefficients of M coefficients as shown in Figure 3 to produce (N*-M)/2 approximation coefficients, we have
(2)CA0=S0*h0+S1*h1+⋯+SM-1*hM-1,CA1=S2*h0+S3*h1+⋯+SM+1*hM-1,⋮CA((N*-M)/2)=S(N*-M)*h0+S(N*-M+1)*h1+⋯+S(N*-1)*hM-1.
Convolution of the input signal samples with high pass filter coefficients produces the first level detail coefficients (CD_{0}, CD_{1}, and CDK) as follows:
(3)CD0=S0*g0+S1*g1+⋯+SM-1*gM-1,CD1=S2*g0+S3*g1+⋯+SM+1*gM-1,⋮CD((N*-M)/2)=S(N*-M)*g0+S(N*-M+1)*g1+⋯+S(N*-1)*gM-1.
The generalized equation for deriving approximation coefficients and detail coefficients for wavelet decomposition is given as
(4)CAK=h0S2*K+h1S2*K+1+⋯+hM-1S2*K+M-1=∑i=0M-1hi*Si+2*K,CDK=g0S2*K+g1S2*K+1+⋯+gM-1S2*K+M-1=∑i=0M-1gi*Si+2*K.
The basic relation between the input samples and filter coefficients (low pass and high pass) for generating approximation and detail coefficients for any level “b” can be stated as
(5)CAb,K=∑i=0M-1hi*CAb-1,i+2*K,CDb,K=∑i=0M-1gi*CDb-1,i+2*K.
The general wavelet packet transform equations for deriving theta band (level 4, part 1), alpha band (level 4, part 2), beta 1 band (level 5, part 7) and beta 2 band (level 2, part 1) as shown in Figure 2 are given in (6) to (9), respectively, based on db4 (M=8) as follows:
(6)CD4(1),K=∑i=07gi*CD3(0),i+2*K,(7)CA4(2),K=∑i=07hi*CD3(1),i+2*K,(8)CD5(7),K=∑i=07gi*CD4(3),i+2*K,(9)CD2(1),K=∑i=07gi*CA1(0),i+2*K.
WPT of two levels.
3.1. Amplitude Spectrum
Amplitude spectrum is defined as the magnitude of the Fourier transform of a time-domain signal. Every signal can be written as a sum of sinusoids with different amplitudes and frequencies. It can have other names like spectral density, voltage spectrum, power spectrum, and spectral intensity which describes how the power of a signal or time series is distributed over the different frequencies. The frequency spectrum of a time-domain signal is a representation of that signal in the frequency domain. The frequency spectrum can be generated via a Fourier transform of the signal, and the resulting values are usually presented as amplitude and phase, both plotted versus frequency. A signal can be broken into short segments (sometimes called frames), and spectrum analysis may be applied to these individual segments. In this work, the average amplitude of the FFT output of EEG bands wavelet transformed is used to derive two different features, namely, spectral centroid and PSD.
3.1.1. Power Spectral Density (PSD)
Spectral analysis is the distribution of power over frequency. Spectral analysis finds applications in many fields such as speech analysis, monitoring vibration, economics, and sonar systems. In medicine, spectral analysis of various signals measured from a patient, such as electrocardiogram (ECG) or electroencephalogram (EEG) signals, can provide useful material for diagnosis. A random signal usually has finite average power and, therefore, can be characterized by an average power spectral density as
(10)PSD(i)=∑i=0N|X(k)|2,
where X(k) represent the out of FFT and k is the position of the FFT components.
3.1.2. Spectral Centroid Frequency
Spectral centroid frequency is commonly known as subband spectral centroid [7, 10]. The spectral centroid is used to find the center value of the groups for each frequency band. Spectral centroids feature extraction technique was widely used in audio recognition because of its robustness to recognize the dominant frequency and to extract EEG features for stress identification [12, 13]. In this work, the author tried to use this feature for EEG classification. The spectral centroid (C) is calculated using the following formula:
(11)C=∫kX(k)dk∫X(k)dk.
3.1.3. Features Extraction Algorithm
Load the input EEG signal from 14 channels.
Apply 4th order Butterworth IIR band-pass filter and followed by notch filter to remove the effects of noises and artifacts.
Perform the framing on the preprocessed signal with duration of 1 second.
Decompose the EEG signal into five levels using the chosen wavelet function (db4, db8, sym8, and coif5) to extract the wavelet coefficients for theta, alpha, beta 1, and beta 2 frequency bands through DWPT.
Perform FFT for each frequency band to get the frequency spectrum
(12)X(k)=∑n=0N-1x(n)wkn,wherew=e-j(2π/N),
where k = position of sample after FFT, x(n) is the input wavelet coefficients corresponding to any of the four frequency bands, n is the number of input sample positions, and N is the maximum length of the input wavelet coefficients.
Determine the absolute value of FFT to get the PSD and C of the spectrum of each band.
Add the amount of this PSD and C of each band in this specified channel to the total mean of the said values of each band over the 14 channels
Take the average of PSD and C of each band by dividing by 14.
Repeat the above steps from 4 to 8 for the next 1 sec. EEG and continue to perform the analysis for all the active EEG channel.
4. The Classifiers
A standard classification problem generally follows a two-step procedure which consists of training and testing phases. During the training phase, a classifier is trained to achieve the optimal separation for the training data set. Then, in the testing phase, the trained classifier is used to discriminate new samples with unknown class information. As the predictability of the features may vary, an exhaustive method was used to select the best combination of features. That is, try all possible combinations of features and pick up those with best performances. In this paper, three different classifiers have been used to compare the results and choose the most suitable classifier for this distraction level classification purpose.
4.1. PNN Classifier
In this work, PNN architecture is constructed using newpnn () function in MATLAB 7.0. The PNN model is one among the supervised learning networks and has many features different from those of other networks in the learning processes. The data training set was used to train designed PNN. The PNN is tested with testing data set to show the impact on classification rate. The spread value (σ) of the radial basis function (RBF) was used as a smoothing factor, and classifier accuracy was examined with different values of σ. The first step of training the PNN network is by selecting the optimal spread values which control the spread of the RBF functions. If the spread value is too large, then the model will not be able to closely fit the function if the spread value is too small, the model will over fit the data because each training point will have too much influence. In this work smoothing factor of 0.1 value has been used to classify the hypovigilance level.
The algorithm of classification of new test feature vector is determined by the class of its K-nearest neighbors. This classifier memorizes all vectors in the tanning sets and then compares the test vector with them. Therefore this classifier is called memory based learning. KNN algorithm is based on Euclidian distance metrics to locate the nearest neighbors. The Euclidian distance between the two points X and Y is explained as in (13)
(13)Dist(X,Y)=∑i=1D(Xi-Yi)2,
where D is the number of coordinates. In this work the K-nearest neighbor value is varied from 2 to 9. The optimal value of K is selected based on the higher classification rate.
4.3. Fuzzy Subtractive Clustering
Fuzzy subtractive (FS) clustering is a fast, one-pass algorithm for estimating the number of clusters and the cluster centers in a set of data. This technique depends upon the measure of the density of data points in the feature space. The aim is to find areas in the feature space with high densities of data points. The point with the highest number of neighbors is considered as the center of a specific cluster. The algorithm will remove the data points within a prespecified fuzzy radius. This process will check all the data points. The radii variable is a vector of entries between 0 and 1 that specifies a cluster center’s range. Small radii values will generate few large clusters. Recommended values for radii should be between 0.2 and 0.5. In this work, a value of 0.5 for all the radii was chosen because this leads to fewer membership functions and less computation time, without losing accuracy. Once the inputs for hypovigilance classification are selected, input membership functions must be determined. The Gaussian membership function shown in Figure 4 is selected since it has continuous derivability. The function is given by μ(x)=e-(x-m)2/2σ2. This function is based on two factors, m and σ, as they represent the center and the width of the Gaussian function, respectively. MATLAB fuzzy logic toolbox provides an important function to generate FS system. This function constructs a set of rules to model the data organization based on the subtractive clustering centers and clusters to allocate antecedent membership functions.
Gaussian member ship function with m=5 and σ=2.
4.4. Data Preparation for Classification
The requirement of generating classifier system is to divide the training data into two data sets. Firstly, an input data set which has 6 values of two features F1 and F2 over three bands (θ, α, and β) [F1,θ, F1,α, F1,β, F2,θ, F2,α, and F2,β], where F1 and F2 represent centroid frequency and power spectral density features, respectively. Hence each vector of the overall 200 vectors contains 6 values. Therefore the overall data inputs are 1200 values over 50 subjects for four levels (50*4*6). Secondly, an output data set (1, 2, 3, or 4) is used for one output. The output is either 1 for neutral or 2 for low level or 3 for medium level, or 4 for high level. These points were placed into a single output data set with 200 values, each 50 values for one class, where 60% of the vectors are used as training (120) and 40% as testing (80).
5. Results and Discussion
This research work is intended to investigate the effects of distraction due to cognitive, visual, and auditory distraction using different stimuli. In this work, we utilized the potential of localizing the frequency bands in EEG signals through DWPT and fusion with FFT for efficient feature extraction to get efficient distraction classification. The significance of these two features, spectral centroid and PSD, are checked based on Analysis of Variance (ANOVA) test over each wavelet (db4, db8, sym8, and coif5) as shown in Table 1.
ANOVA test of centroid and PSD features over db4, db8, sym8, and coif5 for each distraction level.
Neutral
Low
Medium
High
P
db4
Centroid
5.3±50.8
15.5±217.8
10.1±239.8
9.3±286.4
<0.001
PSD
0.41±7.5
1.12±18.8
0.013±0.0007
0.019±0.0039
<0.001
db8
Centroid
1.29±5.12
9.19±113.9
4.6±64.3
4.15±53.8
<0.001
PSD
0.0008±5.4E-06
6.78±213.9
0.005±0.0001
0.005±0.0002
<0.001
sym8
Centroid
1.26±3.27
5.6±49.6
5.4±78.7
4.2±61.7
<0.001
PSD
0.0008±4.46E-06
2.3±60.5
0.004±8.6E-5
0.005±0.0002
<0.001
coif5
Centroid
2.02±5.19
15.6±395.4
10.7±372.7
7.66±199.9
<0.001
PSD
0.002±0.00003
0.03±0.003
0.048±0.024
0.04±0.02
<0.001
All the results are presented as mean ± SD with P values. The ANOVA test with P values generally less than 0.005 suggests that these features measures can be used as classification features. We extracted PSD and centroid frequency features from the amplitude spectrum and performed ANOVA test on four classes of distraction (neutral, low, medium, and high). These two features give excellent P values under ANOVA test as shown in Table 1. Features are computed from 3-second window of the 14 EEG channels, and ANOVA test is used to check if the mean values are different for the different classes. Table 1 shows the results of the amplitude spectrum parameters for different wavelets over the four levels of distraction. The mean centroid frequency magnitude after neutral state seems to decrease from low to medium to high distraction EEG based on db4. Therefore, both said parameters are suitable for differentiating and classification. For db8 these two features cannot differentiate the medium from the high distraction. When sym8 is applied, the mean centroid frequency magnitude starts decreasing from low to medium to high distraction EEG, and the two features are very weak in medium distraction state. Therefore, it is easy to distinguish this state from low and high distraction. It is obvious under coif5 that the centroid is decreasing from low to medium to high state, while PSD almost shows no significant changes. Finally, we concluded that sym8 wavelet is the most suitable wavelet for distraction classification, therefore it gives maximum classification achievement of 79.10% as shown in Tables 2 and 3 using PSD feature for fuzzy classifier which its input vectors distribution is shown in Figure 5 and its structure is shown in Figure 6. Therefore, we considered this wavelet for subsequent analysis.
Classification accuracy of different classifiers for different wavelets over 4 distraction levels for both two features.
Neutral
Low
Medium
High
Average
Centroid
PSD
Centroid
PSD
Centroid
PSD
Centroid
PSD
Centroid
PSD
Fuzzy
db4
71.17
77.73
67.08
79.21
79.32
84.80
64.87
70.99
70.61
78.18
db8
64.17
73.24
77.29
89.90
75.99
71.73
63.61
79.41
70.27
78.57
sym8
62.51
72.79
82.61
91.99
81.01
70.60
63.65
79.21
72.45
79.21
coif5
65.36
74.94
63.19
66.30
76.81
78.22
62.68
63.55
67.01
70.75
PNN
db4
62.39
58.13
58.14
73.56
65.88
83.20
31.19
63.57
54.40
69.62
db8
52.64
50.31
72.92
90.79
57.56
64.26
25.92
75.48
52.26
70.21
sym8
25.00
49.29
69.22
91.29
77.50
63.31
53.91
79.22
56.41
70.78
coif5
55.42
68.52
50.93
34.01
58.50
60.84
24.80
25.42
47.41
47.20
KNN
db4
44.29
72.33
34.93
61.69
75.79
77.75
55.26
45.49
52.57
64.32
db8
25.31
66.34
60.25
90.13
71.54
46.35
52.98
65.04
52.52
66.97
sym8
50.70
65.53
78.55
91.29
69.00
44.66
28.20
70.91
56.61
68.10
coif5
31.29
52.45
24.74
56.91
70.57
73.49
50.74
53.00
44.34
58.97
Bold values refers to maximum average values.
Classification results of KNN, PNN, and fuzzy classifiers over 4 distraction levels based on sym8 using centroid and PSD features.
Distraction based on sym8
% CR
SEN.
SPEC.
TPR
FNR
Neutral
Centroid
KNN
51.29
53.50
46.86
48.85
44.64
PNN
62.39
65.50
56.15
58.95
53.03
Fuzzy
71.17
74.73
64.05
67.26
60.49
PSD
KNN
72.33
75.95
65.10
68.35
61.48
PNN
58.13
61.04
52.32
54.94
49.41
Fuzzy
77.73
81.61
69.96
73.45
66.07
Low
Centroid
KNN
52.93
54.68
49.44
51.01
47.69
PNN
58.14
61.05
52.33
54.94
49.42
Fuzzy
67.08
70.43
60.37
63.39
57.01
PSD
KNN
61.69
64.77
55.52
58.30
52.44
PNN
73.56
77.24
66.20
69.51
62.53
Fuzzy
79.21
83.17
71.29
74.85
67.33
Medium
Centroid
KNN
75.79
79.58
68.21
71.62
64.42
PNN
65.88
69.18
59.30
62.26
56.00
Fuzzy
79.32
83.29
71.39
74.96
67.43
PSD
KNN
77.75
81.64
69.98
73.48
66.09
PNN
83.20
87.37
74.88
78.63
70.72
Fuzzy
84.80
89.04
76.32
80.14
72.08
High
Centroid
KNN
55.26
58.02
49.73
52.22
46.97
PNN
51.19
52.75
48.07
49.47
46.51
Fuzzy
64.87
68.11
58.38
61.30
55.14
PSD
KNN
45.49
47.76
40.94
42.99
38.67
PNN
63.57
66.75
57.21
60.07
54.04
Fuzzy
70.99
74.54
63.89
67.08
60.34
Average
Centroid
KNN
52.57
55.19
47.31
49.67
44.68
PNN
54.40
57.12
48.96
51.41
46.24
Fuzzy
70.61
74.14
63.55
66.73
60.02
PSD
KNN
64.32
67.53
57.88
60.78
54.67
PNN
69.62
73.10
62.66
65.79
59.17
Fuzzy
79.21
82.09
70.36
73.88
66.45
The distribution of the input vectors to the fuzzy classifier over 4 distraction levels: (a) 120 training and (b) 80 testing vectors.
Fuzzy system structure for two features of 14 EEG channels.
Sensitivity and specificity are commonly used performance measures of binary classification tests. Sensitivity is defined as the proportion of actual positives which are correctly identified as positive, and specificity is the proportion of negatives which are correctly identified as negative. These parameters, namely, accuracy, sensitivity, specificity, true positive rate (TPR), and false negative rate (FNR) can be calculated as follows:
(14)Accuracy=TP+TNTP+TN+FP+FN,Sensitivity=TPTP+FN,Specificity=TNFP+FN,TPR=TPTP+FP,FNR=FNFN+FP,
where TP is the true positive, TN is the true negative, FP is the false positive, and FN is the false negative.
Table 3 summarizes the classification accuracy (% CR), sensitivity, specificity, TPR, FNR of KNN, PNN, and fuzzy classifiers for the two features (centroid and PSD) under db8. The best performance of classification of 79.21% was achieved by fuzzy using PSD feature based on sym8 as shown in Table 3 with an average sensitivity of 82.09%, specificity of 70.36%, TPR of 73.88%, and FNR of 66.45%. The KNN and PNN classifiers produce maximum classification accuracy of 64.32% and 69.72%, respectively, both based on same wavelet (sym8) and same feature (PSD) as shown in Table 3. Therefore, sym8 wavelet can be considered as the dominant wavelet type to get good accuracy of classification of different levels of distraction based on PSD feature.
Table 4 shows the comparison between the maximum mean distraction classification rate of the previous researchers work and the present work. From this table, the maximum mean classification rate of 92% is achieved on classifying two classes [28]. The maximum classification rate of 89.4% is achieved on classifying two classes based on Fisher linear discrimination method [26]. Junya et al. [27] got maximum classification rate of 75.9% on classifying three classes based on hybrid of physical and performance methods mentioned in Section 1. However the present recognition system used 50 subjects and achieved the average maximum mean rate of 98.7% and 79.21% on classifying two and four different levels of distraction, respectively.
Comparison of maximum mean distraction classification rate of present work with that of earlier works.
Reference
Physiological signal
Database
Feature extraction
Classifier
% Accuracy
[26]
EEG
5 Subjects 15 Ch
FFT
Fisher linear discriminate 2 Class
89.4
[27]
EEG, EOG
5 Subjects 30 Ch
FFT
SVM 3 Classes
75.9
[28]
EEG, vehicle behaviour
4 Subjects 15 Ch
WT
SVM 2 Class
92
[29]
EEG
13 Subjs 2 Channels
WT
Self orgnizing map 3 Classes
60
Present work
EEG
50 Subjects 14 Ch
DWPT + FFT
Fuzzy 2 Classes
98.7
Fuzzy 4 Classes
79.21
6. Conclusion
Most of the research works have discussed the classification of driver distraction into two levels based on EEG frequency bands (distracted or nondistracted). In addition, many of the researchers have not attempted to investigate different types of distraction stimuli in the literature. This paper present amplitude spectrum of the three bands (theta, alpha, and beta) of the EEG signal which has been proposed along with the hybrid scheme based on DWT and FFT. Fusions of the above two methods give more significant results on extraction of centroid and PSD features under ANOVA analysis. The proposed methodology has been tested on 50 subjects and provides maximum accuracy of 79.21% using sym8 and subtractive fuzzy inference system for PSD feature with an average sensitivity of 82.09% and of 70.36%. However, we focus on strengthening this present database with more number of subjects for developing a generalized driver distraction detection system using the proposed methodology.
NHTSADrowsly driver detection and warning system for commercial vehicle drivers: field proportional test design, analysis, and progressWashington, DC, USA, National Highway Traffic Safety Administration, http://www.nhtsa.dot.gov/UenoH.KanedaM.TsukinoM.Development of drowsiness detection systemProceedings of the Vehicle Navigation and Information Systems Conference19941520HayamiT.MatsunagaK.ShidojiK.MatsukiY.Detecting drowsiness while driving by measuring eye movement: a pilot studyProceedings of the 5th Internation Conference on Intelligent Transportation Systems2002Singapore156161SmithP.ShahM.da Vitoria LoboN.Determining driver visual attention with one cameraSubasiA.Automatic recognition of alertness level from EEG by using neural network and wavelet coefficientsCrespelA.GélisseP.BureauM.GentonP.PicotA.CharbonnierS.CaplierA.On-line automatic detection of driver drowsiness using a single electroencephalographic channelProceedings of the 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS '08)August 2008386438672-s2.0-61849168100MakeigS.BellA. J.JungT. P.SejnowskiT. J.Independent component analysis of electroencephalographic dataEskandarianA.MortazaviA.Evaluation of a smart algorithm for commercial vehicle driver drowsiness detectionProceedings of the IEEE Intelligent Vehicles Symposium (IV '07)June 2007Istanbul, Turkey5535592-s2.0-47849090182YuanP.WeichihH.KuoT. B. J.Liang-YuS.A portable device for real time drowsiness detection using novel active dry electrode systemProceedings of the 31st Annual International Conference of the IEEE Engineering in Medicine and Biology Society: Engineering the Future of Biomedicine (EMBC '09)September 2009377537782-s2.0-7795101829010.1109/IEMBS.2009.5334491LinC.-T.LiangS.-F.ChenY.-C.HsuY.-C.KoL.-W.Driver's drowsiness estimation by combining EEG signal analysis and ICA-based fuzzy neural networksProceedings of the 2006 IEEE International Symposium on Circuits and Systems (ISCAS '06)May 2006212521282-s2.0-34547296519EgnerT.Multiple conflict-driven control mechanisms in the human brainKhoukhiA.Data-driven multi-stage motion planning of parallel kinematic machinesXuJ.-X.HouZ.-S.Notes on data-driven system approachesHorberryT.AndersonJ.ReganM. A.TriggsT. J.BrownJ.Driver distraction: the effects of concurrent in-vehicle tasks, road environment complexity and age on driving performanceDukicT.HansonL.FalkmerT.Effect of drivers' age and push button locations on visual time off road, steering wheel deviation and safety perceptionHancockP. A.LeschM.SimmonsL.The distraction effects of phone use during a crucial driving maneuverCrundallD.Van LoonE.UnderwoodG.Attraction and distraction of attention with roadside advertisementsFrenchJ.A model to predict fatigue degraded performanceProceedings of the 7th Conference on Human Factors and Power PlantsSeptember 20024164192-s2.0-0036439970BilL.ZhangR.ChenZ.Study on real-time detection of alertness based on EEGProceedings of the IEEE/ICME International Conference on Complex Medical Engineering (CME '07)May 2007149014932-s2.0-4814911163510.1109/ICCME.2007.4381994MousaM. K.MurugappanM.BadlishahA. R.PNN based driver drowsiness level classification using EEGAcharyaU. R.FaustO.SreeS. V.AlvinA. P. C.KrishnamurthiG.SeabraJ. C. R.SanchesJ.SuriJ. S.Atheromatic symptomatic vs. asymptomatic classification of carotid ultrasound plaque using a combination of HOS, DWT and textureProceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS '11)September 2011Boston, Mass, USA448944922-s2.0-8405519390210.1109/IEMBS.2011.6091113WaliK. M.MurugappanM.BadlishahA. R.ZhengB.Development of discrete wavelet transform (DWT) toolbox for signal processing applicationsProceedings of the International Conference on Biomedical Engineering (ICoBE '12)February 2012Penang, Malaysia2728RizonM.MurugappanM.NagarajanR.YaacobS.Asymmetric ratio and FCM based salient channel selection for human emotion detection using EEGWaliK. M.MurugappanM.BadlishahA. R.Mathematical implementation of hybrid fast Fourier transform and discrete wavelet transform for developing graphical user interface using visual basic for signal processing applicationsBrahimC.ZehangH.GvanC.PhuaS.TeeK.KengK.Learning EEG-based spectral-spatial patterns for attention level measurementsProceedings of the IEEE International Symposium on Circuits and Systems (ISCAS '09)2009Taipei, Taiwan14651146810.1109/ISCAS.2009.5118043JunyaF.TouyamaH.HiroseM.Extracting Alpha band modulated using visual spectral attention without flickering stimuli using common spatial patternProceedings of the 80th Annual International IEEE EMBS Conference2008British Columbia, CanadaVancouverAlfredoS.JammesD.EsteveD.MendolaG.Driver Hypovigilance diagnosis using wavelets and statistical analysisProceedings of the International Conference on intelligent transportation system200216216710.1109/ITSC.2002.1041207SeungL.AbullavB.Analysis of attention deficit hyperactivity disorder in EEG using wavelet transform and self organized mapsProceedings of the International conference on control, Automation and Systems2010Singapore