1. Introduction

MPE

Mathematical Problems in Engineering

1563-5147 1024-123X

Hindawi Publishing Corporation

297587

10.1155/2013/297587

297587

Research Article

Wavelet Packet Transform Based Driver Distraction Level Classification Using EEG

Wali

Mousa Kadhim

¹ Murugappan

Murugappan

² Ahmmad

Badlishah

¹ Hong

Wei-Chiang

School of Computer and Communication Engineering

Universiti Malaysia Perlis, 01000 Kangar

Malaysia

unimap.edu.my

School of Mechatronic Engineering

Universiti Malaysia Perlis, 01000 Kangar

Malaysia

unimap.edu.my

2013

12 11 2013

2013 09 03 2013 26 08 2013 26 08 2013

2013

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

We classify the driver distraction level (neutral, low, medium, and high) based on different wavelets and classifiers using wireless electroencephalogram (EEG) signals. 50 subjects were used for data collection using 14 electrodes. We considered for this research 4 distraction stimuli such as Global Position Systems (GPS), music player, short message service (SMS), and mental tasks. Deriving the amplitude spectrum of three different frequency bands theta, alpha, and beta of EEG signals was based on fusion of discrete wavelet packet transform (DWPT) and FFT. Comparing the results of three different classifiers (subtractive fuzzy clustering probabilistic neural network, K-nearest neighbor) was based on spectral centroid, and power spectral features extracted by different wavelets (db4, db8, sym8, and coif5). The results of this study indicate that the best average accuracy achieved by subtractive fuzzy inference system classifier is 79.21% based on power spectral density feature extracted by sym8 wavelet which gave a good class discrimination under ANOVA test.

1. Introduction

In many countries distraction is responsible for many car accidents. The National Highway Traffic Safety administration (NHTSA) estimates that 100,000 of police-reported crashes because of driver fatigue happened each year [1]. Thereby, it is important to develop automatic detectors of this state. Most of the automatic detection methods are based on analyzing the driver behavior to detect abnormal actions [2] or using image processing technique to monitor and evaluate his head position and eye movement or blinking [3, 4]. Drowsiness can also be identified through electroencephalographic (EEG) signals, which contain alertness information [5]. The EEG plays fatal role into measuring the electrical activity of the brain [6]. Different signal processing techniques, like wavelet transform (WT) [5], means comparison test [7], independent component analysis [8] with different classifiers such as neural networks (NNs) [8–10], and Fussy Logic [11], have been applied to detect drowsiness in EEG signals. Driving is a complex task in which different skills and functions are combined simultaneously; therefore monitoring drivers’ attention regarding brain resources is a strong challenge for researchers and analytics in the field of cognitive brain research and brain interface computer. The level of interference from task-irrelevant stimulus information (conflict), reflected in slowed responses and decreased accuracy for incompatible relative to compatible stimuli, is found to be reduced after the processing of an incompatible as compared with a compatible stimulus [12]. Recently, soft computing had been seen as an attractive alternative, and several methods were developed for trajectory design and robot motion control using neurofuzzy techniques [13]. Data-driven approaches have been widely applied to solve industrial problems encountered in the real life, including control engineering, instrumentation and measurement, computer security, intelligent transportation systems, and vehicles [14]. Causes of distractions during driving were quite widespread, including eating, drinking, talking with passengers, use of cell phones, reading, fatigue, problem solving, and using in-car equipment such as GPS, media player, and in-vehicle entertainment, thus making it likely that the problem of driver is inattention [15–18]. Many researchers have proposed a lot of methods to detect attention change using physiological changes such as eye blinking, heart rate, pulse rate or skin electric potential, and especially brain wave [19]. EEG based methods mainly focus on the monitoring of the alertness variation of driver fatigue due to drowsiness, whereas the detecting approaches of alertness change in tasks requiring sustained attention have been seldomly explored [20]. This work has two objectives: (1) to select the optimal wavelet function for getting the better classification accuracy from the alpha, theta, and beta band features and (2) to determine the classifier which gives better average and individual classification rate. In our work, we have used audiovisual stimuli for evoking four different levels such as neutral, low, medium, and high distraction. Two features, spectral centroid and power spectral density (PSD), are derived using wavelet transform on theta, alpha, and beta band. These numerical features are classified using three different classifiers, namely, K-nearest neighbor (KNN), probabilistic neural network (PNN), and fuzzy inference system. In our last recent work, we used PNN classifier to classify driver drowsiness level (sleepy state) and achieved 61% based on db4, and we expect that this accuracy would be more, if fuzzy classifier had been used [21]. This paper adds on significant solution for driver distraction level related to EEG bands and their position in the packet of the wavelet transform explored mathematically by their designated equations. In this work, a set of four distraction stimuli, namely, media player, GPS, mental task, and SMS message are induced by using audio-visual stimuli. The rest of this paper is organized as follows. In Section 2, we summarize the research methodology by elucidating the data acquisition process. Sections 3 and 4 explain feature extraction using wavelet transform and classification of distraction level by different classifiers, respectively. Section 5 illustrates the overview of the results and discussion of this present work, and conclusions are given in Section 6.

2. Data Acquisition

Mobile phone is considered as the main reason of driver distraction compared to other distraction reasons such as GPS, music and video players, and mental thinking. Therefore, we applied these four distractions to develop suitable database for this work using EEG signals. Figure 1 shows a simulated environment of real driving in one of our university laboratories based on simulation driving software. Infrared camera had been used to capture the driver face image for data validation after finishing the experiment.

Protocol flow. (a) Subject distracted by mobile (mental task). (b) Driving car environment. (c) Subject distracted by GPS. (d) Electors position and signals.

(a) (b) (c) (d)

Before start driving, the subject was asked to initially keep eyes closed for 2 min duration followed by another two minutes for open eyes. After this neutral initialization, the driver was asked to drive for 30 minutes containing different tasks of distraction, each 2 minutes duration, such as media player, GPS, mental thinking by answering few mental questions through mobile phone, and finally he or she should type and send SMS messages. Through this protocol and according to the continuous performance test (CPT), we can determine whether the subject is in low, medium, or high level distraction according to his/her time response through relooking to the screen and controlling the steering wheel continuously. For the first 30 subjects, we first determined visually the 1 sec. duration of distraction (like typing in GPS or SMS messages) and considered as low level. Secondly, for the medium level, the continuous 2 sec. distraction time was extracted, whereas the continuous 3 sec. distraction time is assumed to be as a high level.

In this work, 50 subjects (43 Males and 7 Females) in the age range of 24 years to 34 years have participated. Emotive EEG system is used to acquire the EEG signals over the complete scalp through 14 electrodes (FP1, FP2, F7, F8, F3, F4, T7, T8, P7, P8, O1, O2, A1, and A2). All the electrodes are placed over the subject scalp based on international 10–20 system of electrode placement. EEG signals are acquired at a sampling frequency of 128 Hz and band pass was filtered between 0.05 Hz and 60 Hz. The reference electrode and ground electrode are placed on right and left ear lobes. The impedance of the electrodes is kept below 5 KΩ.

3. Feature Extraction

Brain electrical signals are time-varying and nonstationary signals, which have different frequency elements at different times. Indeed, the EEG signals cannot be considered as stationary even under short duration, since it can exhibit considerable short term nonstationary [22]. Therefore, DWT is a more suitable method to decompose the EEG signal into its different frequency bands and retain the signal information in both time and frequency domain unlike FFT or STFT [22, 23]. In this work, the spectrum features from the EEG signals for different distraction levels are derived from three frequency bands, namely, theta, alpha, and beta, by applying four different wavelets (db4, db8, sym8, and coif5). These wavelet functions have been chosen due to their near optimal time frequency localization properties. Moreover, the waveforms of these wavelets are similar to the waveforms to be detected in the EEG signal, the orthogonal property, and optimal number of filter coefficients for reducing the computational complexity. Therefore, extraction of EEG signals features is more likely to be successful [23]. Due to the nonstationary nature of EEG signals, we need to analyze them onto basis functions created by dilation and shifting the mother wavelet function. In general, the characteristic nature of mother wavelet function should be similar in shape to the original signal under processing. The extracted wavelet coefficients provide a compact representation that shows the energy distribution of the EEG signal in time and frequency [24].

The researchers are utilizing discrete wavelet packet transform (DWPT) for efficient frequency band localization. DWPT decomposes both high and low frequency component of the input signal into any level of decomposition as shown in Figure 2, unlike normal wavelet transform which decomposes only the approximation coefficients in the subsequent levels. In this work, DWPT is used to obtain three frequency bands, namely, theta (4–8 Hz), alpha (8–12 Hz), and beta (14–32 Hz) frequency bands, for distraction detection. PSD estimates of noise signals from a finite number of its samples are based on three fundamentally different approaches, namely, parametric, nonparametric, and subspace method. Though the computation complexity is higher during the PSD computation using DWPT and FFT approach, it gives good classification accuracy on efficiently distinguishing the distraction levels. As a beginning of this research, we computed the PSD feature through DWPT and FFT. In future, we aim to analyze the significance of PSD through DWPT alone for distraction levels classification.

Figure 2

Five level EEG signal decomposition using discrete wavelet packet transform (DWPT).

The mathematical derivation of the approximation coefficients (CA0, CA1, and CAK) is by taking the N samples of the input signal S and extend it to N*=N+2(M-2)+C, as C is a constant which is equal to 0 for even N or 1 for odd N [25]. This extension is highly needed to make matching between the numbers of input samples with the wavelet filter coefficients, and this thing should be applied on each input to any level. Therefore the new extended signal S is as follows: (1)S=[S0,S1,S2,…,SN*-1]. And by applying wavelet decomposition on this S signal by performing convolution of the input samples with low pass filter coefficients of M coefficients as shown in Figure 3 to produce (N*-M)/2 approximation coefficients, we have (2)CA0=S0*h0+S1*h1+⋯+SM-1*hM-1,CA1=S2*h0+S3*h1+⋯+SM+1*hM-1, ⋮CA((N*-M)/2)=S(N*-M)*h0+S(N*-M+1)*h1+⋯+S(N*-1)*hM-1. Convolution of the input signal samples with high pass filter coefficients produces the first level detail coefficients (CD₀, CD₁, and CDK) as follows: (3)CD0=S0*g0+S1*g1+⋯+SM-1*gM-1,CD1=S2*g0+S3*g1+⋯+SM+1*gM-1, ⋮CD((N*-M)/2)=S(N*-M)*g0+S(N*-M+1)*g1+⋯+S(N*-1)*gM-1. The generalized equation for deriving approximation coefficients and detail coefficients for wavelet decomposition is given as (4)CAK=h0S2*K+h1S2*K+1+⋯+hM-1S2*K+M-1=∑i=0M-1hi*Si+2*K,CDK=g0S2*K+g1S2*K+1+⋯+gM-1S2*K+M-1=∑i=0M-1gi*Si+2*K. The basic relation between the input samples and filter coefficients (low pass and high pass) for generating approximation and detail coefficients for any level “b” can be stated as (5)CAb,K=∑i=0M-1hi*CAb-1,i+2*K,CDb,K=∑i=0M-1gi*CDb-1,i+2*K. The general wavelet packet transform equations for deriving theta band (level 4, part 1), alpha band (level 4, part 2), beta 1 band (level 5, part 7) and beta 2 band (level 2, part 1) as shown in Figure 2 are given in (6) to (9), respectively, based on db4 (M=8) as follows: (6)CD4(1),K=∑i=07gi*CD3(0),i+2*K,(7)CA4(2),K=∑i=07hi*CD3(1),i+2*K,(8)CD5(7),K=∑i=07gi*CD4(3),i+2*K,(9)CD2(1),K=∑i=07gi*CA1(0),i+2*K.

Figure 3

WPT of two levels.

3.1. Amplitude Spectrum

Amplitude spectrum is defined as the magnitude of the Fourier transform of a time-domain signal. Every signal can be written as a sum of sinusoids with different amplitudes and frequencies. It can have other names like spectral density, voltage spectrum, power spectrum, and spectral intensity which describes how the power of a signal or time series is distributed over the different frequencies. The frequency spectrum of a time-domain signal is a representation of that signal in the frequency domain. The frequency spectrum can be generated via a Fourier transform of the signal, and the resulting values are usually presented as amplitude and phase, both plotted versus frequency. A signal can be broken into short segments (sometimes called frames), and spectrum analysis may be applied to these individual segments. In this work, the average amplitude of the FFT output of EEG bands wavelet transformed is used to derive two different features, namely, spectral centroid and PSD.

3.1.1. Power Spectral Density (PSD)

Spectral analysis is the distribution of power over frequency. Spectral analysis finds applications in many fields such as speech analysis, monitoring vibration, economics, and sonar systems. In medicine, spectral analysis of various signals measured from a patient, such as electrocardiogram (ECG) or electroencephalogram (EEG) signals, can provide useful material for diagnosis. A random signal usually has finite average power and, therefore, can be characterized by an average power spectral density as (10)PSD(i)=∑i=0N‍|X(k)|2, where X(k) represent the out of FFT and k is the position of the FFT components.

3.1.2. Spectral Centroid Frequency

Spectral centroid frequency is commonly known as subband spectral centroid [7, 10]. The spectral centroid is used to find the center value of the groups for each frequency band. Spectral centroids feature extraction technique was widely used in audio recognition because of its robustness to recognize the dominant frequency and to extract EEG features for stress identification [12, 13]. In this work, the author tried to use this feature for EEG classification. The spectral centroid (C) is calculated using the following formula: (11)C=∫kX(k)dk∫X(k)dk.

3.1.3. Features Extraction Algorithm

(1)

Load the input EEG signal from 14 channels.

(2)

Apply 4th order Butterworth IIR band-pass filter and followed by notch filter to remove the effects of noises and artifacts.

(3)

Perform the framing on the preprocessed signal with duration of 1 second.

(4)

Decompose the EEG signal into five levels using the chosen wavelet function (db4, db8, sym8, and coif5) to extract the wavelet coefficients for theta, alpha, beta 1, and beta 2 frequency bands through DWPT.

(5)

Perform FFT for each frequency band to get the frequency spectrum (12)X(k)=∑n=0N-1x(n)wkn, where w=e-j(2π/N),

where k = position of sample after FFT, x(n) is the input wavelet coefficients corresponding to any of the four frequency bands, n is the number of input sample positions, and N is the maximum length of the input wavelet coefficients.

(6)

Determine the absolute value of FFT to get the PSD and C of the spectrum of each band.

(7)

Add the amount of this PSD and C of each band in this specified channel to the total mean of the said values of each band over the 14 channels

(8)

Take the average of PSD and C of each band by dividing by 14.

(9)

Repeat the above steps from 4 to 8 for the next 1 sec. EEG and continue to perform the analysis for all the active EEG channel.

4. The Classifiers

A standard classification problem generally follows a two-step procedure which consists of training and testing phases. During the training phase, a classifier is trained to achieve the optimal separation for the training data set. Then, in the testing phase, the trained classifier is used to discriminate new samples with unknown class information. As the predictability of the features may vary, an exhaustive method was used to select the best combination of features. That is, try all possible combinations of features and pick up those with best performances. In this paper, three different classifiers have been used to compare the results and choose the most suitable classifier for this distraction level classification purpose.

4.1. PNN Classifier

In this work, PNN architecture is constructed using newpnn () function in MATLAB 7.0. The PNN model is one among the supervised learning networks and has many features different from those of other networks in the learning processes. The data training set was used to train designed PNN. The PNN is tested with testing data set to show the impact on classification rate. The spread value (σ) of the radial basis function (RBF) was used as a smoothing factor, and classifier accuracy was examined with different values of σ. The first step of training the PNN network is by selecting the optimal spread values which control the spread of the RBF functions. If the spread value is too large, then the model will not be able to closely fit the function if the spread value is too small, the model will over fit the data because each training point will have too much influence. In this work smoothing factor of 0.1 value has been used to classify the hypovigilance level.

4.2. <inline-formula><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M41"><mml:mrow><mml:mi>K</mml:mi></mml:mrow></mml:math></inline-formula>-Nearest Neighbor Classifier

The algorithm of classification of new test feature vector is determined by the class of its K-nearest neighbors. This classifier memorizes all vectors in the tanning sets and then compares the test vector with them. Therefore this classifier is called memory based learning. KNN algorithm is based on Euclidian distance metrics to locate the nearest neighbors. The Euclidian distance between the two points X and Y is explained as in (13) (13)Dist(X,Y)=∑i=1D(Xi-Yi)2, where D is the number of coordinates. In this work the K-nearest neighbor value is varied from 2 to 9. The optimal value of K is selected based on the higher classification rate.

4.3. Fuzzy Subtractive Clustering

Fuzzy subtractive (FS) clustering is a fast, one-pass algorithm for estimating the number of clusters and the cluster centers in a set of data. This technique depends upon the measure of the density of data points in the feature space. The aim is to find areas in the feature space with high densities of data points. The point with the highest number of neighbors is considered as the center of a specific cluster. The algorithm will remove the data points within a prespecified fuzzy radius. This process will check all the data points. The radii variable is a vector of entries between 0 and 1 that specifies a cluster center’s range. Small radii values will generate few large clusters. Recommended values for radii should be between 0.2 and 0.5. In this work, a value of 0.5 for all the radii was chosen because this leads to fewer membership functions and less computation time, without losing accuracy. Once the inputs for hypovigilance classification are selected, input membership functions must be determined. The Gaussian membership function shown in Figure 4 is selected since it has continuous derivability. The function is given by μ(x)=e-(x-m)2/2σ2. This function is based on two factors, m and σ, as they represent the center and the width of the Gaussian function, respectively. MATLAB fuzzy logic toolbox provides an important function to generate FS system. This function constructs a set of rules to model the data organization based on the subtractive clustering centers and clusters to allocate antecedent membership functions.

Figure 4

Gaussian member ship function with m=5 and σ=2.

4.4. Data Preparation for Classification

The requirement of generating classifier system is to divide the training data into two data sets. Firstly, an input data set which has 6 values of two features F1 and F2 over three bands (θ, α, and β) [F1,θ, F1,α, F1,β, F2,θ, F2,α, and F2,β], where F1 and F2 represent centroid frequency and power spectral density features, respectively. Hence each vector of the overall 200 vectors contains 6 values. Therefore the overall data inputs are 1200 values over 50 subjects for four levels (50*4*6). Secondly, an output data set (1, 2, 3, or 4) is used for one output. The output is either 1 for neutral or 2 for low level or 3 for medium level, or 4 for high level. These points were placed into a single output data set with 200 values, each 50 values for one class, where 60% of the vectors are used as training (120) and 40% as testing (80).

5. Results and Discussion

This research work is intended to investigate the effects of distraction due to cognitive, visual, and auditory distraction using different stimuli. In this work, we utilized the potential of localizing the frequency bands in EEG signals through DWPT and fusion with FFT for efficient feature extraction to get efficient distraction classification. The significance of these two features, spectral centroid and PSD, are checked based on Analysis of Variance (ANOVA) test over each wavelet (db4, db8, sym8, and coif5) as shown in Table 1.

Table 1

ANOVA test of centroid and PSD features over db4, db8, sym8, and coif5 for each distraction level.

	Neutral	Low	Medium	High	P
db4
Centroid	5.3 ± 50.8	15.5 ± 217.8	10.1 ± 239.8	9.3 ± 286.4	<0.001
PSD	0.41 ± 7.5	1.12 ± 18.8	0.013 ± 0.0007	0.019 ± 0.0039	<0.001
db8
Centroid	1.29 ± 5.12	9.19 ± 113.9	4.6 ± 64.3	4.15 ± 53.8	<0.001
PSD	0.0008 ± 5.4 E - 06	6.78 ± 213.9	0.005 ± 0.0001	0.005 ± 0.0002	<0.001
sym8
Centroid	1.26 ± 3.27	5.6 ± 49.6	5.4 ± 78.7	4.2 ± 61.7	<0.001
PSD	0.0008 ± 4.46 E - 06	2.3 ± 60.5	0.004 ± 8.6 E - 5	0.005 ± 0.0002	<0.001
coif5
Centroid	2.02 ± 5.19	15.6 ± 395.4	10.7 ± 372.7	7.66 ± 199.9	<0.001
PSD	0.002 ± 0.00003	0.03 ± 0.003	0.048 ± 0.024	0.04 ± 0.02	<0.001

All the results are presented as mean ± SD with P values. The ANOVA test with P values generally less than 0.005 suggests that these features measures can be used as classification features. We extracted PSD and centroid frequency features from the amplitude spectrum and performed ANOVA test on four classes of distraction (neutral, low, medium, and high). These two features give excellent P values under ANOVA test as shown in Table 1. Features are computed from 3-second window of the 14 EEG channels, and ANOVA test is used to check if the mean values are different for the different classes. Table 1 shows the results of the amplitude spectrum parameters for different wavelets over the four levels of distraction. The mean centroid frequency magnitude after neutral state seems to decrease from low to medium to high distraction EEG based on db4. Therefore, both said parameters are suitable for differentiating and classification. For db8 these two features cannot differentiate the medium from the high distraction. When sym8 is applied, the mean centroid frequency magnitude starts decreasing from low to medium to high distraction EEG, and the two features are very weak in medium distraction state. Therefore, it is easy to distinguish this state from low and high distraction. It is obvious under coif5 that the centroid is decreasing from low to medium to high state, while PSD almost shows no significant changes. Finally, we concluded that sym8 wavelet is the most suitable wavelet for distraction classification, therefore it gives maximum classification achievement of 79.10% as shown in Tables 2 and 3 using PSD feature for fuzzy classifier which its input vectors distribution is shown in Figure 5 and its structure is shown in Figure 6. Therefore, we considered this wavelet for subsequent analysis.

Table 2

Classification accuracy of different classifiers for different wavelets over 4 distraction levels for both two features.

	Neutral		Low		Medium		High		Average
	Centroid	PSD	Centroid	PSD	Centroid	PSD	Centroid	PSD	Centroid	PSD
Fuzzy
db4	71.17	77.73	67.08	79.21	79.32	84.80	64.87	70.99	70.61	78.18
db8	64.17	73.24	77.29	89.90	75.99	71.73	63.61	79.41	70.27	78.57
sym8	62.51	72.79	82.61	91.99	81.01	70.60	63.65	79.21	72.45	79.21
coif5	65.36	74.94	63.19	66.30	76.81	78.22	62.68	63.55	67.01	70.75
PNN
db4	62.39	58.13	58.14	73.56	65.88	83.20	31.19	63.57	54.40	69.62
db8	52.64	50.31	72.92	90.79	57.56	64.26	25.92	75.48	52.26	70.21
sym8	25.00	49.29	69.22	91.29	77.50	63.31	53.91	79.22	56.41	70.78
coif5	55.42	68.52	50.93	34.01	58.50	60.84	24.80	25.42	47.41	47.20
KNN
db4	44.29	72.33	34.93	61.69	75.79	77.75	55.26	45.49	52.57	64.32
db8	25.31	66.34	60.25	90.13	71.54	46.35	52.98	65.04	52.52	66.97
sym8	50.70	65.53	78.55	91.29	69.00	44.66	28.20	70.91	56.61	68.10
coif5	31.29	52.45	24.74	56.91	70.57	73.49	50.74	53.00	44.34	58.97

Bold values refers to maximum average values.

Table 3

Classification results of KNN, PNN, and fuzzy classifiers over 4 distraction levels based on sym8 using centroid and PSD features.

Distraction based on sym8	% CR	SEN.	SPEC.	TPR	FNR
Neutral
Centroid
KNN	51.29	53.50	46.86	48.85	44.64
PNN	62.39	65.50	56.15	58.95	53.03
Fuzzy	71.17	74.73	64.05	67.26	60.49
PSD
KNN	72.33	75.95	65.10	68.35	61.48
PNN	58.13	61.04	52.32	54.94	49.41
Fuzzy	77.73	81.61	69.96	73.45	66.07

Low
Centroid
KNN	52.93	54.68	49.44	51.01	47.69
PNN	58.14	61.05	52.33	54.94	49.42
Fuzzy	67.08	70.43	60.37	63.39	57.01
PSD
KNN	61.69	64.77	55.52	58.30	52.44
PNN	73.56	77.24	66.20	69.51	62.53
Fuzzy	79.21	83.17	71.29	74.85	67.33

Medium
Centroid
KNN	75.79	79.58	68.21	71.62	64.42
PNN	65.88	69.18	59.30	62.26	56.00
Fuzzy	79.32	83.29	71.39	74.96	67.43
PSD
KNN	77.75	81.64	69.98	73.48	66.09
PNN	83.20	87.37	74.88	78.63	70.72
Fuzzy	84.80	89.04	76.32	80.14	72.08

High
Centroid
KNN	55.26	58.02	49.73	52.22	46.97
PNN	51.19	52.75	48.07	49.47	46.51
Fuzzy	64.87	68.11	58.38	61.30	55.14
PSD
KNN	45.49	47.76	40.94	42.99	38.67
PNN	63.57	66.75	57.21	60.07	54.04
Fuzzy	70.99	74.54	63.89	67.08	60.34

Average
Centroid
KNN	52.57	55.19	47.31	49.67	44.68
PNN	54.40	57.12	48.96	51.41	46.24
Fuzzy	70.61	74.14	63.55	66.73	60.02
PSD
KNN	64.32	67.53	57.88	60.78	54.67
PNN	69.62	73.10	62.66	65.79	59.17
Fuzzy	79.21	82.09	70.36	73.88	66.45

The distribution of the input vectors to the fuzzy classifier over 4 distraction levels: (a) 120 training and (b) 80 testing vectors.

(a) (b)

Figure 6

Fuzzy system structure for two features of 14 EEG channels.

Sensitivity and specificity are commonly used performance measures of binary classification tests. Sensitivity is defined as the proportion of actual positives which are correctly identified as positive, and specificity is the proportion of negatives which are correctly identified as negative. These parameters, namely, accuracy, sensitivity, specificity, true positive rate (TPR), and false negative rate (FNR) can be calculated as follows: (14)Accuracy=TP+TNTP+TN+FP+FN,Sensitivity=TPTP+FN, Specificity=TNFP+FN,TPR=TPTP+FP, FNR=FNFN+FP, where TP is the true positive, TN is the true negative, FP is the false positive, and FN is the false negative.

Table 3 summarizes the classification accuracy (% CR), sensitivity, specificity, TPR, FNR of KNN, PNN, and fuzzy classifiers for the two features (centroid and PSD) under db8. The best performance of classification of 79.21% was achieved by fuzzy using PSD feature based on sym8 as shown in Table 3 with an average sensitivity of 82.09%, specificity of 70.36%, TPR of 73.88%, and FNR of 66.45%. The KNN and PNN classifiers produce maximum classification accuracy of 64.32% and 69.72%, respectively, both based on same wavelet (sym8) and same feature (PSD) as shown in Table 3. Therefore, sym8 wavelet can be considered as the dominant wavelet type to get good accuracy of classification of different levels of distraction based on PSD feature.

Table 4 shows the comparison between the maximum mean distraction classification rate of the previous researchers work and the present work. From this table, the maximum mean classification rate of 92% is achieved on classifying two classes [28]. The maximum classification rate of 89.4% is achieved on classifying two classes based on Fisher linear discrimination method [26]. Junya et al. [27] got maximum classification rate of 75.9% on classifying three classes based on hybrid of physical and performance methods mentioned in Section 1. However the present recognition system used 50 subjects and achieved the average maximum mean rate of 98.7% and 79.21% on classifying two and four different levels of distraction, respectively.

Table 4

Comparison of maximum mean distraction classification rate of present work with that of earlier works.

Reference	Physiological signal	Database	Feature extraction	Classifier	% Accuracy
[26]	EEG	5 Subjects 15 Ch	FFT	Fisher linear discriminate 2 Class	89.4
[27]	EEG, EOG	5 Subjects 30 Ch	FFT	SVM 3 Classes	75.9
[28]	EEG, vehicle behaviour	4 Subjects 15 Ch	WT	SVM 2 Class	92
[29]	EEG	13 Subjs 2 Channels	WT	Self orgnizing map 3 Classes	60
Present work	EEG	50 Subjects 14 Ch	DWPT + FFT	Fuzzy 2 Classes	98.7
Present work	EEG	50 Subjects 14 Ch	DWPT + FFT	Fuzzy 4 Classes	79.21

6. Conclusion

Most of the research works have discussed the classification of driver distraction into two levels based on EEG frequency bands (distracted or nondistracted). In addition, many of the researchers have not attempted to investigate different types of distraction stimuli in the literature. This paper present amplitude spectrum of the three bands (theta, alpha, and beta) of the EEG signal which has been proposed along with the hybrid scheme based on DWT and FFT. Fusions of the above two methods give more significant results on extraction of centroid and PSD features under ANOVA analysis. The proposed methodology has been tested on 50 subjects and provides maximum accuracy of 79.21% using sym8 and subtractive fuzzy inference system for PSD feature with an average sensitivity of 82.09% and of 70.36%. However, we focus on strengthening this present database with more number of subjects for developing a generalized driver distraction detection system using the proposed methodology.

NHTSA

Drowsly driver detection and warning system for commercial vehicle drivers: field proportional test design, analysis, and progress

Washington, DC, USA, National Highway Traffic Safety Administration, http://www.nhtsa.dot.gov/

Ueno

Kaneda

Tsukino

Development of drowsiness detection system

Proceedings of the Vehicle Navigation and Information Systems Conference

1994

15 20

Hayami

Matsunaga

Shidoji

Matsuki

Detecting drowsiness while driving by measuring eye movement: a pilot study

Proceedings of the 5th Internation Conference on Intelligent Transportation Systems

2002

Singapore

156 161

Smith

Shah

da Vitoria Lobo

Determining driver visual attention with one camera

IEEE Transactions on Intelligent Transportation Systems 2003 4 4 205 218

2-s2.0-1342332858

10.1109/TITS.2003.821342

Subasi

Automatic recognition of alertness level from EEG by using neural network and wavelet coefficients

Expert Systems with Applications 2005 28 4 701 711

2-s2.0-17844371713

10.1016/j.eswa.2004.12.027

Crespel

Gélisse

Bureau

Genton

Atlas of Electroencephalography 2006 2 1st

Paris, France

John Libbey Eurotext

Picot

Charbonnier

Caplier

On-line automatic detection of driver drowsiness using a single electroencephalographic channel

Proceedings of the 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS '08)

August 2008

3864 3867

2-s2.0-61849168100

Makeig

Bell

A. J.

Jung

T. P.

Sejnowski

T. J.

Independent component analysis of electroencephalographic data

Advances in Neural Information Processing Systems 1996 8

Cambridge, Mass, USA

MIT Press

145 151

Eskandarian

Mortazavi

Evaluation of a smart algorithm for commercial vehicle driver drowsiness detection

Proceedings of the IEEE Intelligent Vehicles Symposium (IV '07)

June 2007

Istanbul, Turkey

553 559

2-s2.0-47849090182

Yuan

Weichih

Kuo

T. B. J.

Liang-Yu

A portable device for real time drowsiness detection using novel active dry electrode system

Proceedings of the 31st Annual International Conference of the IEEE Engineering in Medicine and Biology Society: Engineering the Future of Biomedicine (EMBC '09)

September 2009

3775 3778

2-s2.0-77951018290

10.1109/IEMBS.2009.5334491

Lin

C.-T.

Liang

S.-F.

Chen

Y.-C.

Hsu

Y.-C.

L.-W.

Driver's drowsiness estimation by combining EEG signal analysis and ICA-based fuzzy neural networks

Proceedings of the 2006 IEEE International Symposium on Circuits and Systems (ISCAS '06)

May 2006

2125 2128

2-s2.0-34547296519

Egner

Multiple conflict-driven control mechanisms in the human brain

Trends in Cognitive Sciences 2008 12 10 374 380

2-s2.0-52049122113

10.1016/j.tics.2008.07.001

Khoukhi

Data-driven multi-stage motion planning of parallel kinematic machines

IEEE Transactions on Control Systems Technology 2010 18 6 1381 1389

2-s2.0-77954471746

10.1109/TCST.2009.2036600

J.-X.

Hou

Z.-S.

Notes on data-driven system approaches

Acta Automatica Sinica 2009 35 6 668 675

2-s2.0-67650553656

10.3724/SP.J.1004.2009.00668

Horberry

Anderson

Regan

M. A.

Triggs

T. J.

Brown

Driver distraction: the effects of concurrent in-vehicle tasks, road environment complexity and age on driving performance

Accident Analysis and Prevention 2006 38 1 185 191

2-s2.0-27644483179

10.1016/j.aap.2005.09.007

Dukic

Hanson

Falkmer

Effect of drivers' age and push button locations on visual time off road, steering wheel deviation and safety perception

Ergonomics 2006 49 1 78 92

2-s2.0-30844456582

10.1080/00207540500422320

Hancock

P. A.

Lesch

Simmons

The distraction effects of phone use during a crucial driving maneuver

Accident Analysis and Prevention 2003 35 4 501 514

2-s2.0-0037988749

10.1016/S0001-4575(02)00028-3

Crundall

Van Loon

Underwood

Attraction and distraction of attention with roadside advertisements

Accident Analysis and Prevention 2006 38 4 671 677

2-s2.0-33646359919

10.1016/j.aap.2005.12.012

French

A model to predict fatigue degraded performance

Proceedings of the 7th Conference on Human Factors and Power Plants

September 2002

416 419

2-s2.0-0036439970

Bil

Zhang

Chen

Study on real-time detection of alertness based on EEG

Proceedings of the IEEE/ICME International Conference on Complex Medical Engineering (CME '07)

May 2007

1490 1493

2-s2.0-48149111635

10.1109/ICCME.2007.4381994

Mousa

M. K.

Murugappan

Badlishah

A. R.

PNN based driver drowsiness level classification using EEG

Journal of Theoretical and Applied Information Technology 2013 52 3

Acharya

U. R.

Faust

Sree

S. V.

Alvin

A. P. C.

Krishnamurthi

Seabra

J. C. R.

Sanches

Suri

J. S.

Atheromatic symptomatic vs. asymptomatic classification of carotid ultrasound plaque using a combination of HOS, DWT and texture

Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS '11)

September 2011

Boston, Mass, USA

4489 4492

2-s2.0-84055193902

10.1109/IEMBS.2011.6091113

Wali

K. M.

Murugappan

Badlishah

A. R.

Zheng

Development of discrete wavelet transform (DWT) toolbox for signal processing applications

Proceedings of the International Conference on Biomedical Engineering (ICoBE '12)

February 2012

Penang, Malaysia

27 28

Rizon

Murugappan

Nagarajan

Yaacob

Asymmetric ratio and FCM based salient channel selection for human emotion detection using EEG

WSEAS Transactions on Signal Processing 2008 4 10 596 603

2-s2.0-58249096164

Wali

K. M.

Murugappan

Badlishah

A. R.

Mathematical implementation of hybrid fast Fourier transform and discrete wavelet transform for developing graphical user interface using visual basic for signal processing applications

Journal of Mechanics in Medicine and Biology 2012 12 5

Brahim

Zehang

Gvan

Phua

Tee

Keng

Learning EEG-based spectral-spatial patterns for attention level measurements

Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS '09)

2009

Taipei, Taiwan

1465 11468

10.1109/ISCAS.2009.5118043

Junya

Touyama

Hirose

Extracting Alpha band modulated using visual spectral attention without flickering stimuli using common spatial pattern

Proceedings of the 80th Annual International IEEE EMBS Conference

2008

British Columbia, Canada

Vancouver

Alfredo

Jammes

Esteve

Mendola

Driver Hypovigilance diagnosis using wavelets and statistical analysis

Proceedings of the International Conference on intelligent transportation system

2002

162 167

10.1109/ITSC.2002.1041207

Seung

Abullav

Analysis of attention deficit hyperactivity disorder in EEG using wavelet transform and self organized maps

Proceedings of the International conference on control, Automation and Systems

2010

Singapore