Extraction of Optimal Measurements for Drowsy Driving Detection considering Driver Fingerprinting Differences

Contributions of measurements for detecting drowsy driving are determined by calculation parameters, which are directly related to the accuracy of drowsiness detection. The previous studies utilized the same Uniﬁed Calculation Parameters (UCPs) to compute each driver’s measurements. However, since each driver has unique driving behavior characteristics, namely, driver ﬁngerprinting, Individual Drivers’ Best Calculation Parameters (IDBCPs) making measurements more discriminative for drowsiness are various. Regardless of the diﬀerence in driver ﬁngerprinting among the drivers being tested, using UCPs instead of IDBCPs to compute measurements will limit the drowsiness-detection performance of the measurements and reduce drowsiness-detection accuracies at the individual driver level. Thus, this paper proposed a model to optimize calculation parameters of individual driver’s measurements and to extract individual driver’s measurements that eﬀectively distinguish drowsy driving. Through real vehicle experiments, we collected naturalistic driving data and subjective drowsy levels evaluated by the Karolinska Sleepiness Scale. Eight nonintrusive drowsiness-related measurements were calculated by double-layer sliding time windows. In the proposed model, we ﬁrstly applied the Wilcoxon test to analyze diﬀerences between measurements of the awake state and drowsy state, and constructed the ﬁtness function reﬂecting the relationship between the calculation parameters and measurement’s drowsiness-detection performance. Secondly, the genetic algorithms were used to optimize ﬁtness functions to obtain measured IDBCPs. Finally, we selected measurements calculated by IDBCPs that can distinguish drowsy driving to constitute individual drivers’ optimal drowsiness-detection measurement set. To verify the advantages of IDBCPs, the measurements calculated by UCPs and IDBCPs were, respectively, used to build driver-speciﬁc drowsiness-detection models: DF_U and DF_I based on the Fisher discriminant algorithm. The mean drowsiness-detection accuracies of DF_U and DF_I were, respectively, 85.25% and 91.06%. It indicated that IDBCPs could enhance measurements’ drowsiness-detection performance and improve the drowsiness-detection accuracies. This paper contributed to the establishment of personalized drowsiness-detection models considering driver ﬁngerprinting diﬀerences.


Introduction
Drowsy driving is a typically dangerous driving behavior, which seriously threatens traffic safety and causes substantial financial costs to individuals and society [1][2][3]. Drowsy driving is a common phenomenon that results from multiple underlying reasons, including prolonged driving hours and sleep deprivation [3,4]. A review of previous studies indicated that there was an obvious association between drowsy driving and the risk of traffic accidents [4,5].
e studies pointed out that 7% of all accidents and 16.5% of all road casualties were related to drowsy driving [6]. And, in the European Union, about 20% of commercial transport crashes were caused by drowsy drivers [7]. It was found that the risk of a near-crash event was significantly increased when drivers were drowsy after a night shift [8]. Besides, it is a problem to judge whether a traffic accident is caused by drowsiness [9]. Unlike drunk driving, many accidents related to drowsiness have not been reported due to the lack of objective criteria for judging drowsiness occurrence, and the actual hazards of drowsiness driving may be more serious [10]. erefore, it is of great significance to study the accurate real-time anti-drowsiness warning system.
Drowsiness-detection models are at the core of the antidrowsiness warning systems, which are mainly divided into two categories: intrusive and nonintrusive [11,12]. e intrusive drowsiness-detection models rely on contact devices to collect physiological data, such as electroencephalograms [13], electromyography [14], etc. For example, researchers [15] obtained the signal of heart rate variability (HRV) from surface electrocardiogram and trained drowsiness detector using 7 features derived from HRV, and the positive predictive value reached 0.96. Although the drowsiness-detection using physiological signal is more accurate, the intrusiveness of collecting data limits the practicability [14]. e nonintrusive drowsiness-detection models use noncontact sensors to collect data nonintrusively to detect drowsy driving, such as steering wheel angle, lane position, eye movement data, etc. [14,16]. e nonintrusive drowsiness-detection models are more practical because they have little interference with drivers and fewer restrictions [14]. So, nonintrusive drowsiness-detection methods have always been a hotspot in drowsy driving research [12,17]. As reported in much literature, the researchers established various drowsiness-detection models based on nonintrusive measurements, such as lane position derived measurements [18], steering wheel angle derived measurements [17], and the ocular movement derived measurements [19] and obtained relatively accurate drowsiness-detection results.
us, this paper concentrated on nonintrusive measurements and models of drowsiness detection. e sensitivity of nonintrusive measurements to drowsiness depends on the time window setting method and calculation parameters of measurements, which greatly determines the accuracy of drowsiness-detection [18,19]. Many studies concerning measurements and detection methods of drowsy driving have been conducted, which can be found in some review literature [11,12,14,[20][21][22]. is paper focuses on the optimization of calculation parameters of nonintrusive drowsiness measurements. erefore, we summarize some literature that typically uses nonintrusive measurements to detect drowsiness and gives specific computational parameters. Table 1 is about the apparatus, time window setting, and parameters for nonintrusive measurements calculation in partial drowsy driving studies. It can be seen that time window setting and calculation parameters of nonintrusive measurements in studies are inconsistent, which indicates that these factors affect research outcomes of drowsy driving. erefore, the calculation parameters of the measurements need to be optimized to improve the drowsinessdetection performance of nonintrusive measurements.
Most of the drowsy driving studies calculated the measurements of each driver by the same Unified Calculation Parameters (UCPs), without considering individual differences in measurements' calculation parameters, which are results of driver fingerprinting differences among drivers. It has been discovered that each person not only has unique physical characteristics but also behavioral characteristics [31]. For example, every person has his or her behavioral characteristics in shopping, writing, reading, etc. Driver fingerprinting is each driver's unique driving behavior characteristics during the driving duration [32], which are reflected in the steering wheel angle, speed, eye movement, and other driving behavior data [33,34]. e driver fingerprinting is stable and unique [33][34][35]. So, several researchers have tried to describe driver fingerprinting to identify the drivers. Finker et al. [35] collected multiple CAN-bus signals by field experiments, and cepstral feature extraction techniques were used to analyze brake pedal signals and gas pedal signals. ey used artificial neural networks to learn driver fingerprinting characteristics and performed driver-identification. e identification accuracy yielded 84.6%. Xun et al. [36] used actual vehicles to collect naturalistic driving behavior data, including speed, steering wheel angle, accelerator pedal signal, etc. ey built a model to achieve driver fingerprinting based on a convolutional neural network and support vector domain description and used driver fingerprinting to accurately identify drivers. e above studies utilized the differences in driver fingerprinting among drivers to accurately identify drivers, which proves that drivers have stable driving behavior characteristics that are different from others. And, the differences in driver fingerprinting were also the reason for the individual differences in some aspects of drowsy driving studies, such as the calculation parameters, data distribution, and drowsiness-detection threshold of drowsiness-related measurements [37][38][39].
Many scholars have noticed individual differences caused by driver fingerprinting differences in drowsy driving detection research [18,38,40,41]. Most drowsiness-detection models were generalized models that were trained by blending measurements of all drivers, and these models detected the drowsiness of each driver utilizing the same drowsiness threshold [14]. e differences in driver fingerprinting were the important reasons for the low reliability of the nonintrusive generalized drowsiness-detection models [39,42,43]. It was frequently mentioned that, due to the driver fingerprinting differences, drivers differed considerably in data distribution of measurements such as SDLP [39], the standard deviation of steering wheel movement (SDSWM) [38], PERCLOS [39], eye closure duration [43], etc. For instance, Inger et al. [37] analyzed the correlation between self-reported drowsiness level and blink duration and SDLP. e results indicated that there were obvious differences among drivers in the distribution of blink duration and SDLP under various levels of drowsiness. ey also pointed out that the drowsiness-detection at the individual driver level suffered from systematic errors if the drowsy driving was detected by the same drowsiness-detection threshold for all drivers. Moreover, researchers explored reasons for differences in driver fingerprinting that affected the drowsiness-identification. Silveira, et al. adopted features derived from electrocardiographs and eye movement signals to build the classifiers of subject-dependent driver sleepiness. ey found that physiological signals presented obvious individual characteristics, while subjectindependent drowsiness classifiers that ignored individual differences performed worse [44]. Similarly, Persson, et al. used HRV measurements to build a drowsiness classifier based on machine learning and pointed out that the accuracy decreased dramatically for a new driver because of individual differences [45]. Yan et al. [41] analyzed driver fingerprinting differences in drowsiness-detection models and found that differences in measurement distribution among drivers could be beyond differences caused by drowsiness when all drivers' measurements were mixed to train models, which decreased the correlation between measurements and drowsiness. erefore, the accuracies of drowsiness-detection models without driver fingerprinting differences were impaired.
For eliminating the negative influence of driver fingerprinting differences on drowsiness-detection accuracy, scholars attempted to use measurements of individual drivers to establish driver-specific models and achieve more accurate drowsiness-detection [39,41,43,46,47]. Chu et al. [47] recorded naturalistic driving data by actual vehicles, and trained drowsiness-detection models by the speed and lane departure value of individual drivers, based on the combined model of neural network and support vector machine. e drowsiness-identification rate of the model for some drivers exceeded 90%, which verified that driver-specific models could improve the accuracy of drowsiness-detection. You, et al. [46] handled differences in eye movement behavior data among drivers, extracted ocular measurements like eyes aspect ratio, and used ocular measurements of individual drivers to establish the driver-specific drowsiness-detection models. e accuracy of the proposed model reached 94.8%, which outperformed generalized drowsiness-detection models. Wang et al. [39] collected multisource driving behavior data by simulating driving experiments and utilized 23 nonintrusive measurements of individual drivers to build the personalized drowsy driving models based on the multilevel logit model. e results verified that the drowsiness-detection accuracy of the personalized model considering driver fingerprinting differences was higher. Naurois et al. [48] used data of a group of drivers to train the drowsiness-detection model based on artificial neural networks and used adaptive learning to personalize the drowsiness-detection model for a new driver. e model enhanced drowsiness-detection performance by roughly 40%. e above driver-specific drowsiness-detection models improved the accuracy of drowsiness-detection, which was attributed to adopting individual drivers' specific drowsiness-detection thresholds. However, in the above driverspecific models, measurements of each driver were still computed by UCPs rather than Individual Drivers' Best Calculation Parameters (IDBCPs), which restrained contributions of measurements for detecting drowsiness. Due to differences in driver fingerprinting among drivers, IDBCPs that made measurements more discriminative for drowsiness were various [18,49]. For some drivers, IDBCPs of measurement are very different from UCPs. And, using UCPs to compute individual driver's measurements restrains the drowsiness-detection performance of measurements, which decreases the accuracy of driver-specific drowsiness-detection models. Similarly, Zhang et al. [18] found that there were considerable differences in IDBCPs and using the UCPs to calculate measurements might weaken the correlation between measurements and drowsiness levels for some drivers. erefore, the purpose of this paper was to build a novel model to optimize the measurements' calculation parameters of the individual driver. And, measurements calculated by IDBCPs which could validly distinguish drowsiness were chosen to compose individual drivers' optimal measurement set of drowsy driving. is paper contributes to drowsiness measurement calculations and the establishment of drowsiness-detection models considering driver fingerprinting differences. e rest of this paper is arranged as follows. In Materials and Methodology, we described the experiment details, measurement calculation method, the extraction model of optimal drowsy driving measurements of individual drivers, and the validation method of the drowsiness-detection advantages of measurements calculated by IDBCP. In Results, we presented the driver fingerprinting differences of measurements using UCPs, the results of the model, and the drowsiness-detection advantages of using IDBCPs to calculate nonintrusive measurements. e discussion of the results, the important contributions, and the application prospects of this paper are introduced in Discussion. And, the summary of research work, research limitations, and future work are given in Conclusions.

Materials and Methodology
2.1. Experimental Design 2.1.1. Participants. According to the actual proportion of male and female commercial vehicle drivers locally, 35 male drivers and 5 female drivers were recruited as participants and numbered. All participants were professional drivers with rich driving experience and proficient driving skills. eir age ranged from 34 years old to 57 years old (mean � 46.83, SD � 5.62); the driving age ranged from 3 years to 32 years (mean � 16.53, SD � 6.10). Participants did not have any sleep-related disorders and other physical or mental illnesses, and they took no pharmaceuticals within 1 month before the experiments. On the co-pilot, we arranged one coach with 30 years of driving experience as a safety officer to accompany every participant. e safety officer took countermeasures during emergencies and was trained to inquire and record the participants' self-reported drowsiness level using the Karolinska Sleepiness Scale (KSS). e KSS ranges from one to nine, where one and nine, respectively, represent extremely alert and extremely drowsy [50].
e safety officer has a rich long-distance driving experience and strong anti-fatigue abilities. To prevent the safety officer from using KSS due to excessive fatigue, the safety officer should have a good rest when arriving at the service area. Besides, during driving duration, the safety officer kept relatively alert by drinking coffee and refreshing energy beverages and listening to pop music, etc.

Devices.
e major experimental devices and routes are shown in Figure 1.
e experimental vehicle was modified from the actual vehicle, equipped with a variety of sensors. e multisource naturalistic driving behavior data were collected timely. e three-way HD camera (frame rate was 30 Hz) could capture the driver's facial characteristics, driving operation details, and the traffic environment outside the vehicle including road linearity and traffic flow. e HD cameras were Logitech C910. e driver state sensor (sampling frequency was 60 Hz) was W5Y-DSSV3 (AR-ES5430SM) produced by Seeing Machines, which used an image recognition algorithm to monitor the driver's eyes and eyelid movements.
e Mobileye C2-270 (sampling frequency was 15 Hz) could detect lane lines and output the distance between the vehicle center and the lane lines. e inertial navigation system (sampling frequency was more than 20 Hz) could collect acceleration, velocity, and GPS, which was RT2500 developed by Oxford Technical Solutions.
e industrial computer could store raw multiplesource experimental data, which was SIMATIC Rack PC 847B of SIEMENS. e steering wheel angle sensor was LWS 5 of BOSCH (sampling frequency was more than 20 Hz), which collected the steering wheel angle. Besides, other materials were also used, such as the KSS, demographic information scale, etc.

Implementation Process.
Before experiments, the participants were notified of the important information including the experimental process and the possible risk, and each participant signed informed consent. Participants were trained to understand KSS correctly for ensuring the reliability of self-reported drowsiness levels. en, participants were familiarized with the experimental vehicles and made sure not to take beverages, such as caffeine, that would affect the measure of drowsiness. On the night before the experiment, participants kept a normal schedule and ensured adequate sleep duration. Before driving, participants were required to report their sleep duration the previous night and current KSS. In the experiment, participants were required to drive the car along the route shown in Figure 1 with their usual driving habits. e experimental road section was G70 (twoway four-lane) Expressway from Wuhan to Xiangyang, China. e road section was flat and relatively straight. Its traffic flow was simple. So the influence of road shape and other vehicles' disturbance on driving behavior could be eliminated such as steering and lane-keeping, which was out of the scope of the current work. e factor of weather, such as snow, rain, and fog, can also affect naturalistic driving behavior [51], which can interfere with the analysis of the effects of drowsiness on driving behaviors. To control irrelevant variables, all experiments were carried out under the condition of well-lit weather to ensure that the temperature, humidity, and other environmental factors were consistent. Besides, each participant started the experiment at 9 Am on weekdays to avoid the influence of traffic volume changes. All participants entered the G70 highway from the Fuhe toll station. After about 2 hours, participants reached the Suizhou service area and took a 1-hour break. en, participants drove to Xiangyang north toll station, turned around at this point, and returned to the start point. e total driving time was about 6 hours and the total travel was about 600 kilometers in this experiment. During the process of driving, the noise level was controlled to keep a comfortable environment and prevent noise from influencing the physiological and psychological state of participants. Participants were required to drive at a speed limit of 120 km/h. e same safety officer inquired to record 40 participants' self-report KSS every 5 minutes.
After completing the whole experiment, participants were paid for their participation. And, the experimental designs met the guidelines of the local ethics committee and the personal information of participants was properly secured. Finally, 35 participants' qualified experimental data were obtained. Valid data of 5 participants were not collected due to poor experimental conditions or equipment problems.

Data Preprocessing.
According to the flow in Figure 2, raw experimental data were preprocessed to obtain the drowsy driving measurements that meet the requirements of the model.

Synchronizing Multisource Data.
Based on the timestamp recorded by the sensor, we synchronized multisource data including videos, self-reported KSS, lane position, steering wheel angle, velocity, and so on. Because the driver's drowsiness level was evaluated every 5 minutes, we divided the raw data into samples within 5 minutes to avoid the sample crossing two KSS.

Choosing Data of the Continuous Driving Scene.
e driving behavior in the continuous driving scene on the motorway was chosen to analyze. e driving operation is more monotonous when the driver is driving continuously on the motorway, and the risks of drowsy driving are greater because of the high speed [39]. According to the videos outside the vehicle, we extracted the driving duration under the continuous driving scene and selected the data of the continuous driving scene.

Setting Double-Layer Sliding Time Window and Calculating Measurements.
e research in the field experiments found that the drowsy driving state generally lasted 15-75 seconds, while the typical drowsy operation characteristics duration was generally 5-20 seconds [30]. When measurements are calculated by data in the entire drowsy state duration, the calculating average effect covers the drowsy operating characteristics [1,18]. us, the doublelayer sliding time window was proposed to divide raw data into samples and calculate measurements.
Firstly, we set the first-layer sliding step (S 1 ) and time window (T 1 ) within every 5 minutes. e experimental data in T 1 were data of a sample, and T 1 was a fundamental unit for calculating measurements and detecting drowsiness. e data in one T 1 could be used to calculate one measurement value, which was one measurement sample. Secondly, within each T 1 time window, we set the second-layer sliding step (S 2 ) and time window (T 2 ). e data in T 2 were used to calculate the measurement. To magnify the drowsy driving characteristics in the sample (T 1 ), the maximum measurement for all T 2 in every T 1 was chosen as the final measurement value of the sample (T 1 ). For instance, when calculating SDLP, if T 1 � 60 s, S 2 � 5 s, T 2 � 20 s, then T 1 included nine T 2 and the lane position data in each T 2 can be used to calculate one SDLP; we chose the maximum SDLP of nine T 2 as the final SDLP of the sample T 1 . e number of T 2 was determined by the size of S 2 . e smaller the S 2 is, the more T 2 can be obtained to select max drowsiness measurement value within T 1 . But if the amount of T 2 was too large, the calculation amount of the model would increase and the calculation speed of the model would reduce. Besides, there are certain gaps between typical fatigue operating features, S 2 cannot be too small. us, for reducing the amount of computation, avoiding or missing the best S 2 , and reducing the degree of freedom, we design the optimization range of S 2 to be an integer between 2 and 5.

Dividing Measurement Samples into Samples of Awake State and at of Drowsy State.
e KSS of T 1 was defined as the KSS of measurement samples. Referring to the literature [18,52], measurement samples with KSS less than 7 belonged to samples of the awake driving state; otherwise, they belonged to samples of the drowsy driving state. In Figure 3, we drew the KSS overtime of 35 participants. e KSS equals zero means stop because of rest or some breakdowns. As shown in Figure 3, with the increase in driving time, KSS gradually increased. Before and after rest in the service area, the KSS peak appeared with the increase of driving time. And after the rest in the service area, KSS decreased to a certain extent, but KSS rapidly increased with the increase of driving time.

Measurement Calculation.
Drowsiness can affect the driver's steering wheel rotation, eye movement, and other aspects [12]. According to related literature [12,18,39], eight drowsiness-related nonintrusive measurements of different categories were chosen to analyze. In the previous studies, measurements of all drivers were calculated by UCPs. For comparing with the drowsiness-detection contributions of measurements calculated by IDBCPs, we also used suitable UCPs to calculate measurements.
We utilized the Wilcoxon test (introduced in Section 2.4.2) to analyze the differences between measurements of the awake state and that of drowsy state. For a participant, if P value < 0.05, the measurement can distinguish drowsy driving [53]. When measurements calculated by UCPs were used to build drowsiness-detection models without considering individual differences, the number of participants whose drowsiness-driving can be detected by the model using unified parameters should be as many as possible. Given UCPs, we calculated measurements by UCPs and did the Wilcoxon test on individual driver's measurements. e more the participants with P value < 0.05 are, the better the UCPs are. And for a participant, the same original experimental data were used to calculate UCPs and IDBCPs, so the experimental setting had no influence on the results. us, through iterating various UCPs and comparing the number of drivers whose drowsiness can be detected by measurements calculated by different UCPs, we found the suitable UCPs of each measurement which can make the number of participants whose P value < 0.05 maximum.
According to reference [18,30], we designed the value ranges of UCPs. T 1 corresponds to the duration of the drowsy state (15 s-75 s), and T 2 corresponds to the duration of the typical characteristics of drowsiness (5 s-20 s). erefore, the value range of T 1 and T 2 is designed to be 15-75 and 5-20, respectively. e frequencies of sensors were above 20 Hz, and every second contains a lot of data, reflecting the driver's operation details. erefore, the drowsiness characteristics can be obtained by using the related smaller T 2 to calculate measurements. And, the larger T 2 can contain the nondrowsy data and weaken the drowsiness characteristics. S 1 and S 2 affect the quantity of T 1 and T 2 , respectively. Using small S 2 is convenient for selecting max drowsiness measurement value within T 1 . However, if S 2 is too small, it will increase the number of T 2 and increase the amount of model computation. Also, there are gaps among typical fatigue operating features. To make sure the best quality and reduce the calculation amount, we set the range of S 1 and S 2 to be 2-5 and 7-10, respectively, to balance the amount of computation and reduce the degree of freedom.   Journal of Advanced Transportation Calculation formulas of some complicated measurements are as follows.
② SRR measures the frequency of sampling points whose angle exceeds the wheel angle threshold, which can reflect the stability of steering wheel control [18].
N thr is the number of sampling points whose angle exceeds the thr and N all is the total number of sampling points in the time window.
④ VSA can eliminate the influence of road curvature on steering wheel angle to some extent, which reflects the relationship between drowsiness and steering wheel angle [54].
MSWA and SDSWA are, respectively, the mean value and standard deviation of the steering wheel angle in the time window.
⑧ SDPE describes the variation of PERCLOS in the time window, which reflects the blink frequency.
PE i is the PERCLOS at sampling point i, and n pe is the total number of PERCLOS sampling points in the time window. e PERCLOS is directly output by DSS. Within a certain period, DSS calculates the proportion of time the eyes are at least 80 percent closed. en, for increasing the volume of data, DSS obtains the PERCLOS of each sampling point through interpolation. And, the sampling frequency of PERCLOS is about 60 Hz. erefore, we can calculate the mean and standard deviation of PERCLOS in T 2 .

Extraction Model of Individual Driver's Optimal
Drowsiness-Detection Measurements 2.4.1. Structure of the Model. e framework for the extraction model of optimal drowsiness-detection measurements of individual drivers based on the Wilcoxon test and genetic algorithm (GA) is shown in Figure 4.
As shown in Figure 4, the model is divided into three parts. Measurements of the individual driver are input into this model to obtain IDBCPs and compose the individual driver's optimal drowsiness-detection measurement set. In part A, through performing the Wilcoxon test on measurements of the awake state and that of drowsy states, we obtain |Z-statistics| (|Z|) representing the drowsiness-detection performance of measurements and construct the   fitness function reflecting the correspondence between the calculation parameters and measurements' drowsinessdetection performance. In part B, the GA is used to optimize the fitness function and obtain the measurements' IDBCPs. In part C, we complete the Wilcoxon test on measurements that are computed by IDBCPs. We choose the measurements whose P value < 0.05 to compose the individual driver's optimal drowsiness-detection measurement set.

Establishing Fitness Function of GA.
Firstly, the measurement samples are computed by the given calculation parameters. And, according to KSS, measurement samples are divided into samples of the awake state or that of the drowsy state.
Secondly, we perform the Wilcoxon test on measurements of the awake state and that of the drowsy state and obtain Z-statistics. Measurement sample sizes of the awake state and that of the drowsy state are different and samples are not normally distributed. erefore, the Wilcoxon test was used to analyze the differences between the measurement samples of the awake state and that of the drowsy state.
e Wilcoxon test is to analyze whether differences between unpaired samples from two groups are statistically significant without requesting normal distribution [53]. e bigger the |Z|, the greater the difference between measurement samples of the awake state and drowsy state. us, |Z| represents the drowsiness-detection performance of measurements. e method of calculating Z-statistics is as follows.
Measurement samples in the awake driving state are A: A 1 , ..., A i , ..., A n , and measurement samples in the drowsy driving state are D: D 1 , ..., D j , ..., D m . e sample size of the awake state and that of the drowsy state is, respectively, n and m. Measurement samples of awake state and that of drowsy state are mixed. e mixed measurement samples are sorted in the ascending order.   Journal of Advanced Transportation R i and R j are the rank in mixed samples of the awake sample A i and the drowsy sample D j , respectively.N(0, 1) means that the data are normally distributed.
Finally, the fitness function is established as follows: C(C 1 , C 2 , . . . , C j ) is a vector consisting of independent variables and C j was the j-th calculation parameter of the measurement. Take SRR in Table 2 as an example; the calculation parameters of SRR are S 1 , T 1 , S 2 , T 2 , and r. erefore, C(C 1 , C 2 , . . . , C j ) of SRR is C(S 1 , T 1 , S 2 , T 2 , Thr). We can get the corresponding |Z i | when a set of calculation parameters C i (C i1 , C i2 , . . . , C ij ) is given. erefore, the fitness function reflects the relationship between the calculation parameters and the drowsiness-detection performance (|Z|) of the measurement. And, the larger the fitness value (|Z|) is, the better the calculation parameters of the measurement are.

Optimizing Calculation Parameters Using GA.
e genetic algorithm is a method of searching for the optimal solution based on population intelligence with advantages of simplicity, high efficiency, and good robustness [55]. us, we used GA to optimize the fitness function (|Z| � E(C)) to obtain measurements' IDBCPs. Chromosomes are individuals, and multiple individuals constitute a population. e flow chart of GA is shown in Figure 5. And the main steps of GA are introduced as follows: Encoding measurement's calculation parameters: a chromosome consists of calculation parameters of measurements that need to be optimized. We use binary coding and decoding because of its advantages of high search efficiency and easy convergence, and convert calculation parameters into binary strings of different lengths according to the solution space and accuracy of the calculation parameters. For instance, when we take SDLP in Table 2, the calculation parameters of SDLP are S 1 , T 1 , S 2 , T 2 , and for SDLP, chromosome � [S 1 , T 1 , S 2 , T 2 ] � (6, 45, 4, 18) 10 � (110 , 101101, 100, 10010) 2 . Calculating fitness value: the chromosome which represents measurement's calculation parameters is input into the function (|Z| � E(C)).
rough the Wilcoxon test on measurements, the fitness value (|Z|) of the chromosome is obtained. Choosing reproduction: chromosomes with larger fitness values are more likely to be selected for reproduction (crossover or mutation) [55]. e probability of the chromosome being selected equals to |Z i |/ |Z i |. Z i is the fitness value of chromosome i. And, we obtain better chromosomes by roulette selection. Crossover: two chromosomes are randomly selected to crossover according to the cross probability (P cr ), and then a pair of nodes are randomly selected for the exchange to generate two new chromosomes.
Mutation: a chromosome is randomly select to mutate according to the mutation probability (P mu ). And, then a node on its binary string is randomly selected to replace it with an allele value. Relative process parameters of GA: learn from related research [55], parameters of GA are designed as follows, N � 50 (N is the population size), P cr � 0.6, P mu � 0.01. Termination condition: after the genetic algebra reaches 200, GA is ended. Finally, by decoding, the binary optimal solution is converted to decimal, and IDBCPs of measurements are output.
In this paper, for fusing multiple measurements of a sample (T 1 ) to identify drowsiness, all measurements should have the same S 1 and T 1 . First, some groups (S 1 , T 1 ) are generated by enumeration. en, all measurements use the same (S 1 , T 1 ), and the GA is used to optimize the remaining calculation parameters to obtain the maximum |Z max | of each measurement under the same (S 1 , T 1 ). Finally, (S 1 , T 1 ) with the largest sum of all measurements' |Z max | is chosen as the best (S 1 , T 1 ) of all measurements. Finally, (S 1 , T 1 ) with the largest sum of all measurements' |Z max | is chosen as the best (S 1 , T 1 ) of all measurements, which together with other optimal parameters constitutes the measurements' IDBCPs.

Composing Individual Driver's Optimal Drowsiness-Detection Measurement Set.
Although we optimize the calculation parameters of individual driver's measurements in part B, whether the measurements calculated by IDBCPs can effectively detect drowsy driving needs further test. us, we perform Wilcoxon test on measurements calculated by IDBCPs, and then measurements with P-value < 0.05 are selected to compose the optimal drowsiness-detection measurement set of the individual driver.

Validation Model of Individual Driver's Measurements Calculated by IDBCPs.
For verifying the drowsiness-detection advantages of measurements' IDBCPs, for the individual driver, the multiple nonintrusive measurements of individual drivers were integrated to build driver-specific drowsiness-detection models based on the Fisher discriminant algorithm. We, respectively, used drowsiness-detection measurements calculated by UCPs and IDBCPs to build two kinds of driver-specific drowsiness-detection models named DF_U and DF_I. e drowsiness-detection advantages of using IDBCPs are illustrated by comparing drowsiness-detection accuracies of DF_U and DF_I.
Fisher discriminant algorithm is an effective classification method, which simplifies the problem by projecting the points in the high-dimensional space [56].
Journal of Advanced Transportation 9 DS 1 and DS 2 are the function values of the awake state and drowsy state, respectively. α(α 1 , α 2 , . . . , α 6 ), β(β 1 , β 2 , . . . , β 6 ), b 1 , b 2 are the algorithm coefficient, which is obtained by using training samples to train the model. P i (i � 1, 2) is the drowsiness-detection measurement of individual drivers such as SDLP and SRR. P i (i � 1, 2) is the probability that the sample belongs to the awake state or drowsy state. e driving state with a higher probability is the identification result of the model.

Results
To save paper space, 6 participants (No. 5,No. 11,No. 14,No. 18,No. 23,and No. 29) were randomly selected as examples to display the related results in this section.

Driver Fingerprinting Differences of Measurements Using
UCPs. We took SDLP as an example to display driver fingerprinting differences of measurements calculated by UCPs. Figure 6 shows the radar charts composed of the quartiles of the SDLP of 6 participants in the awake state and drowsy state.
In Figure 6, all closed lines were not circular but irregular polygons. It meant that the distributions of SDLP calculated by the same calculation parameters of 6 participants were different, which was the outcome of driver fingerprinting differences in lane control behavior. ere were differences in the median of SDLP among participants in the awake state and drowsy state. For instance, in the awake state, the medians of No. 11 and No. 18 were maximum (0.35) and minimum (0.20), respectively. And, in the drowsy state, the medians of No. 11 and No. 18 were maximum (0.41) and minimum (0.24), respectively. Besides, the ranges of SDLP among participants were also different. In the awake state, the differences between the upper and lower quartiles of No. 11 and No. 18 were the maximum (0.18) and minimum (0.07), respectively. In the drowsy state, the differences between the upper and lower quartiles of No. 14 and No. 18 were the maximum (0.21) and minimum (0.12), respectively.

Optimal Drowsiness-Detection Measurements of Individual Participants.
All samples were divided into the training sample set and testing sample set, with the proportions of 70% and 30%, respectively. Using stratified random sampling, 70% of the data in awake and drowsy states were, respectively, selected to form the training sample set, and the remaining data were the testing sample set. e stratified random sampling was repeated three times, and the final result was taken as the average value of repeated calculations. Table 3 presents the results of models in Figure 4 of the example participants. And in Table 3, |Z| is |Z-sta-tistics| of measurements calculated by IDBCPs, and * means P value < 0.05.
As shown in Table 3, most measurements using IDBCPs could distinguish drowsy driving (P value < 0.05). However, for some participants, although the measurements' calculation parameters were optimized, the measurements calculated by IDBCPs could not still distinguish drowsy driving (P value < 0.05). For example, the P value of SDSWM of No. 11 exceeded 0.05. erefore, SDSWM could not be selected to compose an optimal drowsiness-detection measurement set of No. 11. Participant No. 14 was chosen as an example to display the optimization process of SRR's calculation parameters. e training samples of SRR were input into the model in Figure 4. As is shown in Figure 7, the fitness value gradually increased with iterations, which indicated that the model could optimize the calculation parameters and improve the drowsiness-identification performance of measurements.

Comparison of Measurements Calculated by IDBCPs and
UCPs.
e testing samples were used to analyze the change of drowsiness-detection performance of measurements calculated by IDBCPs. In Table 4, we took SDSWM as an example, and |Z| of SDSWM calculated by UCPs (|Z|-UCPs) and IDBCPs (|Z|-IDBCPs) were, respectively, listed. In Table 4, for every participant, |Z|-IDBCPs was bigger than |Z|-UCPs, which meant that using IDBCPs to calculate SDSWM could improve the contribution of SDSWM for drowsiness detecting. However, the improvement extent for participants was various, that of No. 18 and No. 29 was, respectively, maximum (2.33) and minimum (0.66). It could   (17) No. 11 (23) No. 14 (31) No. 18 (30) No. 23 (35) No. 29 (41) Measurements To study the distribution differences between measurements calculated by UCPs and that calculated by IDBCPs, in Figure 8, we chose No. 18 to, respectively, draw the boxplots of SDSWM calculated by UCPs and IDBCPs in the awake state and drowsy state.
In Figure 8, whether the SDSWM was calculated by UCPs or IDBCPs, the median of the SDSWM in the drowsy state was greater than that in the awake state. Although there is still an overlap between SDSWM of awake state and that of drowsy state, by performing the Wilcoxon test on SDSWM of awake state and that of drowsy state, P value was less than 0.05, which illustrated differences between SDSWM in waking state and SDSWM in the drowsy state was statistically significant, and drowsy driving could be distinguished by using SDSWM. It indicated that the drowsiness impaired the ability to control the steering wheel and reduced the stability of the steering wheel movement, which was consistent with previous study conclusions [12]. Besides comparing with SDSWM calculated by UCPs, the overlap decreases when using IDBCPs to calculate SDSWM. e difference between the median of SDSWM calculated by UCPs of the awake state and that of the drowsy state was 0.05, while the difference between the median of SDSWM calculated by IDBCPs of the awake state and that of the drowsy state was 0.07. e result showed that when measurements were calculated by IDBCPs, distribution differences between measurements in the awake state and that in the drowsy state were more significant, and its drowsiness-detection performance was stronger.

Verification of Individual Driver's Optimal Drowsiness-Detection Measurements.
Drowsiness-detection advantages of measurements calculated by IDBCPs were verified by comparing drowsiness-detection accuracies of DF_U and DF_I. For the individual participant, the optimal drowsiness-detection measurements calculated by IDBCPs were fused to build DF_I, and the same drowsiness-detection measurements calculated by UCPs were also fused to build DF_U. For individual drivers, according to stratified sampling, 80% of the samples were, respectively, extracted from measurements in the awake state and that in the drowsy state to compose the training sample set, and the rest constituted the testing sample set. e training samples were used to train driver-specific drowsiness-detection models based on the Fisher discriminant algorithm. In Table 5, for DF_U and DF_I, we displayed the mean, standard deviation, minimum, and maximum of drowsiness-detection results across all participants.
As shown in Table 5, the related drowsiness-detection results of DF_U were better than that of DF_I. e mean drowsiness-detection accuracy across 35 participants of DF_I was 91.06%, which was higher than that of DF_U (85.25%).
e results indicated building driver-specific models based on measurements calculated by IDBCPs could improve drowsiness-detection accuracies.
For every participant, Figure 9 shows the comparison of the drowsiness-detection accuracy of DF_U and DF_I. In Figure 9, for each example participant, the accuracy of DF_I was higher than that of DF_U. Besides, for different participants, the accuracy improvements equalling the accuracy of DF_I minus the accuracy of DF_U were various. e accuracy improvement of No. 6 was maximum with the value of 8.11% (DF_U � 81.73%, DF_I � 89.84%), whereas that of No. 34 was minimum with the value of 1.56% (DF_U � 86.89%, DF_I � 88.45%). e possible interpretation was that the IDBCPs of some participant were similar to UCPs and the improvement of the drowsinessdetection performance of measurements using IDBCPs is limited.

Discussion
By summarizing the above results, important insights and contributions to the development of drowsiness-detection methods considering driver fingerprinting differences were obtained. ere are individual differences in data distribution of measurements using UCPs (Figure 6), which is consistent with the previous research [37]. e reason for this phenomenon is that there are differences in drivers' raw driving behavior data, which is the typical manifestation of driver fingerprinting. Similar to the studies [33,34], it is indicated that individual drivers have unique characteristics in driving behavior and physiology, and there are differences in the driver fingerprints among the drivers. Drowsinessdetection thresholds are the average of all drivers when all drivers' mixed measurement data are used as a whole to train models. And, driver fingerprinting differences make drowsiness-detection of the individual driver to suffer from systematic error. For example, the individual driver's drowsiness-detection threshold is below the average threshold (like No. 18 in Figure 6), the driver's drowsiness cannot be detected when measurements exceed this individual driver's drowsiness-detection threshold but not reach the average threshold of all drivers. erefore, it is worthy to study accurate drowsiness-detection methods considering driver fingerprinting differences. e major achievement of this paper is to propose a model to optimize individual driver's calculation parameters of measurements considering driver fingerprinting differences and extract individual driver's optimal drowsinessdetection measurements. It is found that the IDBCPs of measurements among drivers are significantly different (Table 3), which is attributed to driver fingerprinting differences among drivers. Figure 8 confirms that using IDBCPs to calculate individual drivers' measurements can enhance the contribution of measurements for detecting drowsy driving. Similar to existing studies [18,32,36], every driver has unique driver fingerprinting characteristics, and the effect of drowsiness on the driving behavior of each driver is also different [10], which causes many individual differences in the raw data related to the drowsiness. For example, the time length of the drowsy state, duration of typical drowsy driving behavior characteristics, and distribution of drowsiness-related driving behavior data of each driver are different. Without considering the driver fingerprinting differences, using UCPs to calculate measurements causes measurements to fail to accurately reflect drowsy driving behavior characteristics of individual drivers, which limits the drowsiness-detection performance of nonintrusive measurements at the individual driver level.
us, it is suggested to optimize measurements' calculation parameters of individual drivers to obtain IDBCPs suitable for individual driver's driver fingerprinting characteristics at different drowsiness levels, which can strengthen the sensitivity of measurements to drowsiness.
Comparison results of drowsiness-detection accuracy in Figure 9 illustrate that DF_I outperforms DF_U, which verifies the drowsiness-detection advantages of using IDBCPs to calculate individual driver's measurements. e reason is that DF_I utilizes measurements calculated by IDBCPs with stronger drowsiness-identification performance to build models. e previous study establishes the driver-specific drowsiness-detection model in a simulated driving environment using measurements calculated by UCPs [39]. Although the accuracy of DF_U (84.16%) is lower than that of previous research (88.60%), there are many interferences in the real driving environment increasing complexities and difficulties of detecting drowsiness. us, the drowsiness-detection accuracy of DF_U is ideal. Moreover, the average drowsiness-detection accuracy of DF_I is higher than that of the previous research [39], and accuracies of some drivers reach more than 93.00% (No. 14 and No. 23 in Figure 9). It is demonstrated that the drowsiness of individual drivers can be reliably detected with higher accuracy using nonintrusive measurements calculated by IDBCPs, which is also inferred in the previous study [18].
As mentioned in the existing studies, the calculation and selection of measurements is an important factor affecting drowsiness-detection accuracy [26,39]. erefore, the model and results of this paper can be applied to develop personalized anti-drowsiness systems in real-world conditions. Individual driver's optimal drowsiness-detection measurements calculated by IDBCPs should be used to train driver-specific drowsiness-detection models. According to the results in Figure 9, the drowsiness-detection accuracy can be improved by using measurements calculated by IDBCPs having higher drowsiness-detection performance. It is worth highlighting that for some drivers whose accuracy improvement of DF_I is high, it is more indispensable to build the driver-specific drowsiness-detection models using measurements calculated by IDBCPs.

Conclusions
Due to driver fingerprinting differences, the IDBCPs matching individual driver's behavior characteristics may be very different from UCPs, and using IDBCPs to calculate measurements enhances measurements' drowsiness-detection performance for individual drivers. erefore, the purpose of this paper was to propose a model to optimize measurements' calculation parameters of individual drivers. Firstly, the naturalistic driving data and KSS of 35 participants were collected through field experiments. Eight nonintrusive measurements related to drowsiness were chosen and computed using the doublelayer slide time window. en, based on Wilcoxon test and GA, we established a model (Figure 4) to extract individual drivers' optimal drowsiness-detection measurements. Finally, based on the Fisher discriminant algorithm, the two kinds of driver-specific drowsinessdetection models were built using measurements calculated by UCPs and IDBCPs, respectively. And, drowsiness-detection advantages of using IDBCPs to calculate measurements were illustrated.
In the present paper, we obtained individual driver's optimal drowsiness-detection measurements calculated by IDBCPs, and the results of example participants are shown in Table 3. Figure 6 verifies that there are individual differences in nonintrusive measurements calculated by UCPs, which is attributed to the driver fingerprinting differences among drivers. is paper verifies that for individual drivers, the drowsiness-detection performance of measurements calculated by IDBCPs is better than that of measurements calculated by UCPs. In addition, the results showed that the average drowsiness-detection accuracy of DF_I was 6.25% higher than that of DF_U, which indicated that using nonintrusive measurements calculated by IDBCPs to establish the driver-specific drowsiness-detection models improved drowsiness-detection accuracy at the individual driver level.
Admittedly, there are some limitations in this paper. For instance, we used participants' self-report KSS as ground truth for drowsiness, and the drowsy levels were relatively subjective. Besides, the amount of experimental data of some individual drivers was not very sufficient which led to insufficient model training and not very high drowsiness-detection accuracy for some participants. For a new user, firstly, we need to label the data with drowsiness and awakeness, which is cumbersome and a limitation in practice. However, for some special drivers such as dangerous goods transport drivers and longdistance bus drivers, it is still very worthwhile to establish a personalized model based on measurements calculated by IDBCPs to improve the drowsiness-detection accuracy.
In the future, we will adopt the more objective and precise standard as the ground truth for drowsiness such as electroencephalogram. And, higher performance models such as artificial neural networks and Bayesian networks will be used to build drowsiness-detection models to reach better accuracies. Ultimately, this research can provide references for the calculation of drowsiness measurements using naturalistic driving data, and guide the establishment of drowsiness-detection models considering driver fingerprinting differences, which can accelerate the development of personalized anti-drowsiness driving active safety systems in vehicles. Meanwhile, the efforts highlight the advantages of studying driver fingerprinting differences and promote the application of driver fingerprinting in the field of dangerous driving behavior.

Data Availability
e raw data used in this study are available from the corresponding author on request.

Conflicts of Interest
e data indicated in the findings have not been made available due to data privacy.