Operating Comfort Prediction Model of Human-Machine Interface Layout for Cabin Based on GEP

In view of the evaluation and decision-making problem of human-machine interface layout design for cabin, the operating comfort prediction model is proposed based on GEP (Gene Expression Programming), using operating comfort to evaluate layout scheme. Through joint angles to describe operating posture of upper limb, the joint angles are taken as independent variables to establish the comfort model of operating posture. Factor analysis is adopted to decrease the variable dimension; the model's input variables are reduced from 16 joint angles to 4 comfort impact factors, and the output variable is operating comfort score. The Chinese virtual human body model is built by CATIA software, which will be used to simulate and evaluate the operators' operating comfort. With 22 groups of evaluation data as training sample and validation sample, GEP algorithm is used to obtain the best fitting function between the joint angles and the operating comfort; then, operating comfort can be predicted quantitatively. The operating comfort prediction result of human-machine interface layout of driller control room shows that operating comfort prediction model based on GEP is fast and efficient, it has good prediction effect, and it can improve the design efficiency.


Introduction
As a complicated human-machine environment system, cabin is the working space to perform the task of observation and manipulation, centralizing display instrument, manipulator, signal, alarm, and other terminal interfaces, for example, aerospace manned cabin, deep-sea submersibles cabin, engineering machinery cab, drilling rig control room, control room in nuclear power plant, and automobile cabin. The internal structure of these cabins is complex; the operators need to rely on vision, hearing, and touching to get information from the instruments and the outside world and make judgments quickly then immediately through locomotive organ such as hand and foot to manipulate correctly. In this state, the comfort of operating posture is an important factor to affect the operators' work load, fatigue, health, and even safety, which should be considered in the humanmachine interface design with emphasis. In the humanmachine interface design for cabin, the layout design of all sorts of manipulators directly determines different operating postures, and the different operating postures directly affect operating comfort, convenience, and efficiency. Comfortable operating posture is advantageous to keep good matching relation between locomotive organ and manipulators. Therefore, the operating comfort is an important basis to evaluate layout design of human-machine interface for cabin [1].
Comfortable feeling is a kind of subjective feeling combined with the experience between physiological and psychological perception and affected by various factors such as work environment, duration, and different task [2][3][4][5]. The comfort evaluation research is usually divided into two categories: subjective evaluation and objective evaluation. Through a variety of evaluation scale directly, subjective evaluation has the advantages of low cost and simple operation steps. However, relying on subjective description of different subjects, the feeling of subjects themselves, the difference of patience, the mood, and other subjective factors and test environment have influence on the comfort evaluation results obtained by statistical analysis; the evaluation results given by different subjects may have difference [6][7][8][9][10][11]. Objective evaluation can be proceeded by physical model experiment using ergonomic experiment equipment such as 3D motion capture system and pressure distribution measurement system and through anthropometry, biomechanical characteristic, body pressure distribution, operating posture and so on; comfortable feeling can be obtained based on observation, recording, and measuring objectively and indirectly [12][13][14][15][16][17].
At present, operating posture research focuses on using camera, driving posture monitoring system, 3D motion analysis, and other equipment to conduct the measurement, statistics, and analysis of postures. This kind of method is time-consuming and expensive and cannot be used in the design process early. Moreover, there is lack of effective modeling method in the comfort evaluation of operating posture. In order to evaluate the layout design scheme of human-machine interface for cabin in the early stage of design process and reduce the times of rework and production of physical prototype, shorten the cycle from design to manufacturing and cost [18]; on the basis of Chinese adult's body parameters to establish virtual human body model, this paper attempts to use CATIA software to do simulation experiment [19] and simulate and analyze the human operating postures. Then, the relation model of joint angles and operating comfort is established based on GEP algorithm. Through the limb angles to predict the comfort of operating posture, a simple and effective evaluation method is provided to evaluate the human-machine interface layout.

Prediction Method of Operating
Comfort Based on GEP 2.1. Outline of the Proposed Method. The implementation steps of the proposed method are shown in Figure 1. First of all, take CATIA software as the simulation platform, build the library of layout design model, human body model, and operating posture, simulate the operating process, output the limb angles of head, chest, waist, shoulder, upper arm, and forearm in various operating postures, and use the ergonomic evaluation module of CATIA to evaluate the comfort. Second, analyze the data collected from simulation, factor analysis is used to reduce the dimension of variables affecting the operating comfort, effectively reduce the input variables of prediction model, and eliminate the correlation between the input data. Then, using the function finding ability of GEP algorithm, establish the relation model of joint angles and operating comfort. Finally, the example of the operating comfort prediction of driller in driller control room verifies the effectiveness of the proposed method. to establish a comfort model of operating posture. Using mathematical method to describe each part size of the human body and relative position, it is used for the analysis of working posture and operating range and has nothing to do with the characteristics of the human body volume [20]. Joint connects different limb and is the pivot to transmit force and torque, allowing the body to move normally. Taking the joints as points and the bones between the joints as chains, the human torso and limbs are connected, and the skeletal system of human body modeling is formed. As shown in Figure 2, the human body skeleton model is simplified as follows: the Computational Intelligence and Neuroscience upper limb is divided into upper arm, forearm, and hand; lower limb is divided into thigh, leg, and feet; trunk is divided into head, shoulder, chest, and waist.

The
Comfort of Joint. The human body's different joint angles form different postures. To study comfortable working posture, the comfortable joint angles of the human body should be studied firstly. Most operation in cabin is given priority to sitting posture, so this paper chooses six parameters which are the most close to operating comfort of upper limb as variables of ergonomic characteristic. The six parameters are head, chest, waist, shoulder, upper arm, and forearm. The joint angles between adjacent limbs are used to describe the working posture.
The human body has many joints, and every joint has multiple degrees of freedom (DOF); thus many motions can be realized precisely. Joint motion can be regarded as the rotation around axis; the type of joints determines its form of motion. According to the DOF of the activity of articular surface relative to the joints of each other, the joints are divided into three kinds: DOF 1, DOF 2, and DOF 3. The DOF of joints involved in upper limb operating is shown in Figure 3.
Range of motion (ROM) of joints depends on statistical numerical [21]; it can be expressed as ≤ ≤ , = 1, 2, . . . , , where and are the lower limit and upper limit of ROM of joints, respectively, and is the number of joints. Discomfort is the value function where the respective deviation from the center position of a joint determines the discomfort. The comfort of center position of joint is expressed as ; then the discomfort relative to comfortable position can be represented as − . The comfortable feeling of each joint is not the same; weight should be introduced to express the relative comfortable degree between each joint; weighted value is taken as judgment function of comfort: where ∑ =1 = 1. Driving posture is an important issue in vehicle design process; many scholars and institutions have carried out a number of studies on optimal driving posture, preferred angles, seat comfort, and so on. Referring to the research results in vehicle driving posture and ergonomics [22], the ROM is divided into three levels: comfortable range, less comfortable range, and uncomfortable range, and the specific angles are shown in Table 1. In different operating postures, the changes of joint angles of each of the human body parts are different. The division of comfort of joint angles is the basis of operating comfort evaluation as in Table 1.

Brief Introduction of GEP.
Refer to biological genetic gene expression patterns, Portuguese scientist Candida Ferreira proposed a revolutionary new member in the evolutionary computation family-GEP, which combined with the advantages of genetic algorithm and genetic programming [23,24]. Because GEP is not dependent on the specific areas of problems, it has very strong robustness for kinds of problems, and it is widely applied to formula discovery, function mining, association rules discovery, factorization, sunspots forecasting, and other fields.
GEP is a kind of new data mining technique, which has ideal efficiency. Because GEP combines the advantages of genetic algorithm and genetic programming, its efficiency is higher than GA or GP 2∼4 orders of magnitude in solving complex problems. Using adaptive random search method, GEP is able to discover formula which can describe data inherent law from the data, without relying on any Upper arm prior knowledge, showing strong accuracy and universality.
Research on improvement and application of GEP algorithm has been attracting more and more attention, but the GEP has not been applied in the field of operating comfort prediction. So, this paper attempts to carry out the research on operating comfort prediction model based on GEP.
In GEP, a computer program is coded into fixed length of linear symbol strings and then when calculating the individual fitness, chromosomes (genotype) are expressed as different shapes and sizes of expressing tree (phenotype). Gene is the basic unit to constitute chromosome. Chromosome represents the feasible solution to solve the problem and consists of one or more genes. Formalization definition of gene can be expressed as a six-tuple: where Ω represents the set of genetic elements, namely, linear string. represents the function set and contains mathematical function and custom function and so forth. represents the terminal set and generally includes variables, nonparameter functions, and constants. Φ represents the set of genetic operators, such as variation and translocation. ℎ represents the length of gene head; the length of gene tail and total length of gene can be calculated by the length of gene head. represents the fitness value, calculated by the fitness function.

The
Steps of GEP Algorithm. As one kind of evolutionary algorithm, the operation process of GEP is similar to genetic algorithm. First of all, randomly generate initial population containing a certain number of individuals, and evaluate the fitness of these chromosomes. Then, on the basis of the fitness valve, choose the individuals as the next generation of population, conduct genetic operation on the selected individuals, and generate offspring with new features. The new individuals enter into the next round of the survival of the iterative process, and the process is repeated until the terminal condition is satisfied. The main steps of function mining by GEP are as follows [25].
Step 1. Code the individuals and create the initial population. Population contains a number of individuals (chromosomes), and chromosome is composed of more than one gene (see Table 2). Genes are connected by linking function. Head contains function set ( ) or terminal set ( ), and tail contains terminal set ( ).
Step 2. Calculate the fitness value. The fitness value of each individual is calculated by fitness function. And the fitness value reflects the extent of excellence of individual to achieve the optimal solution in the course of evolution. If the optimal individual meets ending condition, it should be transferred to the output and if not, it should be transferred to genetic operation steps, and then it produces offspring with new characteristics. In GEP, in order to evaluate the matching degree between the data calculated by the expression and training data, Ferreira put forward two evaluation models: the fitness function based on absolute error and the fitness function based on relative error.
Step 3. Carry out a series of genetic operations to produce a new generation of population using evolution principles to Computational Intelligence and Neuroscience 5 guide the evolution, including (1) keeping the best individual, (2) selection, (3) replication, (4) mutation, (5) transposition, and (6) recombination. If the mutation occurs in gene head, all the symbols can be selected. If the mutation happens in gene tail, only the terminal symbol can be selected. The snippets or string (transposable elements) combined by adjacent gene elements will be inserted into the other position of the chromosomes by transposition operator. Basic transposition operator contains IS, RIS, and gene transposition. Recombination is the process where two chromosomes will be selected from parent chromosomes randomly, interchange of some components, and generate new offspring. Basic recombination operator contains 1-point, 2-point, and gene recombination.

Case
Take the operating comfort evaluation of console layout of a certain type drilling rig as an example to illustrate the implementation of the proposed method. Use CATIA software to establish the human body and the product model, integrate the human physiological characteristics, simulate the operating postures, and realize the visualization of dynamic process in human-computer interaction. At the same time, making full use of ergonomic evaluation criteria and algorithm, operating comfort is analyzed and evaluated quantitatively. Using the data obtained from the simulation evaluation as training sample and validation sample, the relation model of joint angles and operating comfort is established based on GEP. Through limb angles to predict the comfort of operating posture, the basis of evaluation and optimization of human-machine interface layout design for cabin is provided.

Establish the Chinese Virtual Human Body Model in CATIA.
The ergonomic design module of CATIA integrates four submodules: the human builder, human measurements editor, human activity analysis, and human posture analysis [26]. Human joints exist the maximum ROM, the software can test whether the location of manipulators within the reachable area; and quantify the comfortable degree. But CATIA only has five kinds of the human body model: the United States, Canada, France, Japan, and Korea. Therefore, in order to get accurate evaluation results, the Chinese virtual human body model needs to be established firstly. Create a data file of the human body model which must follow certain form. A population file contains four segments; the form is as follows: The segment of MEAN STDEV needs each measurement numerical value of Chinese adult body size, including the mean and the standard deviation. Each item takes up one line with the pattern of "⟨variable⟩ ⟨mean⟩ ⟨standard deviation⟩" to describe a variable.
The segment of CORR needs correlative numerical values between any pairs of variables; the correlation between any two variables is defined in −1.0∼1.0. It expresses the dependencies between two variables. The absolute value of correlation is higher; the dependencies between variables are higher.
According to the above format, the human body dimension data from the Chinese standard GB10000-88 is wrote in order; a complete database file of the human body dimension can be established. Take .sws as extension name, the file can be uploaded by user defined population database in CATIA. Detailed constructive process can be found in [27]. Before the posture evaluation, the preferred angle and corresponding score of each DOF of joints must be defined. When evaluating the comfort of body parts, based on the angle of DOF and score in current posture, the software will conclude evaluation score by interpolation and weighted arithmetic.

Operating Comfort Evaluation
(1) Set Up the Preferred Angle and Corresponding Score of Each DOF of Joints. According to Table 1, to divide comfortable ROM of head, chest, waist, shoulder, upper arm, and forearm, edit the angle of locomotive parts of the human body, and set up the comfort score. For example, select the upper arm of the virtual human body model in Figure 4, edit the angular limitations and the preferred angles of DOF 2 (adduction/abduction). The range is divided into three levels: comfortable range, less comfortable range, and uncomfortable range. The range is divided into five regions and different levels, respectively, show different colors. Meanwhile, in the blue area (−5 ∘ ∼25 ∘ ), upper arm is comfortable and the score is 9 points, while, in the yellow area (−10 ∘ ∼ −5 ∘ , 25 ∘ ∼60 ∘ ), upper arm is less comfortable and the score is 7 points, whereas, in the red area (−18 ∘ ∼ −10 ∘ , 60 ∘ ∼80 ∘ ), upper arm is uncomfortable and the score is 5 points.
(2) The Score of Operating Comfort. Simulate the driller's operating process (see Figure 5) and evaluate the comfort in different operating posture. According to the operating task, edit the posture of head, torso, arm, leg, and foot. After posture editing (see Figure 6), the quantitative analysis of the virtual human model's posture is carried out. Figure 7 shows the comfort score when the driller of 90 percentile handing control handles. In this operating posture, the comfort score of each part of the human body model is as follows: 9.48 6 Computational Intelligence and Neuroscience

The Statistics and Analysis of Simulation Data
(1) The Statistics of Comfort Score. Take the right hand operating the console in Figure 5, for example; in turn, simulate the posture of operating the 22 manipulators in practical work. Take the average score of each DOF as comfort score of every part; the statistic results of comfort score are shown in Table 3.
(2) Factor Analysis. With 16 joint angles to describe the operating comfort being too complicated, dimension reduction is required. Factor analysis is a method of multivariate statistical analysis, which studies how to make numerous original variables condense into a few factors with the least amount of information loss and how to make the factors have certain named explanatory [28].
First of all, check whether the data is suitable for factor analysis. Input the data in Table 3 into SPSS Statistics 19.0 software; by Bartlett's test of sphericity and Kaiser-Meyer-Olkin (KMO) test, the relationship between variables is tested. The statistics observed value is 416.785 in Bartlett's test of sphericity; since the corresponding probability of value is close to 0, less than the significance level of ( equal to 0.05), it can be regarded as that there is significant difference between the correlation coefficient matrix and unit matrix. The value of KMO is 0.744; according to the KMO metrics provided by Kaiser, the original variables are appropriate to conduct factor analysis.  According to the principle of "usually select the number of eigenvalues as factor number when cumulative variance contribution rate is greater than 0.85," the factors are extracted by the method of principal component analysis. Four factors are extracted; the corresponding cumulative variance contribution rate reaches 86.311% (shown in Table 4) and meets the above principle.
In Table 5, the data shown in bold in each column represent the joint angles as having high loading on the four factors, respectively. For example, 12 , 15 , and 1 have high load on the first factor; the first factor mainly explains these three variables of joint angle. In this way, the dimension of 16 variables is reduced to 4 comfort impact factors and can reflect most of the information of the original variables.
By regression method, the factor score coefficient is estimated. Component sore coefficient matrix is calculated, Computational Intelligence and Neuroscience 7 Factor analysis is used to reduce the variable (namely, joint angle) dimension and eliminate the correlation between the variables so as to reduce the independent variable inputting in GEP later. The data after dimension reduction are shown in Table 6.

Operating Comfort Prediction Model Based on GEP.
Through the analysis of the data in Table 6, study the influence of the joint angles on operating comfort. Then, GEP algorithm is used for data mining, and the operating comfort prediction model of human-machine interface layout for cabin is established.

Select Fitness Function.
The evolution results in every generation are evaluated by fitness function; the individuals with bigger fitness value are retained and have a higher chance to reproduce. According to the characteristics of the problem and based on the mean square error (MSE), this paper constructed the fitness function. The smaller the MSE value the bigger the fitness value of individual. The largest fitness value is 1000. The fitness function is defined as follows: where = (1/ ) ∑ =1 ( − ) 2 is the MSE of experimental samples. is the total number of training set samples; is the output value of the th sample of the th individual, calculated by the mathematical expressions acquired by GEP algorithm modeling; is the target value of the th sample. The larger the fitness value of individual the better the individual. Stop condition of algorithm is the fact that the fitness value of best individual achieves the required accuracy or the program achieves the maximum evolutionary generations.

Determine the Organizational Structure of Individual.
Determine the genetic structure and the length of gene head. According to the complexity of the problem to define the length of gene head ℎ, the length of gene tail and the length of gene head ℎ satisfy the relationship that = ℎ ( − 1) + 1. And represents the maximum number of operation numbers in function set. So it can guarantee that the genes are legitimate. Suppose that a gene is composed of the elements in {+, −, * , /, sin, cos, 1 , 2 , 3 , 4 }, so = 2. If ℎ = 6, then = 6(2 − 1) + 1 = 7. Randomly generate a legitimate GEP gene as follows: where the top six represent the gene head, and the rest represent the gene tail. The corresponding expression tree is shown in Figure 8.
In GEP coding, the length of each gene is fixed, including the front effective K-expression and the back of filler components. From top to bottom and left to right, the expression tree in Figure 8 is traversed, and K-expression of expression tree is obtained. The effective length of the gene has 12 characters: + + sin − * / 4 3 1 2 3 2 . So the mathematical expression is as follows:

Operating Comfort Prediction.
Determine the genetic control parameters before the algorithm running, including the size of population, the upper limit of evolution generation, and the probability of each genetic operator. The main operating parameters of GEP model are shown in Table 7. In function set, Sqrt represents square root; Ln represents return to natural logarithm of a number; 2 represents square; Avg2 represents mean of two variables. In terminal set, 1 , 2 , 3 , and 4 represent four comfort impact factors extracted by factor analysis. 15 groups of simulation data are randomly selected from 22 groups of data in Table 6 as training set and the rest 7 groups as validation set. Through Visual Basic programming, the best individual is obtained after running multiple times, using computer with Inter Core i7-4500U CPU and 8 GB RAM. The best individual expression tree is shown in Figure 9; each expression tree represents a gene, and 3 genes are connected by linking function "+" to form a chromosome. The best individual translated into mathematical expression is as follows: (7) Figure 10 shows the curve fitting of GEP algorithm in the training set. MSE and -square are used to verify the validity of the algorithm and the ability of the prediction. The formulas are as follows: where , , avg represent the predicted value, the actual value, and the actual average value, respectively. The range of -square is [0, 1], the closer to 1, showing that the four variables have stronger ability to predict   the operating comfort. The calculation result shows that the MSE and -square of training set are 0.0009 and 0.9883, respectively. In order to verify the validity of GEP model, the rest 7 groups of data are taken into the model for verification. The curve fitting of validation set is shown in Figure 11. And the MSE and -square are 0.0031 and 0.9538, respectively, which achieve the ideal effect.  to predict the above 22 sets of data, respectively, and the predicted values and the relative error are shown in Table 8. The comparison chart of actual value and predicted value by two kinds of prediction model is shown in Figure 12.

Model
In Table 8, the average relative error of operating comfort prediction obtained by GEP model and BP model is 0.37% and 1.89%, respectively. In comparison, the average relative error of GEP model is smaller and the prediction accuracy is higher. So, the GEP model has high fitting degree.

Results and Discussion.
In order to validate the presented operating comfort prediction method of human-machine interface layout, the method of questionnaire survey is adopted, and the operating comfort of operating 14 manipulators on console shown in Figure 13 is evaluated again. 20 drillers (all men) are invited to evaluate operating comfort with scoring criteria from 0 to 10. The questionnaire results are compared with the prediction results using the evaluation method proposed in this paper. Table 9 shows that the deviation between the score of questionnaire and abovementioned method is among 0.04∼ 0.51, and the average deviation of 14 operating comfort scores is 0.211. Therefore, the operating comfort prediction method of human-machine interface layout for cabin put forward in this paper can accurately predict the drillers' feel and assist the designer to design human-machine interface layout.  In addition, in accordance with the method of establishing the virtual human body model put forward in Section 3.1, other countries, regions, and even special population can be customized, which can be used for ergonomic evaluation of some particular products. The ROM can be divided according to different industries and job characteristics.

Conclusion
Comfort is a kind of subjective feeling, and it is difficult to quantify. In the process of operation, controlled by the feedback mechanism of the human body movement, the body always keeps each joint at a high comfort level as much as  possible. Utilizing this adjustment mechanism, the operating comfort evaluation data is obtained by CATIA software. Then, GEP algorithm is applied for ergonomic analysis of human-machine interface layout for cabin. With 22 groups of evaluation data as the prediction model's data base, according to GEP algorithm to realize the complex functions' automatic modeling, the operating comfort prediction model is established. The example of operating comfort prediction of human-machine interface layout for driller control room proves that GEP has strong nonlinear and global search ability to find function, with the predicted results close to the target, and it has high prediction accuracy. With the limited training samples, GEP also can get accurate results. The comfort prediction model constructs the coupling relationship between joint angles and operating comfort, providing a solution for rapid assessment.