Statistical Optimization of Process Parameters for Lipase-Catalyzed Synthesis of Triethanolamine-Based Esterquats Using Response Surface Methodology in 2-Liter Bioreactor

Lipase-catalyzed production of triethanolamine-based esterquat by esterification of oleic acid (OA) with triethanolamine (TEA) in n-hexane was performed in 2 L stirred-tank reactor. A set of experiments was designed by central composite design to process modeling and statistically evaluate the findings. Five independent process variables, including enzyme amount, reaction time, reaction temperature, substrates molar ratio of OA to TEA, and agitation speed, were studied under the given conditions designed by Design Expert software. Experimental data were examined for normality test before data processing stage and skewness and kurtosis indices were determined. The mathematical model developed was found to be adequate and statistically accurate to predict the optimum conversion of product. Response surface methodology with central composite design gave the best performance in this study, and the methodology as a whole has been proven to be adequate for the design and optimization of the enzymatic process.


Introduction
Triethanolamine-(TEA-) based esterquat has been the primary ingredient in European fabric softeners and is becoming the global molecule of choice for various industries [1]. Esterquat cationic surfactant was considered as a type of biodegradable material utilized as a textile softening agent.There have been an increasing number of researchers who concern the biodegradable esterquat cationic surfactant since the beginning of the 1990s [2]. In addition, they are highly biodegradable and biocompatible because their ester bonds are easily hydrolyzed [3][4][5]. Besides biodegradability additional advantages such as excellent softening properties, suitability for various fabrics, and simple preparation procedures have been discovered by the use of esterquat cationic surfactants as textile softening agents [2].
In this work, triethanolamine and oleic acid were chosen as substrates to design an optimal model reaction which will lead to high conversion rate utilizing lipase from Candida antarctica (Novozym 435) as a biocatalyst in the organic solvent system. The investigated reaction conditions included enzyme amount, reaction time, reaction temperature, the molar ratio of substrates, and agitation speed. The major aim of this study was to model the effect of process parameters on the reaction yield. The most important stages in a process were modeling and optimization to improve a system and increase the efficiency of the process without increasing the cost. All process parameters are selected to conduct the optimization by using response surface methodology (RSM) [6][7][8][9][10]. The optimization of process has been reported by artificial neural network (ANN) in our previous study [11]. Subsequently, the simulated result in optimum conditions 2 The Scientific World Journal from the response surface methodology (RSM) and ANN was compared.
Prior to doing the statistical analyses by RSM, experimental data were inspected and explored the nature of variables by several normality tests. It has been observed that only a few researchers have paid attention to the application of right and accurate statistical techniques in order to validate experimental data [12]. Statistical methods are based on various underlying assumptions. One common assumption is that a random variable is normally distributed. In many statistical analyses, normality is often conveniently assumed without any empirical evidence or test. However, normality is critical in many statistical methods. Testing of assumptions usually involves obtaining descriptive statistics on variables [13]. Descriptive statistics provide important information about variables to be analyzed. Mean, median, and mode measure central tendency of a variable. Measures of dispersion include variance, standard deviation, and range. Researchers may draw a histogram, stem-and-leaf plot, or box plot to see how a variable is distributed [14]. When this assumption is violated, interpretation and inference may not be reliable or valid [15]. The usual processes in the statistical assessment of a data set are as follows: screen the data for outliers or blunders, plot the data to detect asymmetry and tail weight, calculate the indices of sample shape (i.e., skewness and kurtosis), perform tests of normality, and if the data is normal use parametric statistics for further analysis [15]. In order to test the validity of a normal distribution, quantitative tests need to be employed, such as Kolmogorov-Smirnov, Liliefors, and Shapiro-Wilks. In this study, normality tests also included the Kolmogorov-Smirnov (Lilliefors modification) and the Shapiro-Wilk for checking the normal distribution validity of variables.

Materials.
Novozym 435, Candida antarctica lipase B immobilized on a macroporous acrylic resin (10,000 propyl laurate units per gram), was purchased from Novo Nordisk A/S (Bagsvaerd, Denmark). The enzyme is a granular product with a particle size of 0.2-0.6 mm. The bulk density of Novozym 435 is 350-450 kg/m 3 . n-Hexane obtained from J. T. Baker (USA) was used as the organic solvent. Oleic acid and triethanolamine were purchased from Merck, Germany. All other chemicals used in this study were of analytical reagent grade.

Experimental
Design. The optimization study was carried out in accordance with the experimental design with 5 factors and 5 levels with 50 experimental points. The fractional factorial designs consisted of 32 factorial points, 10 axial points (two axial points on the axis of each design variable at a distance of 1.75 from the design center) and eight center points. The generalized response surface model is shown by (1), and the variables and their levels selected for the study were represented in Table 1: where (conversion %) represents the response variable, 0 is the constant term, represents the coefficients of the linear parameters, represents the variables, represents the coefficients of the quadratic parameter, represents the coefficients of the interaction parameters, and is the residual associated to the experiments.

Enzymatic Esterification and Analysis of Samples.
The reactions were performed in 2000 mL reactor, and specified volumes of hexane were added as solvent. The reactor consisted of a screw cap and a glass flask with a capacity of 2 liters and an inner diameter of 10 cm. A four-bladed impeller (4.5 cm in diameter) was immersed in the reaction mixture a 2 cm height from the bottom of the flask to provide agitation effect. The impeller was connected by a shaft to motor for speed controlling purpose. A baffle was connected to the cap and immersed in the reaction mixture. The reaction temperature was controlled by immersing reactor in a temperaturecontrolled water bath. The reactions were catalyzed by various amounts of Novozym 435 from 1.5 to 8.5% w/w of oleic acid for experimental design at different temperature (51. .75 ∘ C) and agitation speed (137.5-662.5 r.p.m.) values. The studied ranges of the substrates were 708 mmol for OA as a constant amount, while concentrations of TEA were varied according to Table 1 for the experimental design. All experiments were carried out in the range of 2-30 h, as shown in Table 1. The basic points for the design were selected from a preliminary study in laboratory scale [16] by using Taguchi design (data not shown).
At the end of the reaction periods, 30 mL aliquot was withdrawn from the system using a syringe. The reaction sample was terminated by dilution with 10 mL of ethanolacetone (50 : 50, v/v). The enzyme particles were then separated by filtration, and the remaining free acid in the reaction mixture was determined by titration of the aliquots of reaction mixture against standard NaOH. The amount of reacted acid was determined from the values obtained for the control (without enzyme) and test samples. The ester formed was expressed as equivalent to conversion of the acid [17]. The ester formation was confirmed by thin-layer chromatography (TLC) using chloroform : methanol (95 : 5) solvent system. Further identification for ester formation was The Scientific World Journal 3

Testing Experimental Data for Normality.
Normal is used to describe a symmetrical, bell-shaped curve, which has the greatest frequency of scores in the middle, with smaller frequencies towards the extremes [18]. Normality can be assessed to some extent by obtaining skewness and kurtosis values. Table 2 shows descriptive statistics to check the skewness and kurtosis values for five variables at three levels of each of them. For other levels conversions percentage was constant when variables were placed in ±1.75 levels, and they have been omitted.
The results showed that skewness ranged between −0.925 and 0.532 (acceptable range of normality is between −2.0 and +2.0). The values of kurtosis ranged between −0.848 and 1.111 (acceptable range of normality is between −5.0 and +5.0) [19]. As a result, the skewness and kurtosis values indicate almost normal distribution. However, these descriptive statistics do not provide conclusive information about normality, and testing normality needs to use some other statistics tests. SPSS software provides two different statistics for testing normality. The Shapiro-Wilk and Kolmogorov-Smirnov tests were used for data distribution analysis. Both tests similarly demonstrated that the data set was normally distributed. As shown in Table 3, the values of Shapiro-Wilk and Kolmogorov-Smirnov tests confirm null hypothesis that the variable are normally distributed ( ≥ 0.05). Since the number of observations is less than 2,000, however, Shapiro-Wilk test will be appropriate to this case.  Table 4. Evaluation of coefficients of the empirical models and their statistical analyses were carried out using central composite design.
Fitting of the data to various models (linear, 2FI, quadratic, and cubic) and their subsequent analysis of variance showed that TEA-based esterquat synthesis was most suitably described with a quadratic model. The model was modified based on the insignificancy of some model terms. The final reduced model to predict the conversion % of TEA-based esterquat catalyzed by Novozym 435 is shown as follows: (Conversion%) = 53.56 + 1.
where matches product conversion % and 1 , 2 , 3 , 4 , and 5 match to coded values for the enzyme amount (% w/w), reaction time (h), reaction temperature ( ∘ C), the molar ratio of substrates (mole), and agitation speed (r.p.m.), respectively. The positive sign in front of the terms indicates a synergistic effect while the negative sign indicates an antagonistic effect. Negative values of coefficient estimates denote negative influence of parameters on the reaction. It was observed that all the linear coefficients from the model gave positive effect except the coefficient estimate for the molar ratio of substrates ( 4 ) in the model of percentage conversion. This may be due to that the percentage of conversion was negatively affected by the presence of the higher ratio of oleic acid as the ratio of oleic acid/triethanolamine. From the equation, the conversion of enzymatic reaction has linear and quadratic effects by the five process variables. The model was found to have coefficient of determination value ( 2 ) of 0.9201, which means that 92.01% of the total variation in the results was attributed to the independent variables investigated. When 2 approaches unity, the better empirical model fits the actual data [20]. Normally, a regression model having an 2 value higher than 0.9 was considered as model having a very high correlation [21]. Hence, the 2 value in this regression model is relatively high, which indicates a good agreement between predicted and experimental conversion of TEA-based esterquat reaction. Figure 1 summarizes correlation between experimental values and predicted values by using the developed model. Figure 1(a) shows the actual values versus predicted values of the product conversion %, which indicated a good agreement between actual and predicted responses. A residual plot allowed visual assessment of the distance of each observation from the fitted line (Figure 1(b)). The residuals randomly scattered in a constant width band about the zero line. Figure 1(c) shows the histogram of the residuals in  The Scientific World Journal allowed visual assessment of the assumption. As observed, the measurement errors in the response variable were normally distributed, and the histogram of the residuals revealed a normal distribution overlay. Statistical analysis based on ANOVA for the response surface quadratic model is presented in Table 5. The value for the model is less than 0.05, which indicates that it is a significant and desirable model. Besides, the value of < 0.0001 indicates that there is only a 0.01% chance that a "modelvalue" this large could occur due to noise in the experiments. The "Lack of Fit -value" of 0.43 implies that lack of fit is not significant relative to pure error. Thus, it is possible quantitatively judge if the model represents the observations satisfactorily. (2) show that interactions between variables have significant effect on the conversion% of enzymatic reaction of TEAbased esterquat. Therefore, instead of studying single variable the interactions will be investigated, which is significant and important for a comprehensive optimization study. Figure 2(a) shows the effects of different reaction time and agitation speed on the conversion % of product in threedimensional surface response. Generally, increased reaction time and agitation speed resulted in an increase percentage of conversion until agitation speed reached 523 r.p.m. The response started to decrease after the agitation speed exceeded 523 r.p.m. even at the higher reaction time. However, it was observed that reaction time showed a significant effect to the reaction conversion at the higher agitation speed. Increasing agitation speed had increased the external mass transfer rates between the bulk phase of the reaction mixture and surface of enzyme; moreover, higher reaction time also promoted collision time between enzyme and substrate molecules. As shown in Figure 2(b), the reaction with the enzyme amount of 5.80% w/w led to the maximum percentage of conversion. Response surface plot for interaction between enzyme amount and reaction temperature was generated with reaction time fixed at 16 h, the molar ratio of substrates (OA : TEA) 2 : 1 mole, and agitation speed 400 r.p.m. The percentage conversion of product increased by increase ongoing from 3 to 5.80% w/w and thereafter decreased with further increase to 7% w/w. However, higher temperatures tended to induce enzyme inactivation due to denaturation processes [22,23]. These results were similar to those in most reviewed papers, namely, that Novozym 435 was optimally used at temperatures between 40 ∘ C and 60 ∘ C [24,25]. Figure 2(c) represents the effect of varying amount of enzyme and agitation speed on the synthesis of TEA-based esterquat with constant condition for other independent variables (reaction temperature of 60 ∘ C, reaction time of 16 h, and substrate molar ratio of 2 : 1 mole). From Figure 2(c), while the enzyme amount and agitation speed increased, the conversion of esterquat was increased as the agitation speed reached 523 r.p.m. in the enzyme amount of 5.80% w/w. However, the effect of enzyme amount variable was lower than the effect of agitation speed variable. Increase in agitation speed caused the substantial increase in the specific interfacial area between the substrate and the enzyme present in the nonaqueous phase by reducing the droplet size [26,27]. A negative effect in percentage of conversion was detected with agitation speed greater than 523 r.p.m. This may be due to adverse shear effect caused by impeller at higher agitation speed. Typically, the immobilized enzyme was driven radially from impeller against the wall of the reactor, forcing the breakage, especially at high agitation speed [28]. Finally, Figure 2(d) shows the effect of varying the amounts of enzyme and molar ratio of substrates on the esterification reaction of oleic acid and triethanolamine while reaction time and reaction temperature are fixed at 16 h and 60 ∘ C, respectively. It was shown that the maximum conversion of esterquat was obtained when the enzyme amount was 11.6 g and increased with the lower molar ratio of substrates. However, increase in acyl donor showed less significant increase in the esterification conversion, on the other hand, and resulted in slight decrease of percentage conversion at the high amount of enzyme 14 g and the molar ratio of substrates of 3 : 1 mole. This was due to the limiting factor caused by triethanolamine, which was significant at the high amount of oleic acid and hence reduced the percentage of conversion.

Optimization by Response Surface Methodology and Model
Validation. The next step in the present study was to determine the effects of five independent variables (enzyme amount, reaction time, reaction temperature, molar ratio of substrates, and agitation speed) shown in Table 6, along with the mean predicted values for enzymatic reaction product. For this purpose, the response surface methodology, using a central composite design, was adopted for finding optimal conditions. Experiment was then carried out under the recommended conditions and resulting response was compared to the predicted values. The optimum reaction parameters were enzyme amount of 4.77% w/w, reaction time of 24 h, reaction temperature of 61.9 ∘ C, substrates molar ratio (OA : TEA)   The Scientific World Journal be used to adequately describe the relationship between the independent variables and response.

Conclusion
In the present paper, RSM was used to optimize the enzymatic reaction conditions. A central composite design was applied to optimize the experimental conditions for synthesis of TEAbased esterquat at 2000 mL scale. The normality test was investigated as an initial step in process capability studies for better results and higher accuracy. Considering normality tests, the results indicated that all of the data and distributions were close to expected values under normality. The variables include enzyme amount, reaction time, reaction temperature, substrates molar ratio, and agitation speed. Quadratic mathematical model was suggested for synthesis of TEA-based esterquat. Analysis of variance corroborates the accuracy of the model by using high value (33.60), very low value (<0.0001), nonsignificant lack of fit, and the coefficient of determination ( 2 = 0.9201). A conversion percentage of 63.57% was attained, which was good compared to the predicted amount of 65.08%, with the relative standard error percentage (RSE) 2.32%. The comparison of RSM and ANN (QP) indicated that the RSM had less RSE% rather than ANN (QP) method (3.98%). The methodology as a whole has proven that RSM is adequate for the design and optimization of the enzymatic process.