A Wiener-Laguerre model with artificial neural network (ANN) as its nonlinear static part was employed to describe the dynamic behavior of a sequencing batch reactor (SBR) used for the treatment of dye-containing wastewater. The model was developed based on the experimental data obtained from the treatment of an effluent containing a reactive textile azo dye, Cibacron yellow FN-2R, by Sphingomonas paucimobilis bacterium. The influent COD, MLVSS, and reaction time were selected as the process inputs and the effluent COD and BOD as the process outputs. The best possible result for the discrete pole parameter was α=0.44. In order to adjust the parameters of ANN, the Levenberg-Marquardt (LM) algorithm was employed. The results predicted by the model were compared to the experimental data and showed a high correlation with R2>0.99 and a low mean absolute error (MAE). The results from this study reveal that the developed model is accurate and efficacious in predicting COD and BOD parameters of the dye-containing wastewater treated by SBR. The proposed modeling approach can be applied to other industrial wastewater treatment systems to predict effluent characteristics.
1. Introduction
The reactive dye-containing effluents from dye manufacturing and application industries can cause serious environment pollution due to the toxicity and slow degradation of dyes [1]. In addition, the presence of dyes in water is highly visible and affects water transparency and aesthetics even in low concentrations. Therefore, the effluents must be treated before being released into the environment.
In recent years, researchers have shown interests in biological treatment of wastewaters with high concentrations of dyes [2, 3]. Treatment of these polluted wastewaters requires high effectiveness and low cost dye removal processes [4]. Sequencing batch reactor (SBR) is a promising biological system for treating dye-containing wastewaters [5, 6]. This system is cost efficient and flexible to handle different feed characteristics. Furthermore, its operation is easier than other biological methods [1]. However, the SBR process, like other biological processes, is highly nonlinear, time varying, and subject to significant disturbances [7]. Modeling the treatment process can provide better understanding, design, operation, and control of the process [8].
The ability of artificial neural networks (ANNs) in black-box modeling of nonlinear systems with complicated structure has made them the most popular tool for modeling of biological processes [9]. In recent years, recurrent neural networks are developed based on common NARX (Nonlinear AutoRegressive eXogenous model) topology in order to characterize the behaviors of dynamic systems. In such models, the past terms of input and output signals are used as the network inputs [10]. In literature, different applications of NARX networks have been reported for monitoring [11, 12], controlling [13], optimization [14], and simulation [15, 16] of wastewater treatment process plants. In spite of good performances of NARX networks, they are still suffering from some drawbacks such as the need for prior terms information of input and output variables. Furthermore, the presence of output feedback may cause prediction errors to deteriorate the model performances [10].
Wiener-type models are a class of block-oriented representation of nonlinear systems, where a dynamic linear part is cascaded by a static nonlinear part [17]. In recent years, Wiener-type models have been successfully applied to identify nonlinear dynamics in biological processes [18–20]. Employing standard multilayer perceptron (MLP) networks as nonlinear mapping function in Wiener models structure is proposed as an alternative solution to develop nonlinear models [21]. The linear part of Wiener models is often chosen to be as formal autoregressive with exogenous input (ARX) models. In this case, in order to attain high performance, a large amount of training data for estimating the order of model, time delays, and parameters would be required. A big obstacle is the high cost of preparing training data, where special instruments and data acquisition equipment for online system monitoring are required.
A combination of orthogonal functions as the linear dynamic part and artificial neural networks as the memoryless nonlinear part of Wiener models was proposed to deal with this problem [22, 23]. Laguerre functions are the most common orthogonal functions used in Wiener model structure. Simple structure and capability to give accurate description of systems with rather small number of tunable parameters are the reasons they have been widely employed for linear and nonlinear system identification [24].
In this study, a Wiener-Laguerre model is employed to describe the nonlinear behavior of an SBR used for dye wastewater treatment. The proposed method has not been previously used for biological treatment modeling of dye-containing wastewaters. An aerobic bacterium, Sphingomonas paucimobilis, was used for decolorizing of the effluent containing a reactive textile azo dye, Cibacron yellow FN-2R. S. paucimobilis has been shown previously to be efficient for degradation of azo and triphenylmethane dyes [25, 26].
2. Material and Methods2.1. Microorganism
Sphingomonas paucimobilis was isolated from a closed drainage system located in Perai industrial area, Perai, Penang, Malaysia, where several heavy industries were located. The microorganism was grown on slant agar at 35°C for 2 days under static conditions.
2.2. Media Composition
A synthetic dye-containing wastewater was used in this study. Seed culture medium consisted of distilled water 1000 mL, powdered Cibacron yellow FN-2R as the main carbon source, urea as the nitrogen source, and K2HPO4 and KH2PO4 as the phosphorous sources and the nutrients. The composition of the wastewater gave a C/N/P ratio of approximately 100/10/1 by adding NH4Cl and KH2PO4. The initial pH was adjusted to 7 using NaOH.
2.3. Preparation of the Synthetic Dye Wastewater
Cibacron yellow FN-2R is one of the main reactive dyes that are used in textile industries. Reactive Cibacron FN-2R was purchased from Sigma-Aldrich. The dye solution of various concentrations 250, 500, 1000, and 1500 mg/L were treated in four reactors with successively increasing of dye concentration and were analyzed for COD, BOD, and MLVSS everyday according to the standard methods [27].
2.4. Experimental Setup
In this study, four cylindrical Plexiglass reactors with 14 cm diameter and 46 cm height were used. The working volume and influent flow rate were 1.6 L and 3.0 L/d, respectively. Four air pumps and four mixers were used for continuous aeration and mixing. The speed of impeller was adjusted to 300 rpm to maintain the dissolved oxygen concentration at DO > 2 ppm. After acclimatization period, different concentrations of the dye and 2 mL of bacterium (S. paucimobilis) were added to each reactor. The pH of the mixture was adjusted to 7 after stirring at a constant temperature (35°C) for a certain period.
2.5. Input-Output Selection
In order to develop accurate and reliable models, appropriate data sets should be collected from the variables, which are governing the behavior of the process. A set of variables including the influent COD, MLVSS, and reaction time as the process inputs and the effluent COD and BOD as the process output were prepared. Two sets of data including 147 data points (in 147 days) and 90 data points (in 90 days) are chosen for training and validation of the models, respectively. As a result, the input/output vectors would be as
(1)Input=[CODin,MLVSS,Time],Output=[BODout,CODout].
For increasing the performance of modeling process, the recorded data are normalized in the range of [0,1] by dividing on the maximum value of each datum column.
3. Model Structure and System Identification
Based on the obtained experimental data, appropriate nonlinear models were developed to predict the concentrations of COD and BOD in the effluent.
Wiener-type model is a well-known block-oriented representation of nonlinear systems, in which a linear dynamic part is cascaded with static nonlinearities. As it is shown in Figure 1, the sequences of input variable u(k) are mapped into the intermediate variable v(k) through the transfer function H(z), where the model output y(k) is estimated throughout the nonlinear mapping function f(·). Many different options are available for the linear part and nonlinear mapping functions.
Wiener type model configuration.
Here, the considered Wiener model is a combination of Laguerre basis filters as the linear part and a feed-forward NN for nonlinear mapping. In this case, without loss of generality by considering a SISO (single-input and single-output) system, the nonlinear dynamics can be represented as a discrete-time nonlinear mapping on filtered input terms,
(2)y(k)=f(l0(k)*u(k),l1(k)*u(k),…,li(k)*u(k)),
where li(k) denotes the Laguerre filters. This structure is a NN realization of the Wiener-Laguerre model [21].
As it is presented in (1), three inputs are chosen as the dominant variables on the effluent COD and BOD. In this case, each process model can be considered as a multi-input single-output (MISO) model, as it is presented in Figure 2.
Wiener-Laguerre network model.
3.1. Laguerre Filters
Laguerre functions can be represented as a set of discrete-time transfer functions in z-domain,
(3)Li(z,α)=1-α21-αz-1(-α+z-11-αz-1)ii=0,1,2,…,
where α∈{ℜ:|α|<1} is the pole parameter. The dominant pole “α” determines the rate of exponential decay for Laguerre functions responses, which can be captured through optimization or by experiments [28]. The first section of Li(z,α) is a first-order low-pass filter, which is followed by (i-1) identical all-pass filters. The orthonormal bases are formed by a set of Laguerre functions on L2[0,∞), which is defined on z-domain. Consider
(4)〈Ln,Lm〉=12πj∮Ln(z)Lm(z-1)dzz=δn,m.
For physical systems, the limited order of Laguerre stages can be employed to approximate system dynamics. In this case, for a given truncated order “a,” the transfer function H(z) for a SISO linear discrete-time system is obtained as follows:
(5)H(z)=Y(z)U(z)=∑i=0abi(α)Li(z,α),
in which U(z) and Y(z) are input and output of the system and bi(α) are the Laguerre coefficients [28]. By considering α=0, Li(z,α) will turn to regular delay operators and H(z) to the usual FIR (finite input response) model. Introducing Vi as the output of Laguerre filters,
(6)Vi=Li(z,α)U(z)i=0,1,2,…,a.
Thus, for a linear system, the output of an “a-order” Laguerre network is obtained as the linear combination of the Laguerre filters output as
(7)Y(z)=∑i=0abi(α)Vi.
The linear combination of the Laguerre filters’ output can be replaced by a multilayer perceptrons (MLP) as the nonlinear mapping function in order to deal with nonlinearities. In order to increase the performances of Laguerre networks, it is required that the pole parameters and the order of the Laguerre filters have to be estimated.
Developing a first order linear model for the process could be helpful to find appropriate process time constant. For this aim, the influent and the effluent CODs can be considered as the model input and output signals. We have
(8)y(s)u(s)=kpτps+1,
in which kp and τp are the process gain and time constant. The parameters of linear model can be captured using the MATLAB System Identification Toolbox (Ver. 7.4.2). The best possible result for the linear system pole was s=-1/τp=-0.82, as the model output fit on the real data was about 52.1%. The equivalent discrete pole parameter can be captured by estimating α=exp(sT) in the z-domain, where the sampling time T is equal to 1. This results in α=0.44.
The step responses of Laguerre filters (normalized) are shown in Figure 3. It is noted that by increasing the order of Laguerre filters, the structure of the NN part would be more complicated and therefore computational efforts for the nonlinear part training would increase. A major advantage of Laguerre network is its ability to describe the behavior of nonlinear systems by high-truncated order models. Choosing the order of filter a=8 is an appropriate tradeoff between the complexity and the accuracy of models. The pole parameters and filter order are chosen to be 0.44 and 9, respectively, for all input variables.
The step responses of Laguerre bases filters.
3.2. Neural Network
A multilayer feed-forward neural network (FFNN) is considered as the nonlinear part of the proposed models. A FFNN is developed by a number of cascaded layers that are interconnected by weight coefficients to the neighboring layers nodes. Neurons are the fundamental part of NN and responsible for information processing, which are consisting of weighting coefficients, activation functions, and biases. The structure of FFNN is presented in Figure 4. It is noted that, in this work, no bias connection was considered.
The structure of feed-forward neural network.
Generally, a FFNN may have one or more hidden layers; however, a network with a single hidden layer could approximate many systems with an acceptable degree of accuracy. In many cases, choosing the appropriate number of neurons in the hidden layer and the type of the activation functions is known as the dominant parameters on the model performances. The number of neurons in the input/output layers is chosen with respect to the number of input/output variables of the investigated system, which are 27 and 1 for input and output layers, respectively. For a single hidden-layer NN, it is recommended that number of hidden neurons (Nh) be chosen with respect to the geometric pyramid rule proposed as
(9)Nh=βNu×Ny,
where Nu and Ny are the number of network inputs and outputs, respectively, and β is multiplication factor. The value of β should be selected in the range of 0.5<β<2 depends on complexity of system [29]. As a result, by considering β=1.8, the number of neurons in the hidden layer Nh would be equal to 10.
The best activation functions may often be selected through trial and error. For the hidden layer, activation functions are chosen to be hyperbolic tangent sigmoid function and the output layer chosen as linear transfer function.
In order to adjust the parameters of NN, the second-order derivative-based Levenberg-Marquardt (LM) algorithm was employed. The LM is an accelerated neural networks training algorithm, which is rather used for adjusting the parameters of moderate-sized MLP. The model weight parameters (W) are tuned through iteration by the following equation:
(10)Wk+1=Wk-[JT(wk)J(wk)+μI]-1JT(wk)E(wk),
where J is the Jacobian matrix that is captured by first derivation of network errors with respect to its parameters and μ is some nonnegative value known as the learning parameter [30]. The objective function for training process was considered as mean square of error (MSE),
(11)e=1P∑i=1P(yi*-yi)2,
where yi* and yi are the target and actual outputs for the ith pattern, respectively, and P is the total number of training patterns. The error vector E is calculated as follows:
(12)E=[e1e2⋯eP]T.
The MATLAB Neural Network Toolbox (ver. 7.0.1) was employed to perform the process of model training.
4. Simulations and Results
The proposed modeling approach was applied to the experimental data. The performance of the models was measured by the coefficient of determination (R2) and mean absolute error (MAE) between the predicted values of the model and the experimental values, which were calculated by (13) and (14), respectively. Consider
(13)R2=1-∑i=1n(yi*-yp(i))2∑i=1n(yi*-y-)2,(14)MAE=1n∑i=1n|yi*-yp(i)|,
where y- is the average of y over the n data, and yi* and yp(i) are the ith target and predicted responses, respectively.
In Figure 5(a), the responses of the developed model for prediction of BOD are presented. In addition, the correlation between the experimental and predicted values is presented in Figure 5(b). Figure 6(a) shows the responses of the developed model for prediction of COD, whereas the correlation between the experimental and predicted values of COD are depicted in Figure 6(b).
The responses of the developed model (a). Correlation between the actual values and the values predicted by the model for BOD (b).
The responses of the developed model (a), Correlation between the actual values and the values predicted by the model for COD (b).
The results indicate the accuracy and ability of the proposed model to predict the selected parameters perfectly.
The prediction accuracy of the developed models was also evaluated by performing a comparison between the responses of the models and the actual values. For this propose, the error functions were calculated, where the error is defined as the difference between the predicted values by the models and the experimental data. Here, the upper bound error (Max(|e|)), lower bound error (Min(|e|)), MAE, and R2 are calculated and presented in Table 1. The results show a small deviation between the models predicted values and the experimental data. A significantly high R2 and a low MAE obtained indicate the superior data fitting and prediction capability of the developed model.
Error functions for the developed model predicting BOD and COD parameters.
Max (|e|)
Min (|e|)
Mean
MAE
R2
BOD
5.3985
1.28e-4
0.0173
0.3201
0.9998
COD
11.5073
2.78e-4
−0.0367
0.7696
0.9987
BOD and COD are two major parameters for examining the quality of discharged wastewater. Their measurements need significant time and commitment to make proper adjustments in the wastewater treatment process. The model proposed has the ability to provide good generalization performance in capturing nonlinear relationships between parameters to estimate COD and BOD of the treated wastewater.
5. Conclusion
The present study aimed to explore the potential of the Wiener-Laguerre network model in estimation of BOD and COD of the output stream of a SBR system. A multilayer feed-forward neural network was considered as the nonlinear part of the proposed model. The performance of the model was found to be reasonably good with a high R2 (0.99) and a small deviation between the predicted and experimental values. The proposed model can be used as a flexible alternative to the first-order models commonly used for long-term prediction of BOD and COD parameters.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
HaoO. J.KimH.ChiangP. C.Decolorization of wastewater20003044495052-s2.0-0033763999SabaB.KhalidA.NazirA.KanwalH.MahmoodT.Reactive black-5 azo dye treatment in suspended and attach growth sequencing batch bioreactor using different co-substrates20138555656210.1016/j.ibiod.2013.05.005KhouniI.MarrotB.AmarR. B.Treatment of reconstituted textile wastewater containing a reactive dye in an aerobic sequencing batch reactor using a novel bacterial consortium2012871101192-s2.0-8485703958410.1016/j.seppur.2011.11.030KoprivanacN.Jovanović-KolarJ.BosanacG.MeixnerJ.Studies on wastewater decolorization by the precipitation/flocculation process19924633793842-s2.0-38249007762LimP. E.ErC. C.Treatment of dye-containing wastewater by sequencing batch reactor with powdered activated carbon addition2000751-275872-s2.0-0034032394MohanS. V.RaoN. C.SarmaP. N.Simulated acid azo dye wastewater treatment using suspended growth configured sequencing batch reactor (SBR) under anoxic-aerobic-anoxic microenvironment20097125342-s2.0-70350399644YooC. K.VillezK.LeeI. B.RosénC.VanrolleghemP. A.Multi-model statistical process monitoring and diagnosis of a sequencing batch reactor20079646877012-s2.0-3394715468910.1002/bit.21220GüçlüD.DursunŞ.Artificial neural network modelling of a large-scale wastewater treatment plant operation2010339105110582-s2.0-7795808320610.1007/s00449-010-0430-xAlmeidaJ. S.Predictive non-linear modeling of complex data by artificial neural networks200213172762-s2.0-003646917510.1016/S0958-1669(02)00288-4MenezesJ. M. P.Jr.BarretoG. A.Long-term time series prediction with the NARX network: an empirical evaluation20087116–18333533432-s2.0-5654909015610.1016/j.neucom.2008.01.030HongS. H.LeeM. W.LeeD. S.ParkJ. M.Monitoring of sequencing batch reactor for nitrogen and phosphorus removal using neural networks20073533653702-s2.0-3424974359310.1016/j.bej.2007.01.033HanH. G.ChenQ. L.QiaoJ. F.An efficient self-organizing RBF neural network for water quality prediction20112477177252-s2.0-7996089136510.1016/j.neunet.2011.04.006ZengG. M.QinX. S.HeL.HuangG. H.LiuH. L.LinY. P.A neural network predictive control system for paper mill wastewater treatment20031621211292-s2.0-004301191310.1016/S0952-1976(03)00058-7HamzaouiY. E.HernándezJ. A.Silva-MartínezS.BassamA.ÁlvarezA.Lizama-BahenaC.Optimal performance of COD removal during aqueous treatment of alazine and gesaprim commercial herbicides by direct and inverse neural network20112771–33253372-s2.0-7996018115510.1016/j.desal.2011.04.060ChenJ. C.ChangN. B.ShiehW. K.Assessing wastewater reclamation potential by neural network model20031621491572-s2.0-004150913010.1016/S0952-1976(03)00056-3RádulyB.GernaeyK. V.CapodaglioA. G.MikkelsenP. S.HenzeM.Artificial neural networks for rapid WWTP performance evaluation: methodology and case study2007228120812162-s2.0-3394771223510.1016/j.envsoft.2006.07.003BiagiolaS. I.FigueroaJ. L.Wiener and Hammerstein uncertain models identification20097911329633132-s2.0-6764942159610.1016/j.matcom.2009.05.004QuagliniV.PrevidiF.ControR.BittantiS.A discrete-time nonlinear Wiener model for the relaxation of soft biological tissues20022419192-s2.0-003611879210.1016/S1350-4533(01)00113-8KalafatisA. D.WangL.CluettW. R.Linearizing feedforward-feedback control of pH processes based on the Wiener model20051511031122-s2.0-444430532910.1016/j.jprocont.2004.03.006SongI. H.RheeH. K.Application of Wiener type predictive controller to the continuous solution polymerization reactor20061598618642-s2.0-33745782704ŁawryńczukM.Computationally efficient nonlinear predictive control based on neural Wiener models2010741–340141710.1016/j.neucom.2010.03.014MahmoodiS.PoshtanJ.Jahed-MotlaghM. R.MontazeriA.Nonlinear model predictive control of a pH neutralization process based on Wiener-Laguerre model200914633283372-s2.0-5834908345910.1016/j.cej.2008.06.010ArtoV.HannuP.HalmeA.Modeling of chromatographic separation process with Wiener-MLP representation20011154434582-s2.0-003548003010.1016/S0959-1524(00)00053-6WahlbergB.MäkiläP. M.On approximation of stable linear dynamical systems using Laguerre and Kautz functions19963256937082-s2.0-003014327610.1016/0005-1098(95)00198-0SanayeiY.IsmailN.TengT. T.MoradN.Biological treatment of reactive dye (Cibacron yellow FN_2R) by sequencing batch reactor performance200934407140772-s2.0-77953505267AyedL.AchourS.KhelifiE.CherefA.BakhroufA.Use of active consortia of constructed ternary bacterial cultures via mixture design for Congo Red decolorization enhancement201016224955022-s2.0-7795527518710.1016/j.cej.2010.05.050APHA2005New York, NY, USAAmerican Public Health AssociationMasnadi-ShiraziM. A.AleshamsM.Laguerre discrete-time filter design20032911731922-s2.0-003633526410.1016/S0045-7906(01)00028-3PalitA. K.PopovicD.2005London, UKSpringerJangJ. S. R.SunC. T.MitutaniE.19971stUpper Saddle River, NJ, USAPrentice-Hall