1. Introduction

MPE

Mathematical Problems in Engineering

1563-5147 1024-123X

Hindawi Publishing Corporation

10.1155/2015/801213

801213

Research Article

Development of Hybrid Models for a Vapor-Phase Fungi Bioreactor

Spigno

Giorgia

¹ Tronci

Stefania

² Almendral

Juan A.

Istituto di Enologia e Ingegneria Agro-Alimentare

Università Cattolica del Sacro Cuore

Via Emilia Parmense 84

29122 Piacenza

Italy

unicatt.it

Dipartimento di Ingegneria Meccanica

Chimica e dei Materiali

Università degli Studi di Cagliari

Piazza D’Armi

09123 Cagliari

Italy

unica.it

2015

162015

2015 07 12 2014 18 05 2015 162015

2015

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This study is aimed at the development of a model for an experimental vapour-phase fungi bioreactor, which could be derived in a simple way using the available measurements of a pilot-plant reactor, without the development of ad hoc experiments for the evaluation of fungi kinetics and the estimation of parameters related to biofilm characteristics. The proposed approach is based on hybrid models, obtained by the connection of the mass balance equation (used in traditional phenomenological models) with a feedforward neural network (used in black-box modelling), and the proper use of statistical tools for the model assessment and system understanding. Two different hybrid models were developed and compared by proper performance indexes, and their capability to predict the biological complex phenomena was demonstrated and compared to that of a first-principle model.

1. Introduction

The harmful effects of the emissions of volatile organic compounds (VOCs) on the environment and human health have prompted the development of a wide range of off-gas treatment technologies [1]. The degradation of pollutant compounds by means of biological systems is attractive for several reasons: low cost of the process, absence of toxic by-products, ambient condition for operation, and high efficiency of the process.

Among the different biodegradation methods, biofiltration is emerging because of its efficiency to treat large volume of air contaminated by volatile organic compounds (VOCs) [1–3]. In this context, fungi-based biofiltration can support an enhanced mass transfer of hydrophobic VOCs due to the high hydrophobicity of the fungal cell wall and the ability of fungi to colonize with their aerial hyphae the empty space in the biofilter [4]. The possibility of using such processes for industrial applications depends on the availability of good knowledge of the complex phenomena occurring in the biological systems; therefore a considerable effort in bioprocess modeling has been done [5–9]. The implementation of system models is a support for design, management, and process control purposes.

Traditionally, biofilter models are based on mass balance principles and require a good knowledge of the underlying physics of the process such as information on specific growth rate of microbes, biofilm thickness and density, values of diffusivity, partition coefficient, yield, and biofilm distribution [9]. Significant efforts have been done on developing methods and equations to estimate key design parameters of biofiltration processes such as Henry constants, interfacial areas, and active biomass in biofilters [10]. It is worth noticing that the achievement of a precise and reliable model requires a high experimental effort, sometimes with elaborate technical facilities and expertise in order to properly estimate model parameters. For instance, Dorado et al. [11] evidenced that the determination of kinetic parameters is a demanding task, due to the difficulty to reproduce the experimental system and the necessity of calibrating each model for each specific experimental condition. Even if the use of first-principle models proposed in literature is generally able to give a good description of the considered biofilter system [8–11], a limited understanding of the biological processes along with the high uncertainties in the model parameter estimation may restrain the applications of such models.

The aim of the present work is to propose a simple modeling approach minimizing the measurement and experimental effort, while retaining model efficiency. Within this purpose, data-driven (or black-box) models are a possible alternative to describe the biological complex systems, with the advantage that, in principle, they can be constructed without any knowledge on the process, if a proper amount of data is available. In particular, neural networks (NNs) have shown good reconstruction capabilities in different application fields [12–15] and, recently, in the modeling of biological waste-gas treatment systems [16–21]. As well underlined by Rene et al. [19], the main issues with black-box models concern (i) the availability of a large amount of data from which it is possible to extract the information on the underlying system, (ii) the choice of the network architecture, and (iii) the selection of input variables from which it is possible to infer the process outputs. Sometimes, it may be convenient to take advantage of the a priori knowledge of the nonlinear system using the macroscopic balances (i.e., mass, energy balances) and introducing NNs only for the description of some phenomena. The resulting model is called hybrid [19, 22–24], and it has proved to be successful for dynamic systems, with better generalization features and, in addition, identifiable with a reduced set of data with respect to a black-box equivalent model. To our best knowledge, such hybrid models have been recently applied to waste-water treatment bioreactors [25] and food technology [26, 27] but never to a gas-phase bioreactor.

In previous works [9, 28] the authors applied a phenomenological model to a bioreactor, where hexane was removed from a contaminated stream, and the obtained results were compared with experimental data. The biofiltration process was conducted in a vapor-phase fixed-bed bioreactor, containing a biological phase, the fungus Aspergillum niger, immobilized on a support. Different experiments were conducted both on the lab-scale reactor and in batch conditions for assessing, respectively, fluid dynamic properties and kinetic parameters. Even if the first-principle model allowed a good reconstruction of the average removal efficiency of the biofilter at different pollutant loads, model uncertainty was evidenced through sensitivity analysis, showing that partition coefficient, maintenance coefficient, and available specific surface, which had been determined partly by theoretic and simulation approach, were the parameters with greatest influence on the final removal efficiency of the bioreactor.

Following these premises, the aim of this work is to propose an alternative modeling approach based on the available measurements easily acquired in a vapor-phase fungi pilot plant. For this purpose, the first-principle model previously developed [28] has been modified by introducing a neural network for the description of the removal rate of the pollutant, which represents the most difficult modeling task, because it requires the determination of the specific kinetic rate expression (which is usually a complex function of state variables) and of the distribution of the nonuniform biofilm coverage. This information on the reacting biosystem was extracted from the input-output experimental data of the pilot plant, using a neural network in two different hybrid models: (i) a heterogeneous model where the neural network describes the kinetic rate in the biological phase and (ii) a homogeneous model where the neural networks approximate the flux of hexane at the biolayer/air interface. The two different structures have been compared and the assessment of the models has been aided by using proper statistical tools.

2. Experimental Apparatus and Conditions

A lab-scale bioreactor inoculated with a strain of Aspergillus niger was used for the treatment of an artificially polluted hexane airstream. The system consisted of two identical columns connected in series; therefore reactor performance could refer either to each single column (single configuration) or to a unique reactor of double length (double configuration). Each column was a jacketed glass column of overall height 0.40 m, internal diameter 25 mm, with a stainless steel net at 40 mm from the bottom to sustain the packing material (expanded clay in granular form with average Ø 3–5 mm) and sampling ports for the substrate and air supply and for the outlet gas flow. The contaminated air-stream was artificially created by mixing two distinct flows supplied by a compressor: the first one was passed through a humidifying system; the second one was made air sparging in a vessel containing liquid hexane at 30°C. The gas flow rate was set to 4 · 10⁻³ m³/h, corresponding to an empty bed residence time (EBRT) of 159 sec. Optimal EBRT should be in the range from 15 to 60 sec, but the minimum value is actually dictated by the given set of off-gas composition and filter conditions, the pollutant RE, or maximum outlet concentration allowed by regulations. In our case preliminary trials with a double flow rate and corresponding EBRT of 80 sec had brought to an almost zero RE and poor mycelium development [28].

Even if the system was located in a conditioned room to work as close as possible to a constant temperature of 30°C, optimal value for the fungal growth, daily monitoring temperature showed that it varied from 19°C to 30°C.

More details about the bioreactor inoculation procedure, the development of the experiments, and the results obtained from this reactor were already reported [9, 28].

Reactor performance was represented in terms of removal efficiency (RE) calculated from the inlet ( C G 0 ) and outlet ( C G e ) gas concentration, as reported in the following: (1) RE = C G 0 - C G e C G 0 .

3. Modeling Approach

Starting from the first-principle model of the biofiltration system, a hybrid model approach has been proposed where empirical models (e.g., neural networks) are used to describe the most critical phenomena of the considered biosystem.

3.1. First-Principle Model

Before describing the hybrid models developed in the present paper, the first-principle model previously proposed [9] to describe the biofiltering system of Section 2 is recalled: (2a) D v H 2 ∂ 2 S G ∂ ζ 2 - U g H ∂ S G ∂ ζ + D e α A δ ∗ ∂ S F ∂ η η = 0 = 0 (2b) C G 0 D e δ ∗ 2 ∂ 2 S F ∂ η 2 - X F μ m a x Y S F K S / C G 0 + S F + C G 0 S F 2 / K I - X F m s = 0 with the following boundary conditions: (2c) ζ = 0 , 0 ≤ η ≤ 1 , D v U g H ∂ S G ∂ ζ = S G - 1 ζ = 1 , 0 ≤ η ≤ 1 , ∂ S G ∂ ζ = 0 η = 0 , 0 ≤ ζ ≤ 1 , S F = S G m η = 1 , 0 ≤ ζ ≤ 1 , ∂ S F ∂ η = 0 . In the above equations S G and S F are the dimensionless pollutant concentrations, respectively, in the gas phase and in the biofilm phase, obtained by dividing the actual concentrations, C F and C G , for the inlet concentration C G 0 ; U g is the superficial gas velocity; ζ is the dimensionless reactor height calculated with respect to the reactor length ( h / H , with h being the position in the column); and η is the dimensionless biolayer thickness calculated with respect to the effective one ( θ / δ ∗ , with θ being the position in the biolayer and δ ∗ the effective biolayer thickness). The model has six parameters related to the system fluid dynamics, biofilm characteristics, and mass transfer: D is the dispersion coefficient in the reactor; v is the bed porosity; D e is the effective diffusion coefficient of the pollutant in the biolayer; A is the biolayer surface area per unit volume of the reactor; α is the fraction of A covered by the biofilm; and m is the pollutant air/biofilm distribution coefficient. For the specific growth rate expression of the fungi growing on hexane, Monod kinetics with an Andrews type inhibition was assumed where K S is the saturation constant; K I is the inhibition constant; m S is the maintenance coefficient; μ m a x is the maximum specific growth; X F is the biofilm density; and Y is the biomass yield coefficient.

The model has many parameters, which had been evaluated [9] partly through a trial-and-error method, partly using the knowledge on the process obtained through experimental data and from the literature (Table 1). The proposed model showed good performance capabilities, but sensitivity analysis evidenced model uncertainty, principally due to parameters related to the biological phase, which are difficult to be experimentally evaluated. The main critical experimental points concern the evaluation of biodegradation kinetics and the prediction of biomass film distribution, which strongly influence degradation and imply large data variability. This aspect unfavorably plays in the construction of a bioreactor model, which should be robust with respect to data variation or presence of outliers. It is also important to notice that some parameters are quite difficult to be experimentally estimated, like the partition coefficient m , while biofilm thickness and biofilm surface are impossible to measure and might be only adjusted by fitting. Furthermore, the inoculated fungus Aspergillus niger can develop as filamentous mycelium and spores, and in the previous study it was not possible to recognize which form was responsible for hexane degradation.

Table 1

Estimation of parameters in the first-principle model.

Model parameter	Estimation
D	Experimental
D e	Theoretical
α	Experimental/theoretical
V biomass	Experimental/theoretical
Δ	Experimental/theoretical
K S	Experimental
X f · μ max ⁡ / Y	Experimental
K I	Theoretical/simulation
m S	Theoretical/simulation
m	Simulation

3.2. Neural Networks

A feedforward neural network (FNN) has been used here coupled with the first-principle model, in order to describe phenomena occurring in the biofilter systems and related with pollutant degradation kinetics. A general FNN is represented in Figure 1, for n 1 inputs, represented by the vector z 1 , and one output, z 3 . In more detail, the input signals z 1 are scaled by the adjustable parameters, called weights, w 1 ( i , j ) ; then all the contributions are summed and processed by the activation function f e (3a). The resulting signal, z 2 , is scaled by the weight w 2 ( i , j ) . Its components are summed and mapped into the vector z 3 by the activation function f 0 (3b): (3a) z 2 i = f e ∑ j = 1 n 1 w 1 j , i z 1 j + w 1 n 1 + 1 , i b (3b) z 3 i = f o ∑ j = 1 n 2 w 2 j , i z 2 j + w 2 n 2 + 1 , i b , where b represents the bias term.

Figure 1

Scheme of the feedforward neural network.

Neural networks are data-driven models and, in principle, it is not necessary to have a deep knowledge on the physicochemical phenomena governing the process. They are universal approximators [29]. Therefore they should be able to model any nonlinear system if the proper network structure is used, that is, the right network input variables and the number of hidden neurons.

In this work, a knowledge-based approach supported by statistical tools has been used to identify the inputs of the network, which are the variables affecting the consumption of reactant. With regard to hidden neurons, it is important to underline that they cannot be determined from the knowledge of the process, because they elaborate signals that have lost the physical meaning of the inputs. In this case, a trial-and-error method has been used for the choice of the number of hidden units: starting from the general consideration that a parsimonious model is preferred, the number of hidden units has been evaluated from the simplest model, with only one hidden neuron and adding one more neuron until a significant change in the model performance was observed. Then, the input and hidden layers were augmented with an extra neuron, the bias, which provides a constant output signal equal to one.

3.3. Hybrid Models

Considering the drawbacks underlined for the first-principle model in Section 3.1, the aim of the present work is finding a simple modeling approach, which could be easily applied also to industrial bioreactors, where the possibility of accomplishing experimental measurements for biomass characterization is scarce. The approach proposed here is based on hybrid models, obtained by the integration of a FNN in the first-principle model ((2a)–(2c)). Aiming at the best compromise between simplicity of description and prediction capabilities of the model, two different approaches have been considered to reconstruct the biological system behavior: a heterogeneous and a homogenous model.

3.3.1. Heterogeneous Hybrid Model (GM1)

The identification of the kinetic law occurring in the biological phase is one of the most critical points when describing biological systems, because the kinetic constants may vary significantly with process conditions [30]. The neural network, indicated as f N N 1 ( z 1 ) , has been therefore introduced in the reactor model ((2b)-(2c)) to estimate hexane reaction rate along with the term X F m s ; therefore the following heterogeneous or two-phase hybrid model is obtained ((4a) and (4b)): (4a) D v H 2 ∂ 2 S G ∂ ζ 2 - U g H ∂ S G ∂ ζ + D e α A δ ∗ ∂ S F ∂ η x = 0 = 0 (4b) C G 0 D e δ ∗ 2 ∂ S F ∂ η 2 - f N N 1 z 1 = 0 along with the boundary conditions reported in Section 3.1. In the above formulation z 1 is the input vector of the network (cf. Figure 1), which consists of independent selected variables affecting the reaction rate. It will be defined later.

The use of a neural model has the immediate consequence that it can be applied directly on the experimental data obtained in the reactor configuration used for the biofiltration of the polluted stream, without the necessity of conducting ad hoc experiments for kinetics identification and parameter estimation, as those previously carried out [9].

3.3.2. Homogeneous Hybrid Model (GM2)

A further simplification of the model has been obtained by using the neural network to describe the flux of the hexane at the gas-biological phase interface, avoiding also the estimation of the parameters related to the biofilm characteristics. As a result, a homogeneous model is obtained, where the derivative of concentration in the biological phase is modeled by a neural network, indicated as f N N 2 ( z 1 ) , as reported in (5) D v H 2 ∂ 2 S G ∂ ζ 2 - U g H ∂ S G ∂ ζ + f N N 2 z 1 = 0 .

In this case only the fluid dynamic characteristics of the system need to be estimated, and the reactor efficiency is obtained by integrating (5) along with the boundary condition with respect to ζ , reported in Section 3. Again, the input network variables, indicated with z ₁, will be defined later.

4. Development of the Neural Network

The experimental data available for parameter estimation and neural model validation were 290 outlet concentration values, at constant gas flow rate of 4 · 10⁻³ m³ h⁻¹, according to the single and double configuration, at different inlet concentration and temperature values, spanning, respectively, from 1 to 20 g m⁻³ and from 19 to 30°C. All the used concentration values were collected at regime condition, that is, when, after an adaptation period of about two weeks, biomass development was not visually observed anymore and steady-state conditions could be assumed [9].

Data have been randomized and divided into training (90%) and test (10%) set. The former series has been further divided into two different sets, one set for parameter estimation (80%) that means the proper training and one set (10%) for cross-validation. The latter data set is used in order to assure generalization capability of the neural model [29]. In more detail, the training phase is stopped when the objective function calculated on the cross-validation data reaches the minimum.

The number of data points used for the development of the hybrid models is on average a high number compared to the other artificial neural network models developed for different waste-gas treatment systems [19] and it matches the requisite that data-driven model needs a high number of experimental patterns in order to give a good estimate of the process.

The training of the network has been developed by means of the Levenberg-Marquardt optimization algorithm. Unlike more common applications of neural network modeling, the variables to be estimated, which are reaction rate (GM1) and pollutant flux at the air/biolayer interface (GM2), are not experimentally measured; therefore FNN parameter estimation has been based on the error between the experimental and calculated concentration at the reactor exit. The latter is obtained through the integration of the reactor model, assuming that the error between the experimental and calculated concentration is exclusively due to reaction rates for GM1 and due to the hexane flux for GM2. In particular, the following objective function Φ ( w ) has been used for both models: (6) Φ w = ∑ i = 1 N t C G e i - C G c i C G e i 2 , where w represents the weights (parameters) of the neural model, N t is the number of data used for the training, and C G e ( i ) and C G c ( i ) are, respectively, the experimental and calculated concentration at reactor exit.

The minimum search was accomplished for every network structure considering one hundred initial w vector values, randomly generated, and considering those weights leading to the lowest error calculated on cross-validation set. This approach solves the problem of generalization, that is, the obtainment of good performance with data that do not belong to the training set [29]. The cross-validation was also used to examine the prediction capabilities of neural network with respect to sample variation and to assess model robustness.

The development of the neural kinetic model should capture, as well as possible, the essential characteristics of the functional relationships between inputs (i.e., concentration and temperature) and outputs (reaction rate). In other words, the neural kinetic model must also provide consistent derivatives of the reaction rate with respect to the concentration and temperature.

5. Selection of the Model

The construction of the two neural models has been accomplished selecting the model inputs, the number of hidden neurons, and the activation functions. The selection of the best structure has been based on the following performance indexes evaluated on the training data: (i)

Coefficient of determination R 2 , which measures the variance explained by the model and defined as (7) R 2 = 1 - ∑ i = 1 N t C G e i - C G c i 2 ∑ i = 1 N t C G e i - C - G 2 ,

where C ¯ G is the average concentration value calculated for the N t experimental points of the training set;

(ii)

Standardized residuals, d ( i ) , which indicates if there are deterministic features that have not been predicted by the model, defined as [31] (8) d i = C G e i - C G c i 1 / N t - N w ∑ i = 1 N t C G e i - C G c 2 ,

where N w is the number of neural model parameters. Nonrandom behavior of residuals with respect to calculated RE reveals the inadequacy of the model to capture system behavior. The Kolmogorov-Smirnov test is also used to assess the goodness of fit [32–34].

5.1. Selection of NN Structure in GM1

The use of neural network in the two-phase model GM1 allows using a priori information on the reactor fluid dynamics and biofilm phase. In particular, the following parameters previously reported [9] have been used: D , D e , A , α , δ ∗ , and m . The kinetic law has been, on the other hand, extracted from the experimental data using the proper input variables, in particular hexane concentration in the biofilm and reactor temperature. The results obtained for the training showed that temperature does not affect the reaction rate in the considered experimental conditions, confirming the correctness of isothermal assumption in the phenomenological model [9].

The selection of the hidden neurons has been established by varying the number of neurons in the range 1–4 and the best structure, consisting of two hidden neurons, has been selected on the basis of residual analysis and R 2 . Training results evidenced better model behavior when a sigmoidal and a linear activation function were used, respectively, for input and hidden neurons instead of using the nonlinear one for both the layers. The analytical forms of the activation functions are reported in (9) f e = 1 1 + e - ∑ j = 1 n 1 w 1 j , i z 1 j + w 1 n 1 + 1 , i b f o = ∑ j = 1 n 2 w 2 j , i z 2 j + w 2 n 2 + 1 , i b .

A summary on the structures and performance indexes for different network is reported in Table 2, while Figure 2(a) shows the results obtained for the training data set in terms of comparison between experimental removal efficiency (RE) and calculated ones.

Table 2

Network structure and performance indexes of the two hybrid models.

Model	Model inputs	Needed parameters	Input neurons	Hidden neurons	Training		Test
Model	Model inputs	Needed parameters	Input neurons	Hidden neurons	R 2	MSE	R 2	MSE
GM1	C F	D , D e , A , α , δ ∗ , m	1	2	0.824	0.0076	0.83	0.0059
GM2	C G 0 , C G	D	2	2	0.844	0.0067	0.88	0.004

Figure 2

Training results (removal efficiency RE versus inlet pollutant concentration C G 0 ) for heterogeneous hybrid model GM1: (a) comparison between experimental (one reactor configuration: [ ∗ ] , two reactors’ configuration: [ + ] ) and GM1 data (one reactor configuration: [ □ ] , two reactors’ configuration: [ ○ ] ) and (b) standardized residuals with respect to predicted RE (one reactor configuration: [ □ ] , two reactors’ configuration: [ ○ ] ).

(a) (b)

It is worth noting that the variability of the data is quite large due to the variations of biomass activity with respect to inlet pollutant concentration, as evidenced in [28]. The hybrid model GM1 follows a medium trend of the RE with respect to the inlet hexane concentration. Modeling errors may exceed the measurement error that is equal to 5%, but it is important to underline that the estimated RE correctly tends to unity, for both reactor configurations, as C G 0 tends to zero.

Furthermore, a measure of the quality of the fitting is obtained plotting the standardized residuals with respect to the calculated RE (Figure 2(b)). The trend appears without a deterministic structure, indicating that the obtained model captures the essential features of the data. The data contained in the region ( - 2,2 ) are more than the 95% of the total amount of data used for training, indicating that residuals can be reasonably modeled as an outcome of a random normal value. This is also corroborated by the Kolmogorov-Smirnov test [33, 34] which does not reject the null hypothesis of Gaussian assumption of the residuals with a significance level of 5%.

5.2. Selection of the NN Structure for GM2

The neural network used in the homogeneous model has to describe two steps of the hexane degradation; in particular it should extract from the experimental data the phenomenon of both adsorption and reaction in the biological phase. On the other hand, GM2 needs less information on the biological phase and kinetics, therefore requiring a minor experimental effort in terms of ad hoc experiments and sophisticated system analysis. In particular, only the dispersion coefficient ( D ) has been used (5).

The selected network inputs are inlet hexane concentration and concentration of hexane along the reactor. Again, results do not evidence temperature effects on RE, leading to a network with two inputs.

The number of hidden neurons leading to the best results has been, in this case, equal to two and, again, a sigmoidal and a linear activation function have been used, respectively, in the input and hidden neurons.

The principal aspects of GM2 model are reported in Table 2, and the results for the training set are shown in Figure 3(a). As for GM1, modeling errors may be higher than measurement errors, but it is important to note that residuals (Figure 3(b)) again seem randomly disposed with respect to the estimated variable, and the amount of residuals enclosed in the ( - 2,2 ) region is less than 95%. Applying the Kolmogorov-Smirnov test [33, 34] to residuals with a significance level of 5% the null hypothesis of Gaussian assumption is again not rejected.

Figure 3

Training results (removal efficiency RE versus inlet pollutant concentration C G 0 ) for homogeneous hybrid model GM2: (a) comparison between experimental (one reactor configuration: [ ∗ ] , two reactors’ configuration: [ + ] ) and GM2 data (one reactor configuration: [ □ ] , two reactors’ configuration: [ ○ ] ) and (b) standardized residuals with respect to predicted RE (one reactor configuration: [ □ ] , two reactors’ configuration: [ ○ ] ).

(a) (b)

6. Test Results

The ability of the two hybrid models to predict the biofiltration system behavior has been evaluated by comparing calculated values with the experimental data selected for testing purposes. To analyze the model performance for a wide range of pollutant inlet concentration, the interval of variation of C G 0 1–20 g m⁻³ has been divided into nineteen subintervals of unitary length. Test data (10% of all the data points) have been randomly selected with more than one sampling from intervals with larger amount of data. Because those data have not been used during the network training, this comparison shows the performance of the developed models in unknown situations.

The test of GM1 is reported in Figure 4, where the removal efficiency calculated with the hybrid model for the single (Figure 4(a)) and double (Figure 4(b)) configuration reactor is compared with the experimental data, with a mean square error for the total points equal to 0.0059. Considering the significant variability of the experimental RE used for training and validation of the model, the capability reconstruction of the hybrid model is quite good. This is also evidenced by the closeness of GM1 estimation with the RE calculated using the first-principle model [28], which has a mean square error equal to 0.0064. This means that the information contained in the data available at reactor exit allows a good reconstruction of the kinetic law, which rules hexane degradation in the biofilm phase. As a further confirmation of the capability of the hybrid model approach, the kinetic rate predicted by the neural model and that obtained in [9] (cf. (2b)) are reported in Figure 5. Even if a mismatch exists in the estimation of the two models, the qualitative behavior of the kinetic law is well reproduced using only concentration data calculated in one point of the reactor. This result indicates the possibility of using this approach to obtain information on the functional form of a kinetic law, when this is unknown for the problem at hand.

Figure 4

Test results (removal efficiency RE versus inlet pollutant concentration C G 0 ): comparison among heterogeneous hybrid model [ ○ ] , first-principle model [ Δ ] , and experimental data [ + ] for (a) single and (b) double configuration reactor.

(a) (b)

Figure 5

Comparison between kinetic rate calculated with the phenomenological model as ( X F μ m a x / Y ) ( S F / ( K S / C G 0 + S F + C G 0 S F 2 / K I ) ) [ Δ ] and neural network ( f N N 1 ) plus the constant term X F m s [ ○ ] as a function of dimensionless pollutant concentration in the biological phase ( S F ).

The GM2 test results are reported in Figures 6(a) and 6(b), where the comparison with the experimental data shows again discrete performance of the model. In this case the distance between the hybrid model and the experimental data is smaller than in the previous case, the mean square error being equal to 0.004, but there is bigger mismatch with the first-principle model. This is because GM2 used only information on the system fluid dynamics, extracting all the information regarding biofilm characteristics, kinetics, and mass transfer phenomena from the experimental data (Table 2), with the advantage of reducing ad hoc experiments and avoiding the use of difficult analytical procedures to calculate the model parameters related to the biofilm characteristics.

Figure 6

Test results (removal efficiency RE versus inlet pollutant concentration C G 0 ): comparison among homogeneous hybrid model [ ○ ] , first-principle model [ Δ ] , and experimental data [ + ] for (a) single and (b) double configuration reactor.

(a) (b)

7. Conclusions

The modeling problem of a gas-phase bioreactor was solved by resorting to hybrid models, a combination of material balances (used in traditional phenomenological models) and neural networks used to describe the most complex phenomena present in the process. Two different options were investigated: (i) a heterogeneous model where the neural network describes the rate of consumption of the pollutant in the biological phase and (ii) a homogenous model where the neural network reconstructs the flux of the hexane at the interphase, avoiding the integration of the mass balance in the biological phase. Model construction was based on a large amount of experimental data, characterized by a large variability, from which both the hybrid models were able to capture the deterministic features of the observations (as demonstrated by residual analysis). It is worth noting that the neural model has not been used in the traditional way, where it is trained to model directly observed variables. In fact, in the present paper the kinetic rate or the mass transfer term was not measured; therefore the neural networks had to learn their functionality by using an indirect measure, which was the outlet pollutant concentration.

The main contribution of the proposed modeling approaches is the reduction of time and resources necessary to conduct experimental activities aimed at the identification of kinetic law and mass transfer phenomenon for the bioreactor, which are the most demanding task when modeling biological systems. The satisfactory reconstruction capabilities of the neural networks in both the heterogeneous and homogeneous model have been demonstrated by comparing hybrid models predictions with the ones obtained using the first-principle model. The good match between the predictions indicates that neural networks are able to extract information on specific complex phenomena from the simple observations of the laboratory reactor exit concentration. Furthermore, through the analysis of residuals [31, 32] and Kolmogorov-Smirnov [33, 34] test applied to the hybrid models, it was possible to corroborate the isothermal assumption used in the phenomenological model.

Results evidenced that the behavior of the heterogeneous model was closer to that of first-principle model compared to the homogenous one, with good prediction of the reaction rate law. This model should, then, be recommended. On the other hand, the homogenous model used less a priori information on the biosystem, which might imply a possible reduction of performance when used at different operating conditions (e.g., changes of gas flow rate). To overcome this limitation, a proper experimental campaign (i.e., with ad hoc trials at different flow rates) followed by model recalibration could be carried out to improve description model capabilities and obtain a wider applicability. This approach is easier to be implemented in a full-scale industrial plant making the homogenous model preferable to the heterogeneous one.

The proposed methodology allows the construction of parsimonious models, which are useful for on-line application, and also requires less computational time during training. In particular, hybrid models based on neural networks can be useful for process monitoring purposes in case of delayed concentration measurements, where it is advantageous to have a simple tool to fast recognize if the system is going out of control and take corrective measures. Furthermore, the availability of the model can give an effective tool for on-line optimization of the process.

Nomenclature

A :

Biolayer surface area per unit volume of the reactor (m⁻¹)

C G :

Concentration of the pollutant in the air at position h along the biofilter (g/m³)

C - G :

Average concentration value calculated for the experimental points of the training set (g/m³)

C G e :

Experimental concentration

C G c :

Calculated concentration

C G 0 :

Concentration of the pollutant in the air at the inlet of the biofilter (g/m³)

C G e :

Concentration of the pollutant in the air at the outlet of the biofilter (g/m³)

C F :

Concentration of the pollutant at a position θ in the biolayer at a point h along the column (g/m³)

D :

Dispersion coefficient in the reactor (m²/h)

D e :

Effective diffusion coefficient of the pollutant in the biolayer (m²/h)

H :

Reactor height (m)

K I :

Inhibition constant (g/m³)

K s :

Saturation constant in the specific growth rate expression of a culture growing on the pollutant (g/m³)

N t :

Number of experimental points

N w :

Number of neural model parameters

R 2 :

Coefficient of determination

RE:

Removal efficiency

S G :

Dimensionless pollutant concentration in the gas phase ( C G / C G 0 )

S F :

Dimensionless pollutant concentration in the biological phase ( C F / C G 0 )

U g :

Superficial gas velocity (m/h)

Y :

Yield coefficient of the culture on the pollutant (g-biomass/g-compound)

X F :

Biofilm density (g-dry cells/m³ biofilm)

b :

Bias term of the neural model

f e :

Activation function of the hidden neurons

f o :

Activation function of the output neurons

f N N 1 ( z ) :

Neural network for the heterogeneous hybrid model

f N N 2 ( z ) :

Neural network for the homogeneous hybrid model

h :

Position in the column; h = 0 at the entrance and h = H at the exit

m S :

Maintenance coefficient (g-hexane/g-biomass/h)

m :

Pollutant air/biofilm distribution coefficient

p r :

Parameter vector related to kinetics

w 1 ( j , i ) :

Weight connecting j th input and i th hidden neuron

w 2 ( j , i ) :

Weight connecting j th hidden neuron and i th output neuron

x :

Dimensionless biolayer thickness ( θ / δ ∗ )

z :

Dimensionless reactor height ( h / H )

z 1 :

Input vector of neural network

z 2 :

Signal vector of neural network from the hidden to the output layer

z 3 :

Output vector of neural network.

Greek Letters

α :

Fraction of A covered by the biofilm

δ ∗ :

Effective biolayer thickness (m)

η :

Dimensionless biolayer thickness ( θ / δ ∗ )

μ m a x  :

Maximum specific growth rate (h⁻¹) in Monod kinetics, kinetic constant in Andrews kinetics (Monod-type equation with substrate inhibition)

ν :

Bed porosity position in the biolayer (m), θ = 0 at the air/biofilm interface and θ = δ ∗ at the biofilm/support interface

ζ :

Dimensionless reactor height ( h / H ) .

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Kennes

Biotechniques for air pollution control and bioenergy

Journal of Chemical Technology and Biotechnology 2012 87 6 723 724

10.1002/jctb.3829

2-s2.0-84862091077

Quan

Zhang

Zhao

Long-term operation of a compost-based biofilter for biological removal of n-butyl acetate, p-xylene and ammonia gas from an air stream

Biochemical Engineering Journal 2006 32 2 84 92

10.1016/j.bej.2006.09.005

2-s2.0-33750620016

Zehraoui

Hassan

A. A.

Sorial

G. A.

Biological treatment of n-hexane and methanol in trickle bed air biofilters under acidic conditions

Biochemical Engineering Journal 2013 77 129 135

10.1016/j.bej.2013.06.001

2-s2.0-84879566957

Arriaga

Revah

Removal of n-hexane by Fusarium solani with a gas-phase biofilter

Journal of Industrial Microbiology and Biotechnology 2005 32 11-12 548 553

10.1007/s10295-005-0247-9

2-s2.0-28644451740

Arriaga

Revah

Mathematical modeling and simulation of hexane degradation in fungal and bacterial biofilters: effective diffusivity and partition aspects

Canadian Journal of Civil Engineering 2009 36 12 1919 1925

10.1139/l09-090

2-s2.0-72949088895

Kraakman

N. J. R.

Rocha-Rios

van Loosdrecht

M. C. M.

Review of mass transfer aspects for biological gas treatment

Applied Microbiology and Biotechnology 2011 91 4 873 886

10.1007/s00253-011-3365-5

2-s2.0-80052645730

Vergara-Fernández

Hernández

Revah

Phenomenological model of fungal biofilters for the abatement of hydrophobic VOCs

Biotechnology and Bioengineering 2008 101 6 1182 1192

10.1002/bit.21989

2-s2.0-56749100390

Salehahmadi

Halladj

Zamir

S. M.

Unsteady-state mathematical modeling of a fungal biofilter treating hexane vapor at different operating temperatures

Industrial and Engineering Chemistry Research 2012 51 5 2388 2396

10.1021/ie2014718

2-s2.0-84857002692

Spigno

de Faveri

D. M.

Modeling of a vapor-phase fungi bioreactor for the abatement of hexane: fluid dynamics and kinetic aspects

Biotechnology and Bioengineering 2005 89 3 319 328

10.1002/bit.20336

2-s2.0-14244267994

Bordel

Muñoz

Díaz

L. F.

Villaverde

Mechanistic model for evaluating the performance of suspended growth bioreactors for the off-gas treatment of VOCs

Biochemical Engineering Journal 2008 38 3 395 405

10.1016/j.bej.2007.08.004

2-s2.0-38849120715

Dorado

A. D.

Baquerizo

Maestre

J. P.

Gamisans

Gabriel

Lafuente

Modeling of a bacterial and fungal biofilter applied to toluene abatement: kinetic parameters estimation and model validation

Chemical Engineering Journal 2008 140 1–3 52 61

10.1016/j.cej.2007.09.004

2-s2.0-42549166929

Bellos

G. D.

Kallinikos

L. E.

Gounaris

C. E.

Papayannakos

N. G.

Modelling of the performance of industrial HDS reactors using a hybrid neural network approach

Chemical Engineering and Processing 2005 44 5 505 515

10.1016/j.cep.2004.06.008

2-s2.0-13844253427

Bhutani

Rangaiah

G. P.

Ray

A. K.

First-principles, data-based, and hybrid modeling and optimization of an industrial hydrocracking unit

Industrial and Engineering Chemistry Research 2006 45 23 7807 7816

10.1021/ie060247q

2-s2.0-33751565809

Tronci

Baratti

Servida

Monitoring pollutant emissions in a 4.8 MW power plant through neural network

Neurocomputing 2002 43 1–4 3 15

10.1016/s0925-2312(01)00617-8

2-s2.0-0036138536

Gong

Liu

Embedded artificial neuval network-based real-time half-wave dynamic resistance estimation during the A.C. resistance spot welding process

Mathematical Problems in Engineering 2013 2013 7

862076

10.1155/2013/862076

2-s2.0-84884878442

Chairez

García-Peña

Cabrera

Dynamic numerical reconstruction of a fungal biofiltration system using differential neural network

Journal of Process Control 2009 19 7 1103 1110

10.1016/j.jprocont.2008.12.009

2-s2.0-67349225397

Elías

Ibarra-Berastegi

Arias

Barona

Neural networks as a tool for control and management of a biological reactor for treating hydrogen sulphide

Bioprocess and Biosystems Engineering 2006 29 2 129 136

10.1007/s00449-006-0062-3

2-s2.0-33745698111

Rene

E. R.

Veiga

M. C.

Kennes

Experimental and neural model analysis of styrene removal from polluted air in a biofilter

Journal of Chemical Technology and Biotechnology 2009 84 7 941 948

10.1002/jctb.2130

2-s2.0-66849088492

Rene

E. R.

Kim

J. H.

Park

H. S.

Immobilized cell biofilter: results of performance and neural modeling strategies for NH₃ vapor removal from waste gases

Aerosol and Air Quality Research 2009 9 3 379 384

10.4209/aaqr.2008.10.0046

2-s2.0-73449087953

Rene

E. R.

Estefanía López

Veiga

M. C.

Kennes

Neural network models for biological waste-gas treatment systems

New Biotechnology 2011 29 1 56 73

10.1016/j.nbt.2011.07.001

2-s2.0-82755182829

Zamir

Halladj

Saber

Ferdowsi

Nasernejad

Biofiltration of hexane vapor: experimental and neural model analysis

Clean—Soil, Air, Water 2011 39 9 813 819

10.1002/clen.201000525

2-s2.0-80052942201

Porru

Aragonese

Baratti

Servida

Monitoring of a CO oxidation reactor through a grey model-based EKF observer

Chemical Engineering Science 2000 55 2 331 338

10.1016/S0009-2509(99)00328-0

2-s2.0-0345504194

Safavi

A. A.

Nooraii

Romagnoli

J. A.

A hybrid model formulation for a distillation column and the on-line optimisation study

Journal of Process Control 1999 9 2 125 134

10.1016/S0959-1524(98)00041-9

2-s2.0-0033117270

Tronci

Medde

Baratti

An hybrid model for the hydrotreatment of gasoil, ICheaP-9

Chemical Engineering Transactions 2009 17 1233 1238

Kumar

B. S.

Venkateswarlu

Estimating biofilm reaction kinetics using hybrid mechanistic-neural network rate function model

Bioresource Technology 2012 103 1 300 308

10.1016/j.biortech.2011.10.006

2-s2.0-82955195627

Saraceno

Curcio

Calabrò

Iorio

A hybrid neural approach to model batch fermentation of ‘ricotta cheese whey’ to ethanol

Computers and Chemical Engineering 2010 34 10 1590 1596

10.1016/j.compchemeng.2009.11.010

2-s2.0-77956190116

Saraceno

Aversa

Curcio

Advanced modeling of food convective drying: a comparison between artificial neural networks and hybrid approaches

Food and Bioprocess Technology 2012 5 5 1694 1705

10.1007/s11947-010-0477-3

2-s2.0-84862148132

Spigno

Pagella

Fumi

M. D.

Molteni

de Faveri

D. M.

VOCs removal from waste gases: gas-phase bioreactor for the abatement of hexane by Aspergillus niger

Chemical Engineering Science 2003 58 3–6 739 746

10.1016/s0009-2509(02)00603-6

2-s2.0-0037289603

Principe

Euliano

N. R.

Lefebvre

W. C.

Neural and Adaptive Systems: Fundamentals Through Simulation 1999

New York, NY, USA

Wiley

Kennes

Veiga

M. C.

Bioreactors for Waste Gas Treatment 2001

Dordrecht, The Netherlands

Kluwer Academic

Neumann

M. B.

Gujer

Underestimation of uncertainty in statistical regression of environmental models: influence of model structure uncertainty

Environmental Science and Technology 2008 42 11 4037 4043

10.1021/es702397q

2-s2.0-44449154918

Montgomery

D. C.

Design and Analysis of Experiments 2008 7th

Hoboken, NJ, USA

John Wiley & Sons

D'Agostino

R. B.

Stephens

M. A.

Goodness-of-Fit Techniques 1986

New York, NY, USA

Marcel Dekker

Grosso

Galan

Baratti

Romagnoli

J. A.

On the prediction and shaping of the PSD in crystallization operations

Computer Aided Chemical Engineering 2010 28 805 810

10.1016/s1570-7946(10)28135-x

2-s2.0-78651451732