1. Introduction

JNM

Journal of Nanomaterials

1687-4129 1687-4110

Hindawi

10.1155/2017/9702384

9702384

Research Article

An Accurate PSO-GA Based Neural Network to Model Growth of Carbon Nanotubes

http://orcid.org/0000-0003-3157-7796

Asadnia

Mohsen

¹ Khorasani

Amir Mahyar

² Warkiani

Majid Ebrahimi

³ Hayashi

Yasuhiko

Department of Engineering

Macquarie University

Sydney

NSW 2109

Australia

mq.edu.au

School of Engineering

Deakin University

Waurn Ponds

VIC 3125

Australia

deakin.edu.au

School of Biomedical Engineering

University of Technology

Sydney

NSW 2007

Australia

uts.edu.au

2017

692017

2017 12 04 2017 10 07 2017 25 07 2017 692017

2017

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

By combining particle swarm optimization (PSO) and genetic algorithms (GA) this paper offers an innovative algorithm to train artificial neural networks (ANNs) for the purpose of calculating the experimental growth parameters of CNTs. The paper explores experimentally obtaining data to train ANNs, as a method to reduce simulation time while ensuring the precision of formal physics models. The results are compared with conventional particle swarm optimization based neural network (CPSONN) and Levenberg–Marquardt (LM) techniques. The results show that PSOGANN can be successfully utilized for modeling the experimental parameters that are critical for the growth of CNTs.

Australian Research Council

DP170103704

1. Introduction

Increasing requirements for high manufacturing efficiency such as low throughput time, better product quality, and cheaper finished parts are still driving the equipment manufacturers and the fabrication industries in their search for new technologies. The increasing requirements drive the industry to search for smaller transistors, where size reductions result in higher clock frequencies and lower power dissipation. CNTs show promise in satisfying the need for smaller transistors as a result of their physical and electrical properties [1, 2]. The mechanisms involved in growing CNTs are often complex with numerous experimental parameters that need precise control and often the growth-rate is very slow. Moreover, the growth process involves precursor materials, hydrocarbons, carrier gases, expensive equipment, and high thermal budget. Generating a simulator platform that allows us to optimize the growth parameters before conducting trial and error optimization through experiments will greatly save time and money. Therefore, in this paper, we present a novel algorithm by combining both particle swarm optimization (PSO) and genetic algorithm (GA) which predicts the experimental growth results of CNT and allows making parameter optimization less cumbersome. Ton et al. [3] have presented a numerical piecewise nonlinear approximation of the nonequilibrium mobile charge density to be used in the modeling of CNT transistors. Similarly, Yamacli and Avci [4] have developed a parameterisable model of CNTFET nanoelectronics. Recent advances in ANNs have been made possible through the efforts to model and simulate the behaviour of CNTS. Through the imitation of biological systems and analysis of brain structures these ANNs have developed human-like performance [3–6].

However, ANNs used for pattern classification and optimization problems often suffered from issues such as finding appropriated architecture to perform the satisfactory modeling performance [7]. Considerable research has been conducted in the development of new architectures and learning algorithms of the neural networks to achieve this objective, such as modular neural networks [8], hybrid neural networks [9], and evolutionary algorithms and evolutionary programming [10].

Seeking improved performance of conventional neural networks, researchers [11, 12] have turned to GA. Three evolutionary operations are required to implement a GA: selection, crossover, and mutation. It has been found in experiments that, with large training samples, the convergence speed for the GA would be significantly reduced [13]. Furthermore, application of the crossover operation in GA to neural networks can result in what is known as the “permutation problem.” Consequently, the employment of GA is seen as a generally complicated process. Recently, other evolutionary techniques such as PSO have been applied in other branches of engineering [14, 15]. By contrast, the PSO algorithm does not have two of the evolutionary operators of GA (crossover and mutation). The reduction in parameters yields a faster convergence which is easier to implement [16]. Accordingly, PSOs are suitable for approaching dynamic or problems that change rapidly over time [17].

The superiority of PSO and GA to the BP algorithm stems from the ability of PSO and GA trained ANNs to deal with nondifferentiable functions and work without gradient information. But one of the most notorious problems with the application of crossover to neural networks is known as falling in the local minimums and failing to converge. This problem is more visible when the number of data sets is not enough. Another common problem in conventional NNs is overfitting. If the number of weights of NN exceeds the number of data sets for the training of NN to some extent, “overfitting” may occur. GA are capable of isolating global optimums and, however, converge at low speed to this optimum. On the other hand, PSOs converge quickly albeit at a greater risk of isolating and being trapped at a local optimum. The optimization problem that arises from attempts to balance these two techniques can be solved by particle swarm intelligence. In order to overcome the downsides associated with each algorithm a combination of GA and PSO may be used which is referred to as GAPSO.

In GA, the binary strings of the initial population are generated randomly, so different runs of GA often give similar results. The idea of PSOGANN is to select these initial populations appropriately by using PSONN.

As mentioned before, in the case of lack of enough training data for network, neither conventional PSO nor GA based NN can provide a proper learning method for training NNs. Having a robust algorithm that could be trained with less training data could be a promising method for the application of NNs in microelectromechanical system (MEMS) fabrication and many other engineering fields.

In this study, a novel PSO-GA based neural network is proposed for improving the training capacity of neural network. To evaluate the performance of the proposed PSO algorithm, the training capacity of improved PSO-based ANN is first tested and then compared to that of a conventional PSO-based ANN and a back propagation-based ANN, using the experimental data obtained from carbon nanotube growth process. Optimal back propagation-based neural network architecture is designed using MATLAB Neural Network Toolbox. Programs of PSOGANN and conventional PSO-based ANN are accomplished in C++.

2. Genetic Algorithm

A GA emulates the evolutionary characteristic of survival of the fittest. At each phase, encoded chromosomes are simulated; the algorithm establishes the strength of each chromosome. The chromosomes mutate with crossover producing the next generation. Then the process repeats. The input parameters for GA are a set of solutions (the chromosomes of the GA) and a fitness function defining success characteristics and stopping criteria. At each step of the algorithm chromosomes are first evaluated for suitability against the success characteristics. Subsequently successful chromosomes are randomly pooled to mate. Pairs of chromosomes in this pool randomly share genetic information with each other. The chromosomes are evaluated against the stopping criteria. Then the process repeats if the criteria are not met. These features of GA make it adequate for handling large, nonlinear problems with unpredictable results. Relying on multipoint search and algorithmic features, the chance of convergence to the universal optimal solution is much higher than the chance of falling into a local optimal solution. GA has a positive track record successfully having dealt with problems in a variety of fields, including but not limited to optimization, fuzzy logic, NN, expert systems, and scheduling [11].

3. IPSOGA Based ANN

Particle swarm optimization is a population based stochastic optimization algorithm. In PSO algorithm the potential solutions, called particles, including weights and thresholds vectors fly through the problem space by following the current optimal particles. During training, after initializing PSO parameters using a group of random particles (solutions), optimal solution is achieved through the solution space [9]. Therefore, the velocity and position of the obtained optimal solution are updated according to its own experience and global cooperation. Despite regular PSOs converging rapidly on solutions, they may often become trapped within local maxima and minima [12]. To obviate this problem and improve its training capacity, a developed PSO algorithm is proposed. An improvement to the PSO algorithm is proposed by considering best and worst case particle positions. By nature of the algorithm best case positions will gravitate towards the optimal positions, away from worst case positions [13, 18].

Individual particles in the swarm are represented by n-dimensional position and velocity vectors contained in S, the search space:(1)Xi=xi1,xi2,…,xinTES,Vi=vi1,vi2,…,vinTES.

The evaluation of each particle is performed against the success function, with individual best positions being cumulatively stored in a position vector:(2)Pi=Pi1,Pi2,…,PinT.

A global optimum position, Pg, is established from an evaluation of individual positions.

Between each iteration the new velocity of each particle is calculated from the distance to the global best position, the local best position, and an inertia weight from the previous velocity:(3)Vt+1=w∗Vt+c1r1Pit-Xit+c2r2Pgt-Xit,where cn∗rn yields a randomly distributed acceleration coefficient.

The updated position can be given by taking the sum of the previous position and current velocity over the next iteration:(4)Xt+1=Xt+Vt+1.The proposed algorithm (PSOGANN) is developed by taking advantage of both PSO and GA into the training process. A stopping criterion (which can be either the maximum number of iterations or reaching to a certain MSE) is imposed and if the PSO is unable to meet the stop criteria, the best population for GA (including weights and biases) is determined by PSO and GA will again search for the best parameter set. This process will continue until the stop criterion is satisfied. Figure 1 shows the flow chart of the proposed algorithm and details are presented below.

Figure 1

Flow chart of the proposed neural network trained by hybrid PSO and GA algorithms.

Step 1.

Initialization of PSOGANN parameters: This includes (a) determination of the initial PSONN parameters (c1, c2, c3, w, Vmax, and Vmin); (b) select weights and biases for the network randomly (first iteration); (c) selecting initial position and velocity vectors for all the particles (randomly); (d) selecting initial values of Pg(t) and Pi(t) randomly; (e) determination of number of circuits of group 1, which is number of generations where PSO can try to meet the stopping criteria in each step before its current best particle (Pg(t)) is saved as one of the GA’s populations; (f) number of circuits of group 2, which is number of initial populations in GA.

Step 2.

Compute fitness of individual particles by the feedforward network.

Step 3.

(a) Perform PSO operators to find the best PSONN parameters. (b) Update weights and thresholds according to equations (1) and (2) until “counter 1 > number of circuits of group 1” is satisfied.

Step 4.

(a) Best position of PSO algorithm saves as an initial population for GA; (b) counter 1 resets; (c) algorithm continues to search for the optimal PSO parameters for the current set of network weights and biases until the tolerance is met in Step 3.

Step 5.

(a) If the tolerance is not met after the maximum number (counter 2), perform GA algorithm by initial populations which are saved by PSO in previous steps. (b) Continue until stop criterion is met in Step 4.

4. Carbon Nanotube Growth Process 4.1. Sample Preparation

A 4′′ silicon wafer was oxidized with an oxide layer of 1 μm thickness on both sides, which could function as a buffer layer to circumvent the interaction of catalyst particles with the silicon during CNT growth process. Then a layer of Fe catalyst was deposited onto its top surface with the nominal thickness of 2 nm by electron beam evaporation. After slicing the wafer into small samples, the sample was kept in the CVD quartz chamber for CNT growth.

4.2. Growth Process

In these experiments, we set the initial temperature ramping rate as 50°C/min and the final CNT growth temperature TG = 725°C in the control program, and the pressure inside the chamber was maintained at about 11 Torr. Based on the temperature profile of the substrate as shown in Figure 2(A), the growth could be divided into 3 steps. Similar to our previous CVD processes [16, 17, 19], the gas mixture of hydrogen (H₂) and argon (Ar) was provided throughout the 3 steps: Ar functions as the carrier gas and helps to dilute the acetylene (C₂H₂) concentration, while H₂ acts as the reductive agent to refresh the activity of catalyst particles during the growth [20]; the carbon source C₂H₂ was only introduced in the second step to initiate and maintain the CNT growth. The gas flow rate was controlled and monitored in situ with mass flow controller.

Figure 2

CNT growth process plot. (a) Schematics of the substrate temperature profile during the growth of CNTs. ((b) and (c)) SEM images of grown CNTs. (d) The coaxial walls of a single multiwalled CNT.

To begin with (starting from time t=t0), the temperature T increases dramatically from initial temperature T0 (usually its room temperature) towards the growth temperature TG (725°C here). The temperature ramping rate is not constant with time; instead it decreases as T is approaching TG.

When the temperature reached TG (t1), the second step began. The catalyst layer went through some minutes of pretreatment (annealing), so as to further turn its thin film morphology into isolated small particles by increasing surface tension. Then at time t2, the C₂H₂ gas was introduced into the chamber to initiate the CNT growth. After 30 minutes of growth, C₂H₂ supply was stopped at t3, and the system started to cool down until below 200°C, when the samples could be taken out for characterizations. The surface morphology and the length of the as-grown CNTs on substrate were characterized with scanning electron microscopy (SEM), Hitachi S-3500 N. Typical SEM image of the CNT mat is shown in Figure 2(B)(a), the as-grown CNT mat is perpendicular to the substrate top surface, with uniform thickness of about 320 μm, and the closer look in Figure 2(B)(b) reveals that these CNTs are densely packed with a bit wavy entanglement between them. The transmission electron microscopy (TEM, model: FEI Titan) is used to characterize the structure of CNTs with very high resolution. As shown in Figure 2(B)(c)-(d), the CNTs grown here are multiwall carbon nanotubes (MWCNTs) with 10–30 walls and 10–30 nm in outermost diameter.

5. Modeling Results and Discussions

In this study, five input patterns were used during the CNT growth (values of C₂H₂, Ar, H₂, pretreatment, and growth duration) and length of CNT is considered as the only output. The set of training data is comprised of 90% experimental data (43 groups). 10% (5 groups) of data was randomly set aside for testing purposes. Stop criteria were selected with either 1500 as iterations or 0.005 as minimum error (MSE) condition for all networks. A preliminary analysis was performed on conventional NNs which has not been presented in this paper to establish a control for comparison with the proposed PSO ANN. The LM-NN performed best in both the training and test data sets out of traditional NNs, yielding the lower mean-square-error, MSE. Tables 1 and 2 compare the performance of CPSONN LM-NN and IPSOGANN. In both training and testing phases IPSOGANN proved superior to CPSONN and LM-NN. The data suggests IPSOGANN can provide a 55% (MSE) improvement over CPSONN and 80% (MSE) improvement over LM-NN. During the training it is also found that in terms of speed of convergence (number of needed iterations to meet the stop criteria) PSONNGA is approximately 80% faster as compared to CPSONN (see Figure 3).

Table 1

Optimized parameters for LM-NN, CPSONN, and PSOGANN parameters.

	V m a x	V m i n	C 1	C 2	W	Number of particles	Architecture
LM-NN	—	—	—	—	—	—	5-6-3-1
CPSONN	6	−6	2.0	1.9	0.9	30	5-4-4-1
PSOGANN	7	−5	1.5	2.1	0.9	30	5-7-3-1

Table 2

Comparison between LM-NN, CPSONN, and IPSONN for (a) Model I and (b) Model II.

	LM-NN		CPSONN		PSOGANN
	Train	Test	Train	Test	Train	Test
MSE (m2)	0.244	0.269	0.091	0.106	0.041	0.057
R 2	0.760	0.789	0.895	0.873	0.918	0.902
NS	0.712	0.745	0.807	0.778	0.886	0.854

Figure 3

Convergence speed for PSONN and PSOGANN algorithms.

6. Conclusion

This study proposed a novel algorithm based on PSO and GA for training ANNs (PSOGANN). Application of the proposed algorithm for modeling growth of CNTs is discussed. In particular, proposed model has demonstrated about 40% improvement in offline training average error in comparison to those of conventional PSO-based ANN algorithm. PSOGANN can be trained extremely quickly, which makes it possible to perform a large number of evaluations required by GA. This method is less sensitive to the permutation problem and improves the results of the evolved networks. This method can highly solve some critical issues associated with traditional neural network systems such as overfitting and falling in local minimum.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

Majid Ebrahimi Warkiani would like to acknowledge the support of the Australian Research Council through a Discovery Project Grant (DP170103704).

Bosnick

Ban

Hiebert

Shi

Huang

Lister

Mleczko

Organic vapor adsorption on in situ grown carbon nanotube films

Carbon 2011 49 11 3639 3644

2-s2.0-79958293937

10.1016/j.carbon.2011.04.067

Prasad

H. C.

Hashmi

S. A. R.

Naik

Bhargaw

H. N.

Improved shape memory effects in multiwalled-carbon-nano-tube reinforced thermosetting polyurethane composites

Journal of Applied Polymer Science 2017 134 7

2-s2.0-84992379104

10.1002/app.44389

44389

Ton

R. M.

Munteanu

Cocina

G. C.

Concept of artificial neural network (ANN) and its application in cerebral aneurism with multi walls carbon nanotubes (MWCNT)

Proceedings of The 10th Wseas International Conference on Neural Networks

2009

Yamacli

Avci

Neural network modeling of voltage-dependent resistance of metallic carbon nanotube interconnects: An ab initio study

Expert Systems with Applications 2010 37 12 8014 8018

2-s2.0-77957829521

10.1016/j.eswa.2010.05.089

Zarei

Khataee

A. R.

Ordikhani-Seyedlar

Fathinia

Photoelectro-Fenton combined with photocatalytic process for degradation of an azo dye using supported TiO₂ nanoparticles and carbon nanotube cathode: neural network modeling

Electrochimica Acta 2010 55 24 7259 7265

10.1016/j.electacta.2010.07.050

2-s2.0-77956479795

Zarei

Niaei

Salari

Khataee

A. R.

Removal of four dyes from aqueous medium by the peroxi-coagulation method using carbon nanotube-PTFE cathode and neural network modeling

Journal of Electroanalytical Chemistry 2010 639 1-2 167 174

2-s2.0-75249099805

10.1016/j.jelechem.2009.12.005

Farahnakian

Razfar

M. R.

Moghri

Asadnia

The selection of milling parameters by the PSO-based neural network modeling method

International Journal of Advanced Manufacturing Technology 2011 57 1-4 49 60

2-s2.0-80053573850

10.1007/s00170-011-3262-1

Carmantini

G. S.

beim Graben

Desroches

Rodrigues

A modular architecture for transparent computation in recurrent neural networks

Neural Networks 2017 85 85 105

2-s2.0-84994319086

10.1016/j.neunet.2016.09.001

Razfar

M. R.

Asadnia

Haghshenas

Farahnakian

Optimum surface roughness prediction in face milling X20Cr13 using particle swarm optimization algorithm

Proceedings of the Institution of Mechanical Engineers, Part B: Journal of Engineering Manufacture 2010 224 11 1645 1653

2-s2.0-78649486645

10.1243/09544054JEM1809

Sinha

Malo

Deb

Evolutionary algorithm for bilevel optimization using approximations of the lower level optimal solution mapping

European Journal of Operational Research 2017 257 2 395 411

10.1016/j.ejor.2016.08.027

MR3566866

2-s2.0-84994762013

Bhoskar

Kulkarni

O. K.

Kulkarni

N. K.

Patekar

S. L.

Kakandikar

G. M.

Nandedkar

V. M.

Genetic Algorithm and its Applications to Mechanical Engineering: A Review

2624 2630

2-s2.0-84947331667

10.1016/j.matpr.2015.07.219

Asadnia

Chua

L. H. C.

Qin

X. S.

Talei

Improved particle swarm optimization-based artificial neural network for Rainfall-Runoff modeling

Journal of Hydrologic Engineering 2014 19 7 1320 1329

10.1061/(asce)he.1943-5584.0000927

Soleymani Yazdi

M. R.

Razfar

M. R.

Asadnia

Using particle swarm optimization based neural network for modeling of thrust force drilling of PA-6/ Nanoclay Nanocomposites

Applied Mechanics and Materials 2010 34-35 722 726

2-s2.0-78650863602

10.4028/www.scientific.net/AMM.34-35.722

Khorasani

A. M.

Asadnia

Saadatkia

Modeling of TiC-N Thin Film Coating Process on Drills Using Particle Swarm Optimization Algorithm

Arabian Journal for Science and Engineering 2013 38 6 1565 1571

2-s2.0-84878074604

10.1007/s13369-013-0600-7

Das

Pattnaik

P. K.

Padhy

S. K.

Artificial Neural Network trained by Particle Swarm Optimization for non-linear channel equalization

Expert Systems with Applications 2014 41 7 3491 3496

10.1016/j.eswa.2013.10.053

2-s2.0-84892373106

Miao

Wang

Local synthesis of aligned carbon nanotube bundle arrays by using integrated micro-heaters for interconnect applications

Nanotechnology 2009 20 29

2-s2.0-67651165315

10.1088/0957-4484/20/29/295303

295303

Wang

Miao

Chen

Tan

C. M.

Aligned carbon nanotubes for through-wafer interconnects

Applied Physics Letters 2007 91 4

2-s2.0-34547445650

10.1063/1.2759989

042108

Soleymani Yazdi

M. R.

Razfar

M. R.

Asadnia

Modelling of the thrust force of the drilling operation on PA6-nanoclay nanocomposites using particle swarm optimization

Proceedings of the Institution of Mechanical Engineers, Part B: Journal of Engineering Manufacture 2011 225 10 1757 1771

2-s2.0-84855916549

10.1177/0954405411399018

Miao

Yan

Shen

Erratum: Growth of horizontally aligned dense carbon nanotubes from trench sidewalls (Nanotechnology (2011) 22 (265614))

Nanotechnology 2011 22 47

2-s2.0-80655127969

10.1088/0957-4484/22/47/479502

479502

Ohashi

Chen

G. Y.

Stolojan

Silva

S. R. P.

The role of the gas species on the formation of carbon nanotubes during thermal chemical vapour deposition

Nanotechnology 2008 19 44

2-s2.0-58149234273

10.1088/0957-4484/19/44/445605

445605