Identification and Classification of Atmospheric Particles Based on SEM Images Using Convolutional Neural Network with Attention Mechanism

Accurate identification and classification of atmospheric particulates can provide the basis for their source apportionment. Most current research studies mainly focus on the classification of atmospheric particles based on the energy spectrum of particles, which has the problems of low accuracy and being time-consuming. It is necessary to study the classification method of atmospheric particles with higher accuracy. In this paper, a convolutional neural network (CNN) model with attention mechanism is proposed to identify and classify the scanning electron microscopy (SEM) images of atmospheric particles. First, this work established a database, Qingdao 2016–2018, for atmospheric particles classification research. +is database consists of 3469 SEM images of single particulates. Secondly, by analyzing the morphological characteristics of single particle SEM images, it can be divided into four categories: fibrous particles, flocculent particles, spherical particles, and mineral particles. +irdly, by introducing attention mechanism into convolutional neural network, an Attention-CNNmodel for the identification and classification of the four types of atmospheric particles based on the SEM images is established. Finally, the Attention-CNNmodel is trained and tested based on the SEM images database, and the results of identification and classification for four types of particles are obtained. Under the same SEM images database, the classification results from Attention-CNN are compared with those of CNN and SVM. It is found that Attention-CNN has higher classification accuracy and reduces significantly the misclassification number of particles, which shows the focusing effect of attention mechanism.


Introduction
Atmospheric particulates refer to the micro-solid or liquid matters suspending in the atmosphere. Particulate matters whose aerodynamic equivalent diameters are less than or equal to 2.5 μm are called PM2.5. PM2.5 has small particle size and large surface area, and most of them are nanoscale spheres and ellipsoidal spheres. ey can carry a large number of toxic and harmful substances and stay in the atmosphere for a long time.
is has a great impact on physical health by penetrating deep into lung and exacerbating chronic respiratory and pulmonary diseases [1]. Also, long-term exposure to PM2.5 will increase the incidence of diseases and mortality [2,3]. Besides, PM2.5 in the atmosphere affects global climate: soot aerosol particles are major contributors to global warming through their absorption of solar radiation [4], and the PM2.5 response to climate change is complex and variable with different regions [5]. At present, the study of PM2.5 has become an important issue in the prevention and control of global air pollution.
Atmospheric particulates have different morphological characteristics and chemical composition. e study of single particle can well reflect its physical and chemical characteristics and plays an important role in the formation mechanism, transport process, chemical activity, and environmental impact of particulate matter. Many scholars have studied the particle size distribution, micromorphology, hygroscopicity, optical characteristics, element composition, and mixing state of single particles. Genga et al. [6] applied principal component analysis and hierarchical cluster analysis on PM10, whose aerodynamic equivalent diameter is less than or equal to 10 μm, to investigate the relationship among the chemical and morphological parameters. Based on the scanning electron microscope-energy dispersive X-ray spectroscopy (SEM-EDS), the microscopic images of the particles were processed, and the area, aspect ratio, roundness, fractal dimension, box width, box height, and perimeter of the particles were obtained through image analysis. Chithra and Shiva Nagendra [7] measured the chemical characterization of suspended particulate matter of indoor and outdoor in Chennai city during monsoon, winter, and summer seasons using ion chromatography and inductively coupled plasma optical emission spectrometry. Morphological characteristics of particles were analyzed by scanning electron microscopy (SEM) coupled with energy dispersive X-ray spectrometry, and the distribution of the chemical composition of particulate matter was obtained. Dye et al. [8] collected the fine urban aerosol (<1 μm) using direct sampling of urban aerosol onto porous carbon films. Aerosol particles morphology characteristics were analyzed using transmission electron microscopy (TEM), and the fractal dimensions and diameter of particles were measured and analyzed. Akram et al. [9] investigated the morphology, microstructure, and chemical composition of a variety of particles emitted from coal-fired power plants, steel plants, and vehicle exhausts by SEM and TEM. e investigation indicated that the particles from these three sources were different in morphology, microstructure, and chemical composition. Sphere aggregates were generally the most abundant components, with silicon and aluminum as major elements. Pipal et al. [10] investigated the shapes, morphology, and elemental composition of aerosols in PM10 and PM2.5 in Agra located in north central India using SEM-EDS and concluded that SEM-EDS was a convenient method to identify the sources of particulate air pollution emissions. Zhao et al. [11] investigated the morphology, chemical characteristics, and source apportionment of PM2.5 samples collected from five sites in Jiaxing. SEM and TEM were used to analyze the PM2.5 and the results showed that soot aggregates and coal-fired fly ash were the most abundant components in the samples. e results revealed that the PM2.5 concentrations in winter and spring were higher than in autumn and summer. Li and Shao [12] applied TEM and SEM to study morphologies, sizes, and compositions of aerosol particles during the fog and nonfog episode in Beijing. e number size distributions of mineral particles in fog and nonfog episodes were obtained and the results demonstrated that particles mainly included Si-rich, Carich, and S-rich. Pipal et al. [13] used medium volume air sampler to sample the PM2.5 and PM10 particles at roadside and a semirural site over Agra, India. Particle number concentrations of PM2.5 and PM10 were measured by a Grimm aerosol spectrophotometer. e morphology and elemental composition analyses of particulate matter were performed by SEM and Energy Dispersive Spectrometry (EDS), respectively. Bahadar Zeb et al. [14] studied the morphology, size, and composition of PM2.5 and PM10 by using Fourier Transform Infrared Spectroscopy and SEM with energy dispersive X-ray spectroscopy. e abovementioned studies on the morphology of atmospheric particles mainly focus on the study of the morphology characteristics of atmospheric particles by SEM and TEM, as well as the analysis of characteristics and composition of particles by EDS. Because the background of SEM images of atmospheric particulates is complex and has many noises, it is time-consuming and labor-intensive for classification of atmospheric particles based on EDS, and an effective classification method of atmospheric particulates is needed for further investigation.
As the rising of machine learning, deep neural network, especially CNN, which has good data process capability, has gained in-depth applications in image identification and classification. In this paper, the CNN with attention mechanism applied to the identification and classification of SEM images of atmospheric particles is studied. e main contributions are as follows: (1) e SEM images database of atmospheric particulates including four types of morphological characteristics is established, which contains fiber particulates, flocculent particulates, spherical particulates, and mineral particulates. (2) In order to improve the identification and classification accuracy of CNN, the attention mechanism is introduced into CNN, and the Attention-CNN model for identifying and classifying SEM images of atmospheric particles is established. (3) e Attention-CNN model is trained and tested using the SEM images database of atmospheric particles. e results indicate that the proposed Attention-CNN model outperforms CNN and SVM. e rest of the paper is organized as follows. In Section 2, the research status related to the identification and classification of atmospheric particles is introduced. In Section 3, the processes of sample collection and SEM image acquisition of atmospheric particles are provided and the morphological characteristics of atmospheric particulates SEM images are analyzed. In Section 4, the Attention-CNN model is constructed. In Section 5, the experiment results and analysis are presented. Finally, a brief conclusion and recommendations for future work are presented in Section 6.

Literature Review
Some scholars have applied deep CNN to the study of the concentration of particulate matter and made a series of achievements. Huang and Kuo [15] combined CNN with long-term and short-term memory (LSTM) to predict PM2.5 concentration.
e results showed that the forecasting accuracy of the proposed CNN-LSTM model was the best compared with the traditional machine learning methods. Chakma et al. [16] combined machine learning with natural images, classified natural images with different PM2.5 concentrations based on deep CNN, and estimated the PM2.5 concentration. Wen et al. [17] proposed a spatiotemporal convolutional long short-term memory neural network extended (C-LSTME) model for predicting PM2.5 concentration. In the model, high-level spatiotemporal features were extracted through the combination of CNN and long-short-term memory neural network (LSTM-NN).
e results showed that the presented model had achieved better performance. Soh et al. [18] proposed PM2.5 prediction model using a combination of multiple neural networks, including an artificial neural network, a CNN, and a long-short-term memory to extract spatial-temporal relations. Experiments employing Taiwan and Beijing datasets showed that the proposed model achieved excellent performance. Li et al. [19] proposed a deep CNN model for exploiting the spatial influence remotely sensed PM2.5 concentration. e results demonstrated that the deep CNN model could be well applied in the field of spatially analyzing remotely sensed big data. e accuracy of the deep CNN is much higher than that of geographically weighted regression. Bo et al. [20] combined image and weather information to estimate PM2.5 indices using CNN and support vector regression (SVR) techniques. First, CNN was used to predict the PM2.5 index based on image information, and then the PM2.5 predicted by CNN and two weather features, humidity and wind speed, were combined to yield final estimated PM2.5 index using a created SVR model.
In addition, because of its powerful image learning ability, CNN is widely used in image classification and several achievements have been achieved. Ciocca et al. [21] investigated the use of CNN-based features for food recognition and retrieval, in which the architecture of CNN network was based on the Residual Network with 50 layers. Steinbrener et al. [22] used CNN to classify smaller datasets of hyperspectral images of fruits and vegetables. e proposed approach made use of the additional information from hyperspectral images to achieve improved accuracy with a limited set of training images. Deepak and Ameer [23] applied a pretrained deep network to classify brain tumor using transfer learning. e experiment showed the system recorded the best classification accuracy compared with all the related works. Wu et al. [24] studied different CNN algorithms for the classification of face skin diseases, which was based on the clinical images. e results demonstrated that CNNs have the ability to recognize facial skin diseases and the models using transfer learning achieved a higher average precision. Zhou et al. [25] proposed a CNN model based radiomics approach on shear-wave elastography to classify malignant and benign breast tumors. e proposed method was designed to directly extract features from the dataset without the prerequisite of segmentation and manual operation. Compared with the existing methods, the method had superior performances. In [26], a mathematical model based on transfer learning is used, in which a CNN architecture, VGG-16 trained on ImageNet dataset, was used as a feature extractor. Experimental results demonstrated that an excellent accuracy was achieved. In [27][28][29][30][31][32], hyperspectral images were classified based on CNN and its improved methods, and the superiority of the corresponding methods was verified through experiments. Ma et al. [33] presented a hybrid CNN for classification of pollution levels based on images with PM2.5 values. e method achieved excellent performance on classification of air pollution levels from a single captured image.
Due to the focusing effect of the attention mechanism, it is added to the CNN to assign different weights to different information, filter out unimportant information, and enable the model to focus on key information areas of the image, thereby improving the identification and classification accuracy of the CNN model. Li et al. [34] established a largescale attention-based glaucoma (LAG) database and proposed an attention-based CNN for glaucoma detection, called AG-CNN. e experiment results from testing over LAG database and another public glaucoma database showed that the AG-CNN significantly advanced the state of the art in glaucoma detection. Karthik et al. [35] exploited the features learned by CNN at various processing hierarchies using the attention mechanism and achieved an overall accuracy of 98% on the validation sets in the 5-fold crossvalidation. Li et al. [36] proposed a deep learning-based method for urban PM2.5 concentration prediction, which comprised a one-dimensional CNN, LSTM, and attentionbased network. e results indicate that the method is capable of dealing with PM2.5 concentration prediction and outperforms six baseline methods. Song et al. [37] presented a highly discriminative network, namely, attention branch based convolutional neural network (ABCNN), to distinguish the differences between leaves features. e ABCNN was tested on special dataset of LeafSnap with highly similar tree leaves and achieved 91.43% classification accuracy, which is higher than previous methods.
In the above-cited works for the study of atmospheric particulates, obtaining SEM images is the basic method. SEM images of atmospheric particulates can be used to analyze the morphology of particles, which are widely used in the study of atmospheric particles. In addition, CNN has strong image learning and classification ability, and it is mainly used to analyze and predict the concentration of particulates in the study of atmospheric particulates. ere are few researches on the application of deep CNN in the classification of atmospheric particulates based on SEM images. In [16,33,36], CNN and its improved algorithms were used to classify and study the natural images of atmospheric particulates, mainly solving the problem of particulate matter concentration and pollution level. Morphological characteristics of atmospheric particulates SEM images, affecting particulate source apportionment, should be further studied. In this paper, the SEM images of atmospheric particulates are discussed based on the morphological characteristics of PM2.5. e CNN and attention mechanism are applied to identify the SEM images of atmospheric particles and realize the classification of four types of particles.

Sampling Filter Membranes for Atmospheric Particulates.
Generally, there are three kinds of sampling filter membranes for atmospheric particulates: quartz fiber filter membrane, cellulose acetate filter membrane, and polycarbonate filter membrane. Under the scanning electron microscope, the morphology of these three filter membranes can be observed from the microscopic level. As shown in Figure 1(a), the quartz fiber filter membrane shows complicated fibrous particles in the SEM. Because of the large thickness of quartz fiber filter membrane, tiny particulates are easy to be embedded in the fiber gap, which makes it inconvenient for operators to obtain images. However, the membrane has high temperature resistance, corrosion resistance, good air permeability, and high flow rate sampling. e morphology of cellulose acetate filter membrane in the SEM presents rough pore morphology, as shown in Figure 1(b). e particulates are embedded in the holes or gaps. e cellulose acetate filter membrane has the advantages of being thinner than the quartz fiber filter membrane, good air permeability, relatively complete adsorbed particulates, and relatively taking less time to find particulate. However, due to the influence of membrane fiber, it is easy to occlude particles. e microscopic morphology of the polycarbonate filter membrane is very clear in the SEM, and the surface presents a smooth microporous shape, as shown in Figure 1(c). SEM image shows that the particulates adhere to the surface of the membrane and the holes are tiny. As background, polycarbonate filter membrane is very suitable for analyzing of PM2.5 using image processing. Polycarbonate filtration membrane also has disadvantages. Due to the small holes and poor permeability, smaller air flow and longer sampling time are required when collecting samples. e three filter membranes have their own advantages and disadvantages. In this paper, deep learning method is adopted to identify atmospheric particulates. SEM images with different backgrounds are more conducive to the applicability of the network model. erefore, all three filter membranes are used for sample collection.

Preparation of Samples.
e fine particulate sampler and medium flow PM2.5 sampler are used for sampling PM2.5. Sampling filters are quartz fiber filters, cellulose acetate filters with 0.8 μm pore size, and polycarbonate filters with 0.1 μm pore size. e sampling flow of particles was calibrated before and after sampling, and blank filter membrane was left on-site for sampling to analyze whether the sampling process was contaminated.
In order to obtain the SEM images of PM2.5, three steps are needed. First, make observation sample: an area of size 5 mm × 10 mm was cut on the filter randomly and adhered to SEM sample piles made of aluminium by conductive adhesives. Second, coat the observation sample with platinum: samples were placed in the spray tank and coated with platinum on the surface by ion sputtering under condition of vacuum.
ird, sample piles were placed in scanning electron microscopy (US FEI Nova Nano SEM 450), and the SEM images of PM2.5 were obtained under high vacuum mode with an accelerating voltage of 20 KV and magnification of 20,000 times.

Morphological Characteristics of Atmospheric
Particulates SEM Images. Based on previous research by our group [38], the observation and analysis of a large number of PM2.5 SEM images, it is found that there are four types of particulates with obvious morphological characteristics, namely, fibrous particulates, flocculent particulates, spherical particulates, and mineral particulates. e SEM images of the four types of atmospheric particulates are shown in Figure 2.
e morphological characteristics of fibrous particulates are obvious and their surface is smooth and long. ey are generally derived from minerals, plant fibers, and artificial fibers dispersed in the air. Fibrous particulates mostly belong to primary aerosol, which are not transformed by secondary chemical reaction in the atmosphere. Flocculent particulates are formed by a large number of 30-50 nm round particles gathered together, mainly from the fine particles emitted by motor vehicles adsorbing other elements in the air through secondary transformation and condensation. eir morphological characteristics are chain and cluster in general, which are easy to distinguish [39].
Spherical particulates are generally known as coalfired fly ash. Due to their smooth spherical characteristics, they are easily identified in atmospheric particles. e main sources of spherical particulates are coal combustion, biomass combustion, or garbage incineration, the formation mechanism of which is the melting of solid fly ash at high temperature. e size of spherical particles is micron generally, and their surface can also adsorb other ultrafine particles or secondary particles.
Mineral particulates generally come from dust and products of secondary atmospheric chemical reactions. Fugitive dust mainly includes road fugitive dust, soil fugitive dust, building fugitive dust, and industrial fugitive dust.
eir morphological characteristics are diverse and generally manifested as mineral form or regular block, which are difficult to distinguish. Mineral particulates come from fine particulate matter in windblown dust on the surface of Earth. Regular mineral particulates generally come from chemical reactions in the atmosphere.

Convolutional Neural Network.
Convolutional neural network is one of the common frameworks of deep learning, which has been widely used because of its unique advantages in image processing. For image processing, convolution is applied to the digital image spatial domain to directly calculate the original pixel space, that is, to obtain important features in the image by multi-convolution kernel operation. At the same time, the back propagation algorithm based on gradient descent is used to train the weights of the network, which has good robustness to translation, scaling, tilt, and so on, and finally the accurate identification and classification of the image is achieved. CNN applied to image identification and classification mainly includes five parts: input layer, convolution layer, pooling layer, full connection layer, and output layer, as shown in Figure 3. Among them, the convolution layer uses convolution kernel to obtain image features and realize filtering and strengthening of image features. e pooling layer reduces the feature dimension of convolution layer by downsampling, compresses the amount of data and parameters, and reduces overfitting. e full connection layer connects all input nodes in series into vectors and outputs image features.

Attention Mechanism.
Attention mechanism is a kind of model similar to the attention function of human. It draws on the principle that people will focus their attention on a specific location at a specific time and reduce their attention to other locations. e function of attention mechanism can be interpreted as screening important information from multiple information, focusing on important information, and ignoring unimportant information. Attention mechanism uses the method of assigning different weights to the input features of the model to focus on important information. e focus process of important information is the calculation process of weight coefficients.
e higher the importance of information, the greater the weight coefficients assigned. e principle of attention mechanism [40] is shown in Figure 4, in which X � (x 1 , x 2 , . . ., x n ) represents the input at time t, H � (h 1 , h 2 , . . ., h n ) indicates the hidden state information in the input sequence, and each h i (i � 1,2, . . ., n) contains the information learned from the input sequence by CNN. e information in h i mainly focuses on the input data around the ith input sequence, and y t is the output at time t, which can be expressed as where g(·) is CNN, s t is hidden output-state at time t, and c t represents context state vector obtained through learning. e calculation of s t can be expressed as follows: e calculation formula of c t is as follows: where a ti is the attention parameter, which corresponds to the weight of h i ; that is, its calculation formula is as follows: where e ti is an alignment model, which evaluates the matching degree between the output at time t and the information at the ith input. It can be calculated by learning the hidden state s t−1 and the h i . e calculation formula is as follows: where tanh(·) is the activation function, W a and U a are the weight parameters, and b a is the bias parameter. From the above, it can be seen that the attention mechanism can improve the learning of the influence relationship of different model states by training the influence weight of each input x i on the output y i , so that the model can focus on the information that the input data has a greater impact on the output and improve the accuracy of the output y i .

Attention-CNN Model.
By introducing attention mechanism into CNN, the network can adaptively allocate weight information for the data of the whole network through the output of last moment and the input of current moment. e key information of the image can be focused on, so as to improve the accuracy and adaptability of the classification.
In this paper, atmospheric particulates are classified by identifying the SEM images. It is necessary to identify the morphology characteristics of particulates in the SEM images. However, SEM images are mainly composed of particulates and filter membrane which is equivalent to the background. If attention mechanism is integrated into CNN to recognize particles in SEM images, attention to features of particle shape can be focused on, and that to the background and other non-particle shape features can be reduced. e identification accuracy of particles can be improved, and then the classification accuracy is improved.

Complexity
Based on this, Attention-CNN model is established by integrating attention mechanism into CNN. e Attention-CNN is used to identify and classify the SEM images of atmospheric particles, and its structure is shown in Figure 5. Attention-CNN model includes input layer, convolution layer, attention layer, full connection layer, and output layer.
Since the input is the SEM image of four types of particles, the input layer is four nodes, that is, the labeled SEM images of four types particles. e convolution layer is set as four layers, in which the convolution kernels of each layer are 8 × 3 × 3, 16 × 5 × 5, 32 × 3 × 3, and 32 × 3 × 3 successively, and attention layer is added after the convolution layer of each layer to realize the weight distribution. Pooling layer is added to the first and last convolution layer. e output of the convolution layer is the input of the full connection layer, and the node number of full connection layer is set as 64. e output layer divides the particles into 4 categories, and the number of output layer node is 4.
In this paper, the TensorFlow model is used to construct the Attention-CNN model. e main parameters of the model are analyzed as follows.

Activation Function.
A nonlinear function can be introduced between the layers of the neural network by setting the activation function, and the output of the node is no longer linear, so it can fit various curves and enhance the expressive ability of the network. In Attention-CNN model, two activation functions, Relu (rectified linear unit) and softmax function, are used in the hidden layer and output layer, respectively.
Relu can solve gradient dispersion in the process of parameter transfer. When Relu function is greater than 0, its derivative is 1. It is simple to find the gradient, which can improve the convergence rate of gradient descent greatly. e expression of Relu function is as follows: Softmax maps the output of multiple neurons to (0, 1) and performs multiple classifications. Assuming there is an input array, i represents the ith element; the softmax value of this element is calculated by where n represents all input elements.

Optimization Strategy.
In the whole training process of deep learning model, it is necessary to use optimization algorithm to iterate the model parameters and reduce the loss function value. Some parameter optimization algorithms have been proposed and applied in related fields [41][42][43][44]. Because the Adam optimization algorithm makes full use of the first-order moment mean and second-order moment mean of the gradient to calculate the adaptive learning rate parameters, and the momentum factor is included in the updating process [45], in this paper, the Adam algorithm is chosen as the optimization strategy in Attention-CNN. e calculation process of Adam is as follows:  Complexity where m t is first-order moment estimate, v t is second-order momentum term, β 1 , β 2 are dynamic values, g t is the gradient of the cost function after t iterations, m t is correction value of first moment, v t is correction value of second moment, W t is the parameters of the t iteration model, and ε is a small number that can avoid the zero denominator.

Loss Function.
Loss function is used in the training of neural network to evaluate the difference between the predicted value and the real value and is a standard to test the performance of the model. In this paper, cross-entropy cost function is selected as loss function in Attention-CNN, which can be expressed as where x is the sample, n is the sum of the samples, a is the output value, and y is the actual value. e gradient of w is calculated as follows: where σ(z) − y is the error between the output and the actual value.

Database.
According to the PM2.5 data collected by three kinds of sampling filter membranes from seven environmental monitoring sites in Qingdao, SEM images of atmospheric particulates were obtained, which contained four types of particulates: fibrous particulates, flocculent particulates, spherical particulates, and mineral particulates. e collected SEM images were transmitted to the computer, and a total of 525 SEM images were obtained. Since the SEM images usually have the information bar of images, which include the shooting time, magnification, scale, and other information of the images, before image processing, it is necessary to cut the SEM images so as to remove the image information bar to avoid affecting the processing of SEM images. By cropping and rotating processing, 3469 single-particle SEM images of four types of atmospheric particles are obtained and the database is set, that is, Qingdao 2016-2018. According to the Attention-  Table 2.

Results and Discussion.
e Attention-CNN model in Section 4.3 is trained and tested based on the SEM images database of four types of atmospheric particles. At the same time, in order to verify the performance of Attention-CNN model, the CNN and SVM which are commonly used in image classification are also applied to the identification and classification of SEM images of atmospheric particles. e structure of CNN is the same as that of the Attention-CNN model, but there is no attention layer. e three models were trained and tested using the same SEM images database. In Attention-CNN and CNN, the Relu function is selected as the activation function of convolution layer, softmax function as the activation function of output layer, Adam optimizer as the network optimization strategy, and the cross-entropy cost function as the loss function, the training time is set as 200 iterations, and the batch size is set as 128.
After Attention-CNN, CNN, and SVM are trained and tested, the corresponding results are obtained. e accuracy and loss curves of training samples and test samples of Attention-CNN are obtained, as shown in Figure 6. Figure 7 shows the classification results of four types of particles obtained by three models.
It can be seen from Figure 6(a) that the classification accuracy of training samples gradually increases and tends to be stable with the number of iterations increasing. e classification accuracy of the test samples increases rapidly at the initial stage of the iteration. After the initial fluctuations, with the number of times increasing, the accuracy of the test samples is close to the accuracy of the training samples and is basically stable. As can be seen from Figure 6(b), similar to the changes in the accuracy of the training samples and the test samples, the loss function of the training samples and test samples decreases rapidly with the number of iterations increasing and finally tends to be stable. e changing of classification accuracy and loss function of Attention-CNN model with the number of iterations increasing shows that the design of Attention-CNN model is reasonable.
As shown in Figure 7, the Attention-CNN model is superior to the other two models in the classification of four Complexity types of atmospheric particles. Classification accuracy of fibrous and flocculent reaches 98.56%, and classification accuracy of mineral is the least, which is 94.33%. e classification accuracy of the four types of particles using Attention-CNN has been improved by 0.77%, 2.59%, 0.68%, and 1.44%, respectively, compared with using CNN and increased by 2.79%, 6.44%, 1.06%, and 7.39%, respectively, compared with using SVM. is shows that the Attention-CNN model can classify the SEM images of atmospheric particles effectively.
In addition, in order to investigate the advantages of Attention-CNN over CNN and SVM in identifying and classifying SEM images of atmospheric particulates, the performances of three models are compared by the confusion matrix, as shown in Figure 8.
According to Figure 8, the following conclusions can be obtained: (1) For fibrous particles, Attention-CNN, CNN, and SVM all have the problem of misclassifying them as    Complexity mineral particles. However, Attention-CNN misclassifying fibrous particulates as mineral particles is in the least number, followed by CNN and SVM, and SVM also has the problem of misclassifying them as flocculent particulates.
(2) For flocculent particulates, Attention-CNN and CNN have classified them into the other three types of particulates mistakenly. SVM misclassifies them as fiber particulates and mineral particulates. But the total number of particulates misclassified using Attention-CNN is the least, followed by CNN and SVM. Among them, the number of mineral particles that have been misclassified is the largest. e misclassified number of Attention-CNN, CNN, and SVM is 6, 24, and 33 in order. It can be seen that the number of flocculent particles that have been misclassified as mineral particles by Attention-CNN is significantly reduced, which indicates the focusing effect of attention mechanism.
(3) For spherical particles, Attention-CNN misclassifies them as mineral particles. CNN misclassifies them as fibers particles and mineral particles. SVM misclassifies them as the other three types of particles. Among them, the number of those misclassified as mineral particles is the largest. e number of spherical particles misclassified as mineral particles by Attention-CNN, CNN, and SVM is 13, 27, and 26, respectively, which also showed the focusing function of the attention mechanism. (4) For the mineral particles, the three models have the problem of misclassifying them as other particles. Attention-CNN and CNN tend to misclassify them into spherical particles, and SVM tends to misclassify them as flocculent particles. However, the total number of misclassified particles using Attention-CNN is the smallest, followed by CNN and SVM.
In summary, it can be obtained that Attention-CNN has a better classification performance than CNN and SVM, especially for fibrous particles, flocculent particles, and spherical particles. Compared with CNN, Attention-CNN significantly reduces the number of particles misclassified after adding attention mechanism to CNN, which indicates the focusing effect of the attention mechanism. Meanwhile, all of the three models have the problem of misclassifying fibrous particles, flocculent particles, and spherical particles as mineral particles, and the number of mineral particles misclassified is the largest compared with other particles, which shows that mineral particles have diverse morphological characteristics and are easily confused with other types of particulates.
In order to further evaluate the classification performance of Attention-CNN, four evaluation indexes, precision, recall, specificity, and F1-score, are used and their calculation formulas are as follows: where TP, FP, TN, and FN are the number of SEM images of PM2.5 classified as true positive, false positive, true negative, and false negative, respectively. F1-score is an evaluation index combining recall and precision, which is used to comprehensively reflect the overall results.
According to the test results of Attention-CNN, the above four evaluation indexes are obtained and compared with the results of CNN and SVM, as shown in Table 3.
From Table 3, it can be found that, in the identification and classification of fibrous particulates, spherical particulates, and flocculent particulates, the four evaluation indexes of Attention-CNN are superior to CNN and SVM. For the identification and classification of mineral particulates, recall and F1-score of Attention-CNN are better than CNN and SVM, and precision and specificity are inferior to CNN but better than SVM. In general, the proposed Attention-CNN model is superior to CNN and

Conclusion
In this paper, samples of PM2.5 are collected and their SEM images are obtained based on scanning electron microscopy. By analyzing the characteristics of SEM images of atmospheric particulates, they are divided into four types of typical particulates: fibrous particulates, flocculent particulates, spherical particulates, and mineral particulates.
e Attention-CNN model for the identification and classification of atmospheric particles SEM images is established by integrating attention mechanism into CNN, which includes input layer, convolution layer, attention layer, full connection layer, and output layer. After Attention-CNN model is trained and tested by the SEM images database of atmospheric particulates, the classification results with high classification accuracy are obtained, which indicates the proposed Attention-CNN model is reasonable to identify and classify atmospheric particles SEM images.
In addition, the Attention-CNN model is compared with CNN and SVM models based on the same database. e results indicate Attention-CNN is superior to CNN and SVM in the classification of four types of atmospheric particles, which has higher classification accuracy and better evaluation indexes. Meanwhile, compared with CNN, Attention-CNN significantly reduces the number of particles misclassified after adding attention mechanism to CNN, which indicates the focusing effect of the attention mechanism.
is study provides a reliable method for the automatic identification and classification of atmospheric particles. However, the morphological characteristics of atmospheric particulates are complex and diverse. is paper only focuses on the identification and classification of four types of particles and the types of particles can be expanded in the future. Moreover, how to introduce a better algorithm to improve the identification and classification accuracy of particulates with morphological diversity using CNN is also the content of further research.

Data Availability
e data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that they have no conflicts of interest.