Automatic Impervious Surface Area Detection Using Image Texture Analysis and Neural Computing Models with Advanced Optimizers

Up-to-date information regarding impervious surface is valuable for urban planning and management. The objective of this study is to develop neural computing models used for automatic impervious surface area detection at a regional scale. To achieve this task, advanced optimizers of adaptive moment estimation (Adam), a variation of Adam called Adamax, Nesterov-accelerated adaptive moment estimation (Nadam), Adam with decoupled weight decay (AdamW), and a new exponential moving average variant (AMSGrad) are used to train the artificial neural network models employed for impervious surface detection. These advanced optimizers are benchmarked with the conventional gradient descent with momentum (GDM). Remotely sensed images collected from Sentinel-2 satellite for the study area of Da Nang city (Vietnam) are used to construct and verify the proposed approach. Moreover, texture descriptors including statistical measurements of color channels and binary gradient contour are employed to extract useful features for the neural computing model-based pattern recognition. Experimental result supported by statistical test points out that the Nadam optimizer-based neural computing model has achieved the most desired predictive accuracy for the data collected in the studied region with classification accuracy rate of 97.331%, precision = 0.961, recall = 0.984, negative predictive value = 0.985, and F1 score = 0.972. Therefore, the model developed in this study can be a helpful tool for decision-makers in the task of urban land-use planning and management.


Introduction
Urban impervious surface, developed by anthropogenic activities, is one of the most crucial land cover forms. e impenetrable surface areas consist of buildings, roads, parking lots, sidewalks, pavements, and many others. ese surfaces prevent the absorption of water into the soil. Previous works have pointed out the impact of impervious surface areas on water quality and the frequency/intensity of downstream runoff [1][2][3][4][5]. erefore, they have been identified as a key indicator used in evaluating urbanization influences on surrounding natural environment and ecosystem [6].
Due to such reasons, up-to-date information regarding impervious surface is of paramount importance for supporting urban land management/planning, detection of unplanned built-up areas, study of regional land-use pattern, and ecosystem monitoring [5,[7][8][9][10]. In developing countries including Vietnam, the conventional approach for obtaining such information is field survey. Nevertheless, this approach is very time-consuming and requires considerable effort in data collection, processing, and storing. erefore, a quick and cost-effective method for substituting this conventional approach is a practical need for municipal land-use managers.
Following this trend of research, scholars and practitioners have increasingly relied on remote sensing and geographic information system (GIS) technologies to improve the productivity and accuracy of the impervious surface detection task [24,25]. ese technologies have been proven to be viable tools for surveying urban landscapes which are rapidly changing and providing timely information regarding urban growth [26][27][28][29]. Based on remotely sensed images, statistical and machine learning models can be constructed for automatic impervious surface extraction [8].
Lo [30] developed a computer-based model for analyzing remote sensing data obtained from Landsat image; this model only relied on spectral information of image pixels to derive land form categories. Zha et al. [29] performed builtup areas mapping with the utilization of normalized difference vegetation index and normalized difference built-up index; the proposed model analyzed Landsat ematic Mapper images and achieved an accuracy of 92.6%. A multivariate statistical analysis approach has been put forward in [28] for characterizing urban growth; this approach could reduce the modeling error to less than 10%. Yang et al. [5] employed a combination of Landsat ETM+ and highresolution imagery to construct a decision tree-based impervious surface mapping. Multilayer perceptron neural network and support vector machine have been used in [6] to classify image samples obtained from Landsat-5 TM Imager.
Zhang et al. [31] integrated spectral information and multivariate texture to extract numerical features from remotely sensed image; the one-class support vector machine is then used for pattern classification. Zhang et al. [32] investigated the capability of random forest approach for impervious surface estimation with a combined utilization of synthetic aperture radar and optical remote sensing images. A backpropagation neural network has been constructed by Patel and Mukherjee [33] to extract the impervious features using Landsat ematic Mapper data. Son et al. [34] introduced an impervious surface fraction algorithm (ISFA) for automatic impervious surface extraction; this algorithm is applied with Landsat data and attains an accuracy of 92.8%. Gupta et al. [7] compared the performances of supervised maximum likelihood algorithms, index-based classification, and neural classification and points out that the neural classification model achieves the most desired outcome.
It can be seen from the literature that most of the previous works have employed the medium resolution opensource image dataset such as Landsat ematic Mapper to extract impervious surface [35]. Because of the complex texture of urban landscape, these coarse resolution images feature certain limitations on impervious surface mapping. Xu et al. [36] investigated the use of the 10 m resolution Sentinel-2A dataset for impervious area extraction and pointed out the superiority of high-resolution data over the conventional 30 m resolution Landsat dataset. Misra et al. [8] attempted to employ high-resolution image obtained from Sentinel-2 to improve the quality of impervious surface detection result; the authors rely on spectral angle mapper, support vector machine, and neural network to carry out pattern recognition task.
Among the machine learning approaches employed in remote sensing and GIS field, neural computing models have been extensively employed and remain effective tools for recognizing patterns in remotely sensed images [37][38][39][40][41][42][43][44]. It is because neural computational models with their capability of universal function approximator are capable of learning and recognizing complex patterns [45]. Nevertheless, the employed neural computing approaches have mainly relied on the conventional gradient descent for model training [7,8,37,46]. Although this conventional training method can help to attain acceptable results in many application cases, it also suffers from slow convergence rate and trapping in local optimal [47]. ese facts definitely reduce the generalization and accuracy of prediction models constructed by neural computational approaches. erefore, there is a pressing need to investigate and apply advanced training algorithms to mitigate the disadvantages of the conventional gradient descent.
In recent years, various advanced gradient-based optimization algorithms have been proposed and used for training neural computing models. However, few research works have investigated these state-of-the-art algorithms in constructing neural computing models used for remote sensing-based impervious surface detection. erefore, this study is an attempt to fill this gap in the current literature. e advanced optimizers of adaptive moment estimation (Adam) [48], a variation of Adam called Adamax [48], Nesterov-accelerated adaptive moment estimation (Nadam) [49], Adam with decoupled weight decay (AdamW) [50], and a new exponential moving average variant (AMSGrad) [51] are employed for automatic impervious surface extraction.
In addition, Da Nang city (Vietnam) has been selected as the study area. Image texture analysis technique including statistical measurements of color channels [52] and binary gradient contour [53] are used to extract useful features from remotely sensed images obtained from Sentinel-2 satellite. e extracted features are then employed by neural computing models for automatic impervious surface detection in the study area. erefore, one major contribution of the current study is to establish an advanced hybridization of machine learning and image processing used for constructing an impervious surface map for the study area of Da Nang city. e subsequent sections of the article are organized as follows: the research methodology is reviewed in the second section. e proposed neural computing model trained by the aforementioned advanced optimizers used for impervious surface area detection is presented in the next section, followed by the fourth section which reports experimental results. Several concluding remarks on the current study are stated in the final section.

General Description of the Study Area.
Da Nang city is located in the Central Vietnam (refer to Figure 1). Its latitude is between 15°15′20″N and 16°14′10″N; its longitude is from 107°18′30″E to 108°20′00″E [54]. It is a port city located on the coast of the East Sea. In 2015, Da Nang had a population of 1,046,876 and an area of 1,285.4 km 2 [55]. is city is divided into 8 districts: 6 urban districts and 2 rural districts [56].
Da Nang is ranked as the fourth largest city by population in Vietnam and serves as an economic base in the service and industrial sectors in Central Vietnam. Due to such reasons, the population of this city is rising rapidly from approximately 673,000 in 1997 to about 1 million in 2014 [57].
is population growth leads to a significant urban expansion. erefore, Da Nang city is selected as the study area in this article.

e Image Data Used.
e image data obtained from the Sentinel-2 on March 13, 2020, is used in this study to perform impervious surface extraction. e bands of 4 (red), 3 (green), and 2 (blue) with spatial resolution of 10 m are selected to compose the image of the study area. e size of each image file (i.e., bands 2, 3, and 4) is 235,484 KB. A full-scene map of Da Nang city (5559 × 3444 pixels) is presented in Figure 2. It is noted that these Sentinel-2's bands have been opened in Sentinel Application Platform (SNAP) software package [58]. e original Sentinel-2's bands obtained from USGS [59] are converted to TIF format using the geometric operation of resampling supported by the SNAP software package. For more details of the SNAP software documentation, readers are guided to articles provided in [60]. Moreover, the used map projection of the obtained images is Universal Transverse Mercator (UTM) within Zone 48N-Datum World Geodetic System (WGS) 84. Based on the original composed image, the contrast enhancement technique of histogram equalization (refer to Figure 3) is employed to create a better image for subsequent analysis. e purpose of histogram equalization is to construct an image with equally distributed brightness levels [61]. is image processing technique meliorates the global contrast of the original image and highlights the image texture. In addition, to facilitate the process of impervious surface detection, pixels at mountainous regions covered by cloud and large beach areas are cast out by masking operation [62]. In addition, the normalized difference vegetation index (NDVI) [63] is computed to remove large water bodies from the study area. Via experimentation with the collected image, pixels belonging to large water bodies are associated with negative NDVI values and can be effectively excluded.
e NDVI computation requires the band 4 (red) and band 8 (near-infrared band); it is obtained via the following equation: Computational Intelligence and Neuroscience 3 where NIR and B4 denote a near-infrared band and band 4, respectively.

Statistical Measurements of Image Bands.
For the purpose of impervious surface detection, the statistical measurements of bands 4 (red), 3 (green), and 2 (blue) are employed in this study. It is noted that this research performs impervious surface detection for each image patch of 10 × 10 pixels. us, to derive statistical measurements, the first-order histogram of an image patch S denoted as P (I) is computed as follows [52,64]: where b denotes a band index, N I,b is the number of pixels having the value of I, and PN denotes the number of pixels within an image patch.

Binary Gradient Contour (BGC) for Texture
Discrimination. Texture is a crucial tool for visual perception in computer vision. Since image patches of impervious surface and pervious surface can have different properties of coarseness, roughness, directionality, contrast, and regularity, using texture analysis can be helpful to delineate them. is study employs the BGC technique [53] for the task of texture discrimination. e BGC combines the analyses of local structures and occurrences to derive texture features. Notably, this method has the advantages of low computational expense and invariant to monotonic illumination changes [65]. It also possesses good discriminative powers demonstrated in previous studies [53,66].
Essentially, the BGC carries out a pairwise comparison of adjacent pixels located in one or more closed paths along the periphery of a neighborhood of the size 3 × 3 pixels [65]. Fernandez et al. [53] put forward three versions of BGC which are the single-loop (BGC 1 ), double-loop (BGC 2 ), and triple-loop (BGC 3 ) descriptors. To ease the description of these descriptors, a square image patch S m,n is denoted as follows: where I m,n denotes the central pixel.
Based on the definition of S, the single-, double-, and triple-loop BGC formulas are expressed as follows [53]:

Artificial Neural Network (ANN) for Pattern Classification.
A general structure of an ANN for two-class pattern recognition tasks is presented in Figure 4. is model typically contains an input layer, a hidden layer, and an output layer [67]. In this study, the input layer receives signals in the form of texture information. e hidden layer consists of M neurons which process the texture information to yield the input of the softmax activation function. e usual activation function used by the neurons in the hidden layer is the log-sigmoid function. e softmax activation function converts its input signals to class probabilities within the range of 0 and 1. It is noted that interactions among neurons are expressed by connection weights. ese sophisticated interactions permit the overall neural computing model to learn and infer complex mapping relationships [68]. e knowledge learnt by a neural computing model is stored in matrices of connection weights. Herein, W L0L1 denotes the matrix of connection weights between the input and hidden layer. W L1L2 represents that between the hidden and the softmax layer. Let X ∈ R D denote the matrix of input feature. e computation process of an ANN model can be compactly defined as follows: where b 0 and b 1 denote two bias vectors of the input and hidden layers, respectively, f A represents the activation function, and δ represents the softmax activation function. e softmax activation function is given by where CN � 2 denotes the number of output classes.  [69,70] with minibatch mode [71]. For the task of data classification, the cross entropy cost function is often used as the objective function (E) for training a neural computing model [72]. e crossentropy function is given by

e Employed
where N d denotes the number of data samples; T and Y represent the actual and predicted class labels, respectively.

e Network's Optimizers
Gradient Descent with Momentum (GDM). e conventional method of gradient descent with momentum (GDM) is widely employed for training neural networks and is used as the benchmark method in this study. Via the GDM, the weights of a neural computing model are adapted as follows: where w t and w t+1 are the previous and updated network weights, E represents the objective function, and α L and λ M are the learning rate and the momentum term, respectively. Adaptive Moment Estimation (Adam). e Adam, introduced by Yoshua and Yann [48], can be considered as a general algorithm for first-order gradient-based optimization of stochastic objective functions. One notable advantage of this optimizer is that it is capable of adaptively fine-tuning the learning rate parameter during the training process. e Adam relies on information obtained from the average of the second moments of the gradients. is optimizer also utilizes an exponentially decaying average of past gradients. In addition, this optimizer requires an initial setting of three hyperparameters: the step size α and the two exponential decay rates (β 1 � 0.9 and β 2 � 0.9999). When the gradient of model parameters is computed, the optimized parameters of a neural computing model are adapted via [48] where m t and v t denote the bias-corrected first moment estimate and the bias-corrected second raw moment estimate, respectively. Adamax. e Adamax [48] is a variant of the Adam in which the update rule for model weights is to scale their gradients inversely proportional to a L p norm of their current and previous gradients. e neural network's weights are updated as follows: where u t � 0 at t � 0; u t represents the biased second raw moment estimate. Nesterov-Accelerated Adaptive Moment Estimation (Nadam). e Nadam optimizer, described in [49], attempts to incorporate Nesterov-accelerated adaptive moment estimation into the Adam. e major advantage of this integrated approach is that the employed adaptive moment estimation helps to perform highly accurate step in the gradient direction via updates of model parameters with the momentum step before the computation of the gradient [73]. e update rule of the Nadam is stated as follows [49,73]: where ...
Output layer Hidden layer Softmax Input layer Figure 4: General structure of the employed ANN model used for two-class pattern recognition. 6 Computational Intelligence and Neuroscience Adam with Decoupled Weight Decay (AdamW). e AdamW [50] optimizer integrates weight decay into the original Adam. e weight decay is a widely used approach for regularizing the network weights. It is because large weights may lead to an overfitted model. Accordingly, the update rule of the AdamW algorithm is given by where λ denotes a hyperparameter. A New Exponential Moving Average Variant (AMSGrad). e AMSGrad optimizer [51] attempts to improve the convergence of the Adam optimizer by the employment of long-term memory of past gradient. To avoid poor convergence and trapping in local optima, Reddi et al. [51] argues that the maximum of past squared gradients v t should be used for parameter update instead of the exponential average employed by the Adam optimizer. e following equation is used to update the neural network's parameter: where v t denotes the updated bias-corrected 2 nd raw moment estimate.

The Proposed Neural Computing Model with Advanced Optimizers for Automatic Impervious Surface Detection
is section of the article presents the general description of the proposed neural computational method employed for automatic impervious surface detection. e proposed model is an integration of image texture analysis, neural network-based pattern recognition, and advanced optimizers used for neural network training. An overview of the data processing and the training phase of the proposed neural computing model used for impervious surface detection is demonstrated in Algorithm 1. e general structure of the newly developed model is presented in Figure 5. It is noted that the proposed neural computational model used for impervious surface detection has been developed in Visual C#.NET environment (Framework 4.6.2) and performed with the ASUS FX705GE-EW165T (Core i7 8750H, 8 GB Ram, 256 GB solid-state drive). e model operation can be divided into four steps: (i) Data preprocessing (ii) Image data sampling (iii) Image texture computation (iv) Neural computing model training and prediction

Data Preprocessing.
In this step, the original Sentinel-2's bands are opened in the SNAP software package and converted to TIFF format. e image process technique of histogram equalization is employed to enhance the contrast of the original image. As mentioned earlier, the NDVI is also calculated using the obtained bands to cast out large water bodies from the study area.

Image Data Sampling.
To establish the neural network model for automatic impervious surface area detection, it is required to prepare a training dataset with assigned ground truth labels. is study has sampled pervious and impervious areas within the map of the study area (refer to Figure 6). Each sample with the size of 100 × 100 pixels is used to create nonoverlapped image patches with the size of 10 × 10 pixels. In total, there are 3,000 image patches that are generated from image samples. To ensure a balanced dataset, the numbers of the negative (pervious surface) and positive (impervious surface) samples are both 1,500. Based on these image patches, the image texture computation methods can be carried out to extract useful feature for the pattern recognition phase.

Image Texture Computation.
Using image samples generated from the previous step, the texture analysis methods using statistical measurements of color channels and the BGC can be performed. e texture computation process converts image samples of the negative (pervious surface) and the positive (impervious surface) classes into numerical features. ese numerical features are subsequently used for the task of pattern recognition performed by the neural computing models. e statistical measurements of the three color channels include the mean, standard deviation, skewness, kurtosis, entropy, and range indices. Since the number of the employed bands obtained from the Sentinel-2 is 3, there are 6 × 3 � 18 features attained from statistical measurements of color channels (refer to Figure 7).
In addition, the BGC is performed with the three versions of single-loop (BGC 1 ), double-loop (BGC 2 ), and tripleloop (BGC 3 ). Each of them produces a histogram which describes the texture information of image samples. is study computes the measurements of mean, standard deviation, skewness, kurtosis, and entropy from each histogram. Hence, the BGC texture descriptors yield 5 × 3 � 15 features (refer to Figure 8).
us, the total number of features extracted from the used texture descriptors is 33. Moreover, to facilitate the data classification based on the employed neural computing model, the texture-based features have been preprocessed by the Z-score data normalization. e Z-score equation is given by  8 Computational Intelligence and Neuroscience where X o and X ZN denote the original and the standardized feature, respectively; m X and s X represent the mean and the standard deviation of the original feature, respectively.

Neural Computing Model Training and Prediction.
As stated earlier, a dataset including 3,000 instances and 33 features has been prepared to train and verify the neural computing approach used for impervious surface detection. Each instance of the dataset has the class label of either pervious (denoted as 0) or impervious (denoted as 1). Each data record contains texture characteristic of an image region within the map of the study area. As mentioned earlier, the statistical measurements of color channels and statistical measurements of the three BGC variants (BGC 1 , BGC 2 , and BGC 3 ) are used as texture descriptors. e neural computing model is used to generalize a decision boundary that can distinguish data instances of the two categories of pervious and impervious surfaces. Accordingly, the original dataset has been randomly split into two mutual exclusive sets: a training set (70%) and a testing set (30%). e first set is used for model construction. e latter set is used to evaluate the model's predictive capability. It is noted that the neural computing models in this study are trained with the mini-batch mode [74]. Accordingly, the training data are split  Computational Intelligence and Neuroscience into small batches and these batches are used to calculate the model error and the gradients of the neural computing models' parameters. e training process of the neural computing model aims at adapting the two matrices of W L0L1 and W L1L2 that specify the model structure.
e size of the first matrix, which is the connection weight between the input and the hidden layer, is M x (N I + 1) matrix where M and N I represent the number of neurons in the hidden layer and the number of input features, respectively. Herein, N I � 33 which is equal to the number of features extracted from the employed texture descriptors.
It is noted that the selection of the tuning parameters of the neural computing model in this study is based on recommendation of previous works and experimental trials using the collected dataset. Based on the suggestion of Heaton [75] and Tien Bui et al. [44], the number of neurons in the hidden layer in this study is set to be (2/3)N I + N O , where N O � 2 denotes the number of the output classes. In addition, the log-sigmoid is chosen as the activation function since it is commonly used for constructing shallow neural network models used for pattern classification [76,77]. e softmax activation function is employed in the final layer to yield class probabilities within the range of 0 and 1 [67,78]. Moreover, the number of training epochs is also required to be set appropriately. It is worth noticing that this tuning parameter may strongly affect the training outcome. An insufficient number of epochs can result in an underfitted model. Meanwhile, an excessive number of epochs may lead to an overfitted model. In this study, via several trial-and-error experiments with the collected dataset, the suitable number of training epoch is found to be 100.
When the number of neurons in the hidden layer is determined, the size of the matrices that contain connection weights can be specified. Herein, the size of the W L1L2 , which stores connection weights between the hidden and output layer is N O x (M + 1). us, the total number of variables needed to be identified by the employed optimizers is N R x N I + N O x N R + 2. In this study, the optimizers of GDM, Adam, Adamax, Nadam,

Experimental Results
As stated earlier, to train and verify the neural computing model used for impervious surface area detection, the extracted dataset has been divided into two sets of training (70%) and testing (30%) datasets. In addition, to alleviate the undesired effect of randomness on data sampling and to accurately assess the generalization capability of the newly developed model, the training/testing data sampling processes have been performed 20 times. In each time, 30% of the dataset, which corresponds to 900 instances, is randomly drawn out to form the testing dataset. e rest of the dataset including 2100 instances is used for model training. Based on the model configuration in the previous section, the employed neural computing model is an artificial neural network consisting of 33 neurons in the hidden layer. e log-sigmoid is used as the activation function in the hidden layer. In the output layer, the softmax function is utilized to derive the probability of the two class labels of impervious surface and pervious surface.
In addition, to evaluate the prediction results of the employed neural computing models, classification accuracy rate (CAR), precision, recall, negative predictive value (NPV), and F1 score are calculated as follows [79]: where TP, TN, FP, and FN represent true-positive, truenegative, false-positive, and false-negative instances, respectively. e outcomes of the artificial neural network models optimized by the used optimizers obtained from the training and testing phases are reported in Tables 1 and 2. It is observable that the neural computing models optimized by the Nadam have achieved the most desired performance with CAR � 97.331%, precision � 0.961, recall � 0.984, NPV � 0.985, and F1 score � 0.972. As can be seen from Tables 1 and 2, the prediction performances obtained from the training phase (CAR � 97.967%) and testing phase (CAR � 97.311%) of the Nadam-based model are relatively close to each other. is fact indicates that the Nadam-optimized neural computing model used for impervious surface detection does not suffer from overfitting issue. e experimental results also demonstrate that the selected number of training epochs is reasonable and help to prevent both overfitting and underfitting. e Adam optimizer is the second best approach (CAR � 97.050%), followed by the AdamW (CAR � 97.028%), Adamax (CAR � 96.572%), AMSGrad (CAR � 96.556%), and GDM (CAR � 93.389%). e model result comparison is also graphically presented by Figures 9 and 10.    Figure 11: Results of the Wilcoxon signed-rank test: (a) results of p values and (b) test outcomes. Note. e symbols ++, +, --, and -denote a significant win, a win, a significant loss, and a loss.

Computational Intelligence and Neuroscience
Original map Impervious cell detection Impervious surface map Figure 13: Demonstrations of the model classification outcomes with small-scale maps. Note. A red cell and a white cell denote impervious and pervious areas, respectively.
Computational Intelligence and Neuroscience 13 Furthermore, to confirm the statistical difference of each pair of the neural computing models used for impervious surface detection, the Wilcoxon signed-rank test with significance level (p value) � 0.05 is used. e test results are provided in Figure 11. Observed from the test outcomes, all of the advanced optimizers including Adam, Adamax, Nadam, AdamW, and AMSGrad significantly outperformed the conventional GDM. e Nadam as the best approach achieves three significant wins and two wins. Notably, the benchmark method of GDM gets five significant losses. e average convergence records of all the employed optimizers are also provided in Figure 12.
e experimental results have demonstrated the superiority of the Nadam optimizer in constructing the neural computing model-based impervious surface detection for the study area. e outstanding performance of the Nadam algorithm can be explained by the fact that this advanced optimizer is a combination of the powerful Adam and Nesterov-accelerated gradient (NAG) approaches. e Adam optimizer has a significant advantage of computing adaptive learning rates for each parameter of the neural computing model [73]. Moreover, since Nesterov momentum provides a correction factor to the standard method of momentum, the NAG often results in good training performance [74].
e Nadam algorithm harnesses the advantages of the Adam and NAG approaches. erefore, this optimizer has achieved the most desired performance for the collected dataset.
Since the Nadam-optimized neural computing model, denoted as Nadam-NCM, has achieved an outstanding accuracy of 97.311%, this model can be employed to accomplish the objective of impervious surface mapping in a reliable manner. Figure 13 demonstrates the application of the Nadam-based model in detecting impervious surface for small-scaled maps. e impervious surface map of the study area is provided in Figure 14. Based on the classification result, the impervious surface areas account for roughly 18.25% of the study area.

Concluding Remarks
Up-to-date information regarding the impervious surface areas is crucial for the task of land-use planning, monitoring, and management. is study investigates the employment of neural computing models trained by the advanced optimizers used for automatic impervious surface area detection. e conventional GDM algorithm and the advanced optimizers of the Adam, Adamax, Nadam, AdamW, and AMSGrad are employed to train the neural computing models used for the pattern recognition task of interest. Experimental results supported by the Wilcoxon signed-rank test points out that the Nadam-optimized neural computing model has achieved the most desired predictive accuracy with CAR � 97.311%. erefore, this model can potentially serve as an effective tool for extracting built-up impervious surfaces at regional scale. Future extensions of the current work may include the following: 14 Computational Intelligence and Neuroscience accuracy of the impervious surface detection problem (iv) Incorporating state-of-the-art regularization techniques (e.g., dropout regularization) into the training process of the neural computing models (v) Investigating the possibility of using metaheuristic algorithms to meliorate the model training performance (vi) e employment of other advanced texture descriptors for improving the classification accuracy rate (vii) Incorporation of statistical and metaheuristicbased feature selection methods into the current model to further enhance the prediction accuracy (viii) Investigation of capabilities of other advanced machine learning models (e.g., deep learning and Markov models) for impervious surface extraction (ix) Developing intelligent models for predicting time series of remotely sensed impervious surface data with other advanced neural computing models including recurrent neural network and long short-term memory.

Data Availability
e dataset used to support the findings of this study has been deposited in the repository of GitHub (https://github. com/NhatDucHoang/NCM_ISD_DaNang).

Conflicts of Interest
e author confirms that there are no conflicts of interest regarding the publication of this article.