1. Introduction

MPE

Mathematical Problems in Engineering

1563-5147 1024-123X

Hindawi Publishing Corporation

974638

10.1155/2012/974638

974638

Research Article

Landslide Susceptibility Assessment in Vietnam Using Support Vector Machines, Decision Tree, and Naïve Bayes Models

Tien Bui

Dieu

^{1, 2} Pradhan

Biswajeet

³ Lofman

Owe

¹ Revhaug

Inge

¹ Hong

Wei-Chiang

Department of Mathematical Sciences and Technology

Norwegian University of Life Sciences

P.O. Box 5003IMT

1432 Aas

Norway

umb.no

Faculty of Surveying and Mapping

Hanoi University of Mining and Geology

Dong Ngac

Tu Liem

Hanoi

Vietnam

humg.edu.vn

Department of Civil Engineering, Spatial and Numerical Modelling Research Group

Faculty of Engineering

Universiti Putra Malaysia

Selangor, 43400 Serdang

Malaysia

upm.edu.my

2012

17 7 2012

2012 01 04 2012 24 04 2012

2012

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

The objective of this study is to investigate and compare the results of three data mining approaches, the support vector machines (SVM), decision tree (DT), and Naïve Bayes (NB) models for spatial prediction of landslide hazards in the Hoa Binh province (Vietnam). First, a landslide inventory map showing the locations of 118 landslides was constructed from various sources. The landslide inventory was then randomly partitioned into 70% for training the models and 30% for the model validation. Second, ten landslide conditioning factors were selected (i.e., slope angle, slope aspect, relief amplitude, lithology, soil type, land use, distance to roads, distance to rivers, distance to faults, and rainfall). Using these factors, landslide susceptibility indexes were calculated using SVM, DT, and NB models. Finally, landslide locations that were not used in the training phase were used to validate and compare the landslide susceptibility maps. The validation results show that the models derived using SVM have the highest prediction capability. The model derived using DT has the lowest prediction capability. Compared to the logistic regression model, the prediction capability of the SVM models is slightly better. The prediction capability of the DT and NB models is lower.

1. Introduction

Vietnam is identified as a country that is particularly vulnerable to some of the worst manifestations of climate change such as sea level rise, flooding, and landslides. In the recent years, together with flooding, landslides have occurred widespread and recurrent in the northwest mountainous areas of Vietnam and have caused substantial economic losses and property damages. Landslides usually occurred during heavy rainfalls in the rainy season from May to October every year. In particular, in the Hoa Binh province during the rainy season of 2006 and 2007, large landslides occurred frequently due to heavy rainfalls. Most of these landslides occurred on cut slopes and alongside roads in mountainous areas. Landslide disaster can be reduced by understanding the mechanism, prediction, hazard assessment, early warning, and risk management [1]. Therefore, studies on landslides and determining measures to mitigate losses are an urgent task. However, the study on landslides in Vietnam is still limited except a few case studies [2–5]. Through scientific analyses of these landslides, we can assess and predict landslide prone areas, offering potential measures to decrease landslide damages [6, 7].

Spatial prediction of landslide hazard map preparation is considered the first important step for landslide hazard mitigation and management [8]. The spatial probability of landslide hazards can be expressed as the probability of spatial occurrence of slope failures with a set of geoenvironmental conditions [9]. However, due to the complex nature of landslides, producing a reliable spatial prediction of landslide hazard is not easy. For this reason, various approaches have been proposed in the literature. Review of these approaches has been carried out by Guzzetti et al. [10], Wang et al. [11], and Chacón et al. [12]. In the recent years, some soft computing approaches have been applied for landslide hazard evaluation including fuzzy logic [7, 13–20], neuro-fuzzy [3, 15, 21, 22], and artificial neural networks [6, 23–29]. In general, the quality of landslide susceptibility models is affected by the methods used [30]. For this reason, comparison of those methods with the conventional methods has been carried out using different datasets. Some researchers found that soft computing methods outperform the conventional methods [31–35]; however, other authors find no differences in overall predictive performance [36]. In general, soft computing approaches give rise qualitatively and quantitatively on the maps of the landslide hazard areas and the spatial results are appealing [37].

In more recent years, data mining approaches have been considered used for landslide studies such as SVM, DT, and NB [38, 39]. They belong to the top 10 data mining algorithms identified by the IEEE [40]. In the case of SVM, the main advantage of this method is that it can use large input data with fast learning capacity. This method is well-suited to nonlinear high-dimensional data modeling problems and provides promising perspectives in the landslide susceptibility mapping [41]. Micheletti et al. [42] stated that SVM methods can be used for landslide studies because of their ability in dealing with high-dimensional spaces effectively and with a high classification performance. In the case of DT, according to Yeon et al. [43] the probability of observations that belong to the landslide class can be used to estimate indexes of susceptibility. Saito et al. [44] used a decision tree model for landslide susceptibility mapping in the Akaishi Mountains (Japan) and stated that the decision tree model has appropriate accuracy for estimating the probabilities of future landslides. Nefeslioglu et al. [45] applied a DT in the metropolitan area of Istanbul (Turkey) with a good prediction accuracy of the landslide model. Yeon et al. [43] concluded that DT can be used efficiently for landslide susceptibility mapping. In the case of NB, although the method has been successfully applied in many domains [46]; however, the application in landslide susceptibility assessment may still be limited. NB is a popular and fast supervised learning algorithm for data mining applications based on the Bayes theorem. The main advantage of NB is that it can process a large number of variables, both discrete and continuous [47]. NB is suitable for large-scale prediction of complex and incomplete data [48]. The main potential drawback of this method is that it requires independence of attributes. However, this method is considered to be relatively robust [49].

The main objective of this study is to investigate and compare the results of three data mining approaches, that is, SVM, DT, and NB, to spatial prediction of landslide hazards for the Hoa Binh province (Vietnam). The main difference between this study and the aforementioned works is that SVM with two kernel functions (radial basis and polynomial kernels) and NB were applied for landslide susceptibility modeling. To assess these methods, the susceptibility maps obtained from the three data mining approaches were compared to those obtained by the logistic regression model reported by the same authors [2]. The computation process was carried out using MATLAB 7.11 and LIBSVM [50] for SVM and WEKA ver. 3.6.6 (The University of Waikato, 2011) for DT and NB.

2. Study Area and Data Used 2.1. Study Area

Hoa Binh has an area of about 4,660 km² and is located between the longitudes 104°48^'E and 105°50^'E and the latitudes 20°17^'N and 21°08^'N in the northwest mountainous area of Vietnam (Figure 1). The province is hilly with elevations ranging between 0 and 1,510 m, with an average value of 315 m and standard deviation of 271.5 m. The terrain gradient computed from a digital elevation model (DEM) with a spatial resolution of 20×20 m is in the range from 0° to 60°, with a mean value of 13.8° and a standard deviation of 10.4°.

Figure 1

Landslide inventory map of the study area.

There are more than 38 geologic formations that have cropped out in the province (Figure 2). Six geological formations, Dong Giao, Tan Lac, Vien Nam, Song Boi, Suoi Bang, and Ben Khe, cover about 72.8% of the total area. The main lithologies are limestone, conglomerate, aphyric basalt, sandstone, silty sandstone, and black clay shale. The ages of rocks vary from the Paleozoic to Cenozoic with different physical properties and chemical composition. Five major fracture zones pass through the province causing rock mass weakness: Hoa Binh, Da Bac, Muong La-Cho Bo, Son La-Bim Son, and Song Da.

Figure 2

Geologic map of the study area.

The soil types are mainly ferralic acrisols, humic acrisols, rhodic ferralsols, and eutric fluvisols that account for 80% of the total study area. Land use is comprised of approximately 7.5% populated areas, 14.5% agricultural land, 52.6% forest land, 21% barren land and nontree rocky mountain, 0.4% grassland, and 4% water surface.

In the study area, there are heavy rainfalls with high intensity, especially during tropical rainstorms, and with an average annual precipitation varying from 1353 to 1857 mm (data shown for the period 1973–2002). The precipitation is most abundant during May to October with a rainfall that accounts for 84–90% annual precipitation. Rainfall usually peaks in the months of August and September with the average around 300 to 400 mm per month. The climate has a typical characteristic for the monsoonal region with a high humidity, being hot, and rainy. January is usually the coldest month with an average temperature of 14.9°C whereas the warmest month is July with an average temperature of 26.7°C.

Landslides occurred mostly in the rainy season when heavy rains exceeded 100 mm per day and continued for three days. Landslides also occurred when rainfall continued for five to seven days with rainfall larger than 100 mm for the last day. For example, landslides occurred in the Doc Cun and Doi Thai areas on September 2000 when the 7 days accumulated rainfalls were 308 and 383 mm, respectively. Many landslides occurred on 5 October 2007, in the Thung Khe, Toan Son, Phuc San, Tan Mai, Doc Cun, and surrounding areas with 3 days of accumulated rainfalls amounting from 334 to 529 mm.

2.2. Data

Landslides are assumed to occur in the future under the same conditions as for the past and current landslides [10]. Therefore, a landslide inventory map has been considered to be the most important factor for prediction of future landslides. The landslide inventory map portrays the spatial distribution of a single landslide event (a single trigger) or multiple landslide events over time (historical) [51]. For the study area, the landslide inventory map (Figure 1) constructed by Tien Bui et al. [2] was used to analyze the relationships between landslide occurrence and landslide conditioning factors. The map shows 118 landslides that occurred during the last ten years, including 97 landslide polygons and 21 rock fall locations. The size of the largest landslide is 3,440 m², the smallest is 380 m², and the average landslide size is 3,440 m².

Based on previous research carried out by Tien Bui et al. [2], ten landslide conditioning factors are selected to build landslide models and to predict spatial distribution of the landslides in this study. They are slope angle, slope aspect, relief amplitude, lithology, soil type, land use, distance to roads, distance to rivers, distance to faults, and rainfall.

The slope angle, slope aspect, and relief amplitude were extracted from a DEM that was generated from national topographic maps at the scale of 1 : 25,000. The slope angle map with 6 categories was constructed (Figure 3(a)). The slope aspect map with nine layer classes was constructed: flat, north, northeast, east, southeast, south, southwest, west, and northwest. The relief amplitude that presents the maximum difference in height per unit area [52] was constructed with 6 categories: 0–50 m, 50–100 m, 100–150 m, 150–200 m, 200–250 m, and 250–532 m. For the construction of the relief amplitude map, different sizes of the unit area were tested to choose a best one (20×20 pixels) using the focal statistic module in the ArcGIS 10 software.

Landslide conditioning factor maps (a) slope, (b) lithology, (c) soil type, and (d) landuse.

(a) (b) (c) (d)

The lithology and faults were extracted from four tiles of the Geological and Mineral Resources Map of Vietnam at the scale of 1 : 200,000. This is the only geological map available for the study area. The lithology map (Figure 3(b)) was constructed with seven groups based on clay composition, degree of weathering, estimated strength, and density [53, 54]. The distance-to-faults map was constructed by buffering the fault lines with 5 categories as: 0–200 m, 200–400 m, 400–700 m, 700–1,000 m, and >1,000 m. The soil type map (Figure 3(c)) was constructed with 13 categories. The land-use map (Figure 3(d)) was constructed with twelve categories.

A road network that undercut slopes was extracted from the topographic map at the scale of 1 : 50,000. A distance-to-roads map was constructed with 4 categories: 0–40 m, 40–80 m, 80–120 m, and >120 m. A hydrological network that undercut slopes was also extracted from the topographic map at the scale of 1 : 50,000. And then a distance-to-rivers map was constructed with 4 categories: 0–40 m, 40–80 m, 80–120 m, and >120 m.

The rainfall map was prepared using the value of maximum rainfall of eight days (seven rainfall days plus last day of rainfall larger than 100 mm) for the period from 1990 to 2010, using the Inverse Distance Weighed (IDW) method. The precipitation data was extracted from a database from the Institute of Meteorology and Hydrology in Vietnam.

3. Landslide Susceptibility Mapping Using SVM, DT, and NB Models 3.1. Support Vector Machines (SVM)

Support vector machines are a relatively new supervised learning method based on statistical learning theory and the structural risk minimization principle [55]. Using the training data, SVM implicitly maps the original input space into a high-dimensional feature space. Subsequently, in the feature space the optimal hyper plane is determined by maximizing the margins of class boundaries [56]. The training points that are closest to the optimal hyper plane are called support vectors. Once the decision surface is obtained, it can be used for classifying new data.

Consider a training dataset of instance-label pairs (xi,yi) with xi∈Rn, yi∈{1,-1}, and i=1,…,m. In the current context of landslide susceptibility, x is a vector of input space that contains slope angle, lithology, rainfall, soil type, slope aspect, land use, distance to roads, distance to rivers, distance to faults, and relief amplitude. The two classes {1,-1} denote landslide pixels and no-landslide pixels. The aim of the SVM classification is to find an optimal separating hyperplane that can distinguish the two classes, that is, landslides and no landslides {1,-1}, from the mentioned set of training data.

For the case of linear separable data, a separating hyperplane can be defined as (3.1)yi(w⋅xi+b)≥1-ξi, where w is a coefficient vector that determines the orientation of the hyper plane in the feature space, b is the offset of the hyper plane from the origin, and ξi is the positive slack variables [57].

The determination of an optimal hyper plane leads to the solving of the following optimization problem using Lagrangian multipliers [58]: (3.2)Minimize∑i=1nαi-12∑i=1n∑j=1nαiαjyiyj(xixj),Subject to∑i=1nαiyj=0, 0≤αi≤C, where αi are Lagrange multipliers, C is the penalty, and the slack variables ξi allows for penalized constraint violation.

The decision function, which will be used for the classification of new data, can then be written as (3.3)g(x)=sign⁡(∑i=1nyiαixi+b). In cases when it is impossible to find the separating hyper plane using the linear kernel function, the original input data may be transferred into a high-dimension feature space through some nonlinear kernel functions. The classification decision function is then written as (3.4)g(x)=sign⁡(∑i=1nyiαiK(xi,xj)+b), where K(xi,xj) is the kernel function.

The choice of the kernel function is crucial for successful SVM training and classification accuracy [59]. There are four types of kernel function groups that are commonly used in SVM: linear kernel (LN), polynomial kernel (PL), radial basis function (RBF) kernel, and sigmoid kernel (SIG). The LN is considered to be a specific case of RBF, whereas the SIG behaves like the RBF for certain parameters [60]. According to Keerthi and Lin [61], the LN is not needed for use when the RBF is used. And generally, the classification accuracy of the SIG may not be better than RBF [62]. Therefore in this study, only the two kernel functions, RBF and PL, were selected. According to Zhu et al. [63], the main advantage of using RBF is that RBF has good interpolation abilities. However, it may fail to provide longer-range extrapolation. On contrast, PL has better extrapolation abilities at lower-order degrees but requires higher order degrees for good interpolation. The formulas and their parameters are shown in Table 2.

The performance of the SVM model depends on the choice of the kernel parameters. For the RBF-SVM, the regularization parameter (C) and the kernel width (γ) are the two parameters that need to be determined, whereas C, γ and the degree of polynomial kernel (d) are three for the case of the PL-SVM. Parameter C controls the tradeoff between training errors and margin, which helps to control overfitting of the model. If values of C are large, that will lead to a few training errors, whereas a small value for C will generate a larger margin and thus increase the number of training errors [64]. Parameter γ controls the degree of nonlinearity of the SVM model. Parameter d defines the degree of the polynomial kernel.

The process of picking up the best pairs of parameters, which produce the best classification result, is considered to be an important research issue in the data mining area [65]. Many methods have been proposed, such as the heuristic parameter selection [66], the gradient descent algorithm [67], the Levenberg-Marquardt method [68], and the cross-validation method [69]. However, the grid search method that is widely used in the determination of SVM parameters is still considered to be the most reliable optimization method [70] and was selected for this study. Firstly, the ranges of all parameters with a step-size process were determined. Secondly, the grid search was performed by varying the SVM hyperparameters. Finally, the performance of every combination is assessed to find the best pairs of parameters. However, the grid search is only suitable for the adjustment of a small number of parameters due to the computational complexity [71].

3.2. Decision Tree (DT)

A DT is a hierarchical model composed of decision rules that recursively split independent variables into homogeneous zones [72]. The objective of DT building is to find the set of decision rules that can be used to predict outcome from a set of input variables. A DT is called a classification or a regression tree if the target variables are discrete or continuous, respectively [73]. DT has been applied successfully in many real-world situations for classification and prediction [74].

The main advantage of DT is that DT models have the capability of modeling complex relationship between variables. They can incorporate both categorical and continuous variables without strict assumptions with respect to the distribution of the data [75]. In addition, DTs are easy to construct and the resulting models can be easily interpreted. Furthermore, the DT model results provide clear information on the relative importance of input factors [76]. The main disadvantage of DTs is that they are susceptible to noisy data and that multiple output attributes are not allowed [77].

Many algorithms for constructing decision tree models such as classification and regression tree (CART) [78], chi-square automatic interaction detector decision tree (CHAID) [79], ID3 [80], and C4.5 [81] are proposed in the literature. In this study, the J48 algorithm [82], which is a Java reimplementation of the C4.5 algorithm, was used. The C4.5 uses an entropy-based measure as the selection criteria that is considered to be the fastest algorithm for machine learning with good classification accuracy [83]. Given a training dataset T with subsets Ti,i=1,2,...,s, the C4.5 algorithm constructs a DT using the top-down and recursive-splitting technique. A tree structure consists of a root node, internal nodes, and leaf nodes. The root node contains all the input data. An internal node can have two or more branches and is associated with a decision function. A leaf node indicates the output of a given input vector.

The procedure of DT modeling consists of two steps: (1) tree building and (2) tree pruning [84]. The tree building begins by determining the input variable with highest gain ratio as the root node of the DT. Then the training dataset is split based on the root values, and subnodes are created. For discrete input variables, a subnode of the tree is created for each possible value. For continuous input variables, two sub-nodes are created based on a threshold that was determined in the threshold-finding process [81]. In the next step, the gain ratio is calculated for all the sub-nodes individually, and the process is subsequently repeated until all examples in a node belong to the same class. And those nodes are called leaf nodes and are labeled as class values.

Since the tree obtained in the building step may have a large number of branches and therefore may cause a problem of over-fitting [85], therefore, the tree needs to be pruned for better classification accuracy for new data. Two types of tree pruning can be seen: before pruning and after pruning. In the case of pre-pruning, the growing of the tree will be stopped when a certain criterion is satisfied, whereas in the post-pruning case the full tree will be constructed first, and then the ending subtrees will be replaced by leafs based on the error comparison of the tree before and after replacing sub-trees.

The information gain ratio for attribute A is as follows: (3.5)GainRatio(A,T)=Gain(A,T)SplitInfo(A,T), where (3.6)Gain(A,T)=Entropy(T)-∑i=1s|Ti||T|Entropy(Ti),SplitInfo(A)=-∑i=1s|Ti||T|log2|Ti||T|. A DT can estimate the probability of belonging to a specific class and therefore the probability isused to predict the probability of landslide pixels. The estimated probability is based on a natural frequency at the tree leaf. However, the estimated probability might not give sound probabilistic estimates; therefore Laplace smoothing [86] was used in this study.

3.3. Naïve Bayes (NB)

An NB classifier is a classification system based on Bayes' theorem that assumes that all the attributes are fully independent given the output class, called the conditional independence assumption [48]. The main advantage of the NB classifier is that it is very easy to construct without needing any complicated iterative parameter estimation schemes [40]. In addition, NB classifier is robust to noise and irrelevant attribute. This method has been successfully applied in many fields [87].

Given an observation consisting of k attributes xi,i=1,2,…,k (xi is landslide conditioning factor), yj,j=landslide,no landslide is the output class. NB estimates the probability P(yj/xi) for all possible output class. The prediction is made for the class with the largest posterior probability as (3.7)yNB=argmax P(yj)yj∈{Landslide, no-landslide}∏i=1nP(xi/yj). The prior probability P(yj) can be estimated using the proportion of the observations with output class yj in the training dataset. The conditional probability is calculated using (3.8)P(xiyj)=12πδe-(xi-μ)2/2δ2, where μ is mean and δ is standard deviation of xi.

3.4. Performance Evaluation

The performances of the trained landslide models were assessed using several statistical evaluation criteria using counts of true positive (TP), false positive (FP), true negative (TN), false negative (FN).

TP rate (sensitivity) measures the proportion of the number of pixels that are correctly classified as landslides and is defined as TP/(TP + FN). TN rate (specificity) measures the proportion of number of pixels that are correctly classified as non-landslide and is defined as TN/(TN + FP). Precision measures the proportion of the number of pixels that are correctly classified as landslide occurrences and is defined as TP/(TP + FP). Overall accuracy is calculated as (TP + TN)/total number of training pixels.The F-measure combines precision and sensitivity into their harmonic mean and is defined as 2*Sensitivity*Specificity/(Sensitivity+Specificity) [88].

In order to measure the reliability of the landslide susceptibility models, the Cohen kappa index (κ) [89–91] was used to assess the model classification compared to chance selection: (3.9)κ=PC-Pexp⁡1-Pexp⁡, where PC is the proportion of number of pixels that are correctly classified as landslide or non-landslide and is calculated as (TP + TN)/total number of pixels. Pexp is the expected agreements and is calculated as ((TP + FN)(TP + FP) + (FP+TN)(FN+TN))/Sqrt(total number of training pixels).

A κ value of 0 indicates that no agreement exists between the landslide model and reality whereas a κ value of 1 indicates a perfect agreement. If κ value is negative, it indicates a poor agreement. A κ value in the range (0.80–1) is considered as indicator of almost perfect agreement while a value in the range (0.60–0.80) indicates a substantial agreement between the model and reality. For a value in the interval (0.40–0.60), the agreement is moderate and the values of (0.20–0.40) and <0.2 indicate over fair and slight agreement, respectively [92].

3.5. Preparation of the Training and the Validation Datasets

In this study, a total of ten landslide conditioning factors were used. They are slope angle, lithology, rainfall, soil type, slope aspect, landuse, distance to roads, distance to rivers, distance to faults, and relief amplitude. For each conditioning factor, a map is generated. These maps were then converted into a pixel format with a spatial resolution of 20×20 m. In the next step, frequency ratio values [93] were calculated for all categories based on the landslide grid cells. Based on these ratio values, each category was assigned an attribute number and then was rescaled in the range 0.1 to 0.9 (Table 1) using the Max-Min normalization procedure [94] as follows: (3.10)v′=v-Min⁡(v)Max⁡(v)-Min⁡(v)(U-L)+L, where v′ is the normalized data matrix, v is the original data matrix, and U and L are the upper and lower normalization boundaries.

Table 1

Normalized classes of landslide conditioning factors used.

Data layers	Class	Class pixels (%)	Landslide pixels (%)	Frequency ratio	Attribute	Normalized classes
Slope angle (^°)	0–10	42.82	0.20	0.005	2	0.26
	10–20	29.13	29.93	1.028	4	0.58
	20–30	20.25	54.75	2.704	5	0.74
	30–40	6.84	14.31	2.094	6	0.90
	40–50	0.93	0.80	0.862	3	0.42
	>50	0.04	0.00	0.000	1	0.10

Slope aspect	Flat (−1)	0.06	0.00	0.000	1	0.10
	North (0–22.5 and 337.5–360)	12.02	4.70	0.391	2	0.20
	Northeast (22.5–67.5)	14.56	11.81	0.811	6	0.60
	East (67.5–112.5)	12.06	7.81	0.648	5	0.50
	Southeast (112.5–157.5)	12.04	14.51	1.206	7	0.70
	South (157.5–202.5)	12.90	22.72	1.761	8	0.80
	Southwest (202.5–247.5)	14.60	26.33	1.804	9	0.90
	West (247.5–292.5)	11.31	7.11	0.628	4	0.40
	Northwest (292.5–337.5)	10.46	5.01	0.478	3	0.30

Relief amplitude (m)	0–50	27.00	1.10	0.041	1	0.10
	50–100	23.97	25.43	1.061	3	0.42
	100–150	22.98	41.04	1.786	6	0.90
	150–200	14.75	20.12	1.364	5	0.74
	200–250	7.06	8.41	1.190	4	0.58
	250–532	4.24	3.90	0.920	2	0.26

Lithology	Group 1	4.08	6.31	1.546	6	0.77
	Group 2	39.62	33.43	0.844	4	0.50
	Group 3	32.55	27.13	0.833	3	0.37
	Group 4	11.65	21.62	1.856	7	0.90
	Group 5	1.18	0.00	0.000	1	0.10
	Group 6	5.62	7.81	1.389	5	0.63
	Group 7	5.29	3.70	0.700	2	0.23

Land use	Populated area	7.53	14.01	1.862	10	0.75
	Orchard land	3.71	2.50	0.674	7	0.54
	Paddy land	9.17	4.10	0.448	5	0.39
	Protective forestland	8.58	20.32	2.368	12	0.90
	Natural forestland	31.91	15.62	0.489	6	0.46
	Productive forestland	11.72	22.62	1.930	11	0.83
	Water	3.97	1.00	0.252	4	0.32
	Annual crop land	1.60	0.20	0.125	3	0.25
	Nontree rocky mountain	4.08	7.21	1.767	9	0.68
	Barren land	16.95	12.41	0.732	8	0.61
	Specially used forestland	0.36	0.00	0.000	2	0.17
	Grass land	0.43	0.00	0.000	1	0.10

Soil type	Eutric fluvisols	3.49	6.11	1.751	12	0.83
	Degraded soil	0.03	0.00	0.000	3	0.23
	Limestone mountain	14.42	15.12	1.048	9	0.63
	Ferralic acrisols	36.53	43.84	1.200	10	0.70
	Rhodic ferralsols	8.97	3.40	0.379	7	0.50
	Humic acrisols	30.91	28.13	0.910	8	0.57
	Dystric fluvisols	0.73	2.80	3.828	13	0.90
	Dystric gleysols	0.39	0.60	1.524	11	0.77
	Luvisols	0.46	0.00	0.000	4	0.30
	Humic ferralsols	1.15	0.00	0.000	5	0.37
	Populated area	0.44	0.00	0.000	2	0.17
	Water	2.41	0.00	0.000	1	0.10
	Gley fluvisols	0.08	0.00	0.000	6	0.43

Rainfall (mm)	362–470	22.48	27.23	1.211	3	0.63
	470–540	46.40	35.84	0.772	2	0.37
	540– 610	22.18	9.01	0.406	1	0.10
	610–950	8.94	27.93	3.125	4	0.90

Distance to roads (m)	0–40	1.40	41.64	29.755	4	0.90
	40–80	1.68	21.52	12.788	3	0.63
	80–120	1.88	4.70	2.509	2	0.37
	>120	95.04	32.13	0.338	1	0.10

Distance to rivers (m)	0–40	3.86	14.41	3.731	4	0.90
	40–80	4.52	12.41	2.747	3	0.63
	80–120	4.82	8.31	1.725	2	0.37
	>120	86.80	64.86	0.747	1	0.10

Distance to faults (m)	0–200	18.09	24.02	1.328	5	0.90
	200–400	15.95	11.61	0.728	2	0.30
	400–700	19.89	24.22	1.218	3	0.50
	700–1,000	14.31	18.42	1.287	4	0.70
	>1,000	31.75	21.72	0.684	1	0.10

Table 2

RBF and PL kernels and their parameters.

Kernel function	Formula	Kernel parameters
RBF	K ( x i , x j ) = exp ( - γ ∥ x i - x j ∥ 2 )	γ
PL	K ( x i , x j ) = ( γ x i T x j + 1 ) d	γ , d

In landslide modeling, the landslide data should be split into two parts, training and validation datasets. Without the splitting, it would not be possible to validate the results [95]. In this study, the landslide inventory map with 118 landslide polygons was randomly split into two subsets: subset 1 comprised 70% of the data (82 landslides with 684 landslide grid cells) and was used in the training phase of landslide models; subset 2 is a validation dataset with 30% of the data (36 landslides with 315 landslide grid cells) for the validation and estimate the prediction accuracy of the resulted models.

All of the 684 landslide grid cells in the subset 1 were assigned the value of 1. SVM may seriously have negative effects on the model performance when the numbers of landslide and non-landslide grid cells in the training dataset are significantly unbalanced. Therefore, the same amount of no-landslide grid cells was randomly sampled from the landslide-free area and assigned the value of −1. In the cases of DT and NB classifiers, no-landslide grid cells were assigned to the value 0. Finally, an extracting process was conducted to extract values for the ten landslide conditioning factors to build a training dataset. This dataset contains a total of 1368 observations, ten input variables, and one target variable (landslide, no landslide).

3.6. Training of the Support Vector Machines, Decision Tree and Naïve Bayes Models and Generation of Landslide Susceptibility Indexes 3.6.1. Support Vector Machines (SVM)

In the case of SVM, the model selection with its optimal parameters searching plays a crucial role in the performance of the model. In this study, RBF and PL kernel functions were selected. The training process was started by searching the optimal kernel parameters using the grid-search method with cross-validation that can help to prevent overfitting. Since the numbers of landslide grid cells in the study area are not large, 5-fold cross-validation was used to find the best kernel parameters. The training dataset was randomly split into 5 equally sized subsets. Each subset was used as a test dataset for the SVM model trained on the remaining 4 data subsets. The cross-validation process was then repeated five times with each of the five subsets used once as the test dataset.

With the RBF kernel, the two kernel parameters of C and γ need to be determined. The procedure is as follows: (1) we set a grid space of (C, γ), where C =2⁻⁵, 2⁻⁴,…, 2¹⁰ and γ= 2¹⁰, 2⁹, …, 2⁻⁴; (2) for each parameter, pairs of (C, γ) in the grid space, conduct 5-fold cross-validation on the training dataset; (3) choose parameter pairs of (C, γ) that have the highest classification accuracy; (4) use the best parameters to construct a SVM model for landslide prediction of new data. The best C and γ are determined as 8 and 0.25, respectively. The correctly classified rate is 91.1%.

With the PL kernel, the two kernel parameters of C and d need to be determined. Table 3 shows the results of training the SVM model using different d values. The result shows that when the values of d increase, AUC in the training dataset is increased as well. However, AUC in the validation dataset increases until d equals 3 and then decreases with the increasing of the d values. And therefore, the SVM model with three degrees of the polynomial kernel is selected. The accurately classified rate of SVM using PL kernel is 91.1%. The best C and γ are determined as 1 and 0.3536, respectively.

Table 3

Degree of polynomial kernel versus area under the ROC curves in the training and validation datasets.

Degree of polynomial kernel	AUC
Degree of polynomial kernel	Training dataset	Validation dataset
1	0.9432	0.9524
2	0.9489	0.9560
3	0.9575	0.9566
4	0.9643	0.9556
5	0.9717	0.9435
6	0.9827	0.9046
7	0.9905	0.8767
8	0.9946	0.8314
9	0.9985	0.8067
10	0.9996	0.8133

A detailed accuracy assessment for RBF-SVM and PL-SVM is shown in Tables 4 and 5. It could be seen that precision, F-measure, and TP rate are high (>90%) whereas FP rate is low (<10%). It indicates a high classification capacity for the training dataset for the two models. The Cohen kappa indexes are 0.822 and 0.823 for RBF-SVM and PL-SVM, respectively. It indicates a good agreement between the observed and the predicted values.

Table 4

Detailed accuracy assessment by classes of RBF-SVM, PL-SVM, DT, and NB models.

Model	TP rate (%)	FP rate (%)	Precision (%)	F-measure (%)	Class
RBF-SVM	90.4	8.2	91.7	91.0	Landslide
RBF-SVM	91.8	9.6	90.5	91.1	No landslide
PL-SVM	90.2	7.9	92.0	91.1	Landslide
PL-SVM	92.1	9.8	90.4	91.2	No landslide
DT	95.5	9.5	90.9	93.2	Landslide
DT	90.5	4.5	95.2	92.8	No landslide
NB	83.2	11.0	88.4	85.7	Landslide
NB	89.0	16.8	84.1	86.5	No landslide

Table 5

Performance evaluation of RBF-SVM, PL-SVM, DT, and NB models.

Parameters	RBF-SVM	PL-SVM	DT	NB
Accuracy (%)	91.08	91.15	92.98	86.11
Cohen’s kappa index	0.822	0.823	0.860	0.722

3.6.2. Decision Tree (DT)

In the case of DT, the first step is to determine the optimal value of the algorithm parameter such as the minimum number of instances (MNIs) per leaf and the confidence factor (CF). Since a lower MNI is required to a leaf tree, the more branching will be created resulting in a larger tree. And thus, it may cause overfitting problem. In contrast, a higher MNI required per leaf will result in a narrow tree.

Figure 4 shows the MNI required per leaf versus the classification accuracy. In this test, the MNI required in a leaf was varied from 1 to 25 with a step of one, and the corresponding classification accuracies were obtained and plotted. The result shows that the highest classification accuracy is 92.8% corresponding to a MNI of 6. Therefore, the MNI per leaf of 6 was selected.

Figure 4

Minimum number of instances per leaf versus classification accuracy.

In order to explore the effect of the CF on the classification accuracy, the CF value was varied from 0.1 to 1 using a step size of 0.05. The corresponding classification accuracy was calculated. The result is shown in Figure 5. The result shows that the highest classification accuracy occurred with the CF of 0.35. Therefore CF of 0.35 was selected. With the two aforementioned parameters being determined, the decision tree model was constructed using the J48 algorithm. The probability of belonging to the landslide or the no-landslide classes for each observation was estimated using the Laplace smoothing. Using10-fold cross-validation, the decision tree model was constructed. The classified rate is 92.9%. The Cohen kappa index is 0.860. Detailed accuracy assessment of the decision tree model by class is shown in Tables 4 and 5. It could be observed that the TP rate, the precision, and the F-measure are greater than 90%. FP rates are 9.5% and 4.5% for the landslide and the non-landslide classes, respectively.

Figure 5

Confidence factor used for pruning versus classification accuracy.

Figure 6 depicts the inferred DT model for landslide susceptibility assessment in this study. It could be observed that the size of the tree is 55 including the root node, 26 internal nodes, and 28 leafs (green rectangular boxes). In leaf nodes, value of 0.1 indicates the class of no landslide, whereas value of 0.9 indicates the landslide class. The number in the parentheses at each leaf node represents the number of instances in that leaf. It is clear that some instances are misclassified in some leaves. The number of misclassified instances is specified after a slash (Figure 6). The highest number of instances in a leaf node is 288, whereas the lowest number of instances in a leaf node is 7. The top-down induction of the tree shows that landslide conditioning factor in the higher level of the tree is more important. The relative importance of the landslide conditioning factor is as follows: distance to roads (81.5% in relative importance), slope (71.6%), land use (66.7%), aspect (61.1%), rainfall (61.5%), relief amplitude (61.6%), distance to rivers (60.1%), distance to faults (58.7%), lithology (57.7%), and soil type (52.8%).

Figure 6

Decision tree model for landslide susceptibility assessment for the study area.

3.6.3. Naïve Bayes (NB)

In the case of NB classifier, the probability is first calculated for each output class (landslide, no landslide), and the classification is then made for the class with the largest posterior probability. The NB model was constructed using the WEKA software. The NB model obtained an overall classification accuracy of 86.1% in average. TP rate, precision, and F-measure are varied from 83% to 89%. The Cohen kappa index of 0.722 indicates that the strength of agreements between the observed and the predicted values is substantial. A summary result of the model assessment and performance is shown in Tables 4 and 5.

Once the SVM, DT, and NB models were successfully trained in the training phase, they were used to calculate the landslide susceptibility indexes (LSIs) for all the pixels in the study area. The results were then transferred into a GIS and loaded in the ARCGIS 10 software for visualization.

4. Validation and Comparison of Landslide Susceptibility Models 4.1. Success Rate and Prediction Rate for Landslide Susceptibility Maps

The validation processes of the four landslide susceptibility maps were performed by comparing them with the landslide locations using the success-rate and prediction-rate methods [95]. Using the landslide grid cells in the training dataset, the success-rate results were obtained. Figure 7 shows the success-rate curves of the four landslide susceptibility maps (obtained from RBF-SVM, PL-SVM, DT, NB models) in this study in comparison with the logistic regression model. It could be observed that RBF-SVM and logistic regression have the highest area under the curve, with AUC values of 0.961 and 0.962, respectively. They are followed by PL-SVM (0.956), DT (0.952), and NB (0.935). Based on these results we can conclude that the capability of correctly classifying the areas with existing landslides is highest for the RBF-SVM (equals to logistic regression), followed by the PL-SVM, DT, and NB.

Figure 7

Success-rate curves and area, under the curves (AUCs) of RBF-SVM, PL-SVM, DT, and NB models in comparison with the logistic regression model.

Since the success-rate method uses the landslide pixels in the training dataset that have already been used for constructing the landslide models, the success-rate may not be a suitable method for measuring the prediction capability of the landslide models [96]. According to Chung and Fabbri [95], the prediction rate could be used to estimate the prediction capability of the landslide models. In this study, the prediction-rate results of the four landslide susceptibility models were obtained by comparing them with the landslide grid cells in the validation dataset. And then the areas under the prediction-rate curves (AUCs) were further estimated. The more the AUC value is close to 1, the better the landslide model.

The prediction-rate curves and AUC of the four landslide susceptibility maps are shown in Figure 8. The results show that AUCs for the four models vary from 0.909 to 0.955. It indicates that all the models have a good prediction capability. The highest prediction capability is for RBF-SVM and PL-SVM with AUC values of 0.954 and 0.955, respectively. They are followed by NB (0.935) and DT (0.907). Compared with the logistic regression (AUC of 0.938) that used the same data, it can be seen that the prediction capability of the two SVM models may be slightly better whereas the prediction capability of DT and ND is lower.

Figure 8

Prediction-rate curves and areas under the curves (AUCs) of RBF-SVM, PL-SVM, DT, and NB models in comparison with the logistic regression model.

4.2. Reclassification of Landslide Susceptibility Indexes

The landslide susceptibility indexes were reclassified into four relative susceptibility classes: high, moderate, low, and very low. In this study, the classification method proposed by Pradhan and Lee [8] was used to determine landslide susceptibility class breaks based on percentage of area: high (10%), moderate (10%), low (20%), and very low (60%) (Figure 9).

Figure 9

Percentage of landslides against percentage of landslide susceptibility maps using of RBF-SVM, PL-SVM, DT, and NB models.

Landslide density analysis was performed on the four landslide susceptibility classes [97]. Landslide density is defined as the ratio of landslide pixels to the total number of pixels in the susceptibility class. An ideal landslide susceptibility map has the landslide density value increasing from a very low- to a higher-susceptibility class [32]. A plotting of the landslide density for the four landslide susceptibility classes of the four landslide susceptibility models (RBF-SVM, PL-SVM, DT, and NB) is shown in Figure 10. It could be observed that the landslide density is gradually increased from the very low- to the high-susceptibility class. Figure 11 shows landslide susceptibility maps using RBF-SVM, PL-SVM, DT, and NB models.

Figure 10

Landslide density plots of four landslide susceptibility classes of RBF-SVM, PL-SVM, DT, and NB models.

Landslide susceptibility maps of the Hoa Binh province (Vietnam) using: (a) RBF-SVM; (b) PL-SVM; (c) DT; and (d) NB.

(a) (b) (c) (d)

Table 6 shows the characteristics of the four susceptibility classes of the four maps of the study area. It can be observed that the percentages of existing landslide pixels for the high class are 87.2%, 87.5%, 90.7%, and 81.3% for RBF-SVM, PL-SVM, DT, and NB, respectively. In contrast, 80% of the pixels in the study areas are in the low- and very-low-susceptibility classes. These maps are satisfing two spatial effective rules [98], (1) the existing landslide pixels should belong to the high-susceptibility class and (2) the high susceptibility class should cover only small areas.

Table 6

Characteristics of the four susceptibility zones of the four landslide susceptibility models obtained from RBF-SVM, PL-SVM, DT, and NB models.

Landslide susceptibility classes	Percentage of area	Landslide density
Landslide susceptibility classes	Percentage of area	RBF-SVM	PL-SVM	DT	NB
High	10.0	8.719	8.749	9.069	8.128
Moderate	10.0	0.740	0.660	0.571	0.791
Low	20.0	0.221	0.241	0.115	0.371
Very low	60.0	0.017	0.018	0.022	0.057

5. Discussions and Conclusions

This paper presents a comparative study of three data mining approaches SVM, DT, and NB for landslide susceptibility mapping in the Hoa Binh province (Vietnam). The landslide inventory was constructed with 118 polygons of landslides that occurred during the last ten years. A total of ten landslide conditioning factors were used in this analysis, including slope angle, lithology, rainfall, soil type, slope aspect, landuse, distance to roads, distance to rivers, distance to faults, and relief amplitude. For building the models, a training dataset was extracted with 70% of the landslide inventory, whereas the remaining landslide inventory was used for the assessment of the prediction capability of the models. Using the three data mining algorithms, SVM, DT, and NB, the landslide susceptibility maps were produced. These maps present spatial predictions of landslides. They do not include information “when” and “how frequently” landslides will occur.

In the case of SVM, the selection of the kernel function and its parameters play an important role in landslide susceptibility assessment. For the RBF function, the best kernel parameters of C and γ are 8 and 0.25, respectively. For the PL function, it is clear that the degree of polynomial function had significant effect in the model. The SVM model with a polynomial degree of 3 has the highest accuracy. The best kernel parameters of C and γ are 1 and 0.3536 respectively. In the case of DT, the probability that an observation belongs to landslide class using Laplace smoothing was used to calculate the landslide susceptibility index. For building the DT model, the selection of MNI per leaf tree and CF has largely affected the accuracy of the model. In this study, the best decision tree model is found with MNI per leaf tree as 6 and the CF as 0.35. Relative importance of landslide conditioning factors are as follows: distance to roads, slope angle, landuse, slope aspect, rainfall, relief amplitude, distance to rivers, distance to faults, lithology, and soil type. In the case of NB, the application for landslide modeling is relatively robust. This is not a time-consuming method, and techniques required to use are simple. The result of this study shows that NB gives relatively good prediction capability.

Qualitative interpretation of the high landslide susceptibility classes of the four maps shows that they agree quite well with field evidence and assumptions. High probability of landslides distributes in areas with active fault zones and road-cut sections. Using the success-rate and prediction-rate methods, the landslide susceptibility maps were validated using the existing landslide locations. The quantitative results show that all the landslide models have good prediction capability. The highest area under the success-rate curve (AUC) is for the RBF-SVM (0.961), followed by PL-SVM (0.956), DT (0.938), and NB (0.935). The highest prediction-rate result is for RBF-SVM and PL-SVM with areas under the prediction curves (AUC) of 0.954 and 0.955, respectively. They are followed by NB (0.932) and DT (0.903). When compared with the results obtained from the logistic regression (Figure 8), the prediction capabilities of the two SVM models are slightly better. On contrast, DT and NB models have lower accuracy. The quantitative results of this study are comparable to those obtained in other studies, such as Brenning [99] and Yilmaz [35]. The findings of this study agree with Yao et al [100] who states that SVM possesses better prediction efficiency than the logistic regression. Additionally, the findings also agree with Marjanović et al. [101], who reported that SVM outperformed the logistic regression and DT. Similarly, the results also agree with Ballabio and Sterlacchini [102], who concluded that SVM was found to outperform the logistic regression, linear discriminant, and NB.

The reliabilities of the landslide models were assessed using Cohen kappa index (κ). In this study, the kappa indexes are of 0.822, 0.823, and 0.860 for RBF-SVM, PL-SVM, and DT, respectively. It indicates an almost perfect agreement between the observed and the predicted values. Cohen kappa index is 0.722 for NB indicating substantial agreement between the observed and the predicted values. The reliability analysis results are satisfying compared with other works such as Guzzetti et al. [91] and Saito et al. [44].

Landslide susceptibility maps are considered to be a useful tool for territorial planning, disaster management, and natural hazards’ mitigation. This study shows that SVMs have considered being a powerful tool for landslide susceptibility with high accuracy. As a final conclusion, the analyzed results obtained from the study can provide very useful information for decision making and policy planning in landslide areas.

Acknowledgments

This research was funded by the Norwegian Quota scholarship program. The data analysis and write-up were carried out as a part of the first author’s Ph.D. studies at the Geomatics Section, Department of Mathematical Sciences and Technology, Norwegian University of Life Sciences, Norway.

Sassa

Canuti

Landslides-Disaster Risk Reduction 2008

New York, NY, USA

Springer

Tien Bui

Lofman

Revhaug

Dick

Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression

Natural Hazards 2011 59 1413 1444

Tien Bui

Pradhan

Lofman

Revhaug

Dick

O. B.

Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS

Computers & Geosciences. In press

Tien Bui

Pradhan

Lofman

Revhaug

Dick

O. B.

Spatial prediction of landslide hazards in Hoa Binh province (Vietnam): a comparative assessment of the efficacy of evidential belief functions and fuzzy logic models

CATENA 2012 96 28 40

10.1016/j.catena.2012.04.001

Lee

Dan

Probabilistic landslide susceptibility mapping on the Lai Chau province of Vietnam: focus on the relationship between tectonic fractures and landslides

Environmental Geology 2005 48 6 778 787

Lee

Landslide susceptibility mapping using an artificial neural network in the Gangneung area, Korea

International Journal of Remote Sensing 2007 28 21 4763 4783

2-s2.0-41249092382

10.1080/01431160701264227

Pradhan

Use of GIS-based fuzzy logic relations and its cross application to produce landslide susceptibility maps in three test areas in Malaysia

Environmental Earth Sciences 2011 63 2 329 349

2-s2.0-79955068561

10.1007/s12665-010-0705-1

Pradhan

Lee

Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia

Landslides 2010 7 1 13 30

2-s2.0-77149157821

10.1007/s10346-009-0183-2

Guzzetti

Reichenbach

Cardinali

Galli

Ardizzone

Probabilistic landslide hazard assessment at the basin scale

Geomorphology 2005 72 1–4 272 299

2-s2.0-28744438107

10.1016/j.geomorph.2005.06.002

Guzzetti

Carrara

Cardinali

Reichenbach

Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, central Italy

Geomorphology 1999 31 1–4 181 216

2-s2.0-0343069785

10.1016/S0169-555X(99)00078-1

Wang

Gangjun

Weiya

Gonghui

GIS-based landslide hazard assessment: an overview

Progress in Physical Geography 2005 29 4 548 567

2-s2.0-28144448823

10.1191/0309133305pp462ra

Chacón

Irigaray

Fernández

El Hamdouni

Engineering geology maps: landslides and geographical information systems

Bulletin of Engineering Geology and the Environment 2006 65 4 341 411

2-s2.0-33750974154

10.1007/s10064-006-0064-z

Ercanoglu

Gokceoglu

Assessment of landslide susceptibility for a landslide-prone area (north of Yenice, NW Turkey) by fuzzy approach

Environmental Geology 2002 41 6 720 730

2-s2.0-0036487950

10.1007/s00254-001-0454-2

Ercanoglu

Gokceoglu

Use of fuzzy relations to produce landslide susceptibility map of a landslide prone area (West Black Sea region, Turkey)

Engineering Geology 2004 75 3-4 229 250

2-s2.0-11344253904

10.1016/j.enggeo.2004.06.001

Pradhan

Sezer

E. A.

Gokceoglu

Buchroithner

M. F.

Landslide susceptibility mapping by neuro-fuzzy approach in a landslide-prone area (Cameron Highlands, Malaysia)

IEEE Transactions on Geoscience and Remote Sensing 2010 48 12 4164 4177

2-s2.0-78649318558

10.1109/TGRS.2010.2050328

Lee

Application and verification of fuzzy algebraic operators to landslide susceptibility mapping

Environmental Geology 2007 52 4 615 623

2-s2.0-34247618785

10.1007/s00254-006-0491-y

Akgun

Sezer

E. A.

Nefeslioglu

H. A.

Gokceoglu

Pradhan

An easy-to-use MATLAB program (MamLand) for the assessment of landslide susceptibility using a Mamdani fuzzy algorithm

Computers and Geosciences 2011 38 1 23 34

2-s2.0-79959888218

10.1016/j.cageo.2011.04.012

Pradhan

Application of an advanced fuzzy logic model for landslide susceptibility analysis

International Journal of Computational Intelligence Systems 2010 3 3 370 381

2-s2.0-78650712774

10.2991/ijcis.2010.3.3.12

Pradhan

Landslide susceptibility mapping of a catchment area using frequency ratio, fuzzy logic and multivariate logistic regression approaches

Journal of the Indian Society of Remote Sensing 2010 38 2 301 320

2-s2.0-78650613649

10.1007/s12524-010-0020-z

Pradhan

Manifestation of an advanced fuzzy logic model coupled with Geo-information techniques to landslide susceptibility mapping and their comparison with logistic regression modelling

Environmental and Ecological Statistics 2011 18 3 471 493

2-s2.0-77953823711

10.1007/s10651-010-0147-7

H. J.

Pradhan

Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area

Computers and Geosciences 2011 37 9 1264 1276

2-s2.0-79951551776

10.1016/j.cageo.2010.10.012

Vahidnia

M. H.

Alesheikh

A. A.

Alimohammadi

Hosseinali

A GIS-based neuro-fuzzy procedure for integrating knowledge and data in landslide susceptibility mapping

Computers and Geosciences 2010 36 9 1101 1114

2-s2.0-77955925179

10.1016/j.cageo.2010.04.004

Lee

Ryu

J. H.

Min

Won

J. S.

Landslide susceptibility analysis using GIS and artificial neural network

Earth Surface Processes and Landforms 2003 28 12 1361 1376

2-s2.0-0345566244

10.1002/esp.593

Lee

Ryu

J. H.

Won

J. S.

Park

H. J.

Determination and application of the weights for landslide susceptibility mapping using an artificial neural network

Engineering Geology 2004 71 3-4 289 302

2-s2.0-1642505737

10.1016/S0013-7952(03)00142-X

Catani

Casagli

Ermini

Righini

Menduni

Landslide hazard and risk mapping at catchment scale in the Arno River basin

Landslides 2005 2 4 329 342

2-s2.0-29144505329

10.1007/s10346-005-0021-0

Ermini

Catani

Casagli

Artificial neural networks applied to landslide susceptibility assessment

Geomorphology 2005 66 1–4 327 343

2-s2.0-14844322351

10.1016/j.geomorph.2004.09.025

Pradhan

Lee

Buchroithner

M. F.

A GIS-based back-propagation neural network model and its cross-application and validation for landslide susceptibility analyses

Computers, Environment and Urban Systems 2010 34 3 216 235

2-s2.0-77952010906

10.1016/j.compenvurbsys.2009.12.004

Yilmaz

A case study from Koyulhisar (Sivas-Turkey) for landslide susceptibility mapping by artificial neural networks

Bulletin of Engineering Geology and the Environment 2009 68 3 297 306

2-s2.0-68249146110

10.1007/s10064-009-0185-2

Pradhan

Buchroithner

M. F.

Comparison and validation of landslide susceptibility maps using an artificial neural network model for three test areas in Malaysia

Environmental and Engineering Geoscience 2010 16 2 107 126

2-s2.0-79955066106

10.2113/gseegeosci.16.2.107

Yilmaz

The effect of the sampling strategies on the landslide susceptibility mapping by conditional probability and artificial neural networks

Environmental Earth Sciences 2010 60 3 505 519

2-s2.0-77954162270

10.1007/s12665-009-0191-5

Yilmaz

Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: a case study from Kat landslides (Tokat-Turkey)

Computers and Geosciences 2009 35 6 1125 1138

2-s2.0-64949164773

10.1016/j.cageo.2008.08.007

Pradhan

Lee

Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling

Environmental Modelling and Software 2010 25 6 747 759

2-s2.0-76749088419

10.1016/j.envsoft.2009.10.016

Yesilnacar

Topal

Landslide susceptibility mapping: a comparison of logistic regression and neural networks methods in a medium scale study, Hendek region (Turkey)

Engineering Geology 2005 79 3-4 251 266

2-s2.0-21044454864

10.1016/j.enggeo.2005.02.002

Nefeslioglu

H. A.

Gokceoglu

Sonmez

An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps

Engineering Geology 2008 97 3-4 171 191

2-s2.0-40649084527

10.1016/j.enggeo.2008.01.004

Yilmaz

Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and Support Vector Machine

Environmental Earth Sciences 2010 61 4 821 836

2-s2.0-77955087655

10.1007/s12665-009-0394-9

Poudyal

C. P.

Chang

H. J.

Lee

Landslide susceptibility maps comparing frequency ratio and artificial neural networks: a case study from the Nepal Himalaya

Environmental Earth Sciences 2010 61 5 1049 1064

2-s2.0-77955709296

10.1007/s12665-009-0426-5

Pradhan

Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia

Advances in Space Research 2010 45 10 1244 1256

2-s2.0-77950518811

10.1016/j.asr.2010.01.006

Miner

A. S.

Vamplew

Windle

D. J.

Flentje

Warner

Williams

A. L.

Pinches

G. M.

Chin

C. Y.

McMorran

T. J.

A comparative study of various data mining techniques as applied to the modeling of landslide susceptibility on the Bellarine Peninsula, Victoria, Australia

Geologically Active 2010

New York, NY, USA

CRC Press

352

Wan

Lei

T. C.

A knowledge-based decision support system to analyze the debris-flow problems at Chen-Yu-Lan River, Taiwan

Knowledge-Based Systems 2009 22 8 580 588

2-s2.0-70349781654

10.1016/j.knosys.2009.07.008

Kumar

Ross

Q. J.

Ghosh

Yang

Motoda

McLachlan

G. J.

Liu

P. S.

Zhou

Z. H.

Steinbach

Hand

D. J.

Steinberg

Top 10 algorithms in data mining

Knowledge and Information Systems 2008 14 1 1 37

2-s2.0-37549018049

10.1007/s10115-007-0114-2

Bai

S. B.

Wang

G. N.

Kanevski

Pozdnoukhov

GIS-Based landslide susceptibility mapping with comparisons of results from machine learning methods versus logistic regression in basin scale

Geophysical Research Abstracts, EGU 2008 10,A-06367

Micheletti

Foresti

Kanevski

Pedrazzini

Jaboyedoff

Landslide susceptibility mapping using adaptive Support Vector Machines and feature selection

Geophysical Research Abstracts, EGU 2011 13

Yeon

Y. K.

Han

J. G.

Ryu

K. H.

Landslide susceptibility mapping in Injae, Korea, using a decision tree

Engineering Geology 2010 116 3-4 274 283

2-s2.0-78149413054

10.1016/j.enggeo.2010.09.009

Saito

Nakayama

Matsuyama

Comparison of landslide susceptibility based on a decision-tree model and actual landslide occurrence: the Akaishi mountains, Japan

Geomorphology 2009 109 3-4 108 121

2-s2.0-67349208776

10.1016/j.geomorph.2009.02.026

Nefeslioglu

H. A.

Sezer

Gokceoglu

Bozkir

A. S.

Duman

T. Y.

Assessment of landslide susceptibility by decision trees in the metropolitan area of Istanbul, Turkey

Mathematical Problems in Engineering 2010 2010

2-s2.0-79953278155

10.1155/2010/901095

901095

Ratanamahatana

C. A.

Gunopulos

Feature selection for the naive Bayesian classifier using decision trees

Applied Artificial Intelligence 2003 17 5-6 475 487

2-s2.0-0242498509

Tzu-Tsung

A hybrid discretization method for naïve Bayesian classifiers

Pattern Recognition 2012 45 6 2321 2325

10.1016/j.patcog.2011.12.014

Soria

Garibaldi

J. M.

Ambrogi

Biganzoli

E. M.

Ellis

I. O.

A “non-parametric” version of the naive Bayes classifier

Knowledge-Based Systems 2011 24 6 775 784

2-s2.0-79957522106

10.1016/j.knosys.2011.02.014

Kazmierska

Malicki

Application of the naïve Bayesian classifier to optimize treatment decisions

Radiotherapy and Oncology 2008 86 2 211 216

2-s2.0-38949131864

10.1016/j.radonc.2007.10.019

Chang

C.-C.

Lin

C.-J.

LIBSVM : a Library for Support Vector Machines 2011

New York, NY, USA

ACM Transactions on Intelligent Systems and Technology

Malamud

B. D.

Turcotte

D. L.

Guzzetti

Reichenbach

Landslide inventories and their statistical properties

Earth Surface Processes and Landforms 2004 29 6 687 711

2-s2.0-3042702986

10.1002/esp.1064

Vergari

Della Seta

Del Monte

Fredi

Lupia Palmieri

Landslide susceptibility assessment in the Upper Orcia Valley (Southern Tuscany, Italy) through conditional analysis: a contribution to the unbiased selection of causal factors

Natural Hazards and Earth System Science 2011 11 5 1475 1497

2-s2.0-79957806054

10.5194/nhess-11-1475-2011

Van

T. T.

Anh

D. T.

Hieu

H. H.

Investigation and Assessment of the Current Status and Potential of Landslide in Some Sections of the Ho Chi Minh Road, National Road 1A and Proposed Remedial Measures to Prevent Landslide from Threat of Safety of People, Property, and Infrastructure 2006

Hanoi, Vietnam

Vietnam Institute of Geoscience and Mineral Resources

Arikan

Ulusay

Aydin

Characterization of weathered acidic volcanic rocks and a weathering classification based on a rating system

Bulletin of Engineering Geology and the Environment 2007 66 4 415 430

2-s2.0-34548745235

10.1007/s10064-007-0087-0

Vapnik

V. N.

Statistical Learning Theory 1998

New York, NY, USA

Wiley-Interscience

1641250

Abe

Support Vector Machines for Pattern Classification 2010

London, UK

Springer

2841444

Cortes

Vapnik

Support-vector networks

Machine Learning 1995 20 3 273 297

2-s2.0-34249753618

10.1007/BF00994018

Samui

Slope stability analysis: a Support Vector Machine approach

Environmental Geology 2008 56 2 255 267

2-s2.0-54949157726

10.1007/s00254-007-1161-4

Damaševičius

Optimization of SVM parameters for recognition of regulatory DNA sequences

Top 2011 18 2 339 353

2-s2.0-78650310572

10.1007/s11750-010-0152-x

Song

Zhan

Long

Zhang

Yao

Comparative study of SVM methods combined with voxel selection for object category classification on fMRI data

PLoS ONE 2011 6 2

2-s2.0-79951893920

10.1371/journal.pone.0017191

e17191

Keerthi

S. S.

Lin

C. J.

Asymptotic behaviors of Support Vector Machines with gaussian kernel

Neural Computation 2003 15 7 1667 1689

2-s2.0-0037822222

10.1162/089976603321891855

Lin

H.-T.

Lin

C.-J.

A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods

2003

Taipei, Taiwan

National Taiwan University

Zhu

Zhang

Jin

Zhang

Missing value estimation for mixed-attribute data sets

IEEE Transactions on Knowledge and Data Engineering 2011 23 1 110 121

2-s2.0-78649402552

10.1109/TKDE.2010.99

Damaševičius

Structural analysis of regulatory DNA sequences using grammar inference and Support Vector Machine

Neurocomputing 2010 73 4–6 633 638

2-s2.0-75749156538

10.1016/j.neucom.2009.09.018

Ali

Smith

K. A.

Automatic parameter selection for polynomial kernel

Proceedings of the IEEE International Conference on Information Reuse and Integration (IRI '03)

Octobe 2003

243 249

Mattera

Haykin

Support Vector Machines for dynamic reconstruction of a chaotic system

Advances in Kernel Methods 1999

Cambridge, Mass, USA

MIT Press

211 241

Chapelle

Vapnik

Bousquet

Mukherjee

Choosing multiple parameters for Support Vector Machines

Machine Learning 2002 46 1–3 131 159

2-s2.0-0036161011

10.1023/A:1012450327387

Platt

Probabilistic Outputs for Support Vector Machines and Comparison to Regularized Likelihood Methods 2000

Cambridge, Mass, USA

MIT Pres

Cherkassky

Mulier

Learning from Data: Concepts, Theory and Methods 2007

New York, NY, USA

John Wiley and Sons

2334401

Zhuang

Dai

Parameter optimization of kernel-based one-class classifier on imbalance text learning

Pricai 2006: Trends in Artificial Intelligence, Proceedings 2006 4099 434 443

Nandi

A. K.

Breast cancer detection from FNA using SVM with different parameter tuning systems and SOM-RBF classifier

Journal of the Franklin Institute 2007 344 3-4 285 311

2-s2.0-33947492207

10.1016/j.jfranklin.2006.09.005

Myles

A. J.

Feudale

R. N.

Liu

Woody

N. A.

Brown

S. D.

An introduction to decision tree modeling

Journal of Chemometrics 2004 18 6 275 285

2-s2.0-9444289262

10.1002/cem.873

Debeljak

Džeroski

Jopp

Reuter

Breckling

Decision trees in ecological modelling

Modelling Complex Ecological Dynamics 2011

Berlin, Germany

Springer

197 209

Murthy

S. K.

Automatic construction of decision trees from data: a multi-disciplinary survey

Data Mining and Knowledge Discovery 1998 2 4 345 389

2-s2.0-0002431740

Bou Kheir

Greve

M. H.

Abdallah

Dalgaard

Spatial soil zinc content distribution from terrain parameters: a GIS-based decision-tree model in Lebanon

Environmental Pollution 2010 158 2 520 528

2-s2.0-74249083880

10.1016/j.envpol.2009.08.009

Tso

G. K. F.

Yau

K. K. W.

Predicting electricity energy consumption: a comparison of regression analysis, decision tree and neural networks

Energy 2007 32 9 1761 1768

2-s2.0-34250170125

10.1016/j.energy.2006.11.010

Zhao

Zhang

Comparison of decision tree methods for finding active objects

Advances in Space Research 2008 41 12 1955 1959

2-s2.0-43049157582

10.1016/j.asr.2007.07.020

Breiman

Friedman

J. H.

Olshen

R. A.

Stone

C. J.

Classification and Regression Trees 1984

Belmont, Calif, USA

Wadsworth

726392

Michael

J. A.

Gordon

S. L.

Data Mining Technique: For Marketing, Sales and Customer Support 1997

New York, NY, USA

Wiley

Quinlan

J. R.

Induction of decision trees

Machine Learning 1986 1 1 81 106

2-s2.0-33744584654

10.1007/BF00116251

Quinlan

J. R.

C4.5: Programs for Machine Learning 1993

San Mateo, Calif, USA

Morgan Kaufmann

Witten

I. H.

Frank

Data Mining: Practical Machine Learning Tools and Techniques 2005 2nd

Los Altos, Calif, USA

Morgan Kaufmann

Lim

T. S.

Loh

W. Y.

Shih

Y. S.

Comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms

Machine Learning 2000 40 3 203 228

2-s2.0-0034274591

10.1023/A:1007608224229

Cho

J. H.

Kurup

P. U.

Decision tree approach for classification and dimensionality reduction of electronic nose data

Sensors and Actuators B 2011 160 1 542 548

Tran

V. T.

Yang

B. S.

M. S.

Tan

A. C. C.

Fault diagnosis of induction motor based on decision trees and adaptive neuro-fuzzy inference

Expert Systems with Applications 2009 36 2 1840 1849

2-s2.0-56349137294

10.1016/j.eswa.2007.12.010

Provost

Domingos

Tree induction for probability-based ranking

Machine Learning 2003 52 3 199 215

2-s2.0-0042346121

10.1023/A:1024099825458

Xie

Zhang

Hsu

Lee

Zhou

Ooi

Meng

Enhancing SNNB with local accuracy estimation and ensemble techniques

3453

Proceedings of the 10th international conference on Database Systems for Advanced Applications (DASFAA '05)

April 2005

Beijing, China

Springer

983

Murakami

Mizuguchi

Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites

Bioinformatics 2010 26 15 1841 1848

2-s2.0-77955036815

10.1093/bioinformatics/btq302

Cohen

A coefficient of agreement for nominal scales

Educational and Psychological Measurement 1960 20 1 37 46

Hoehler

F. K.

Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity

Journal of Clinical Epidemiology 2000 53 5 499 503

2-s2.0-0034095523

10.1016/S0895-4356(99)00174-2

Guzzetti

Reichenbach

Ardizzone

Cardinali

Galli

Estimating the quality of landslide susceptibility models

Geomorphology 2006 81 1-2 166 184

2-s2.0-33749586744

10.1016/j.geomorph.2006.04.007

Landis

J. R.

Koch

G. G.

The measurement of observer agreement for categorical data

Biometrics 1977 33 1 159 174

2-s2.0-0017360990

Pradhan

Lee

Delineation of landslide hazard areas on Penang island, Malaysia, by using frequency ratio, logistic regression, and artificial neural network models

Environmental Earth Sciences 2010 60 5 1037 1054

2-s2.0-77954084069

10.1007/s12665-009-0245-8

Wang

C. M.

Huang

Y. F.

Evolutionary-based feature selection approaches with new criteria for data mining: a case study of credit approval data

Expert Systems with Applications 2009 36 3 5900 5908

2-s2.0-58349092287

10.1016/j.eswa.2008.07.026

Chung

C. J. F.

Fabbri

A. G.

Validation of spatial prediction models for landslide hazard mapping

Natural Hazards 2003 30 3 451 472

2-s2.0-0742288013

10.1023/B:NHAZ.0000007172.62651.2b

Lee

Ryu

J. H.

Kim

I. S.

Landslide susceptibility analysis and its verification using likelihood ratio, logistic regression, and artificial neural network models: case study of Youngin, Korea

Landslides 2007 4 4 327 338

2-s2.0-36849021068

10.1007/s10346-007-0088-x

Sarkar

Kanungo

D. P.

Patra

A. K.

Kumar

GIS based spatial data analysis for landslide susceptibility mapping

Journal of Mountain Science 2008 5 1 52 62

2-s2.0-41849148998

10.1007/s11629-008-0052-9

Can

Nefeslioglu

H. A.

Gokceoglu

Sonmez

Duman

T. Y.

Susceptibility assessments of shallow earthflows triggered by heavy rainfall at three catchments by logistic regression analyses

Geomorphology 2005 72 1–4 250 271

2-s2.0-28744433717

10.1016/j.geomorph.2005.05.011

Brenning

Spatial prediction models for landslide hazards: review, comparison and evaluation

Natural Hazards and Earth System Science 2005 5 6 853 862

2-s2.0-30844446670

100

Yao

Tham

L. G.

Dai

F. C.

Landslide susceptibility mapping based on Support Vector Machine: a case study on natural slopes of Hong Kong, China

Geomorphology 2008 101 4 572 582

2-s2.0-52949147068

10.1016/j.geomorph.2008.02.011

101

Marjanović

Kovačević

Bajat

Voženílek

Landslide susceptibility assessment using SVM machine learning algorithm

Engineering Geology 2011 123 3 225 234

102

Ballabio

Sterlacchini

Support Vector Machines for landslide susceptibility mapping: the Staffora River Basin case study, Italy

Mathematical Geosciences 2012 44 1 47 70