Spatial Approach of Artificial Neural Network for Solar Radiation Forecasting: Modeling Issues

Design of neural networks architecture has been done on setting up the number of neurons, delays, and activation functions. The expected model was initiated and tested with Indian solar horizontal irradiation (GHI) metrological data. The results are assessed using the effect of different statistical errors. The effort is made to verify simulation capability of ANN architecture accurately, on hourly radiation data. ANN model is a well-organized technique to estimate the radiation using different meteorological database. In this paper, we have used nine spatial neighbour locations and 10 years of data for assessment of neural network. Hence, overall 90 different inputs are compared, on customized ANN model. Results show the flexibility with respect to spatial orientation of model inputs.


Introduction
Utility working in the field of solar energy production is compulsory to develop its forecast ability in diverse climatic situation [1,2]. Unusual fluctuations occur in direct and diffuse incident irradiation (up to 100%), due to existence of clouds [3,4].
In recent years, artificial neural networks (ANN) have been used for forecasting and regression of solar radiation in different latitudes and climate conditions. Since its development, no accurate method has been found to solve stability and uncertainty of delayed networks. Therefore, delayed network stability study has gain additional importance. In recent years, awareness of delayed neural networks has increased. It is necessary for the ANN to appear in globally stable region. A numbers of method have been developed to determine optimum numeral of delays, especially, by means of cross validation [5][6][7][8][9][10][11].
Most important works has been done towards using of activation functions. Jordan obtain the logistic function which is a standard design of the next prospect in a binary classification convolution [17]. Yao and Liu enhanced the structure of complete networks with two strange activation function, those are sigmoid and Gaussian basis [18]. Sopena et al. available number of test appearance on multilayer feed forward networks used a sine activation function [19,20]. However, the difficulty along with transfer functions is not a theoretical condition for their selection [21].
The literature assessment demonstrates that the ANNs have not been used for spatial domain analysis. In this paper, consider such special characteristics of spatial domain with solar radiation time series data. Associated study considered source of 3×3 spatial matrixes and 10 years of time series. Spatiotemporal aspect has been used for ANN analysis. Results

Data Collection.
The default value of the radiation area is about 1360 Wm −2 and carries diffuse horizontal radiation (GHI) and direct normal irradiation (DNI). These two terms are used to calculate the global horizontal irradiation (GHI) as follows: where Φ is the solar zenith angle [22]. Longitudinal data for India in the form of 10 kilometers (0.1 ∘ × 0.1 ∘ resolution) points in space [23] are available. Sunny satellites are used every hour from January 2003 to June 2012. This work includes the latitude and longitude 31.85 ∘ -33.65 ∘ and 74.65 ∘ -78.45 ∘ accumulated in GHI in northern India, as shown in Figure 1. The data are in a rectangular grid 400 × 400 km 2 placed. The GHI exists in a particular time zone 5.5 to a central location of the proposed area in Figure 2 for 5,000 hours of 2008.

Data
Processing. Important approach of data mining is used to scale the input and target in the ANN. Thus, the normalization is used with the standard deviation and the mean of the training data set. Therefore, for the data of solar radiation with a zero mean and standard deviation unit of the next equation: where , mean , std , , mean , and std data sets are data, mean, and slandered deviation of target and data and mean and slandered deviation of training set, respectively. The data sets are in form of time series and require interpreting into spatial domain. The input data sets are prepared primarily in physical position allocating to topology function. Using a rectangular grid function may also be used similar to hexagon or random topology. It begins with input data in a rectangular grid similar to that shown in Figure 3 for an instance. Assume that spatial data are in 3 × 3 arrays of nine different locations. The input 1 has position (1, 1), input 2 has the position (1, 2), input 3 has the positions (1, 3) and (2, 1), and so forth. Another three-dimensional topology of spatial data set is shown in Figure 4. The center input has neighbourhoods of increasing diameter nearby it. A neighbourhood of diameter 1 includes the center and its instant neighbours. The neighbourhood of diameter 2 consists of the diameter 1 and its immediate neighbours. The rectangular topology function and all the neighbourhoods for a multiple input map are characterized by an -by-matrix of distance. 3   through information [24]. Figure 2 shows a classic demonstration for a neuron architecture where 1 , 2 , 3 , . . . , , 1 , 2 , 3 , . . . , , , , , and (⋅) are the signals, weights, bias, activation potential, output signal, and activation function, respectively. Afterwards, one can supervise that the neuron efficiency is given by

Artificial Neural Network (ANN)
Such usual network architecture is usually referred to as a multilayer neural network [25]. It is based on its topology and the amount of the weights in the input layer. The simplification of an artificial neural network is the capacity of replicating preferred signals for different input signals and the capacity of holding the dynamics of the system [26]. However, to determine number of neurons in each layer is not trivial. Research articles have explained cases where underfitting and overfitting might occur when smaller and larger numbers of neurons are used in the network [16].
Journal of Solar Energy     Numerous approaches have been used without success to find the appropriate method for computing the number of neurons in each layer [27]. However, due to their simplicity, these methods have been extensively utilized in time series analysis [28]. The ultimate outcome of ANN is group of weights and input variables for linear and nonlinear process [29,30]. In this paper, we have used one-output nodes in the outer layer for forecasting of GHI. The accuracy of the various ANN models is compared to the most correct model of hourly solar radiation, as described below.
Custom Network. Start network designing using toolbox offer special choice. To construct custom arrangements, start with an empty network and set its properties as special as shown in Figure 6. The network used numerous function properties that have been set in many ways, as desired for network architecture. In this section use the slight normal network and different spatial inputs. The input network recognizes normalized value range from −1 to +1 of radiation. The number of layers used for this network is two, initialized with the Nguyen Widrow layer initialization method and trained with the Levenberg-Marquardt as described below.
To train greater number of layers it needs additional time.
While there are simply 2 layers in solar radiation network for default custom models that can be trained for 1000 epochs, the literature described in terms of accuracy, when using 2 layers in ANN, is much higher than that of higher layers. Hence, we can consider the network with 2 hidden layers, which provides the highest precision value, as the most appropriate network for this problem. Any output vectors in output layer will learn to associate the connected target vectors with minimal mean squared error including weights and biases [29][30][31][32].

Neural Network Training.
The LMA deliver enhanced performance as soon as being connected with classic back propagation processes. Due to Newton's technique the networks modernized law is where , , , , , , and are the network weight matrix, number of repetitions, the Hessian matrix, the gradient matrix, the Jacobian matrix, the identity matrix, and a scalar, respectively [25]. The projected networks were trained by 70% of the delivered data, whereas the continuing 15% was used for validation and remaining 15% was for testing the trained network. Thereafter, the trained networks were used for forecast using last 100 days of data. The suggested ANNs forecast the solar radiation at center point of position 5 from Figure 3 (in terms of 2 × 2 matrix) from 2012 year of data, and at that time the forecast effects were equated through the measured data. Individually inputs of 9 spatial locations and 10 year of data (total 90) were verified separately by the hourly radiation values, and all of the suggested inputs were matched collectively by the RMSE values of solar radiation. These special networks were compared with training RMSE error.

Model Evaluation Criteria
Finally, the model has been selected based on the lowest forecasting error. The estimation of error can be many forms such as root mean square error (RMSE) and MAE (mean absolute error): In (5) there are different expressions for error estimation where represents measured value at forecasted horizon and̂is forecasted value. Also represents the total number of test samples. This validation process defines the model accuracy and stop iteration process for ANN model [33].

Model Behavior with Delays.
In this section, we used one input at a time with delay out of 90 different radiation data. The hidden layer required was often (10) ten neurons; the input delay varies from one to thirty (30). For GHI, the wrong alarm frequency was constantly low while being trained on the maximum amount of past data [34]. All configuration models are tested for 30 times at continuous time delay of 1 to 30. The network has to optimize at minimum root mean square (RMSE) on the training data set [28]. Figure 7 and Table 1 illustrate an evaluation of measured and forecasted values by the suggested ANNs; this evaluation was built on hourly averages of global radiations. Established radiation on input numbers (from the   applied in different ANN inputs. The delay decides the convergence property of ANN inputs. Table 1 shows that some of the input converges very fast with minimum error as IN-51 with 0 time delays and around 23.89 percent of testing RMSE error. Similarly other comparable methods of IN-16 show the 23.29 percent of testing RMSE error but long time delay (29). The hourly correlation factor between two clear skies days is almost one day, which is perfect for time series prediction considering the same interval. In terms of spatial analysis top best five testing results show percentage in RMSE 19.63,20.07,20.17,20.66, and 20.71 of neighbour positions 11(1), 13(7), 33(9), 12(4), and 21 (2)    Journal of Solar Energy a n a n a n a n a n a n a n a a n n Figure 9: ANN models used with different transfer function. of delay is random, some inputs perform well with higher delay and some perform at lower delay, it all depends on property of data with respect to time. Table 2 demonstrates that the projected neuron model offers superior results for evaluation. The current technique is used to control the number of neurons based on trial-and-error. It starts with minimum number of neurons and increased neuron to its maximum limit. The drawback is that it is time consuming and there is no surety of setting the neuron. The particular measures for 90 inputs used range of 10-300 neurons at interval of 10 neurons and target for a minimized MAE value. Since the smallest RMSE demonstrates the estimation method accuracy at local level or small number of data sets, MAE indicates global accuracy. In this case previous obtained outcome delays are used and standard transfer functions, hyperbolic tangent sigmoid (tansig), for respective numbers of inputs, are used as well.

Model Behavior with Neurons.
The accuracy was tested by using the training sets results in each case. Figure 8 and Table 2 show the results with all errors, performance coefficients, delays, and different neurons for each input. It is preferred that simplification capability increased while the number of neurons is increased. In solar radiation estimation problem, the precision degree of the productivity was 3.61, 4.63, and 3.98% in training, validation, and testing, respectively, while 190 neurons are

Model Behavior with Multiple Transfer Functions (Activation Functions).
In this study, different activation functions are used as shown in Figure 9 that depend on different number of iterations for comparing their performances on radiation data. For every standard activation function, we used the number of neurons in the hidden layer as mentioned in neuron section for different inputs shown in Figure 8. After presenting in Table 3 different performance parameters, 3.08 * * 3.18 * * 17.83 * * 0.73 * * 24.51 * * 12 * * 3 * * 130 * * compet * * 3 * * 1 * * 2 * * 6 * * * 3.12 * * * 3.41 * * * 3.13 * * * 3.18 * * * 17.83 * * * 0.63 * * * 22.2 * * * 12 * * * 1 * * * 10 * * * hardlims * * * 2 * * * 3 * * * 3 * * *  interpretations of their results will follow here. We used the different number of delays in the first section and different neurons in the second section and to compare different inputs with differences activation functions are described here. According to the first graph from Figure 10 generated by the all inputs are compare with the different activation functions. However, each input performs significantly different behavior with its one kind of function. Satlins activation function results perform the most successful one, for all the test parameters. However, the accuracy of the inputs was totally different in training and testing cases; higher accurate result (lowest MAE value) shows the inverse effect during testing case. This may be the case of over-or underfitting of model with data. The rest of the functions used in this study were not successful and accurate enough in group. Table 3 shows logsig function with IN-69 reporting the best activation function for training. However, testing results show that the accuracy of satlins activation function with IN-11 was much better. This situation explains that total mean absolute error (MAE) according to iterations cannot determine the network accuracy. Hence, we obtain the real accuracy based on testing results. Table 3 shows the error for each activation function for different numbers of inputs, which vary from 1 to 72 after removing the last two years (2003 and 2004) of data due to  its poor performance. In terms of spatial analysis top best five percentage MAE testing results show 3.04, 3.08, 3.13, 3.25, and 3.29% of positions 12(4), 31(3), 23(8), 33 (9), and 23(8) with transfer functions "satlins, " "compet, " "hardlims, " "hardlim, " and "purelin, " respectively. However, compare with the worse results, which are MAE test error 6.68, 6.69, 6.76, 6.9, and 7.31% of position 32(5), 31(7), 11(1), 22 (5), and 33(9) with transfer function "tansig, " "logsig, " "hardlim, " "poslin, " and "compet, " respectively. This result shows no special pattern related to spatial position; even then the target position is 22 (5) not directly related to the same position 22 (5) of input in terms of performance; therefore it is important to consider the neighbor position in the modeling of ANN. As it is not a direct relation between input and output irrespective of position, the modeling needs expert supervision. On the other hand temporal analysis shows clear pattern between input and output with respect to different years of data. In the

Conclusion
This document includes the modeling characteristic of artificial neural networks based on spatial feature. The estimated model was initiated and tested on solar radiation data. The results are evaluated with different statistical error. This document certifies the ability of ANN to accurately reproduce hour's global radiation forecast. The estimation accuracy of the hourly solar radiation can be achieved by using conventional meteorological data of ten years. This model has been used extensively for the specific application; due to the dynamic nature, modeling needs professional advice. This section gives more value to the parameter that shows the progress of the ANN architecture, the delay, neurons, and the corresponding transfer function of spatial position ( Figure 5). The results show a high degree of flexibility in the choice of different inputs and connected parameters for comparative accuracy.