Determining Bal J kesir ’ s Energy Potential Using a Regression Analysis Computer Program

Solar power and wind energy are used concurrently during specific periods, while at other times only themore efficient is used, and hybrid systems make this possible. When establishing a hybrid system, the extent to which these two energy sources support each other needs to be taken into account. This paper is a study of the effects of wind speed, insolation levels, and the meteorological parameters of temperature and humidity on the energy potential in Balıkesir, in the Marmara region of Turkey. The relationship between the parameters was studied using a multiple linear regression method. Using a designed-for-purpose computer program, two different regression equations were derived, with wind speed being the dependent variable in the first and insolation levels in the second. The regression equations yielded accurate results. The computer program allowed for the rapid calculation of different acceptance rates.The results of the statistical analysis proved the reliability of the equations. An estimate of identifiedmeteorological parameters and unknown parameters could be produced with a specified precision by using the regression analysis method. The regression equations also worked for the evaluation of energy potential.


Introduction
The growing global energy problem highlights the increasing need for new and renewable energy sources on a daily basis, with wind and solar energy being the most obvious.According to the Global Wind Energy Council the total wind energy capacity of Turkey was 1329 MW in 2010 [1].The annual average wind speed in the Marmara region for heights of 30, 50, 70, and 100 meters is 5.5-9.5 m/sec, as revealed in a study of Turkey's wind energy potential and energy profile [2].The annual average wind speed and solar energy of the Marmara region are 3.29 m/sec and 51.91 W/m 2 , respectively, which is considerably higher than most of the other regions of Turkey [3].Turkey has the potential to generate electricity from both wind and solar sources.Wind can be unpredictable throughout the year, season, or even during a single day and varies by geographical region [4].According to the European Commission Photovoltaic Geographical Information System's analysis of Turkey, the yearly insolation per meter square is 2,040 kwh/m 2 , demonstrating the country's considerable solar potential [5].Considering the unreliability of solar energy due to variances in insolation levels, recent research has stated that the reliability of the system can be increased through the use of two or more complementary alternative energy sources [6].
Studies into wind-solar hybrid systems have been based on the establishment of different systems, and analyses of their potential have been carried out to identify their efficiency.Systems that incorporate two or more renewable energy sources are called hybrid power systems, which may be used to resolve the variability issues related to renewable energy sources [7].According to the forecast by Pernick et al. (2012), the market share of renewable energy will reach a total of $400 billion in the next 10 years.For 2011, the values of power produced from wind energy and solar batteries, which are considered the most important renewable energy sources of the future, were $71.5 billion and $91.6 billion, respectively.Support for wind-solar industries and their complementary characteristics has led to the development of hybrid energy systems in which wind-solar and wind-micro hydroelectric generators are used together.The use of more than one renewable energy source in electricity production increases initial start-up costs, as well as making the structure and design of the system complex [8].Although there is enough energy for the system to produce electricity from a solar-only source between April and November in the western and the southern regions of Turkey, a supplementary source is necessary for the winter months, which can be provided by wind energy in areas with adequate wind potential [9].El-Shatter et al. carried out an analysis of the movements of the sun in order to gain maximum power from the hybrid system [10].Rivera, using his established hybrid system, calculated the effect of system elements on electrical energy production, ascertaining that 25 percent of the energy produced was lost by the invertors, the regulator, the turbine, and the accumulators [11].Although there are a number of statistical studies of meteorological data, they tend to be specific studies of particular regions in the countries, where the researchers were based.Ge and Xiang carried out a statistical analysis to calculate the average wind speed in Shanghai, China.In the study, data related to wind speed and direction were used (the greatest value was recorded every 2 hours) [12].Anagreh et al. examined the distribution of the data using Skew-Normal (SN), Weibull, Chi-Square, and Kolmogorov-Smirnov (KS) statistical tests in order to evaluate the renewable energy potential with wind speed and solar radiation measurements.The data showed that Aqaba had significant potential to support hybrid renewable energy systems [13].Sanchez et al. in their study reviewed numerous works that have developed different models with the objective of detecting hail precipitation using meteorological radar.For kinetic energy "methodology based on ingredients" has enabled us to explain the results derived from a meteorological perspective, while using exclusively statistical criteria when constructing the logistic prediction model.Once the necessary ingredients were identified, the stepwise method for the introduction of variables was used to construct a logistic prediction model for kinetic energy [14].Ocampo and Rivas, in their study, highlight that knowledge of daily net radiation (Rn) is a basic requirement for quantifying energy used in various processes occurring at the surface levels such as evapotranspiration.This study applies a multiple linear regression model (MRLM) for the estimation of Rn in a subhumid-humid zone of Argentina.In the model they used weather data of solar radiation, temperature, relative humidity, Rn (measured with a Kipp & Zonen net radiometer), and inverse relative earthsun distances.As a result, eight estimation equations of Rn were derived.The results showed a good adjustment and a low error on a daily scale, highlighting those equations involving solar radiation, temperature, relative humidity, and inverse earth-sun distance, allowing calculation of Rn with errors less than 19 W/m 2 [15].In the study by Almorox et al., for locations where measured values are not available, several models have been developed to estimate solar radiation.All models were based on easily available meteorological variables, without using hours of sunshine as input, and were used to estimate the daily solar radiation at Canada de Luque (Cordoba, Argentina).The results showed that all the analyzed models were robust and accurate ( 2 values between 0.87 and 0.89), so global radiation can be estimated accurately with easily available meteorological variables when only temperature data is available.The daily global solar radiation values produced by these models can be used in hydrologic or agricultural applications to estimate missing daily values, when only temperature data is available [16].
The purpose of this study is to identify the wind and solar energy potential of the Balıkesir University C ¸agıs ¸region through the use of equations obtained from statistical studies and to make accurate forecasts.A computer program was developed for the rapid calculation of data.Wind speed, insolation levels, biannual temperatures, and humidity parameter values were used as data in the study.The wind speed and insolation levels were taken as the main parameters, while temperature and humidity were incorporated as secondary variables.

Materials and Methods
Monthly climatological data and daily meteorological tables supplied by the Balıkesir Meteorology Home Office were used in the study.Monthly averages from a 24-month period, from May 2010 to April 2012 (Table 1), were calculated [17].The maximum 10-minute average wind speed (, m/s), total insolation level (, cal/cm 2 ), average temperature (, ∘ C), and average relative humidity (, %) were used as meteorological data.Wind speed at 10-meter heights was calculated, along with the insolation level, temperature, and humidity at 2 meters.The precise location of the measurement was latitude 39 ∘ 37  , longitude 27 ∘ 55  .
Meteorological measurements were detailed and varied.Wind speed was taken as an average of data measured every minute.The insolation level was calculated as the average of the daily measured total insolation level.The average temperature and average relative humidity were taken as the average values for a single day.
In order to derive an equation to analyse regression, different mathematical methods can be used.The distribution of data can be the basis for the formation of the equation.At the beginning of the study, it can be decided which method to use.The margin of error of calculated data from the regression equation should be minimal.Previous literature has shown that the multiple linear regression is the preferred method for statistical calculation due to the accuracy of its results.For this reason, the multiple linear regression method was chosen to analyse the data in this study [18].
Using the multiple linear regression method, two equations were derived from the data in Table 1, in which the dependent variables were, respectively, wind speed and insolation levels, with the other three data sets being independent variables.In these equations, the meteorological data was defined as independent variables, and wind speed and insolation levels were calculated as dependent variables.Monthly average values of meteorological data were calculated using a regression equation.The differences between the observed and calculated regression equations were determined.
Table 2 shows the average values observed for the four months from May to August 2012 [17].The data in Table 2 was not used to derive the regression equations, but rather to forecast meteorological parameters.
In statistical studies it is well known that analysis of large quantities of data is more significant than those made with simple operations; however, large data sets make finding a solution difficult and time consuming.The preparation of a computer program to analyze large amounts of data and matrix solutions brings flexibility to the process, as has been the case in this study.The flexibility in this case comes from the number of replaceable dependent and independent variables, and in addition, the amount of data can also be changed.To this end, in order to write the computer program to arrive at a solution, MS-Quick Basic version 4.5 was used as the compiler, as it is compatible with many operating systems, including MS-DOS and Windows 95, 98, ME, and XP [19].
In this study, code has been written to derive the regression equation.In addition, the standard deviation, the mean, the standard error, probability, and  2 need to be calculated.Furthermore, regression equations of the wind speed and the total insolation were calculated by using statistics software.The regression equations derived using the statistical program (the subject of this study) were compared with the regression equations (the IBM SPSS statistics software v.19 [20]).In this way, evaluation of the regression equations from this study could be made more efficiently.
The SPSS software was used to carry out a linear regression analysis of the observation values from Table 1, and wind and insolation regression equations were derived.The results are displayed in Table 3.In the first calculated regression equation (using SPSS software), wind () is the dependent variable, and temperature ( =  1 ), humidity ( =  2 ), and insolation ( =  3 ) are the independent variables ((i) in Table 3).In the second regression equation (using SPSS software), the insolation level () is the dependent variable, and wind ( =  1 ), temperature ( =  2 ), and humidity ( =  3 ) are the independent variables ((ii) in Table 3).In (i) the wind speed subindex is shown with WS.In (ii) the insolation level subindex is shown with QS.
In this study, the results from the SPSS software have been displayed with the results of the prepared regression equation in order to allow easy comparison of the two.

Experimental Study
The statistical method used was a multiple linear regression method, which aims to find the best equations to represent data using the smallest squares and a matrix approach (1).Consider In (1),   is the dependent variable,  0 is the constant,   is the constant of the independent variable,   is the independent variable,   is the error,  is the arbitrary state, and  is the arbitrary heap.Equation ( 2) is an expression of dependent and independent variables and their constants in a matrix form.In (2),  is the matrix of the dependent variable,  is the matrix of the independent variable,   is the transposition of the independent variable matrix, and β is the matrix form of the independent variable constants.Equation (3) shows the calculated regression equation after the matrix equations have been calculated according to the dependent and independent variables and constants, using all the data.In (3), ŷ is the dependent variable of the calculated regression equation, β0 is the constant of the regression equation, β is the independent variable constant of the regression equation,   is the independent variable of the regression equation, and  and  are the state and heap arbitraries.Equation ( 3) is stated in a general form, and meteorological data is captured in the observation regression equation, and the dependent variable is calculated.This new value is a dependent variable that is calculated according to the regression equation, and so differences with the observed values may occur, although the ideal difference is zero, or at least an insignificant value.Mathematical differences between the meteorological observation data and the data from the regression equation were calculated using (4).In (4),   is the dependent variable observation value, ŷ is the calculated value of the dependent variable regression equation, and   is the difference between the two.
The objective is to write software to calculate the regression equation.The algorithm of the software showing the operations necessary for determining the regression equation is shown in Figure 1.
The SPSS program contains an explanatory menu which describes the operations that are possible.The   dependent and   independent variable values and the observation data values are entered.The sum of the dependent and independent variables ∑   , ∑   and the arithmetic averages of   ,   are calculated from the observed data values.The sum and arithmetic average values (in matrix form) of the dependent and independent variables are calculated to derive the dependent  and independent  variable matrix.
Transposition of the independent variable  matrix results with the transposition   matrix is then carried out.By applying this data to (2), the constants of β1 , β2 , . . ., β , the independent variables, are calculated.By using the constants and arithmetic averages of the calculated independent variables, the β0 constant number is calculated.Entering all the values into (3) allows the regression equation to be obtained, which defines the most suitable curve, representing all of the data of the dependent and independent variables.By applying the independent variable observation values to the regression equation, a new dependent variable is obtained for each data set.This value is the result of the regression equation, and the difference between the observation dependent variables   is found by using (4).In this way the analysis is completed.
To test the result, three sample data sets that were manually prepared were used.Having tested the equations, the output can be transferred to an Excel file, which allows easy manipulation by an MS-DOS or XP operating system.The standard deviation, mean, mean standard of error, probability, and limits of the monthly data in the regression equations are calculated to evaluate the usability of regression equations.By inserting the wind speed and insolation levels into the regression equations in the observation data fields in Table 2, regression data related to wind speed and insolation levels can be obtained and compared to the observation data.

Results and Discussion
4.1.The Prepared Regression Software Solution.Continuity of wind speed and insolation levels lead to a high energy potential, and it is accepted that temperature and humidity have an indirect effect on these two parameters.Generally, a high temperature equates to a high level of solar energy, whereas when wind speed is high, humidity is low.In this paper, in which the aim is to identify the energy generation potential of the wind speed and insolation levels found in Balıkesir, these two parameters are rudimentary.In the first calculated regression equation, wind speed () is the dependent variable, and  temperature ( =  1 ), humidity ( =  2 ), and insolation levels ( =  3 ) are the independent variables (5).In the second regression equation, the insolation level () is the dependent variable, and wind speed ( =  1 ), temperature ( =  2 ), and humidity ( =  3 ) are the independent variables (6).In (6) the insolation level subindex is .In (5) the wind speed subindex is shown with .Consider The changes in wind speed and insolation levels in the months that were calculated using data obtained from the regression equations (( 5) and ( 6)) are shown in Figures 2 and 3, respectively.The months are listed horizontally from 1 to 12, with May being the first month.The data for each month is a two-year average.The vertical axis represents wind speed (m/s) in Figure 2 and insolation levels (cal/cm 2 ) in Figure 3.As can be seen from Figure 2, wind observation and regression values follow a similar trend.The highest wind speed average was in August (9.38 m/s), and the lowest was in December (5.67 m/s).The highest and the lowest wind speeds are good proxies for wind potential.By using this data, the wind turbine can be used at the highest efficiency throughout the year.
As can be seen from Figure 3, the insolation level observation and regression values are similar.The highest insolation level was in July (647.70 cal/cm 2 ), and the lowest was in December (121.68 cal/cm 2 ).In the five months from May to September, an insolation level of 500 cal/cm 2 or more is a significant value.The insolation level values obtained in this study are compatible with the values of the data observed in Balıkesir over the last thirty years [21].
In Table 4, the wind speed dependent variable observation ( İ  ) values and the regression equation results (  ) and their differences ( İ  ) are shown.The  İ  values represent the level of difference from the observation values.An examination of the  İ  values reveals that the difference between the highest and the lowest ranges is from −0.478263931 to 0.579755821.When the observation data and calculated differences are compared, it can be seen that the level of difference is low (at a decimal level).Thus, it can be concluded that the wind regression equations can be used with independent variable parameters.
The standard deviation () 1.078600886, mean () 7.249164621, standard error of mean 0.311365256, and limits −2 <  < 2 of the data that was calculated using the wind speed regression equation ( 5) are shown in Table 5  a standard error of mean that is smaller than the standard deviation demonstrates that the calculated wind regression equation can be employed to evaluate energy potential.The wind speed observations and calculated values are very close to each other.At 95.43%, the probability displayed in Table 5 shows that the data distribution is reliable, and the calculated limits −2 <  < 2 are acceptable in engineering applications.
The insolation level dependent variable observation ( İ  ) values and the regression equation (  ) results and their differences ( İ  ) are shown in Table 6.An examination of the  İ  values reveals that the smallest and largest differences range between −76.057890625 and 81.341855469.A comparison of the observation values and the calculated differences shows that the insolation level regression equations can be used with independent variable parameters.
According to the insolation level regression equation ( 6), the standard deviation of the calculated data () is 181.791332535, the mean is () 373.376475016, the standard error of mean is 52.478637387, and the limit is −2 <  < 2 (Table 7).The standard error of the mean is smaller than the standard deviation.Therefore the regression equation of the insolation levels can be used to determine energy potential.Using the current values, it can be seen that by taking values close to the insolation level in the calculation of the insolation level regression equation.The probability shown in Table 7 indicates that the data distribution is reliable, since the probability is 95.43 percent and the calculated limits of −2 <  < 2 are acceptable in engineering applications.
For future estimates of regression equations of wind speed and insolation levels, the data is inserted into Table 2 in place of the observed data, and a comparison of the respective months is made (Table 8).A comparison of four months of wind and insolation levels reveals that the levels in one month were higher (May 2012) than the standard deviation, but lower in the other three (June, July, and August 2012).These results were important.The insolation levels and the wind speeds were successful for predicting future energy potential.The data in Table 2 was not used in the calculation of the

4.2.
The SPSS Software Solution.The prepared software and SPSS software results show a high level of compatibility.In Figures 2 and 3, the regression curves have very similar slopes for both wind speed and insolation level.In addition, the SPSS-calculated  2 values for wind and insolation are 0.915 and 0.937, respectively.The  2 values obtained were very close to 1 which shows that the software developed can be used to predict meteorological data.In this study, the main objective is to derive the regression equation; other required statistical parameters had not been added to the program.As part of a future study, these statistical parameters could be created.In addition an objectoriented interface could be developed.
The meteorological parameters can be used to design a hybrid system.The wind and insolation parameters were very important for this type of energy potential project.The parameters of wind speed and insolation level were priority parameters for regression equations.
It has been found that it is possible to predict the energy capacity of a given geographical region by using the regression equations derived in this study in conjunction with commercial software such as SPSS.

Conclusion
The compatibility of the method of analysis is important for any examined system, and the method that represents the system must be selected.Moreover, as unexpected natural events may occur, a complete evaluation cannot be made without taking these factors into consideration.
In this study, the wind and insolation level observation curves over a period of 24 months were as expected; and a close relationship was identified between the regression equation curves of the wind speed and insolation levels obtained via the multiple correlation method.In particular, the wind regression equation shows a close correlation with the ideal.
Although the dependent variable is wind speed ( ŷ ), when the difference between the results (  ) of the observation (  ) values and the regression equation ( ŷ ) are compared, very small, almost indiscernible, differences occur, which shows the applicability of the regression equation to the quantification of energy potential in future forecasts.The other indicator of the applicability of the wind regression equation is the comparison of the standard deviation and the standard error of mean.The standard error of mean is calculated from the standard deviation for the equation.The limits of the wind regression equation data are in the range −2 <  < 2 and the probability is 95.43%.
The insolation level regression equation can be used for energy potential evaluations or future forecasts.The standard error of mean is lower than the standard deviation.The insolation level regression equation can be used to predict energy levels with known differences.The limits of the insolation level equation data are in the range −2 <  < 2 and the probability is 95.43%.
The mathematical operations followed in the multiple correlation methods adopted in this study require many matrix operations, where there is an increase in the parameters or independent variables.In these cases, manual calculation would be time consuming, and the recalculation of each operation would take a great deal of effort.For this reason, the use of a flexible algorithm program that can solve these problems quickly is important.When the results of the statistical evaluation of the wind speed and insolation level regression equations are taken into account, the software that has been developed can be used in other studies to calculate the insolation levels and the wind speed.
Furthermore, in this study, the results from the developed software overlap almost completely with the results from the SPSS program taken as a reference, further strengthening the case for using the developed software to predict wind speed and isolation levels.
The regression equations can also be used for future forecasts (except for unexpected seasonal meteorological events) with little margin of error.

Figure 1 :
Figure 1: The designed algorithm for the regression equation.

Figure 2 :
Figure 2: The monthly change of wind speed.

Figure 3 :
Figure 3: The monthly change of insolation level.

Table 1 :
The monthly average meteorological parameters from May 2010 to April 2012.

Table 2 :
The monthly average meteorological parameters from May, June, July, and August 2012.

Table 3 :
Descriptive statistics,  2 , and regression equations from SPSS software.

Table 4 :
Observation values, results, and difference of regression equation when there is a dependent variable wind speed.

Table 5 :
The standard deviation, mean, standard error of mean, probability, and limits of regression equation results for a dependent variable wind speed.

Table 6 :
The results and differences of the regression equation with isolation level as the dependent variable.

Table 7 :
The standard deviation, mean, standard error of mean, probability, and limits of regression equation results with the isolation level as the dependent variable.

Table 8 :
The future estimation results obtained using the dependent variables, regression equations of wind speed, and insolation level data.The data from May which has a deviation from observed data reveals acceptable average values in insolation levels and wind speed, whereas the temperature and humidity are both low.Unexpected observation data at certain times of the year is important, in that it may result in a difference in the standard deviation.