Machine Learning-Based Relative Performance Analysis of Monocrystalline and Polycrystalline Grid-Tied PV Systems

In this research study, the design and performance evaluation of grid-tied photovoltaic systems has been carried out through experimentation, HelioScope simulation, and black-box machine learning methods for data-driven arti ﬁ cial intelligence system performance assessment and validation. The proposed systems are based on 15kWp of monoperk and polyperk, which are separately installed in the industrial sector of Faisalabad, Pakistan. The experimental evaluation of the installed PV modules was performed from November 2020 to August 2021. The performance of the PV modules was evaluated by determining the annual average daily ﬁ nal yield (If), performance ratio (PR), and capacity factor (CF). The study showed that the annual average of daily ﬁ nal yield, performance ratio, and capacity factor for 15kW polyperk was estimated to be 61.94kWh, 84.17%, and 19.12, respectively. The annual average of daily ﬁ nal yield, performance ratio, and capacity factor for 15kW monoperk was estimated to be 58.32 kWh, 81.42%, and 18.13, respectively. A comparison of ﬁ nal yield is obtained from simulation and real-time systems obtained from polyperk PV and monoperk. A signi ﬁ cant mean error exists between the experimentation and simulation results which lie within the range of 1250 to 1470kWh and 1600 to 1950 kWh, respectively. Substantial di ﬀ erences between both aforementioned results were initially tested and highlighted by statistical values; i.e., the standard error lies in-between 5 and 45% in polyperk crystalline and 5 and 25% in monocrystalline PV grid-connected module. Machine learning logistical regression evaluated that monoperk crystalline grid-connected system, experimental work was found to be more reliable with error di ﬀ erence reduces in o ﬀ -peak months as compared to corresponding simulation study and vice versa for polyperk crystalline grid-connected system. Model accuracy after training and testing produced resulted up to 99.5% accuracy for either grid-connected experimentation or simulation outcomes with validation.


Introduction
In any country, energy is one of the key factors for the smooth and faster upgradation of the socioeconomic activities, and it has become a universal fact now. One of the major drivers of economic growth is energy as identified by the government of Pakistan. Therefore, Pakistan requires ade-quate supplies of energy for the generation of healthy economic activities. The country currently is passing through the worst energy crisis in over 70 years. The energy crisis has led to the hindrance of socioeconomic progress below the level of critical sustainability and tolerance of the people. Pakistan has a land area of 881,913 km 2 consisting of a population of 211.117 million [1]. In 2019, Pakistan's electricity generation was recorded to be 87. 3 TWh [1]. The energy mix of Pakistan's energy scenario depicts that thermal energy is the widest energy resource upon which Pakistan has been relying in recent years. To warrant adequate, secure, and cheap energy supply to industries, transportation, domestic needs, and agriculture availing available resources in such a manner that it minimizes its losses should be the main objectives of the energy sector. For the proper growth of renewable energy sectors, they must be given the chance in the present energy mix.
A photovoltaic system is not a new phenomenon as a plethora of literature is available on the topic. It has been in practice for the past few decades in our everyday life producing renewable energy, which is not only efficient, but it is carbon emission-free which is beneficial for the environment. Today, our technology has shifted to renewable resources, such as biomass, wind, and solar. In the development of countries, renewable energy plays a contentious role. Nowadays, the resource in trend is solar power. There are various kinds of photovoltaic (PV) power plants that can be integrated with existing power systems. PV energy plants generally use various types of PV engineering such as monocrystalline, polycrystalline, and amorphous silicon. They come as thin-film PV panels using copper indium di-selenide, copper indium gallium selenide, and cadmium telluride.
Various studies have evaluated the performance parameters of installed PV power plants in different locations, with the varying climatic condition, such as Europe [2][3][4][5], tropical regions [6,7], Malaysia [8,9], Oman [10], and Asian countries, India, Korea [11][12][13][14][15], and China [16][17][18][19][20][21][22]. It was observed that the performance analysis is affected by system losses, module quality, inverters, shading effect, losses in wiring, array tilt angle, and type of grid connections. The effect of these factors was confirmed by literature where a 190 kWp grid-connected PV power plant in India was evaluated [16]. In another study, it was found that these factors played a crucial role in reducing the annual average daily final yield. The efficiency of the PV module, inverter, and system was found to be 14.9%, 89.2%, and 12.6%, respectively. The capacity factor and performance ratio were found to be 10.1% and 81.5%, respectively [2]. The impact of dust is one of the significant factors which reduces the efficiency of PV modules. As compared to previously mentioned factors, the effect of dust is inferior to the performance of the PV systems [20]. Research about the performance evaluation of PV systems installed in Serbia was carried out to investigate the effects of ambient temperature and compare it with similar studies around the world. The study reports an analysis of 2 kW monocrystalline silicon (mc-Si) PV power plant tied to the grid, which was built on the rooftop in Niš, Republic of Serbia. The values of PR and CF of the reported PV power plants were articulated to be 93.6% and 12.88%, respectively [23]. Performance evaluation of polycrystalline silicon-(p-Si-) based PV systems was carried out at various locations, namely, Singapore, Turkey, and Greece, with installed capacities of 142.5 kWp, 2.73 kWp, and 171.36 kWp, respectively. From these studies, it can be observed that local varying parameters such as CF, PR, and final yield affect the performance of PV systems. PV systems have also been installed to meet the energy demands of local office buildings in Singapore. The first zero-energy office building used a 142.5 kWp p-Si grid-connected system to meet its energy demands by using inverter factors such as efficiency and losses of heat. The first performance evaluation of the PV system was carried out over 18 months, which showed a good overall PR of 81% [24]. A similar study was published where performance analysis of a 2.73 kWp mc-Si PV power station located in the Muglia climatic conditions in Turkey was performed wherein winter thy efficiency of mono PV module is increased. According to a study carried out in Greece, a performance analysis of a fully monitored 171.36 kWp p-Si grid-connected PV system was done on an hourly, daily, and monthly basis [4].
The claim regarding the change in the tilt angle of PV modules and the change in efficiency can be supported by a report coming from Italy. This article focused on the influence of climatic conditions on a 960 kWp mc-Si-based PV power plant. The installed systems were divided into two subfields with varying tilt angles and nominal powers (i.e., 606.6 kWp and 353.3 kWp). The results revealed that the performance ratio varied between 79% and 86.5% from March to October 2012 [24]. Another study from Italy (northern), articulated the functioning of two grid-tied PV systems (i.e., 17.94 kWp and 15.9 kWp) with the same factories but possessing varying PV technology, their rated power, and overall efficiency. The performance ratio was observed, over a year, to be 89.1% for the first mc-Si wafer-based PV system and 82.7% for the second mc-Si-based PV system [25].
Although PV systems have been around for quite some time but there is no articulated study from this region that evaluates the performance of installed systems according to the region's environmental conditions, to the best of our knowledge, there has not been a study conducted that takes advantage of machine learning algorithms for the difference in performance evaluation between installed and simulated PV systems. This study has been carried out to evaluate the performance of installed PV systems located in the outskirts of Faisalabad on the same rooftop of an industry. The objectives of this study were to investigate the difference between power obtained from installed PV systems and via simulation software. Further, a machine learning algorithm was applied to analyze the performance of installed systems for cost and timesaving in the future. The PV systems (15 kWp) have been analyzed over a time of seven months. The renewable systems were based on polyperk and monoperk types of PV modules. Moreover, to classify hyperparameter results of categorical division between experimental and simulation results, machine learning-based logistical regression has been introduced. Machine learning algorithm contains the error to a minimum and provides useful insight, corelation, and evaluation with class distinction according to subjective weather and time horizon.

Designing of Solar Photovoltaic System
The PV systems are located at a latitude of 31.47°N, a longitude of 73.22°E, and an altitude of 186 m above sea level in the outskirts of Faisalabad city in Pakistan ( Figure 1).

2
International Journal of Photoenergy The monthly average daily values of solar radiation for the region range from 5.5 to 5.8 kWhm -2 . Figure 2(a) displays the aerial view of the installed PV systems. The PV systems are installed on the rooftop of a switchgear manufacturing firm, where the top row is designed with monoperk PV modules, and the lower row is designed with polyperk PV modules as depicted in Figure 2(b). These PV plates are supported by a steel stand having tilt angles of 30°. To mitigate the effects of dust accumulation, it is necessary to regularly clean the plates at least once a week [12].  3 International Journal of Photoenergy 2.1. Types of PV Modules. The capacity of the installed gridtied PV system is 15 kWp which is composed of two independent monocrystalline and polycrystalline segments. Each segment is installed in a series parallel and has a capacity of 15 kWp. The systems are installed facing south having an optimum monthly tilt angle to enhance their efficiency. In this study, thirty-eight monocrystalline modules (model CS3400WP) and thirty-four polycrystalline modules (model CS3400WP) of similar maximum power output and efficiency ratings are used. Properties of the installed modules are summarized in Table 1. There are four mounting structures of galvanized steel frames holding six panels for each type of PV module. The design of the support structure is to keep a variable inclination with the horizontal plane with the reinforced concrete foundation to withstand the weight of modules and maximum wind speeds at the site as shown in Figure 2(b).
The following formulae can be used for calculating parameters related to the designing of a PV system.
Equations (1) and (2) can be used to calculate the required numbers of PV panels.
where PV max is the peak power of PV array (kWp), L e is the electrical load (kWh/day), I P is the peak solar intensity, H Avg is the average available radiation in kWh/m 2 /d, and T CF is the temperature correction factor (0.4-0.5%°C for crystalline silicon) and the efficiency of inverter.
where P m is the rated power of the selected panel and PV N is the number of PV panels.

Size of inverter = Total
where A f is the additional further load expansion (20%), I e is the efficiency of inverter, and V A is the volt-ampere.
where V M is the voltage at maximum power point, I M is the current at maximum power point, and P M is the maximum power. Soiling ratio: where Pm ðsolid moduleÞ is the measured maximum power of the solid module, Pm ðClean moduleÞ is the measured maximum power of the clean module, kmismatch quantifies the disparity in Pm between the clean and the soiled mod-ules, and SRatiomeasured compares the measured maximum power (Pm) of the soiled and the clean module.

Three-Phase Inverters.
Since the PV modules are semiconductors in nature, they fundamentally generate direct current (DC). The transmission and distribution systems are designed for AC which is obtained by converting it from DC via inverters and feeds the utility of the electrical grid by synchronized frequency normally around 50 or 60 Hz. The selection of the inverter is based on the maximum output power from each type of PV module array. All inverters are of SolisMax, model Solis-15 K Max; the technical features of the inverters are provided in Table 2.

Parameters Considered for the Performance Analysis of Installed PV Systems
Effective and credible assessment of any system's performance is crucial for the designing of an efficient power system. The PV performance is a vital indicator to analyze the quality of its design and reliable combination of PV equipment. According to the International Electrotechnical Commission (IEC) standard 61724, the standard performance parameters for PV-based power systems play a crucial role in the efficiency of a system.

Final Yield.
The definition of the final yield revolves around the portion of total energy generated (E EA ) by the whole PV power plant to the rated output power (P PV ) per kilowatt of installed PV array during a specified period (i.e., day, month, or year). It can be calculated by where E AC is the total energy generated and P PV is the rated output power.

Reference Yield.
Referenced yield can be defined as the total irradiance (H t ) divided by irradiance (G STC ), which, primarily, is 1 kW/m 2 (Equation (7)). This constitutes the solar energy that is available for a specifically mentioned time frame at any place where the PV power system has been deployed. It constitutes the hours per day that are necessary to obtain solar radiation to be at reference irradiance level to produce the same incident energy as received by the sun [25].
where H t is the total irradiance and G stc is the irradiance.
3.3. Performance Ratio. Performance ratio (PR) is a significant parameter that depends on the PV power plant's geographical location. It is represented by the ratio of energy outputs obtained actually and theoretically over the time of a month or a year as depicted by Equation (8). A high PR value for a grid-connected PV is an indication that the system is efficient and reliable. The overall losses from various 4 International Journal of Photoenergy factors such as inverter losses, module mismatch, wiring, temperature, and dust or snow on the rated output power are less. The overall efficiency of the solar photovoltaic power system is closely related to the value of the performance ratio, i.e., closer to 100% [25].
where Y f is the actual energy output and Y r is the theoretical energy output.

Capacity Factor.
The capacity factor is mainly the ratio of produced electricity capacity factor. Geothermal energy and biomass among the clean and renewable energies perform at higher capacity factors compared to intermittent renewable sources which tend to have lower capacity factors. For PV plants, the capacity factor ranges from 10% to 30%, as their outputs fluctuate resulting from weather conditions [25]. It can be calculated by using where Y f is the actual energy output.

Meteorological Data.
To examine the performance of the PV system, it is required to study the weather data recorded by the weather station. This may include data set comprising temperature and wind speed. The ambient and module temperature fluctuated during the nominated period from November 2020 to August 2021 individually. Undesirable module temperature was usually observed during nighttime. During the measured period, wind velocity varied from 1 ms _1 in January and 2 ms _1 in June on average, whereas relative humidity reached almost 87% level during the monsoon season (i.e., July and August), and the maximum value of global-tilt solar irradiation of 1225 Wm -2 was recorded for Faisalabad. It has been observed that on any sunny day, the module performance is affected by three meteorological parameters. These factors include temperature, solar irradiation, and wind speed, whereas other parameters are shown in Table 3.

Performance Analysis and Result Discussion
This comparative study visualizes that in winter, the efficiency of monoperk is better as compared to other types of modules. In summer, polyperk performed with better efficiency because it is highly efficient in the hightemperature range. The maximum production of kWh in winters was more than 2,100 kWh for polyperk. The data obtained displayed that the annual average of daily final yield, performance ratio, and capacity factor for 15 kW polyperk was estimated to be 61.94 kWh, 84.17%, and 19.12, respectively. The annual average of daily final yield, performance ratio, and capacity factor for 15 kW monoperk was estimated to be 58.32 kWh, 81.42%, and 18.13, respectively. The reported results are comparatively better than a similar study conducted in Iran [26]. On the other hand, a maximum yield of around 1,700 kWh was obtained from monoperk as summer ambient temperature is high leading to reduced efficiency. At the start of November, the ambient temperature began to decrease, and thus, the efficiency of monoperk also increased, whereas the efficiency of polyperk started decreasing till December as the highest temperature of the season was monitored this month. In March, the yield of monoperk plummeted, whereas for the polyperk modules, the yield improved. In April, May, June, and July, the ambient temperature stayed at a peak level. Therefore, the efficiency of polyperk increased, and consequently, the efficiency of monoperk decreased. In August, temperature variation was observed due to the monsoon season. On warm days, the efficiency of polyperk increased whereas, on the other hand, the efficiency of monoperk increased on relatively cool days as depicted by simulation results in Figure 3(b). So, it can be concluded that polyperk modules outperform monoperk type modules when the ambient temperature increases. The common material used in solar cells, crystalline silicon, does not help to prevent them from getting hot either. As a great conductor of heat, silicon actually speeds up the heat building up in solar cells on hot summer days. Mono PV modules are made from a single crystal of silicon whereas poly is made from several fragments of silicon melted together which prevents it from heating up.
As it can be seen in Figure 3(a), HelioScope-based system generated higher power in comparison to a real-time system as the software considers ideal conditions and ignores various factors that affect the real system output performance such as dust losses and shadow factor.
Scatter plot in Figure 4 displays two hotspots and distinct regions highlighted by two circles. The upper-end circle has a wide range of experimental mono-and polycrystalline systems and polyperk simulation yield results. The lower circle has a keen distribution of yield results in experimental monocrystalline systems and polyperk simulated systems. Only peculiar results in raw data for the distinctive monocrystalline solar energy material produce oblique and relatively lesser in midseason.
Quantified data and behavior in tail line circles need to be categorized using learning, training, and testing data sets for relative comparison, which is a new theme in solar energy system classification; further details are defined in Section 4.
Ideal conditions and real-time working systems classify two different data sets and elucidated graphs. Statistical graphs show the relatively higher difference in total yield over the time scale length in real time due to undistinguished and random errors in calculations and measurement. Figure 4 shows the hotspot regions using the estimated data varying from lower tail to high end. Figures 5 and 6 depict raw data sets in peak slope form. Figures 4 and 5 show distinct groups and variations in estimated and calculated values. In polyperk system experimentation, Figure 6 shows the error doping in mean values for polyperk crystalline modules (simulation and experiment result comparison). A greater mean error exists in the experimentation results with the range lying in 1250 to 1470 kWh in comparison to simulation results with the least mean error in the upper bound of 1950 kWh. Significance of new advance predictive and artificial intelligence tools in the realm of statistical-based machine learning produces a counterproductive approach using both real-time and ideal simulation patterns and data sets for classification and decision-making for applied applications while using monoperk and polyperk solar systems. The apparent results are statistically plotted for heat mapping to understand the relationship between the distinctive classes of monoperk and polyperk crystalline modules of simulation and experimental data sets. Figures 7 and 8 display futuristic parameters that are independent and dependent in nature on each other. Plasma-type heat map of monoperk crystalline modules shows maximum nonlinearity in the experimental and simulation results up to -0.86. The corresponding increase in module yield is inferred from the heat map but linearity is minimum in experimental and simulation results with values as low as 0.14 and 0.16. In polyperk crystalline modules, greater amount of synergy is accounted for in the experimental and simulation results. The corresponding results are accounted for and validated with heat maplinearity values greater than monoperk crystalline modules up to~0.98. However, in relationship with months and    International Journal of Photoenergy algorithms that over the period of using artificial intelligence learning update their sets and make the logical decision of either using experimental means or simulation for time and cost-saving.

Machine Learning-Based Relative Performance
Investigation of Mono-and Polyperk Modules. As previously defined, the nonlinearity and class distinction in the available raw data sets are classically classified into the categorical division between experimental and simulation results. Parametric results of machine learning-based logistical regression have been introduced as a strategic machine learning algorithm due to its robust nature and easiness with mixed data set performance. Using this technique, the algorithm will     International Journal of Photoenergy evaluate the reasons behind the performance gap between simulation and real-time system outputs as shown in Figure 3 and help to contain the error to a minimum; it also provides useful insight, corelation, and evaluation with class distinction according to subjective weather and time horizon.

Machine Learning-Based Logistic Regression Modeling.
Machine learning-(ML-) based regressions are keenly a new method for inference of error and hyperparameter deduction. For efficient future predictive system installation, result synchronization, and error reduction between the simulation model and real-time system, machine learning-based logistic regression modeling has been performed in this sec-tion. ML classification has been done on the following three algorithms: (1) Supervised learning (2) Unsupervised learning

(3) Reinforcement learning
Flowchart classification of machine classes, input and output data set nature, and algorithms are presented in Figure 9.
The categorical data in ML lies mainly in the domain of logistic regression. Linear and polynomial regressions are commonly used for conventional regressive weight Helio-scope simulation results Figure 11: Pair plot of polyperk module data sets in grid-connected formation for correlation between each independent and dependent variable.

12
International Journal of Photoenergy significant differences between the simulation results and experimental results performed earlier and as highlighted by statistical values, i.e., the standard error lies inbetween 5 and 45% for polyperk crystalline and 5 and 25% for Monocrystalline PV grid-connected module. MLbased logistic regression modeling has been carried out. A pair plot drawn in Python using the Seaborn library assimilates the finest set of attributes and features for a relative explanation of relationships between two and more variables. The plot displays clusters and separation of offset data points along with hyperparameters that results in decision-making and classification of either supervised or unsupervised algorithm for machine learning. Pair plots in Figures 10 and 11 show class representations in monoperk and polyperk module data sets with bar plots showing energy yield produced over a time series. Monoperk crystalline grid-connected system shows the sigmoidal plot and two hotspot regions mainly featuring yield and timeline variable, i.e., month. Figures 10 and 11 also depict method validity for the experimentation and simulation results. The pair plot of monoperk has data sets more aligned statistically for simulation data and interpretations, while the polyperk features data more viable within experimentation data points. The pair plot consists of scatter plots and bar plots. Experimentation data points in the pair plot depict a higher yield for polyperk as compared to the monoperk crystalline grid-connected system.
The under discussion ML-based case study for monoperk and polyperk crystalline module-based grid-tied systems is being conducted first time as no previous literature is available regarding the AI-based performance analysis. This method highlights the significance of implications between simulated and experimental methods with error reduction, consolidation, and the performance evaluation of grid-tied PV solar modules. Anaconda-Python 3.8 libraries including NumPy, Matplotlib, Seaborn, pandas, and scikit-learn have been utilized for data analysis and machine learning-based decision modeling between experimental and simulation results. The initial case study for the polyperk crystalline module data set shows a higher error doping in mean values during experimental evaluation in comparison to simulation results as shown in Figure 5. Figure 5 displays the comparison based on error occurrence between simulation and experimental results in the polyperk crystalline module where class 0 represents experimental and class 1 represents simulation results. In both simulated and experimental environments, per month yield maximum output has been calculated. The descriptive statistics of the polyperk and monoperk crystalline PV module performance can be seen in Tables 4 and 5.
It can be seen from the extracted features represented in Tables 4 and 5 that there is a significant difference between simulation and experimental data. This difference is due to the ideal scenario of the software as there is no consideration of temperature variance and its effect on the system performance as well as other factors such as dust factor and shadow factor. Based on the abovementioned features, a machine learning algorithm has been used to signify the error reduction and predictability module.

Logistical Regression Mathematical Representation.
Logistical regression is generally used for two-class problems and can be extended for implications in the multiclass problems but becomes highly unstable in calculations when classes are well separated; the working diagram is represented in Figure 12. The baseline for the mathematical formulation of logistical regression is simple linear regression.  where y is the continuous output result while the b o is the intercept of the line, b 1 is the slope of the line, and x is the input or independent variable. In continuous data, we can utilize linear regression, but in distinct classes, we cannot draw the best-fit curve. We introduce a classifier for whether an action/event shall occur or not. Figure 13 shows that the output in logistical regression is binomial. LR has a sigmoid function based on the probability of an event (P) and ranges from 0 to 1 and is given as Introducing P in the LR equation and computing at the natural log scale gives the following mathematical expression: Weighted output is in the range of either 0 or 1; if the output is <0.5, then it corresponds to 0th class if not then 1st class. For training and testing data sets after incorporation into machine learning the module, accuracy is defined by  14 International Journal of Photoenergy   The accuracy of a system based on the prediction of classification problems is either correct or can be incorrect so a matrix, namely, a confusion matrix, is formed for a relational summary of actual and predicted values of classification problems.

Performance Analysis of Mono-and Polyperk
Crystalline Modules. The total reading count of simulation and experimental work is divided into 80% training and 20% testing data as shown in Figure 14. The classifier is introduced for encoding purposes to have the binary distinction between greater probability events. The timeline horizon consists of 10 months; no data was conceived for the month of September and October.
where P is the probability of a discrete event defined earlier.
The training data set selects a total of 80% values from raw data. The algorithm reads the pattern according to the problem statement and improves the probability values. In each run, a class check is run for appointing the new red and green distribution as highlighted in Figure 14. An increase in the iterations increases the width of class boundaries. Based on the extracted features and training/testing of both modules, the data set collected from simulation and experimental work of mono-and polyperk crystalline module for a one-year cycle (except September and October) can be seen in Tables 6 and 7 which depict the difference between them ranging from 1% to 60.5% in monoperk and 6% to 27% in polyperk crystalline.
In logistical regression training, the data set values determine the probable event in advance. The most likelihood and unlikelihood are distinguished for experimental and simulation results. For the monoperk crystalline gridconnected system in the experimental scenario, readings are likelihood, and the probability after training and testing the data set lies about 99% accuracy. The confusion matrix produces an absolute validated summary result for the method and material to be incorporated in each particular scenario. A 20% test data set values in most actual and predicted values are in harmony with each other using the ML module. In further classification, as shown in the heat map and pair plot in Figures 8-11, the class distinction reflects that those major values classified in the experimental work are authentic.
To determine the evaluation of the monoperk crystalline grid-connected system, experimental work is more reliable than compared to a simulation study. The error difference reduces in off-peak months of the data cycle, i.e., January, February, and March, whereas in peak months, the   16 International Journal of Photoenergy evaluation results are likely the same and coherent. In the case of polyperk crystalline grid-connected studies, a greater significance and accuracy lies within the simulation and experimental studies, and the likelihood of probability is significant as seen in Tables 6 and 7. Model accuracy after training and testing produces results of up to 99.5%. Simulation results are likely most favorable in predicting the performance evaluation of maximum yield in grid-connected systems. During off-peak (i.e., winter season), the error difference increases due to various factors such as module angle, line losses, cell dimension, and temperature invariability.

Conclusions
In this research, a comparative study was conducted for two solar systems of 15 kW. The systems primarily included monoperk and polyperk crystalline. From the study, it was observed that maximum yield was produced, in winters, by monoperk, while during summers, polyperk displayed enhanced efficiency. In Faisalabad, polyperk has proven to be efficient considering the high rates of temperature whereas monoperk, consequently, reduced the efficiency of the solar system. The annual average of daily final yield, performance ratio, and capacity factor for 15 kW polyperk was estimated to be 61.94 kWh, 84.17%, and 19.12, respectively. The annual average of daily final yield, performance ratio, and capacity factor for 15 kW monoperk was estimated to be 58.32 kWh, 81.42%, and 18.13, respectively. The efficiency can also be attributed to the quality of the system. For example, good quality inverter requires quality plates for enhanced efficiency. Additionally, the production of a 430watt panel was noted as 400 watts. Furthermore, according to the applied machine learning module, a significant difference was observed in the considered PV module types (i.e., mono-and polyperk). A machine learning modeling study using logistical regression was applied to determine the greater significance of experimentation results in monoperk crystalline with an accuracy of 99.5%, while the results in polyperk using simulation studies are more accurate and recommended in the evaluation of PV-connected grid. Depending upon observed parameters and frequency period, the model elucidated a better understanding of the performed real-time analysis leading to both cost and time saving for the installation of similar projects in the region. Furthermore, better performance can be achieved from installed systems if factors such as dust, tilt angle, and shadow effects are considered before the installation of PV systems.

Data Availability
Data will be available on request. For data-related queries, kindly contact Faisal Mahmood (faisal.mahmood@uaf.edu.pk).

Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this research article.