A Comparative Analysis of Machine Learning Algorithms for Detection of Organic and Nonorganic Cotton Diseases

Sreyas Institute of Engineering & Technology, Hyderabad, India Teerthanker Mahaveer University, Moradabad, U.P, India KIET Group of Institutions, Gaziabad, India BML Munjal University, Gurugram, India Central University, Bilaspur, Chhattisgarh, India Neil Gogte Institute of Technology, Hyderabad, India Department of Information Technology, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia


Introduction
Day by day, all over the world, agriculture land is going to be reduced because the population is increasing rapidly and lack of water resources. Disease in the plant is one of those hazards that have to be examined at this stage. In contrast, the isolation of plants from their natural environment is being happened, and they are grown in unusual conditions. Many valuable crops and plants are very vulnerable to disease. ey would have a great struggle to survive in nature without human involvement. Yield loss in harvests is regularly connected with plant illness or factors, such as climate, water availability, and supplement accessibility [1]. To improve the productivity of the crop, environmental factors or product resources such as temperature and humidity are important [2]. An important role is played by the root exudates of the plant, which helps in improving the nutrients of the soil [3]. Compared to their wild relations, cultivated plants are always more flexible to diseases. is is the large numbers of the same species or different kinds, having a similar gene grown together, sometimes over many kilometers of distance. e microorganisms which cause diseases may spread rapidly under these conditions. Plant illnesses influence the plants' development and yield, making a socio-biological and monetary effect on agriculture [4]. e past writings reveal that the various kinds of leaf diseases, such as spots, mildews, rusts, and blight, cause scarcity and many other extreme changes in nations' economies. Plant diseases are one of the ecological factors that facilitate keeping living plants and animals in balance with each other. Plant cells mainly enhance their defenses in opposition to the animals, insects, and also pathogens through signaling pathways that are contained in them. For thousands of years, humans have selected and cultivated plants for food, medicine, clothing, shelter, fiber, and beauty with proper care. erefore, the monitoring of regional crop diseases is important to enhance food security [5].

Motivation and Contribution.
e growth of most developing countries depends on agriculture. Agriculture plays an essential role in India's economic growth. Most of the north Indian states grow cotton crops in 95% of the agricultural land. Various diseases affect the productivity of cotton crops. It is mandatory to detect and classify these diseases in the early stage for the better productivity of cotton crops. Considering the same, this review paper summarized the various cotton diseases and symptoms of the diseases. is paper analyzes various machine learning algorithms of segmentation, detection, and classification techniques to identify cotton diseases. is study would be helpful for future research in this field. e main contributions of this paper are as follows: (i) e various cotton diseases are classified, and their symptoms are also presented (ii) e existing cotton disease prediction techniques are studied, and these are categorized according to the technologies used (iii) e comparative analyses of the existing techniques are performed to assess their advantages and disadvantages (iv) e various databases that have been used to conduct these studies are also discussed. (v) e various performance metrics that can be used to evaluate the efficiency of disease prediction techniques are also presented. e rest of the paper is organized as follows: Section 2 describes the background of the cotton crop. Section 3 discusses the existing literature related to cotton diseases. Section 4 presents the discussion of the existing techniques in terms of advantages, disadvantages, databases used, and performance evaluation. Section 5 concludes the paper and provides the future directions.

Background of Cotton Crop
Cotton is a fiber crop that fills in as a wellspring of feed, staple, biofuel production, and oilseed harvest, which provides 35% globally of the total fiber and raw material for producing textiles [6][7][8]. It is the most important and principal cash crop that affects India's economy in many ways [9,10]. It is always considered the most economically main crop, but various disease pathogens affect its production [11][12][13]. e largest cotton yielding country is India, as compared to other countries in the entire world, as shown in Figure 1. All these together provide most of the yield all over the world. Area-wise, India is the first among them. Production-wise, it has a third place.
Cotton plays a major role in the agrarian economy and industrial activities of India [14]. We have nine states all over India that grow cotton which is partitioned into zones, namely, north, central, and south. Haryana, Punjab, Western Uttar Pradesh, and Rajasthan are northern zone states. Maharashtra, Gujarat, and Madhya Pradesh are the central zone, whereas Andhra Pradesh, Tamil Nadu, and Karnataka fall under the south zone, as shown in Table 1 and Figure 2. All of the states in each of the zones extend to 90 percent of the land to cultivate cotton crops and produce 95% of the total cotton production in India. e other 5% of cotton production is contributed by small states such as Bihar, Meghalaya, Tripura, Orissa, and Assam. e zones in which cotton is produced differ depending upon the soil type, sources available for better crop protection, level of production, and geography.
Plant illnesses have perceptually been a challenge to plant development and harvest creation in numerous global segments. Plant infections will influence plants by numerous cycles such as the absorbance and movement of water and supplements, chemical change, flowering, development of the fruit, plant growth, cell division, and extension. Collecting, acquiring, and analyzing cotton diseases that are being processed manually leads to a lack of efficiency, making the identification difficult [15]. Cotton plant diseases may be caused due to the different types of microorganisms, nematodes, and other agents. erefore, efforts should be directed towards avoiding disease epidemics just like blight diseases and developing effective disease management methods.

Types of Cotton.
Cotton is the cumulative name given according to perpetual shrubs in the family Malvaceae grown as the soft fiber. e four different kinds of plants in the genus are shown in Figure 3, which preserve the plant seeds. Gossypium hirsutum is estimated at roughly 90% in the world. ese plants have a central main stem which gives various branches upwards. e plant leaves are set out around those branches, which possess 3 to 5 triangular lobes and long petioles. e plant gives a single bud that can be red or purple, yellowish-white in color on each auxiliary branch, and forms a leathery, oval seed capsule known as "boll," generally 2 to 6 cm long. e cotton plant can reach heights of 1 to 1.5 meters (3.3-4.9 ft).

Diseases of Cotton and eir Symptoms.
e natural and most frequently occurring cotton disorders are due to the loss of nutrition, environmental stress, and chemical factors that cause imbalances. ese factors affect the production of cotton during the growth of the crop. ere is a contrasting difference between the modification of the plant and the harmful environments, including the disorders. Verticillium wilt and cotton leaf curl diseases are major limiting factors for cotton production [16,17]. Verticillium wilt is a highly hazardous disease that is strongly epidemic [18]. e abundance of wilt pathogens is one of the many factors which results in disease occurrence [19]. e cotton boll is also a chief component of fiber production [20]. Boll injury and symptoms of cotton boll with leaf-footed bugs were similar to those of plant bugs [21,22]. Early detection of bollworm and pink bollworm with the help of field evolved resistance was also discussed in [23]. Fungicides were frequently applied during the planting stage to control cotton seedling disease by spraying [24]. Tobacco Streak Virus (TSV) belongs to the genus Ilavirus of the family of Bromoviridae. Identification of the symptoms of TSV infection by visual observation of the plants often results in misdiagnosis as this virus matches those reflecting nutritional disorders affecting cotton [25]. A sample example is shown in Figure 4.

Mathematical Problems in Engineering
Symptoms.
e disease can occur in many stages. However, the most severe one occurs in the leaves after 45 to 60 days of planting. Tiny, pale brown, irregular round spots, measuring 0.5 to 6 mm diameter, may be present on the leaves. Each spot contains a central lesion surrounded by concentric rings. Several spots merge to form blighted areas. e affected leaves become weak and wither. Stem lesions are observed rarely. But in extreme cases, these spots may appear on bracts and bolls. Best Methods Used for Detection. Image processing and support vector machine.

Organic Cotton.
Organic cotton is created by the utilization of techniques and materials that lowly affect the climate. Organic production structures top off and hold soil ripeness, lessen the utilization of hurtful pesticides and composts, and develop organically unique agribusiness. Outsider confirmation partnerships affirm that regular makers use exclusively techniques and materials permitted in organic production. Organic cotton is developed without the utilization of harmful and consistent pesticides or manufactured composts. Moreover, government rules confine the utilization of hereditarily designed seeds for natural cultivating. Organic cotton still solely occupies a tiny position of far much less than 1% of international production of cotton. However, the quantity of farms changing to organic cotton and the range of tasks for the growth are continuously increasing. ere are several motives to grow cotton organically. e poor effects of farming traditional cotton on the environment and fitness are apparent. It is necessary to adopt a range of integrated measures in a device approach, ensuring that the interplay amongst the soil, plants, surroundings, and people is nicely balanced to get satisfactory yields and income in natural cotton farming. e "elements for success" all need to be used together: (1) Suitable measures to improve and hold soil fruitfulness (2) Establishment of yield revolution and harvest variety; cultivating characteristic equilibrium (3) Selection of assortments appropriate to the conditions (soil, accessibility of water system, and market necessities) (4) Appropriate sorts and amounts of excrements at the correct time (5) Timely harvest the board, for example, intercultural activities, weeding, and water system (6) Careful checking of the yield and enough assurance towards bothers as per the idea of the money-related limit level (7) Timely and alluring picking of the cotton (8) Sufficient documentation for examination and affirmation (9) Capacity building and testing for constant improvement

Diseases in Organic Cotton.
In most of the semiarid tropical regions, illnesses are not a big problem in organic cotton. However, diseases that from time to time happen and methods for stopping or treating them are given below: (i) Bacterial blight: leaves show slick dark spots; stems flip dark; defoliation and boll shedding if the invasion is high. A strategic distance can be followed by methods for the use of safe assortments. (ii) Root rot and boll rot: generally brought about through fungi and bacteria. (iii) Fusarium wilt: crop rotation practice; getting rid of cotton stalks after harvesting.

Literature Survey
Computer technology has been analyzed and had been made into use broadly in agricultural applications to disclose and recognize the infection of the plant and classify them. A computer vision-based algorithm (SLIC-Simple Linear Iterative Clustering) was developed to detect the infected part of the disease and classify the disease using the support vector machine to the type which it belongs to (fungal or bacterial). Using this algorithm, three types, namely, alternate-fungal, bacterial blight, and whitefly diseases, were detected [26]. e emphasis on recent studies that reflect image processing techniques and their benefaction in detecting the plants and their diseases followed by the classification techniques under several circumstances is presented in this study. e structure of methodology for leaf disease identification and the classification techniques are shown in Figures 5 and 6, respectively. e main focus of the pathologists is to identify the diseases that may occur in other various parts such as roots, branches, stem, and leaf of the plant. As mentioned before, this paper focuses distinctly only on the leaves of a cotton plant. e information related to the identification of disease and its classification is available in previous research. Most of them usually involve a preprocessing stage and methods of segmentation accompanied by identifying the feature and level of classification using appropriate measures. All the techniques have applied some sort of preprocessing to eliminate the noise and segmentation procedure for selecting the part of the disease that occurred. e extension of this paper is based on the classification of diseases which is also an important aspect of proper detection. Classification is a procedure that is related to the division where assets are categorized according to the field. It helps in recognition of ideas and objects that are further differentiated and understood.

Disease Detection Analysis with Image Segmentation and
Feature Extraction. Rothe and Kshirsagar [27] discussed the identification of diseases utilizing picture preparing strategies that extracted the highlights, for example, region, major and minor axes, direction, and so on, from pictures of influenced ailment leaves. e paper has two strategies, in particular, image enhancement and image segmentation. 6 Mathematical Problems in Engineering For image enhancement, a low pass channel is utilized to evacuate the commotion present in the picture. e Gaussian channel is utilized as the edge that compares to high recurrence is to be held. It does not have a sharp cut-off; however, it comprises rich and regular stop band reaction that permits adequate higher recurrence parts and subsequently shows little league transfer speed item. e element extraction process includes shading and shape-based extraction.
e essential nature of the visual substance is shading; along these lines, it is utilized to relate and speak to a picture. e shape-based highlights that are chosen to distinguish the sicknesses involve region, sharpness, edge, Eigen esteems, and viewpoint proportion. e quantity of pixels speaks to the territory of the article; along these lines, it tends to be outlined by checking the number of item pixels. e proportion of the length of the significant tomahawks to minor tomahawks is called angle proportion which produces data regarding the flat stretch of the infected spot. By computing normal separation between two limit focuses and the significant hub, the sharpness of the state of the spot is resolved, as shown in Table 2.
Revathi and Hemalatha [28] introduced an image processing procedure alongside edge detection. At first, they took the computerized pictures utilizing an advanced portable camera. At that point, picture handling strategies are applied to the obtained pictures to remove RGB highlights, which are important for additional examination. Some expository strategies were utilized to group the pictures as per the particular issue from that point onward. Users take pictures of the leaves utilizing sensors and then forward them to the PC framework where the info pictures are dissected utilizing the homogenous edge recognition calculations, and the pixels call work rationale utilized for the location of the influenced pieces of the leaf disease-wise. e homogeneous-based edge detector gains the outcome and  Step 1: segmentation  is partition takes out the effect of nonlevel lighting in the picture. RGB picture changes over into the dark scale picture, utilizing edge identification procedures to recognize the leaf spot picture malady, influenced part. e use of shading channel checking and RGB highlights aids in recognizing different parts of images that yield in distinguishing the infected part. Kirthi Pilli et al. [29] discussed the eAGRO technique for early crop disease detection. e agro advisory system is used between farmers and agricultural experts by bridging a gap between them [30].
Comparative analysis of various image processing techniques has been carried out, and it is shown in Table 3.

Disease Detection Analysis Using K-Means and KNN.
Revathi and Hemalatha [31] described the recognition of cotton diseases. e following related examination on quick and exact recognition and arrangement of plant infections utilized by K-means and KNN result is 94%, demonstrating that the mix of K-means may give preferred outcomes over other related research considerations.
Parikh et al. [32] examined the recognition of the severity of the grey mildew with white spots blend that creates white spots that are huge in size on the cotton leaf. After the extreme contamination, the shrinking of leaves occurs. Stage one and two offer comparable infection attributes (white spots) yet can be isolated depending on the white spots' size. e second and third stages have distinctive infection side effects (white spots versus dried dark colored leaf ) and are isolated dependent on the side effects. Stage one utilizes the K-nearest neighbor (KNN) classification method with a series of 15 highlights that are measurable. So the stage two classification utilizes the KNN classifier with two essential highlights: tone and luminance. e classification and the general component extraction are outlined. Narmadha and Arulvadivu [33] presented the image processing technique along with the color and shape features extraction and detected leaf blasts, brown spots, and narrow brown spots with the help of the methods mentioned. Schuster et al. [34] utilized a two-advance k-implies bunching calculation to distinguish the zones of the executives. One of the basic components while utilizing K-means is recognizing the best characteristics or factors for the executive's zone portrayal. Previously, scientists have investigated utilizing yield or dirt properties such as incline, rise, and electrical conductivity. e k-implies calculation is a, for the most part, utilized bunching technique pertinent broadly for unaided learning and distinguishing structure in the dataset. e comparative analysis of K-means and KNN techniques is shown in Table 4. e algorithm implements clustering by limiting a target work in a registering way, generally the total over the square of separation of each point from the comparing group hub. While the AI approach laid out right now presents the administration zones for cotton, a significant issue keeps on existing. It includes the presentation of the progress rate into the machine learning calculation directly. Such progress might help in exhibiting machine learning to coordinate the current operational conditions. Recognizing the cool air territories through temperature detecting innovation and machine learning calculations speaks to another new application for the examination right now.

Disease Detection Analysis Using Principal Component
Analysis. Gulhane and Kolekar [35] predicted distinguishing the diseases on cotton leaves utilizing machine learning techniques; it is conceivable to expand the degree for the location of different diseases inside obvious just, and wavelength regions are invisible. In actualizing multivariable KNN and PCA systems, dissecting the factual information got practical, identified with the Green (G) channel of RGB picture. In a large portion, sicknesses that appear on the plant leaves are specific, leaf necrosis, grey mold, Alternaria, and magnesium deficiency. PCA is the direct technique that licenses for perceiving the uncorrelated parts. In the PCA technique, each element in the group highlights is considered an irregular component space plotted in the space, structuring a haze of highlights. Here, Figure 7 signifies the bend between the recurrence of sicknesses (%) over the 110 test samples. Table 5 shows the result of detection using the PCA technique.

Disease Detection Analysis Using Ensemble Methods (Random Forest) and Decision Trees.
Mehta et al. [36] presented the utilization of random forests with decision trees faring nearly the same all through the machine learning models. MLP classifier is utilized for the characterization, yet it did not get the sickness arrangement properly. e multiyield regressor performed all right with a random forest system as utilized. Random forest is an outfit technique where an example is taken from the preparation set with the trade for building a tree in the gathering. During the tree development, when a hub is split, the split that is best among the highlights will be picked, which is taken randomly. Chopda et al. [37] gave the execution of this project depending on the small scale. When all the functionalities are executed, the project would be extended for enormous scope.
Decision tree classifier is a straightforward and generally utilized characterization technique. e advantage of utilizing this calculation is simple as it bids a direct plan for the classification problem to be solved. It infers many inquiries concerning the characteristics of the test record. e Boa constrictor programming alongside the Jupyter Notepad is utilized to foresee the reasonable yield. e primary strides of the undertaking include the following: get cultivating information utilizing sensors and human contribution via portable applications, driving the gathered information into ingspeak server, examining and anticipating the gathered information utilizing the decision tree classifier, and building up the versatile application for ranchers for crop-related forecasts and cautions. Steps engaged with the decision tree classifier preparing information are an option of a certain level of all-out database alongside testing    Mathematical Problems in Engineering information utilized for training the model. As a result, more proficiency of the model is obtained, which depends on the training data. Testing information is utilized to allocate names for the sort of infection in the decision tree classifier. e comparative analysis of various decision tree and random forest techniques and their results are shown in Table 6.

Disease Detection Analysis Using Fuzzy Logic Analysis.
To detect a common cotton disease in the plants, various parameters can be used, and a list of these parameters is shown in Table 7. Rothe and Kshirsagar [38] implemented the detection of cotton diseases such as Alternaria, Myrothecium, and bacterial blight. Initially, the picture was taken through the digital camera, and then, the implementation of preprocessing image methodology takes place to smoothen the image. e region-based segmentation algorithm and the neural network were utilized for the benchmark datasets' classification. Testing data estimate the overall accuracy of the system. Zhao et al. [39] displayed the picture acknowledgment after-effects of an example. e locales of the creepy-crawly nuisance of sugarcane cotton aphids have been separated viably to give the condition to the programmed ID and analyze bug irritation of sugarcane cotton aphids. e overall evaluation results have proven 85% segmented correctly. Zhang et al. [40] introduced the identification of cotton diseases with the assistance of a fuzzy inference system. In recognizing and diagnosing the cotton malady utilizing PC vision in farming, determination of proper methods influences the classifier's plan and execution. e fuzzy component choice methodology, which incorporates fuzzy surfaces (FS) and fuzzy curves (FC) techniques, is currently the preferred choice for classifying cotton infected leaf images [41]. To get the best data for recognizing, a subclass of noteworthy autonomous highlights is distinguished, misusing the methodology of the fuzzy element. Firstly, use fuzzy curves to confine a little arrangement of critical highlights rapidly from the arrangement of unique highlights as per their need, take out the misleading element, and afterward utilize fuzzy surfaces to dispose of the highlights subject to the enormous highlights. Brewer and Glover [21] discussed the study that describes that root rot of cotton occurs in similar areas in the fields in repeating years even if there is a difference in the diseased pattern above these years. e authors of [42] present the capability of evaluation of the airborne multispectral detection of cotton plants. e comparative analysis of fuzzy system techniques is shown in Table 8.

Disease Detection Analysis Using Support Vector Machine.
Bhimte and ool [44] initiated the work to identify leaf spot disease of cotton using SVM classification and image processing. e objective of this work confers the utilization of a straightforward picture handling perspective for the programmed analysis of the leaf diseases of the cotton plant. e method of classification depends upon choosing the fitting features, for example, shading, and the surface of pictures is finished by using the SVM classifier. Different diseased leaf features dependent on the surface are misused by the Grey Level Coefficient Matrix (GLCM) for the identification of diseases. GLCM technique for learning the second request factual highlights the extraction framework for checking on the surface that thinks about the spatial relationship of pixel dim-level dispersal [45]. Recognizable proof of disease pursues the means such as staking the picture, differentiate upgrade, changing RGB to HSV, and extracting highlights [46]. e HSV algorithm for leaf disease can be determined [47].
Prashar et al. [48] presented the paper with a solution for the problem identified with farming, which includes recognizing leaf disease by utilizing the visible features. e multilayered perceptron (MLP) comes with overlapping pooling, an adaptable layering system to characterize the plant leaves to identify contaminated leaves. MLP is combined with ANN to forecast the content of chlorophyll from leaves [49]. For the overlapping arrangement of the layer to limit the curves by two-fold surfaced representation, KNN and SVM are used. Different procedures such as morphological division, design coordinating, and tint coordinating are consolidated which restrict the infection area with over 96% accuracy. Figure 8 shows the pictures of the cotton leaf at the beginning phase of improvement.
Prajapati et al. [50] proposed the strategy which utilizes the SVM Classifier. To distinguish the cotton leaf infections precisely, the need for machine learning and image processing is beneficial. e background images are handled through the thresholding process of Otsu. Distinctive partitioned images will help remove the highlights, such as color, shape, and surface, from the images. ese removed highlights are needed as the contributions for the classifier. Various authors used SVM to classify the cotton disease, and the comparative analysis is shown in Table 9.

Disease Detection Analysis Using Neural Networks.
Shah and Jain [52] actualized the use of ANN to detect cotton leaf diseases. e layman approach to distinguish infection from the cotton plant was made by examining it outwardly, which brought about a high measure of mistake because of various visual observations along with lighting; however, with the assistance of the artificial neural system, it was simple to recognize the nature of the leaf of the cotton plant. For the testing, 18 leaf pictures were utilized to recognize 6 unique kinds of diseases. e outcomes were "1" and "0" for the leaf with disease and leaf without the disease, respectively. A comparison of ANN and the actual result is shown in Figure 9.
Pujari et al. [53] proposed the work where the center has been around the early recognizable proof of infectious contamination from visual signs. is supportive work, in actuality, involves errands such as image acquisition, image preprocessing, feature selection, and headway of approaches for the proof of transmissible disease manifestations that impact several farming or growing crops, leading towards the CVS design improvement. e correlation of image processing systems claimed to distinguish the parasitic disease arrangement and proof that has been influenced on various farming or cultivation crops is condensed. e database is made to store the yields of the element extractions. e database is utilized to monitor parasitic disease manifestations that influenced natural product crops, business harvests, and grain and vegetable crops. Kashinath Kamble [54] predicted that identifying the disease is the primary need of this system. Comparative analysis of neural network techniques is shown in Table 10.

Various Cotton Disease Segmentation and Classification
Techniques. Many authors have used traditional and machine learning techniques for segmentation and classification purpose. In this paper, a comparative analysis has been performed for the better analysis of segmentation and classification techniques.
is study is shown in Table 11.

Advantages and Disadvantages of Existing Techniques.
Tables 12 and 13 represent the advantages and disadvantages of existing techniques on segmentation and classification techniques. e review on foundation end and segment methodology was moreover examined. rough this examination, they contemplated that the foundation expulsion shading space change from RGB to HSV is important and discovered that the thresholding framework gives extraordinary results that appeared differently concerning other foundation end Output value Figure 9: Comparison between actual output and ANN [52].    us, at long last, SVM gives extraordinary results, similar to precision, for the course of action of diseases.

Various Benchmark Datasets Used.
A dataset is an assortment of information. We as a whole perceive that to make up a machine learning approach; we need a dataset. In general, these machine learning datasets are used for analyzing the information. A dataset is the collection of homogeneous information. Dataset is utilized to mentor and worth the machine learning model. It has a major role in the system that has efficiency and reliability. If the database is commonplace and noise-free, it will give better outcomes. However, we are advanced with various datasets. It may frequently be business-related data or regular clinical data and much additional information. However, the actual problem is to find out the relevant ones according to the system requirements.  Figure 10 shows the confusion matrix.
(i) Confusion matrix (ii) ROC curve e ROC curve (Receiver Operating Characteristics Curve) is another measurement to Figure 11. e introduction of a classifier model, the ROC curve, depicts the locus of certified positives' locus concerning the pace of genuine positives.
is highlights the affectability of the classifier model. e ideal classifier will have a ROC where the outline would hit a certifiable positive pace of 100% with zero enhancements of x-center point (genuine positives). Since that is impossible when in doubt, we measure what numbers of good positive characterizations are being picked up with an increase in the rate of true positives. An example case of a ROC curve is referred to as follows.
Various parameters can be used to predict the outcome of different models, and a list of parameters is shown in Table 14. Most of the authors have used these parameters for the evaluating the performance of the algorithms. Table 15 gives a summarized view of theses parameters used by various authors.

Gaps and Findings.
After surveying the available research paper on a cotton disease, we have identified some of the gaps in the currently available work. ere are some gaps and findings which are discussed below: (i) Some of the techniques are time-consuming by late identification of the disease, and the process for the detection may be lengthy in few cases [57][58][59]. (ii) In most countries, no proper availability of detected disease management after the crop being affected affects the productivity of the cotton crops. It affects the economic growth of the country [55,60]. (iii) Most of the cotton disease detection methods need basic knowledge of operating the mobile application. In India, most of the farmers have lack of knowledge of such application and their usage. (iv) e available traditional and machine learning algorithms are giving accurate results, but it would be better if these algorithms could detect the diseases in the early stage [56,61]. (v) Researchers need more standard online datasets; therefore, research can be done at a faster pace. (vi) Most of the researchers applied the algorithms on standalone datasets; therefore, there is a need to build a model which can be used for multiple datasets. Linear separation of data cannot be performed; it is complex to obtain the data of the covariance matrix precisely Precision Precision � true positive/(true positives + false positives) 3 Recall Recall � true positive/(true positives + false negatives) 4 Sensitivity Sensitivity � true positive/(true positives + false negative) 5 Specificity Specificity � true negative/(true negatives + false positives) 6 True positive rate TPR � TP/(TP + TN) 7 True negative rate TNR � TN/(TN + FP) 8 F-1 F1 � 2TP/(2TP + FP + FN) 9 Error rate 1 − accuracy

Conclusion
Accurate recognition and disease classification help us improve growth, leading to the development of several advanced techniques to be used in agriculture. Due to rapid growth in various diseases and adequate knowledge, identification and detection have become a major challenge. By monitoring the stages continuously, one can notice the disease with the naked eye, but the only drawback would be the time-consuming and increasing cost. It shows that the image processing had substantiated itself as a viable instrument for recognizable proof and arrangement of plant diseases, while the computerized camera capacities are viewed as a superior alternative for the human eye subbed by a learning calculation. To conquer the troubles of the procedure physically, a few strategies dependent on computer technology vision are created lately to perceive the disease and recognizable proof of agribusiness and agriculture crops.
e quality analysis becomes difficult in captured images as they contain noise. Another restriction is that the images are to be captured under the environment in the conditions of lightning controlled. e methods of image processing execution in genuine use might be difficult because of a portion of the issues. To reduce the impact of light, point of the camera, catching gadget, and separation between objects, powerful adjustment is required. Different issues may emerge because of the changeability of shading under characteristic conditions. e absence of specialized information or image processing methods is critical for successfully executing any computer framework. e greater parts of the strategies proposed by authors are equipped for managing just a single species group among a few diseases in the plants. us, to discover plant infections of various types, a global system has to be built. A significant number of the creators do not give their data of testing and processing information, which is the principal perspective during the assessment of the approval of results. erefore, it is essential to give specific solid measures. For better sicknesses' discovery and their acknowledgment, the dataset of great pictures is suggested. e lighting arrangement should likewise be in the correct situation as it can influence the image captured. As provided by some of the authors of what tools they had used clearly, others are suggested to dedicate more time to the proper implementation of the techniques and thorough understanding of the tools to be used.
In the future, we will collect several images from enriching the cotton plant database and develop a deep neural network-based architecture to improve the accuracy. We will be developing a complete module to detect the diseases of cotton plants with a trained model, and information access will be through smart mobiles. It will also involve the recognition of cotton diseases based on different lands.

Data Availability
e data used to support the findings of the study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that they have no conflicts of interest regarding the publication of this paper.