Prediction of Diabetes through Retinal Images Using Deep Neural Network

Microvascular problems of diabetes, such as diabetic retinopathy and macular edema, can be seen in the eye's retina, and the retinal images are being used to screen for and diagnose the illness manually. Using deep learning to automate this time-consuming process might be quite beneficial. In this paper, a deep neural network, i.e., convolutional neural network, has been proposed for predicting diabetes through retinal images. Before applying the deep neural network, the dataset is preprocessed and normalised for classification. Deep neural network is constructed by using 7 layers, 5 kernels, and ReLU activation function, and MaxPooling is implemented to combine important features. Finally, the model is implemented to classify whether the retinal image belongs to a diabetic or nondiabetic class. The parameters used for evaluating the model are accuracy, precision, recall, and F1 score. The implemented model has achieved a training accuracy of more than 95%, which is much better than the other states of the art algorithms.


Introduction
Diabetes is a condition in which the body's ability to process sugar (glucose) is impaired [1]. Because of this, glucose levels in the blood shoot through the roof. Hyperglycemia is the medical term for this condition [2]. e body is unable to create enough insulin when this occurs.
ere is also the potential that the body cannot respond to the produced insulin. Diabetes cannot be cured, but it can be managed. Nerve damage, heart attacks, kidney failure, and stroke are possible outcomes for diabetes. Diabetes affects an estimated 8.8% of the world's population, according to statistics from 2017 [3]. By 2045, this figure is expected to rise to 9.9 percent.
Type 1 diabetes (T1D) and type 2 diabetes are the two types of diabetes (T2D) [4]. Most people diagnosed with type 1 diabetes are in their teens or early twenties. High blood glucose levels and increased thirst and urination are the most common symptoms. Oral drugs alone are ineffective in treating this kind of diabetes; hence, insulin therapy is essential. Obesity, hypertension, dyslipidemia, arteriosclerosis, and other disorders are all more common in older adults and the elderly regarding type 2 diabetes [5].
Diabetes is becoming more and more widespread as people's standard of living rises. Diabetes diagnosis and analysis should be studied because of the importance of speed and accuracy. Glucose tolerance, fasting blood glucose levels, and random blood glucose levels are all used to diagnose diabetes in the medical community [6]. e sooner we get a diagnosis, the easier it will be to treat it. Based on a person's daily physical examination data, machine learning can assist humans in making a preliminary diagnosis of diabetes mellitus. e most critical issues in machine learning are identifying useful features and the correct classifier [7]. e standard machine learning methods, such as the support vector machine (SVM), the decision tree (DT), the logistic regression, and others, have recently been applied to predict diabetes [8]. PCA and fuzzy neural inference were used to separate patients with diabetes from those who were not. As a result of the QPSO method and weighted least squares support vector machine (WLS-SVM) developed by Chi et al. [9], type 2 diabetes can be predicted. Diabetes can be predicted using a model developed by Çalişir and Doǧantekin, known as LDA-MWSVM [10]. Linear discriminant analysis (LDA) was utilized to reduce dimensionality and extract features in this system [11]. High-dimensional datasets necessitated logistic regression to build prediction models for diverse onsets of type 2 diabetes. SVR (support vector regression) was utilized by Georga et al. to predict diabetes, a multivariate regression problem [12]. To further enhance the accuracy of the results, a growing number of studies are using ensemble approaches [13]. Combining 30 machine learning algorithms, Ozcift and Gulten developed an ensemble approach known as rotation forest [14].
Diabetes can be accurately predicted using AI-based technologies [15]. Deciduous categorization power is one of the advantages of using decision trees in the medical industry. In addition, a random forest produces a large number of decision trees. Recently, neural networks have emerged as a well-known machine learning technique because of their superior overall performance. In this article, deep neural networks will be used to predict the onset of diabetes.
e proposed work will use the convolutional neural network to indicate diabetes [16]. In addition, the proposed work has used the dataset consisting of retinal images, and a deep neural network will be implemented on this retinal image dataset to predict the disease [17]. e organisation of the paper is as follows: Section 1 describes the introduction of the paper, whereas Section 2 discusses about the background study. In Section 3, proposed methodology is explained with results and discussion in Section 5. Finally, the conclusion in Section 5 followed by the reference section.

Related Work
When it comes to saving a person's life, early diabetes diagnosis is critical. In the last several years, a number of new diagnostic methods have been developed based on various models and methodologies. Neural networks, deep learning, and machine learning are just a few of the methods that can be used to improve facial recognition [18]. Other methods include decision making, KNN, retinal pictures, and face images for diagnosis [19].
Joshi and Borse [20] developed a neural network called back propagation (BPNN). MathWorks (MATLAB) was used to create the user-interface. Researchers utilise the Pima Indian Diabetes Dataset to test their proposed methods. Parsing is conducted after the dataset has been loaded. ANNs were trained using back propagation neural networks after reading the values one by one. During the feature extraction phase, values were grouped together based on shared characteristics, and the groups were then arranged in a table. In the proposed method, the following step was to normalise the data. e data were encoded as a binary number between 0 and 1 [21]. Data redundancy is eliminated, and data relationships are ensured, as a result of normalisation. e final phase in the proposed method was training. e proposed system underwent up to nine iterations of training. e third iteration yielded the lowest level of error. At lower epoch values, the best results were achieved. Regression and validation plots were used to generate the results but accuracy is not. e computational speed and efficiency of feed forward ANN (FFANN) make it popular in today's society. Diagnosis of diabetes can be improved by using the Small World FANN model, according Erkaymaz and Ozer [22]. Researchers considered a four-layered FFANN in their investigation. e network included one output neuron and eight inputs. In FFANN, they made use of two hidden layers. FFANN used two alternative network topologies. Scientists rely on a bipolar-sigmoid function as their activation function in developing the new approach. e SW-FFANN training algorithm was based on a backpropagation learning algorithm. e PIDD dataset from the University of California, Irvine (UCI) repository was used in this study. e drawback is that optimum regular topology for SW-network development had been used in the rewiring procedure, for which DGlobal and DLocal parameters were determined.
A technique based on artificial neural networks was described in detail. Input, hidden, and output layers make up the three main components of an artificial neural network. Raw data are sent to the input layer. Inputs and weights assigned to them determine how hidden layers work. e data were entered into a JNN tool that calculates the attributes' values. Training, testing, and validation of data were then carried out-Binary numbers were the output of the suggested system. As a diabetic, I scores 0 points, while a healthy one scores 1. e proposed system had an average error rate of 0.010. e dataset underwent a total of 158,000 iterations. ere were 767 training samples and 237 validation samples. e limitation of this technique is that its computationally complex.
Aliberti et al. [23] examined the prediction algorithms trained on glucose signals from a large and heterogeneous cohort of patients and then applied them to estimate future glucose levels on a brand-new patient. Based on nonlinear autoregressive (NAR) and long short-term memory (LSTM) neural networks, the authors have developed and compared two different types of solutions that have been successful in numerous time series prediction situations [24]. 2 Computational Intelligence and Neuroscience A deep neural network framework based on stacked autoencoders was presented by Kannadasan et al. [25] to classify the diabetes data. First, stacked autoencoders are used to extract features from the dataset, and the dataset is then categorised using a softmax layer. Finally, the network is fine tuned using the training dataset using supervised backpropagation. Pradhan et al. [26] used skin impedance and heart rate variability to identify diabetes. Classification was accomplished with the usage of artificial neural networks. Six females and five males with diabetes, an average age of 8 to 40 years, had been studied for skin impedance data. In addition, data from eight normal people, five females, and three males, with an average age of 3 to 24 years, were gathered for the study. When it came to determining signal strength at various frequencies, the Welch method was employed. Data on the electrocardiograms of 20 healthy volunteers, 14 men, and 6 women, with an average age of 22 years and 7 months, were gathered. Additionally, information was gathered on 20 diabetes patients, eight of whom were female and twelve of whom were male, with an average age of 40 years and eight months. In order to remove baseline drift from the resulting signal, median filtering was employed during the initial stages of signal preprocessing. Butter worth a lowpass filter was also used to reduce the high-frequency noise.
e Savitzky-Golay filter was then used to smooth the ECG signal [27].
A deep neural network screening model was created by Ryu et al. [28] for patients with undetected diabetes mellitus (DM). Data from the Korean National Health and Nutrition Examination b (KNHANES) from 2013-2016 were used in our cross-sectional investigation. Only 11,456 people were included in the study after removing those diagnosed with DM, those under the age of 20, and those with incomplete data. KNHANES 2013-2015 was utilised as a training dataset and evaluated to generate a DLM for undiagnosed diabetes mellitus. e DLM was tested on a sample of 4,444 people who completed the 2016 KNHANES survey. Age, waist circumference, BMI, gender, smoking status, hypertension, and family history of diabetes (FH) were used to build the DLM. e area under the curve (AUC) of the model was 80.11, which is in line with previous screening models' performance [29].

Materials and Methods
e proposed methodology has been discussed in this section. Multilayer neural networks have been employed in the suggested research as a deep NN [30]. Convolutional neural networks are gaining popularity as data are structured as an image. Normalization is a key part of this procedure, as it is used for most of the data. Before beginning any work, it is highly advised to preprocess the images from dataset. As a result, preprocessed data will help in improved accuracy.
is dataset has been fed into proposed deep convolutional neural network after preprocessing and normalization. Deep neural networks (DNNs) are then used to run and fit our data, resulting in the output. e following sections will provide a high-level overview of the completed work. Figure 1 displays the flowchart of the proposed model.

Dataset.
e images used for performing the analysis is downloaded from the Github https://github.com/ deepdrdoc/Deep-Diabetic-Retinopathy-Image-Dataset-DeepDRiD. In this work, 410 images based on retina are used for performing the analysis and these images are classified for predicting the images as diabetic or nondiabetic.

Preprocessing Data.
As previously stated, preprocessing is critical to this work. Image processing techniques are used to perform preprocessing on this dataset. To do this, the approaches described above have been employed to locate and bold the intensity of aberrant locations and parts. As a result, an unusual structure can be seen in some images. e optic disc and vessels, as an example, are not typical. e findings of previous tests using multichannel images were not very dependable; this was the case even before attempting a solution based on grayscaled data. As a result, grayscale graphics were chosen as the preferred method of presentation. e next step is to normalize the photos after converting them to grayscale.
In the preprocessing stage, data may now be normalised easily by dividing image intensities to 255 (image converted to greyscale previously). e data must first be normalised using a label to form the network. Each picture name contains a class label in the first substring. e preprocessed data have been standardised as of this moment.

Creating Deep Neural Network Model.
is section discusses deep neural networks with seven layers using different activation functions. For example, the first layer of the convolutional1D network uses the ReLU activation function with a kernel size of 5. After preprocessing, the data are reduced to a 256 * 256 grayscale picture [31]. e input shape of size (256, 256, 1) will be utilized as an input. After that, MaxPooling combines the most important features, then flattens the image, and finally, the classification will be done. e dense layer should be utilized because we have  Computational Intelligence and Neuroscience binary categories (diabetic vs. nondiabetic). Because our class labels are binary, our loss function is binary crossentropy. Finally, Adam's batch size is set to 10, and it is used as an optimizer in batch mode. is optimizer will help to prevent the overfitting of the proposed model [32]. Figure 2 has shown the architecture of the proposed neural network with the size representation of the images. e Figure 3 has also shown the use of activation function and the max pooling layer for performing the classification.

Results and Discussion
e proposed work is implemented using the anaconda framework for executing the Python codes. Anaconda framework is rich in libraries based on deep learning due to which TensorFlow and Keras were easily imported in the code. Different aspects of the image taken for the analysis are given below: Figures 3-6 has shown the different shades of the images. Figure 3 has represented the normal image, whereas Figure 4 has illustrated the gray scaled image. Figure 5 has shown the vessels detected by canny edge detection filter, and at last, Figure 6 has shown the generated preprocessed image which is further classified by the deep neural network.
Above Tables 1 and 2 has shown the comparison chart and confusion matrix of the proposed work, respectively. In Table 1, the proposed deep neural network is compared with the existing machine learning algorithms and the results have indicated that the proposed work has performed better than the existing machine learning algorithms.

Conclusion
Diagnosing diabetes at an early stage is critical to finding an effective treatment. Diabetes classification is implemented using a deep neural network, i.e., convolutional neural network, in the current work. e dataset contains more than 410 images based on the retina for diabetes classification. e number of training epochs was kept short of ensuring that the approach could be quickly used on any mobile device.
e experimental results suggest that the proposed deep learning model is effective and accurate. e model has achieved an accuracy greater than 95%. e model for determining all probable complications, including an orderly sequence in terms of the percentage of complications that can occur, will be improved in a future study as well. Additionally, deep learning algorithms and methodologies can be incorporated to enhance the work for an automated diabetes analysis.

Conflicts of Interest
e authors declare that there are no conflicts of interest regarding the publication of this paper.