Emerging Feature Extraction Techniques for Machine Learning-Based Classification of Carotid Artery Ultrasound Images

Plaque deposits in the carotid artery are the major cause of stroke and atherosclerosis. Ultrasound imaging is used as an early indicator of disease progression. Classification of the images to identify plaque presence and intima-media thickness (IMT) by machine learning algorithms requires features extracted from the images. A total of 361 images were used for feature extraction, which will assist in further classification of the carotid artery. This study presents the extraction of 65 features, which constitute of shape, texture, histogram, correlogram, and morphology features. Principal component analysis (PCA)-based feature selection is performed, and the 22 most significant features, which will improve the classification accuracy, are selected. Naive Bayes algorithm and dynamic learning vector quantization (DLVQ)-based machine learning classifications are performed with the extracted and selected features, and analysis is performed.


Introduction
Cardiovascular illnesses are the chief reason for death as per the World Health Organization. Accurate measurement and analysis of the carotid artery provide early identification of stroke and atherosclerosis diseases, which are caused by plaque deposit. In clinical diagnosis, plaque morphology and stenosis severity characterization may be diagnosed using noninvasive ultrasound imaging [1]. Experienced radiologists outline the layer thicknesses and plaque deposition buy may vary depending on the person. Automatic segmentation and classification of the carotid artery ultrasound images are necessary for accurate and early assessment of the disease.
Biochemical risk factors and vital signs include high C-reactive protein, glycohemoglobin, low-density lipoprotein cholesterol, serum creatinine, factor alpha, platelet-to-lymphocyte ratio, and neutrophil-to-lymphocyte ratio. Biomechanical risk factors include increased plaque area, low stenosis grade, increase in IMT, presence of in-plane curvature, abnormal shear stress, C-shaped ICA, low wall shear stress, and reversal flow. e related diseases because of plaque formation in the carotid artery are diabetes, hypertension, plaque hemorrhage, head and neck radiotherapy, chemotherapy, coronary artery diseases, peripheral artery diseases, and low ankle-brachial index [2,3].
Feature extraction reduces the data dimensionality by removing the redundant features, hence improving the training and inference speed. Nonredundant significant features from the ultrasound images of the carotid artery are extracted and used by machine learning (ML) algorithms to classify the image as abnormal or normal. Using additional features might cause overfitting, and less count features may make the data underfit. A good machine learning model may be built with relevant extracted and selected features from the image. Blood flow features like end-diastolic height, resistivity index (RI), maximum systolic height, pulsatility index (PI), spectral broadening index, and height width index help in plaque identification. Texture, grayscale median (GSM), entropy, coarseness, Hurst coefficient, etc. are some of the useful features [4,5]. e feature mapping to the classes in a computer-assisted diagnostic (CAD) system is called classification.
Loizou et al. proposed a texture feature variation in carotid artery ultrasound video to recognize the biomarkers of the plaque [6]. Followed by denoising and segmentation, the systole and diastole features were extracted from the video sequence. For the normal, symptomatic, and asymptomatic groups, a higher grayscale median was observed in systole and diastole cardiac cycles. Plaque texture features and grayscale median (GSM) were considerably diverse for the two classes. A multiresolution texture classification technique was proposed by Nikolaos et al. to classify the B-mode carotid artery ultrasound image as with or without plaque deposit [7]. Wavelet packet, stationary wavelet, discrete wavelet, Gabor transform, and other basis function decomposition schemes showed effectiveness in identifying the disease. Appropriate features were selected using deviation in values of the features and using a nonlinear correlation coefficient for thresholding. He proved that biomechanical forces affected texture analysis, and they showed horizontal directionality.
Prostate cancer inventions in transrectal ultrasound images by an automatic segmentation approach were proposed by Wang et al. [8]. e low layers of the convolutional neural network (CNN) are suppressed, and the prostate features in the deep layers are increased to identify, select, and influence the multilevel features from the various layers so that each layer features are enhanced. Layer-wise attention is used for enhancing the features in individual layers. e steps are generating attentive maps in layers, concatenate single-layer feature map to multilayer feature maps, and kernel arrangement in the three convolutional layers. e kernels used in the first two convolutional layers are 3 × 3 × 3, and the last kernel has 1 × 1 × 1 layer. From the learned samples in the folds, hyperparameters were identified by cross validation.
A feature-guided denoising CNN was proposed by Dong et al. [9] for removing noise and improving the image quality of portable ultrasound images. A feature masking layer was proposed for hierarchical noise removal in layers. Feature extraction was performed by an explainable artificial intelligence procedure. e highest important features were observed, and the other less important features were also partially preserved by guided backpropagation to map the neural network's choice. e feature images were combined with the Laplacian pyramid fusion method. BI-RADS feature extraction for characterizing the breast cancer lesion, which is continuously varying during the ultrasound sequence, was proposed by Sellami et al. [10]. e analysis was performed for the varying images with morphological and texture features. ey concluded that the feature extraction should be performed in many slices of similar mass to get the optimal segmentation results. Slice selection depends on the radiologist, and another slice may have the necessary useful information, which should not be missed for diagnosis.
Siavash et al. proposed a method to quantify morphological features, which helped in classifying tumor cells and disease identification [11]. By conserving the bigger vessel trunks, morphological parameters like tortuosity were measured.
ey developed an approach to augment the tumor microvessels with ultrafast ultrasound imaging. Wang et al. used a pretrained deep convolutional neural network (DCNN) and developed a multinetwork feature extraction model, which also performed dimensionality reduction and trained an ensemble support-vector machine [12]. Followed by preprocessing of the histological data by scale variation and color improvement, features in many levels were extracted by four trained DCNN. Using dualnetwork orthogonal low-rank learning, feature selection is performed for overfitting modifications and performance boosting of the method.
Zhou et al. proposed an involuntary measurement with phase images for spine curvature using three-dimensional ultrasound volume projection images [13]. Bony feature extraction by twofold thresholding strategy with asymmetric and symmetric measures information using phase congruency was performed. When the signal Fourier components are hugely in-phase, they can extract Mach bands, roof, line edges, and step edges.
Li and Liu proposed a genetic algorithm using information flow for the Naive Bayes classifier. From the view of correlation and angle of causality, a new weight measure criterion is produced by information flow. e approach provided improved classification accuracy compared with other attribute-weighting approaches with Bayes classifiers. e first, third quartile, and median values were higher for the proposed classifier [14]. Zhang et al. proposed a LVQ with a modular reconfigurable pipelining architecture for object recognition and image compression [15]. e method was verified in a 65 nm CMOS prototype chip. Reduced computation time and integration density efficiency were achieved by the approach.
Feature extraction and selection of the carotid artery ultrasound images are performed to assist ML-based classification.
e best significant features are selected by a divergence approach and principal component analysis (PCA). e images are classified as with or without plaque deposit, for early identification of the presence of stroke. is study's layout is as follows. Section 2 gives the methodology of feature extraction, selection, and classification, Section 3 explains the results and discussions, and Section 4 concludes the study.

Methodology
is segment describes the feature extraction of the carotid artery ultrasound images. Appropriate features are selected after extracting the image features. e selected features are classified using Naive Bayes and DLVQ machine learning algorithms, and the classification performances are analyzed.

2
Computational Intelligence and Neuroscience

Carotid Artery Ultrasound Images.
Ultrasound imaging is economically affordable, noninvasive, and continuously improving in image quality that will provide early diagnosis of arterial disease. Intima-media layer thickness, lumen diameter, and texture characterization are necessary for the assessment of cardiovascular illnesses. e ultrasound images are affected by speckle noise and should be effectively denoised, so that image quality is preserved and the useful features are not lost. Curvelet decomposition-based denoising is performed for noise removal. e carotid artery is an artery in the neck region, which transports pure blood from the heart to the brain and the front of the face. It contains the common carotid artery (CCA), which splits into external carotid artery (ECA) and internal carotid artery (ICA). Figure 1(a) presents the carotid artery structure, where ICA provides blood to face and ECA to the brain. e bifurcation is a more plaque-prone area. Figure 1 presents the sample carotid artery ultrasound image with and without plaque deposit. e deepest smooth layer is intima, muscular middle layer media, and the outer layer is adventitia. Lumen is the inner core of the artery. Atherosclerosis is the narrowing of the artery lumen because of fat and calcium deposit in the artery wall, which may lead to thinning of blood flow as indicated. e local thickening in the artery because of fat deposition is called plaque formation.
e intima-media thickness (IMT), lumen-intima (LI) interface thickness, and media-adventitia (MA) interface thickness influence the carotid plaque thickening. IMT measurements for a healthy person are from 0.2 to 0.25 mm.
Carotid artery disease identification includes plaque characterization and layer thickness measurements on age, gender, environmental conditions, and other habits of the person. e increased plaque thickness in the carotid is considered as an early indicator of future cardiovascular risks like stroke and atherosclerosis. e artery is at distance of 1 to 3 cm from the skin and so can be analyzed with imaging techniques. ree hundred and sixty-one carotid artery ultrasound images were obtained, out of which 202 are normal images without plaque deposit and 159 images are with plaque deposit in the artery layers. About 75% (270) of data are used for training and 25% (91) for testing the machine learning algorithms.

Feature Extraction.
Feature extraction and selection prevent overfitting of the training data and poor generalization of the new samples as given in Figure 2. Effective model construction for machine learning depends on the suitable optimized features. For a given feature set . . , f n , the features to be selected should maximize the learner algorithms' ability to classify the carotid image as symptomatic or asymptomatic. F should be a function maximizing the scoring function. e features are normalized or translated in the range [0 1]. us, the data are aligned with similar distributions and help in model fit. Z score normalization procedure is used, which is a standard score technique. Population mean is deducted from the individual data and uniformly divided by the population standard deviation.
Despite variations in class, a useful feature should remain unchanged. Feature extraction categories are based on nontransformed and transformed structural characteristics, and structural and graph descriptors. For high-dimensional feature space, computational complexity increases [16]. A kernel function is made to record the nonlinear data into a higher dimensional space. Kernel is a dot product in highdimensional space. us, all computations are made in the low space dimension. e kernel function is given by K is the inner product of the vectors x and x′ in the lowdimensional feature space for a function ϑ. At an instant y, with slope w 0 , the input's hyperplane is given by Feature selection is to identify the relevant properties of the image for data reduction and reduction in feature set, and to improve accuracy for performance improvement and proper understanding of the data for image analysis. Features may be subdivided based on their relevance. e extracted features may belong to any of the following categories, which assist in deciding whether it is a significant feature.
(i) A feature X i is irrelevant for all the subsets belonging to S i .
ese are the features with very less or zero probability calculated by Kullback-Leibler divergence.
For μ � 0, the feature is surely irrelevant.
(iv) A feature i is possibly nearly inappropriate based on D index computed with n number of examples, with approximation ϕ ≥ 0 and risk δ ≥ 0.
Computational Intelligence and Neuroscience 3 (v) A subgroup V of features is certainly adequate if it also belongs to the complementary of the subset.
(vi) A subgroup V of features is roughly adequate with approximation level ϕ ≥ 0 when If ϕ � 0, the subset is surely sufficient. (vii) A subset V of features is minimally approximately sufficient with approximation level ϕ ≥ 0, and other smaller subsets of ϕ does not exist.
From the carotid artery dataset, more artery characteristic-related features along with the standard features are extracted to analyze the image and identify whether the person has plaque deposit. e dataset was enhanced and labeled by radiologists for accuracy [17], and 63 shape, texture, morphology, histogram, and correlogram-based features are extracted and analyzed with the machine learning techniques.
Along with the existing features, plaque morphologybased features extracted are plaque power spectra for all the three intensity variations (low, medium, and high), namely, shape, connectivity, convexity, plaque size, lipid core, presence of intraplaque hemorrhage, smooth lumen surface indicating no risk, and rough lumen, which may lead to stroke and plaque volume. Figure 3 gives the procedure followed for feature extraction, selection, and classification of the carotid artery ultrasound images.

Extracted Features.
Features are extracted from the carotid artery images in the database following preprocessing followed by segmentation steps. e image is noise-free and contrast enhanced, assisting in the quantization of the image features. e below sixty-three features are extracted from the carotid artery ultrasound image database. Table 1 gives the list of the extracted features. irtythree texture features, five shape features, ten histogram-and     (33) Gray-level co-occurrence matrix (GLCM)-inertia, energy, correlation, contrast, entropy, and homogeneity Gabor wavelet Statistical features-mean, median, kurtosis, and skewness Local binary pattern (LBP)-textures spatial structure Gray-level difference statistics (GLDS)-mean, entropy, contrast, angular second moment, and homogeneity Fractal dimension texture analysis (FDTA) Radial and angular sum of discrete Fourier transform for Fourier power spectrum Neighborhood gray-tone difference matrix (NGTDM), given by strength, complexity, coarseness, and contrast Absolute gradient-mean, variance 2 Shape features (5) Sharpness Complexity Length irregularity Aspect ratio and circularity 3 Histogram and correlogram features (10) Gray-level histogram of segmented ROI of the carotid image-for 32 same measurements, bins were computed Plaque histogram represents plaque characterization Multiregion histogram-to check whether plaque outer region signifies disease progression Grayscale median (GSM) derived from the histograms first-order statistics, and entropy represents echogenicity Histogram of oriented gradient (HOG)-gradient magnitude and orientation Correlogram-statistics and spatial distribution of the features Texture and shape features-normalized; histogram and correlogram features-used without normalization 4 Morphology features (15) Mean probability density functions (PDFs), mean cumulative distribution functions (CDFs) Plaque power spectra for all the three intensity variations (low, medium, and high) Shape, connectivity, and convexity Plaque size, lipid core, and presence of intraplaque hemorrhage Smooth lumen surface-no risk; rough lumen-leads to stroke Plaque volume Computational Intelligence and Neuroscience 5 correlogram-based features are extracted. Fifteen plaque morphology-based features were extracted from the carotid artery ultrasound image database. A detailed explanation of the extracted image features is given below.

Texture Features.
Texture in the image is computed from the gray-level co-occurrence matrix (GLCM), which uses spatial difference among the pixels, which are in pairs with a certain distance, relation, and direction [18,19]. Inertia, energy, correlation, entropy, and homogeneity information are extracted from the matrix. It is proved that the directions 0°, 45°, 90°, and 135°are less dominant for GLCM in medical images, requiring less computation at these angles. For image I(k, k), the co-occurrence matrix is given by where D x � D. cos(θ), and D y � D. sin(θ), θ being the offset defining matrix direction from the central pixel, and D, the distance from the central pixel. For the given angle θ, the parameters computed are as follows: Gabor wavelet texture features are computed in precise directions and frequencies. Statistical features like mean, median, kurtosis, and skewness were computed. LBP gives the characteristics of the texture's spatial structure. GLCM analyzes the pixel pair's gray-level distribution and is known as a second-order histogram. Gray-level difference statistics (GLDS) extracts the texture features mean, entropy, angular second moment, contrast, and homogeneity representing the difference between the pairs of average gray levels. For d � 1, the mean displacement values were computed. e following are the relation between texture and statistical characteristics.
(i) For dis-similar pixel pair in coarse texture region, contrast is high, representing considerable variation in gray level. (ii) A high-energy feature value represents the textural uniformity of the image.
(iii) Uniform GLCM is represented by a large entropy value, which represents nonhomogeneity or degree of disorder. (iv) Homogeneity of the pixel pair is small for different gray-level pixel pairs. (v) Pixel's neighborhood influence represents the correlation.
e pixel statistical properties given by the statistical feature matrix (SFM) are coarseness, roughness, periodicity, and contrast. Pixel's self-similarity is represented by fractals in Euclidean space for a bounded set A for N overlapping copies of itself. For a scaling ratio s, the fractal dimension is given by A rough texture has less fractal dimension texture analysis (FDTA) at image resolution 4. Radial and angular sum of discrete Fourier transform for Fourier power spectrum textural feature is computed [20]. e texture's visual properties are projected in the neighborhood graytone difference matrix (NGTDM), given by strength, complexity, coarseness, and contrast. e absolute gradient value is more when the pixel intensity moves from black to white, and it is less when it moves from dark gray to a lighter shade. Dark to light shade movement is indicated by a positive gradient, and light to dark shade is represented by a negative gradient [21]. Mean and variance are computed from the absolute gradient texture feature. It measures the mean gray-level changes through the image and how far the pixels are from the mean deviation. Pixels of the same gray level in a certain route are given by the run-length matrix.

Shape Features.
Shape-based similarity among the pixels is extracted by shape features. It is a content description based on and performed by feature extraction and likeness measurement among the extracted features. Regioncentered shape feature projects the contour characteristics in the entire image region, and contour-based shape feature gives the shape in the contour alone. Contour-based shape features extracted are sharpness, complexity, length irregularity, aspect ratio, and circularity. Contour detection from the edges is possible with these rotation, translation, and scale-invariant shape features [22][23][24].
For n > 3, K � 2P and for n � 3, K � P, which is the length of the polygon segment and the next segment. 6 Computational Intelligence and Neuroscience where the segment boundary enclosed polygon sides n, discontinuity angle θ, the perimeter of the polygon bounded by segment border P, and the most significant perpendicular distances to the boundary p 1 and p 2 . X and Y coordinates of the picture frame, area of ROI, perimeter, and perimeter 2 / area of ROI were computed to find whether the complexity and size of the carotid image shape help in identifying plaque deposit.

Histogram and Correlogram
Features. For the continuous pixel data, the histogram is its frequency distribution representation. Gray-level histogram of segmented ROI for the carotid image for thirty-two same width bins was computed. e plaque histogram feature effectively represents the plaque characterization. e multiregion histogram is computed for equidistant ROI to check whether the plaque outer region signifies disease progression. Grayscale median (GSM) derived from the histograms first-order statistics and entropy represents the echogenicity and arterial wall ROI's randomness. For the image array Y, T 2 histogram is computed for a known mean scalar A. Image data are obtained from the first-order histogram with n th feature y n (i, j) in the local region M. Let us consider y n constitute discrete values only. e distance vector, which is the gray-level difference between the two pixels, is given by for integers Δ x and Δ y . At distance d, the gray-level difference is given by e HOG feature derived from DCT gives the details about corners and edges in the image. Gradient and orientation information of the artery layer localized region is extracted after separating the image into smaller regions. e histogram of each region is separately computed from the gradients and orientations of pixel values. For directions G x and G y , the gradient magnitude is given by e orientation of each pixel is given by e number of HOG features is dynamic based on the size of each region. Based on the gradient and direction, the weighted HOG is generated. To prevent aliasing problems, in orientation and position centers, bilinear vote interpolation is performed.
Statistics and spatial distribution of the features are represented by a type of histogram called correlogram [25]. Pixel gray intensity from the center is computed to get the correlogram of the image. e distance of each pixel from the center is identified from which equidistant pixels histograms were computed. e calculated distance to the maximum distance difference is divided by the normalization factor, which gives the correlogram comparison. e texture and shape features are normalized, whereas histogram and correlogram features are used without normalization.

Morphology Features.
e presence of structural elements at different scales is identified by morphological features.
e near-isotropic detection structural elements are represented by + in the structural pattern of the plaque. At the two morphological sets, mean cumulative distribution functions (CDFs) and the mean probability density functions (PDFs) are computed. Plaque component-based multilevel decomposition describes the morphology of the image [26,27]. e normalized image is thresholded at low, medium, and high intensities. e unstable plaque is indicated by low dark components, and the bright region indicates stable plaque. e plaque power spectra for all the three intensity variations are used as a feature for plaque classification. e hypergeometric function used to capture the morphology of the image by the n th -order Krawtchouk classical polynomial k is given by For a given x, with m � 0 to N, N > 0, p ∈ (0, 1) if the hyperbolic function F 1 .
is represents the geometrical structure of the artery and the carotid plaque with details of shape, connectivity, and convexity. Clinical diagnosis depends on the artery wall morphology as a weighted map of the weighted original image. If the segmentation results are poor, the contour morphology may miss some details, leading to improper diagnosis. e morphological stream of feature extraction selects all the morphological features required for learning and classification like plaque size, lipid core, and presence of intraplaque hemorrhage [28]. Plaque destabilization leads to plaque rupture, and the disrupted plaque constitutes a thrombus in the distal artery layers. e volume of plaque in the carotid artery is not a clear indicator of the degree of stenosis. Ischemic symptoms can be identified by the plaque morphological features. In spite of any stenosis degree, the features may cause stroke risk. us, the morphological features and the degree of stenosis together give the plaque vulnerability characteristics. A smooth lumen surface indicates no risk, and a rough lumen is considered to lead to stroke.
Blood pressure and plaque rupture cause intraplaque hemorrhage, and it is closely related to cerebrovascular events. e susceptible plaque has a thin fibrous cap, which covers a lipid-rich necrotic core containing inflammatory cells.
e artery attempts to maintain a uniform plaque Computational Intelligence and Neuroscience 7 diameter concerning the lumen. Positive remodeling tends to increase the diameter, and adverse remodeling tends to decrease the diameter, which is called stenosis [29,30]. Plaque volume is proved to be a more significant feature to identify the disease progression and vulnerability than the degree of stenosis. Echolucent plaque, which contains big white blobs, is considered to be dangerous. Calcified and collagen-stuffed plaque is stable and has very less chance of rupture with a tiny high-intensity image. A lipid with scattered components in the low-intensity image can be characterized as asymptomatic. us, unstable plaque is an effect of the thin fibrous cap. From the morphological features extracted, it is understood that plaque instability may not lead to stenosis but reduces blood flow in the artery.

Feature Selection.
A robust subset of the extracted features is selected to reduce computational complexity and improve classification accuracy. Feature selection aims at dimensionality reduction, retaining the discriminatory information from the images. e maximum discriminant features from the extracted features are carefully chosen depending on the subsequent method. e distance among two classes for all the features are generated as below for mean m 1 , m 2 and standard deviation Significant features are identified using the distance measure, and increased distance means more the significance. Sixty-five features are extracted, from which twentytwo most relevant features are considered for the classification process. Along with this method, the principal component analysis (PCA) method for the selection of features is performed. Eigenvalues are used for deriving the principal components. Uncorrelated feature set and principal components are achieved from the correlated ones, orthogonal transformation. e steps in PCA feature reduction are as follows: (1) e mean value of the dataset D is computed.
(2) A new matrix S is formed by subtracting the mean value from D. (3) From S, covariance is estimated at C � AAT. For data 1 to l, (4) Eigenvalues derived from the covariance matrix are [V 1 , V 2 , . . . , V n ]. (5) Eigenvectors are found from the covariance matrix. (6) Vector D − D, a linear function of eigenvectors, is given by e symmetric covariance matrix is For a normally distributed dataset, the principal components are independent. e PCA depends on the individual variable screening. e PCA reduces the dimensionality of the dataset and increases interpretability with a reduction in loss of information. e variances are maximized one after the other by developing variables, which are uncorrelated.

Classification.
e extracted features were given to machine learning classification algorithms, namely, the Naive Bayes algorithm and dynamic learning vector quantization (DLVQ), for a more accurate classification of the carotid artery ultrasound images.

Naive Bayes Algorithm.
Bayes theorem considers an event probability based on prior knowledge of the related conditions with statistical interference.
e Naive Bayes classification algorithm is a supervised approach, which depends on the Bayes theorem and assumes that the occurrence of a feature is not dependent on other features in the image. Frequency tables are generated from the dataset, followed by probability-based likelihood table generation and posterior probability calculation based on the Bayes theorem that are performed. Since the dataset followed a normal distribution, Gaussian-based Naive Bayes classification is performed for the dataset images. Weighting the features and structure extension is used to overcome the drawback of the complete feature independence assumption. A kernel function is used to estimate the data distribution instead of following a normal distribution.
For the two-class problem C 1 , C 2 , samples are indicated by n dimensional vectors.
→ needs to be found with the highest a posteriori probability. Y lies inside the class C based on the following condition.
e numerator value should be maximized to get a higher probability, since the probability of getting the Y vector is a constant.
e assumption that the classes are conditional independent is performed [31,32].
e Y 1 and Y 2 probabilities can be obtained from the training set.
Since Naive Bayes assumes that all the dimensions are independent, the classification accuracy is minimum compared to the LVG algorithm.

Dynamic Learning Vector Quantization (DLVQ).
e LVQ is a winner take all Hebbian learning-based supervised learning approach.
e algorithm classifies the sample with the same label as the codebook prototype, similar to the nearest neighbor classification. In the training phase, LVQ iteratively goes through the training sample y j , shifted to the nearest available prototype n m(j) nearer or farther from y j . is depends on if the n m(j) is in the same class as y j .
ρ is between 0 and 1, and it is a step size, which is monotonically reducing [33,34]. An adaptive version of LVQ is DLVQ, which can acclimate the neuron count to the dataset. It is a computationally less cost approach that has two layers of the fully connected network. e output parallels to one of the classes and matches to a reference vector with weights assigned. e output value is the consolidation of the outcomes of all the output neurons, which signifies the distance between the input sample and the reference vector. e distance helps in weight updating during the training phase and assigning the neuron to the corresponding output during the classification [33]. e frequency-sensitive competitive learning approach is used for training the network. DLVQ is used with global average pooling so that the model parameter count is minimized, thus reducing overfitting.

Results and Discussions
For the carotid artery ultrasound images, feature extraction, selection, and classification are performed. e results are analyzed with the classification performance measures.

Feature Extraction and Selection.
e features extracted from the carotid artery ultrasound images are normalized and plotted for normal images without abnormalities and abnormal images with plaque deposit. From the bar plot, if a particular feature records significant variation for normal and abnormal images, it is considered as a significant feature. If a feature does not show variations for normal and abnormal images, it is considered not a significant feature and is omitted. Figure 4 shows sample discriminant features such as skewness, kurtosis, histogram, HOG, and correlogram. Features with more variance help in machine learning classification since the values are far apart for normal and abnormal cases. For sample features such as correlation, contrast, entropy, and variance, the normalized feature values are indiscriminate and are hence not useful for machine learning classification. Table 2 gives the selected features based on discriminant and PCA feature reduction technique for the carotid artery ultrasound images. e statistical PCA model identifies the discrete pattern in the data and thus selects only the required features. e input neurons of the classification algorithms are powered by the potential features.

Classification Performance Measures.
e selection of appropriate performance measures for evaluating the machine learning classification algorithms provides faith for its real-time application. Since the data are imbalanced, applying more measures provides a clear insight into the algorithm usage. Classification of the carotid artery ultrasound images is a binary class problem, correct classification as fitting to the class being true positive (TP), correct classification as not fit in the class as true negative (TN), wrongly categorized to a class false positive (FP), and misclassified to a class as false negative (FN). e general efficiency of the model is given by Agreement of the labels with positives is given by Sensitivity and recall give the efficiency of identifying positive labels. e data out of the classifier and the positive labeled ones' relationships are measured by F score. Negative label identification performance is measured by specificity. Recall is given by TPR, and probability is given by FPR. Area under the ROC curve (AUC) gives false classification identification presentation of the model, where 1 depicts a n ideal model [35,36]   probable cutoff values in all classification thresholds. e ordinal curve gives the contribution of the features and their significance. It helps in the data to rely on one final side. Figure 5 gives the ROC curve and gain chart of the Naive Bayes classification. It has recorded an area under the curve training of 61.46% and 47.50% for testing. e gain chart gives the whole positive rate percentage to the total count percentage. Figure 6 gives the Box-Cox plot for the plaque diameter, which is a linearity plot giving the correlation between the transformed StDev and the given lambda values. Figure 7 gives the pure ordinal curve for sample features meangl (mean gray level), kurtosis, contrast, and skewness. Table 3 provides the confusion matrix of the applied ML techniques, namely, Naive Bayes and DLVQ, performed in 361 carotid artery ultrasound images. Table 4 gives the comparison of the Naive Bayes and dynamic learning vector quantization approaches using the classification performance measures. Table 5 separately gives the accuracy of the Naive Bayes and DLVQ algorithms with the extracted sixty-three features and the selected twenty-two features. Improved accuracy is found in both the machine learning algorithms on using only the selected features.
Classification of hard and soft plaques will aid in diagnosing dangerous plaque types and may be implemented as a   Computational Intelligence and Neuroscience future work. A patient-specific model with more measurements from a single patient including boundary conditions, pressure, geometry, 3D images, and thermal images may be implemented to improve diagnostic performance.

Conclusions
Sixty-three features consisting of texture, shape, histogram, correlogram, and morphology were extracted from the carotid artery ultrasound images. Principal component analysis (PCA) and feature analysis by feature significancebased feature reduction were performed to minimize overfit and underfit problems. e robust subset from the extracted features reduces model complexity and computation and improves the efficiency of the classification task. Classification of Naive Bayes and DLVQ algorithms is performed using the extracted and selected features. DLVQ with the selected features recorded an accuracy of 91.68%, specificity 98.51%, sensitivity 83.01%, precision 97.77, F score 89.78%, and AUC 98.14%. DLVQ has proven to give improved     Data Availability e data are not from any public database. So the authors will not be able to share the data.

Ethical Approval
Ethical clearances were obtained from SRM Medical College Hospital and Research Centre, India, number: 1736/IEC/ 2019.

Conflicts of Interest
e authors declare no potential conflicts of interest.