Semantic Analysis of Public Health Medical Issues Based on Convolution Neural Networks

Text mining and semantic analysis of medical public health issues are the main points for intelligent medical interaction, but less relevant research has been done on them. is article conceives a convolutional neural network for the semantic classication of public health medical issues.e dual convolution layer is used to further reduce the dimension of the data, extract more in-depth information from the data, and map the features. Each convolution layer includes several convolution nuclei to extract semantic characteristics, and then, the complete connection layer is input to the classier to obtain the results of the classication. To check the classication eect, the dictionary articial construction and the double hidden-layers neuronal network are used for semantic classication, and the three methods are compared and tested on the six real datasets.e experimental results show that when the quality of the dataset is high, the convolution neural network method proposed in this paper exceeds the last two methods. e proposed method is higher than the construction of the articial dictionary and the double hidden-layers neural network in the recall rate: 0.153 and 0.037, and greater than 0.07 and 0.01 for the F1 measure rate, respectively. When the quality of the dataset is general, the models of the three methods do not give good classication results. Finally, it is concluded that the convolutional neural network method conceived has a good semantic recognition performance in public health medical issues.


Introduction
In recent years, the application of arti cial intelligence in the eld of medical science has developed rapidly. In smart hospitals such as Internet hospitals and telemedicine, if arti cial intelligence technology can be used to intelligently identify and classify the medical and health issues frequently asked by patients during patient consultation and patient telephone consultation, the e ciency and intelligence of the hospital will be greatly improved [1,2].
Semantic recognition of medical health problems, essentially text semantic analysis, rst converts the patient's question speech into text, then performs the semantic analysis. However, due to the more specialized and complex knowledge involved in the eld of medical health, there are fewer studies on semantic analysis of medical health issues [3]. Deep learning is the hottest direction in the eld of machine learning in recent years; because of its strong feature extraction ability and learning ability, it has been widely used in image recognition, speech recognition, and natural language processing. Traditional text classi cation algorithms, such as the k nearest-neighbor (KNN) algorithm [4], decision tree [5], and support vector machine (SVM) [6], have limited t of complex functions in the case of limited datasets and computational units, which restricts their ability to deal with complex problems, and their learning ability and generalization are not strong [7]. Deep learning transforms the original data into abstract representation layer by layer through multilayer representation learning, automatically learns features from the data, uses its powerful computing and learning capabilities to discover complex structures in high-dimensional data, and the extracted feature information is better used for classi cation and prediction [8,9].
Deep neural network models that are widely used in the construction of text classi cation models include convolutional neural networks (CNN) [10], recurrent neural networks (RNN) [11], and long short-term memory (LSTM) [12]. As an important network model in deep learning, CNN can classify large-scale text data, and the relevant research results show that CNN has great application value in the field of text classification [13]. For example, the text classification models based on deep convolutional neural networks constructed by Shuai et al. can accurately classify rice knowledge texts with different sample sizes and different complexities [14]. Zhou et al. built a convolutional neural network model to learn and classify medical-related papers based on deep learning text classification methods that can improve the classification accuracy of biomedical texts [15].
Although the above results have achieved good experimental results, few scholars have studied this in the direction of medical public health issue analysis [16][17][18]. erefore, to explore the effect and performance of deep learning on public health issues, this paper studies the semantic analysis of medical public health issue texts using improved convolutional neural networks based on the data analysis needs of real medical public health problems.

Related Issues and Datasets
Medical health issue semantic recognition, essentially text semantic analysis, analyzes the core meaning of the question according to the text asked by the patient, and then performs subsequent matching answers and processing. e data used in this paper are derived from the Medical Data Mining Algorithm Evaluation Contest sponsored by the Medical Information Branch of the Chinese Medical Association and co-organized by the Medical Big Data and Artificial Intelligence Group of the Medical Information Branch of the Chinese Medical Association.
Specific task: Classification of public health issues. e training set data are shown in Table 1, where question sentence is the patient's question and consists of a few complete sentences. e question may have six label conclusions, each representing six possible core semantics. e six core semantic labels are as follows: label represents diagnosis; label2 stands for treatment; label3 stands for anatomy/physiology; label4 stands for epidemiology; label5 stands for a healthy lifestyle; and label6 stands for medical choice. If the question sentence contains information about a certain label, the label is set to 1, and if it is not included, it is set to 0, which is a typical multilabel classification problem. Multilabel classification task is that an entity can have multiple labels or be divided into multiple classes at the same time. For example, as shown in Table 1, label 1 and label 2 of the question are set as 1 simultaneously.
In the classification task, a model based on the 4000 training data is built by learning. en, the model is used to perform semantic analysis on the newly arrived data, and verify indicators such as the accuracy of the analysis. e training set has a total of 4000 statements, and the frequency statistics of the semantic labels contained in it are shown in Table 2. In Table 2, the frequency of label 1 is 1500, the frequency of label 2 is 2558, the frequency of label 3 is 1, and so on.

Design of Convolutional Neural Networks for Semantic Analysis
A convolutional neural network is a multilayer supervised learning neural network that contains an input layer, a hidden layer, and an output layer, which uses the gradient descent method to minimize the loss function to reverse adjust the weight parameters in the network layer by layer and improve the final fit degree through continuous iterative training [19]. e hidden layer generally contains a convolutional layer and a pooling layer, where the function of the convolutional layer is to extract features from the input data, obtain feature mapping, and extract deep features in the data by superimposing multiple convolutional layers. e pooling layer is responsible for compressing features and extracting the main features. e output layer takes advantage of the Softmax classifier. For the problems to be studied, this section uses the dictionary method to represent the question sentence text, and then builds a convolutional neural network model to extract features, learn, and semantically classify the text.

Data Preprocessing.
When training convolutional neural network models to process text, it needs to convert the issue sentence into a vector first. e specific steps: (1) remove the non-Chinese components from the question; (2) look for the question with the longest word count and use its word count as the dimension of the vector, assuming N; and (3) all words that appear in all questions are assigned in the order in which they appear, thus converting the question into a vector dictionary, i.e., key -> value, as shown in Figure 1. After obtaining the dictionary, the words of the question sentence are converted one by one according to the dictionary to obtain a training matrix; if the length of a question sentence is less than N, it is all filled with a value of 0 after the last word, filling the length N. As shown in Figure 2, the first 20 dimensions of data are shown. e data preprocessing algorithm is given in Algorithm 1.

Design of Convolutional Neural Networks for Health Issues
Semantic Analysis. For the question sentences in test set, they are converted into a matrix numerical vector form according to the dictionary in first, and then and the vector is analyzed by the convolutional neural network. e convolutional neural network model is designed using a double convolutional layer, and each convolutional layer includes multiple convolutional kernels to help extract features.
In the first convolutional layer, multiple convolutional kernels perform feature extraction on the data, where each element that makes up the convolutional kernel corresponds to a weight coefficient and a paranoid amount. Each neuron within the convolutional layer is connected to multiple neurons that are close to the region in the previous layer, and the size of the region depends on the size of the convolutional nucleus. Convolutional kernels regularly scan the input features in their work, multiply the matrix elements in the sensory field, and finally, superimpose the paranoids.

Mobile Information Systems
After the test set data is processed by the first convolutional layer, it enters the excitation layer.
In the excitation layer, the output result of the convolutional layer is nonlinearly mapped with the Relu() function to improve the fitting degree of the nonlinear case, and after nonlinear processing, the data are fed into the pooling layer. e pooling layer uses maximum pooling to extract the maximum value of a location and its adjacent areas, then uses the maximum value as the value of the location to reduce the dimensionality without changing the depth of the data.
After the above three layers of processing, the output data are again fed into the second convolutional layer, the excitation layer, and the pooling layer. After two convolutions, the data dimension is further reduced, the number of parameters and weights are reduced, and the deeper information of the data can be extracted to obtain feature mapping. e data are then expanded into one-dimensional data, which is fed into the fully connected layer. e fully connected layer nonlinearly combines the features extracted from the convolutional and pooled layers to output them to the output layer. See the description of Algorithm 2 for details. e algorithm uses the binary cross-entropy loss function BCEWithLogitsLoss(), which integrates the sigmoid of the output layer with the loss function BCELoss for multilabel classification problems.

Manually Constructing the Dictionary Method and Double Hidden-Layers Neural Network
To verify the effectiveness of the proposed method, this section uses manual dictionary construction and neural network methods to solve this problem. e manual construction dictionary method extracts the words in each category manually, which removes the words with weak medical relevance and saves the words with strong medical relevance. en, a thesaurus database is established, and the test data can be compared with the lexical database. A dual hidden-layer neural network consists of the input layer, the hidden layer, and the output layer. e input dimension is the dimension of the data, the output [1, 2, 3, 4, 5, 6,7,8,9,10,11,12,13,5,14,15,16,17,18,19,20    dimension is the category number of classification, and the output layer uses the Softmax classifier [14].

e Manual Construction Dictionary Method.
ere are 5000 public health issues, 4000 of which are used to build dictionaries, and the remaining 1000 are used to verify the method's performance.
When constructing a dictionary, establish a dictionary of stop words to process the question sentence first. e dictionary of stop words contains commonly used modal words and medically weak words that may have a negative effect on model performance. So, when we process questions, we remove all the words from the question sentence that are in the dictionary of stop words. Figures 3 and 4 are the results before and after processing, respectively.
Manually constructed dictionary analysis methods mainly include cleaning and parting words for each public health issue in the dataset first. en, take the thesaurus set of six labels: consider a label, if the label conclusion value is 1, then the corresponded question data after processing is proposed, and all the extracted data are combined and summarized to obtain the thesaurus set of the label, a total of six thesauruses are obtained.
Finally, comparing the classification, the test data to be classified are compared with the six thesauruses, and if one of the words of the data exists in the thesaurus, the corresponding label is set to 1. e algorithm framework is as follows:

e Double Hidden-Layers Neural Network.
e double hidden-layers neural network method designed in this paper contains the following steps. First, the regular matching of all the problems is to remove non-Chinese characters, count all the different words, and build a dictionary. e training data Input the data X into the convolution layer (the number of input channels is 1, the number of output channels is 10, convolution core size is 3, and convolution step is 1; after the data are filled, the convolutional kernel scans the data and extracts depth characteristics) (2): e function Relu() is used for the nonlinear processing of the data (3): Features of the data are extracted using maximum pooling, where the kernel_size is 2 and the step size is 2 Step2. Sequential 2(conv2): (1): e data obtained by Sequential1 is input (the input channel number is 10, the output channel number is 20, the convolutional kernel size is 3, and the convolutional step is 1, and the data's feature is extracted again); (2): e function Relu() is used for the nonlinear processing of the data (3): Max Pooling is used to extract the features of the data, where the kernel_size is 2 and the step size is 2 (4): e data obtained from Sequential2 is expanded and fed into the fully connected layer end for Step3. According to the above results, the loss function BCEWithLogitsLoss is used for multilabel classification of test data to obtain the classification results. End ALGORITHM 2: e convolutional neural network analysis algorithm. are then converted into a training matrix based on the dictionary. en, build a double hidden-layer neural network model, and train the model to complete the prediction. Finally, get the training metrics.
A detailed description of the double hidden-layers neural network method is shown in Algorithm 4. ere are two hidden layers. e input dimension of the data is 1622, which is the dimension of the longest public health issue. e first hidden layer has 1622 neurons, which are consistent with the input dimension so as to extract depth features under the condition that the dimension remains unchanged. e number of neurons in the second hidden layer is 811, which is set as half of the input dimension to help it converge and complete the prediction of public health issues. e specific framework is shown in Algorithm 4.

Input:
Training dataset X and test dataset Output: e results are predicted based on the trained model Begin Step1. Hidden1: (1) Enter the data with a dimension of 1622 (2) e function Relu() is used for the nonlinear processing of the data (3) e output data dimension is 1622 Step2. Hidden2: (1) Enter the data obtained by Hidden1 (2) e function Relu() is used for the nonlinear processing of the data (3) e output data dimension is 811 Step3. Out: (1) Enter the data obtained by Hidden2 (2) e function Relu() is used for the nonlinear processing of the data (3) e output data dimension is 6 end for Step4. Based on the above results, the test data are multilabeled classification using the loss function BCEWithLogitsLoss, obtaining the classification results. End Description: e patient was a typical "three high", wanting to take aspirin as a preventive medication, but developed allergic symptoms. Once treated situation and effect: before high blood pressure, three years ago checked out diabetes. Long-term use of metformin and blood pressure medication. How to GET Help: e patient is typically "three high", wanting to take aspirin as a prophylactic but developing allergy symptoms. Can you use Xuesaitong dispersible tablets instead of white aspirin as preventive medication for a long time? Are there any other drugs recommended? '

Experimental Comparative Analysis
e experiment was conducted under the Windows 10 system with the processor: Intel(R) Core(TM) i5-8300H CPU @2.30 GHz GPU: NVIDIA GeForce GTX 1050Ti 4G. e memory size is 16 GB. e programming language is Python 3.8.3, the development tool is Jupyter Notebook, the deep learning framework is PyTorch 1.6.0, and the Cuda version is 10.2.

Text Training.
ere are a total of 5,000 pieces of data in the public health issue dataset, of which 4,000 pieces of data are used as training sets and the remaining 1,000 pieces of data are used as test sets. Train a double hidden-layer neural network model and a convolutional neural network model on the training set and Figure 5 adjusts the relevant parameters. e method of manually constructing dictionaries is established and learned by humans. Finally, 1000 pieces of test sets are used in Figure 6 to test the performance (Figure 7) of the model. e loss and accuracy of Figure 8 training Figure 9 two types Figure 10 of neural networks on Figure 11 a training set of 4000 pieces of data from public health issues vary with the number of iterations as shown in and the training set contains 4000 pieces of data.
In Figure 5, as the number of iterations increases, the loss value of the double hidden-layer neural network model tends to be 0 and the loss value of the convolutional neural network model tends to be 0.32.
In terms of the changing trend of accuracy, with the increase in the number of trainings in Figure 6, the accuracy of the double hidden-layer neural network tends to be 1 and the accuracy of the convolutional neural network tends to be 0.68. In Figure 7, the accuracy of the hidden-layer neural network tends to be 1 and the accuracy of the convolutional neural network tends to be 0.65. In Figure 8, the accuracy of both types of neural networks rapidly tends to be 1. Figure 9 shows that the accuracy of the double hidden-layer neural network tends to be 1 and the accuracy of the convolutional neural network tends to be 0.88. Figure 10 shows that the accuracy of the double hidden-layer neural network tends to be 1 and the accuracy of the convolutional neural network tends to be 0.91. Figure 11 shows that the accuracy of the double hidden-layers neural network tends to be 1 and the accuracy of the convolutional neural network tends to be 0.84. e two curves in Figure 8 coincide and tend to be 0.96.
According to the above results, for the hidden-layer neural network, it can be seen from Figures 6 to 11 that the accuracy rate tends to be 1. At the same time, combined with the loss curve in Figure 10, the loss of the double hiddenlayer neural network tends to be stable slowly, and the loss is close to 0. After analysis, the double hidden-layer neural network structure is relatively simple, and the data can be fitted quickly. However, the double hidden-layer may have the risk of overfitting. Regarding convolutional neural networks, the accuracy of Figures 6 and 7 is finally located at 0.65, indicating that the generalization ability of the model still has a large room for improvement on label1-2. e accuracy of Figures 9 to 11 is around 0.9, manifesting that the convolutional neural network can well learn the potential relationship between data on the label4-6. e reason for the phenomenon that the accuracy of the convolutional neural network in Figures 8 to 11 is significantly higher than that of Figures 6 to 7 is that the label4-6 of data as 1 in the dataset is less than the label1-2 of data.
After analysis, the reason for the curve coincidence phenomenon of label3 can be seen: in the training set of 4000 data, only the label3 of one data is 1 and the label3 of the other 3999 data is 0, resulting in the learning of the two types of neural network models too fast, so the accuracy curve coincides.

Effect Assessment.
After the above model and manual dictionary method are trained, the classification performance of the model is tested on the test set. e classification performance of each model is evaluated using precision,  In Figure 12, the three models are closer to the precision of label1, near 0.36. Label2 is also closer, at around 0.66. e precision of label3-6 is lower. Overall, the classification effect of the three models on label2 is better.
In Figure 13, the recall of the manually constructed dictionary on label1 reaches 0.56, while alternative models are less effective. e model of the double hidden-layers neural network and the convolutional neural network on label2 is better, in which the convolutional neural network is slightly better than the double hidden-layers neural network. On label3, the indicator is all 0. On label3-6, the effect of two types of neural network is poor, while the method of artificial dictionary construction is stable, and the index is located at about 0.50.
In Figure 14, the F1 score of the manually constructed dictionary on label1 reaches 0.44, the double hidden-layers       neural network reaches 0.42, and CNN reaches 0.26. e index of convolutional neural networks on label2 is up to 0.688. All three models perform poorly on the label456.
An analysis of Figures 12 to 14 shows that only on label2, the performance of the three models is relatively stable, and the performance of convolutional neural networks is slightly better than that of double hidden-layer neural networks. e study found that label2 in the test set had a large amount of 1 data. In label3, there is no 1 data in the test set, resulting in a performance indicator of label3 that is all 0, and the model cannot predict it correctly. e data with a 1 in label1 is less than label2 and more than label3-6, resulting in the phenomenon that label2 has the best classification effect, label1 is weak, and label3-6 is very poor. For the method of artificial dictionary creation, the key lies in the creation and optimization of deactivated dictionaries, which is more limited by the medical literacy of the screener.

Conclusion
Experimental results show that in the case of excellent datasets, convolutional neural networks have better performance in the classification of public health issues than double hidden-layer neural networks and manually constructed dictionaries. However, there are still shortcomings in this experiment, such as the preprocessing of text and the establishment of the training matrix, without considering the weight of specific words or words and the relationship between words and words, the stability of each trained model is poor, and the good model that has not yet been trained can achieve a more stable classification effect in label1 to label6. For future experiments, we can study the prprocessing of text and the construction of the training matrix deeply. is experiment helped the author realize the research and application of the convolutional neural network in the field of public health issue classification and medicine, and the convolutional neural network can greatly improve the efficiency and intelligence of hospitals to some extent. And the model obtained by training can assist doctors to make medical judgments and promote the better integration of medicine and computer technology.

Data Availability
e data used to support the findings of this study are available from the corresponding author upon request.  Mobile Information Systems

Conflicts of Interest
e authors declare that they have no conflicts of interest.