Traffic Flow Prediction and Application of Smart City Based on Industry 4.0 and Big Data Analysis

For smart city trac ow prediction in the period of big data and industry 4.0, the prediction accuracy is low, the prediction is dicult, and the prediction eect is dierent in dierent geographical locations. is paper proposes a smart city trac communication forecast based on Industry 4.0 and big data analysis application. Firstly, this paper theoretically explains the application scenario of urban trac fault text big data and analyzes the characteristics of related problems, especially the fault problems. Secondly, the AC trac prediction algorithm is studied, and the application analysis of PVHH, IDT, and Ford–Fulkerson algorithms is applied, respectively. Finally, the above three algorithms are used to predict and analyze trac ow.


Introduction
With the advent of the industry 4.0 era, arti cial intelligence and big data analysis play an important role in China's information construction. In order to understand the occurrence, development, diagnosis, and treatment of diseases more accurately, it is necessary to analyze the whole molecular measurement in multiple groups and obtain more abundant information resources from the analysis, so it is necessary to evaluate more data. At this time, the above problems can be e ectively solved by using arti cial intelligence [1]. Depression is a common psychological disease in today's society. ere are many people su ering from this disease, which seriously a ects everyone's health and social function [2]. Depression alone a ects 11% of the world's population, where mental health has caused great pain and damage. In order to e ectively treat this disease, arti cial intelligence and big data technology are increasingly used in depression, providing new methods for clinical diagnosis and treatment. Of course, there are not many markers to prove mental health, so that it depends on the questionnaire data of patients and doctors for explanation [3]. Nowadays, microbiology is one of the important disciplines in biology, which includes a wide range of bacteria, viruses, and fungi, and all belong to microorganisms, and it is closely related to human beings [4][5][6]. It has been identi ed as one of the causes of many cancers, such as Helicobacter pylori. Man is its only host, and it is almost impossible to heal after being infected. Helicobacter pylori plays a very important role in the treatment of gastric cancer. With the development of sequencing technology, a large number of complex data has been generated. However, there are still obstacles in the analysis of these data, which is not conducive to making correct decisions. However, the emergence of arti cial intelligence helps and properly solves the doctors' processing of these data [7]. e rapid development of arti cial intelligence (AI) and big data has stimulated the tide of various social networks and produced a lot of social data worth analyzing [8].
Mining the relationship among social organizations, networks, and media is a key point of social computing. e large increase of these data makes it more di cult to mine large-scale social data. Now, the combination of human intelligence and arti cial intelligence is applied to social computing, which provides more methods for the analysis and detection of social data, and is a new direction of arti cial intelligence and big data research [9]. Today's electromagnetic environment is still not optimistic, and spectrum resources are relatively few. In actual division, there will be uneven distribution, and the existing monitoring level is not enough, so there is no way to fully grasp the frequency usage. In order to solve this problem, an electromagnetic spectrum monitoring scheme combining big data and artificial intelligence is proposed, which mainly aims at various applications and related businesses and strengthens the construction of handheld monitoring systems, big data analysis systems, and electromagnetic spectrum monitoring system [10].
In teaching, the effect of online education is far less than that of actual classroom teaching. In order to improve the learning effect of online teaching combined with the actual needs of online education, the evaluation technology based on artificial intelligence big data technology is established with evaluation as the center of teaching, Model analysis is carried out through actual teaching, and various functional modules are established based on learning objectives, all of which are aimed at developmental evaluation. e results show that some models have good performance [11]. e safety of urban traffic is a permanent theme. Bus, as an important national facility and a means of transportation with high frequency, has a great responsibility for ensuring the safety of people's lives and property. In recent years, with the rapid development of expressways in China, expressway undertakes more important transportation tasks. However, the occurrence of various disasters and other unexpected events also bring great hidden dangers to highway transportation safety.
is paper focuses on the research and application of big data analysis technology for urban traffic accidents, as well as the establishment of cloud service network for urban traffic emergency management and the proposal of highway cloud resource scheduling based on cloud computer and double-layer particle swarm optimization.
e integrated management of high-speed emergency big data and the optimization of the emergency scheme were studied, and good results were achieved [12][13][14][15].

Analysis and Application of Urban Traffic Fault Text Big
Data. Urban transportation has entered the era of big data. e analysis of urban traffic faults should be composed of safety supervision report, accident database, and other parts. Big data analysis is used to realize the functions of retrieval, extraction, intelligent classification, and related analysis of urban traffic faults [16,17].
China's urban traffic safety monitoring system is composed of monitoring object layer, monitoring layer, and management layer. Because of the different monitoring objects, it can be divided into three types: people, equipment, and environment. Its dataset has four characteristics, namely, scale, diversity, rapidity, and value.
(1) Scale generally refers to the amount of data.
(2) Diversity: It means that its data comes from many kinds of sources, which is beyond the data range previously included, including semistructured data and unstructured data. In addition, it also analyzes various data such as weather, earthquake, and ministry of public security, shown as follows: Figure 1 describes the classification of traffic big data and whether the internal data and external data mainly come from the transportation department are being judged. In internal data, structured data can be stored directly while Figure 1unstructured data cannot be stored directly, so it needs to be converted into structured data for storage by technical means. (3) Fast speed: big data mining lies in the fast processing speed, that is to say, data streams are mostly highspeed and need fast and continuous real-time processing by processing all kinds of data in time to ensure the safety of urban road driving. (4) Value: the value of road safety lies in the use of data, statistical analysis, and classification algorithm to analyze big data, so as to find correlation and knowledge, predict accident failure safety problems, and provide basis for ensuring driving safety.
Urban traffic is a complex transportation system. Many experts analyze accidents and faults around safety evaluation, which provides favorable decisions for the prevention of safety accidents and faults. Experts use the accident fault data accumulated over the years to analyze the development rules of accident fault from the perspective of data analysis.
is paper uses text big data analysis technology for statistical analysis to promote the application of urban traffic safety big data. e application of fault analysis to accidents includes the following functions: feature extraction, accident-prone areas, fault analysis, full-text search, association analysis, and system management ( Figure 2).

Full-Text Retrieval of Urban Traffic Faults.
In the era of big data, it is of great significance to realize full-text retrieval through urban traffic big data technology. In this paper, through the establishment of full-text retrieval, combined with the actual traffic situation, the storage of traffic unstructured accident fault text, index building, Chinese word segmentation, and full-text retrieval is realized to find important messages in accident fault text.
Failure text retrieval is about indexing documents, queries, and the relationship between the users by using TF-ID to retrieve and text analysis.
TF denotes word frequency, and the formula is as follows: In the above formula, n i,j represents the number of occurrences, k n k,j represents the second sum of occurrences, and the denominator is added with 1 to prevent the denominator from being 0.
IDF denotes the reverse document frequency as follows: where k i represents the number of documents and N represents the size of D. e denominator is added by 1 to prevent the denominator from being 0. Combine TF with IDF to get the weight: Document D j is reorganized into vectors with word weights:

Mathematical Problems in Engineering
e cosine distance is calculated as follows: Finally, according to the results, the documents can be arranged to select the most suitable document for the user.

Technical Research and Analysis of Urban Traffic Emergency Management.
e achievements of China's urban transportation can be said to attract worldwide attention, and it has won worldwide recognition for its characteristics of "high efficiency, high safety, and high quality service." It is very important to establish a perfect safety early warning and emergency management model. Because emergencies are unpredictable, an emergency management mode derived from cloud computing can be formed, a cloud service network can be established, and technologies such as Internet of ings and big data can be used to improve the efficiency of dealing with emergencies. Figure 3 illustrates the main functions of the urban traffic emergency platform (CEP), which is mainly divided into 9 functions, each of which can achieve different target requirements. It has certain intelligent control value for traffic control and realizes the goal of intelligent transportation in Industry 4.0 ( Figure 3).

e Main Functions of the Urban Traffic Emergency Platform (CEP).
By establishing an "emergency cloud" to realize the application and deployment of network resources, the utilization rate of resources is greatly improved.

Emergency Cloud Service Virtualization Modeling.
In the field of cloud computing, virtualization technology is a very important key technology.
For virtualized storage resources, the formula is as follows: where D ij is the Jth virtual machine virtualized from the I-th storage server and s i is the number of virtual machines in the storage. For computational virtualization, the formula is as follows: where C ij has similar functionality to D ij . For virtualized rescue services, the formula is as follows: where T ij is the virtual rescue vehicle service from the i-th rescue vehicle server and 4S is the virtual service number in the rescue vehicle service.

Cloud Computing Model.
e resources used by the emergency platform are virtual cloud services and they are shared, but there are also constraints as follows.
All rescue system resources should be less than the total storage pool resources, and the formula is In the above formula, y n is the number of emergency rescue systems and DG max is the total number of virtual resources in the storage pool.
In the above formula, CG max is the total number of virtual resources in the calculation pool.
In the above formula, NG max is the total number of virtual resources in the network pool.
In the above formula, TG max is the total number of virtual resources in the rescue vehicle service pool.
In the above formula, PG max is the total number of virtual resources in the rescue team service pool.
In the above formula, RG max is the total number of virtual resources in the emergency materials service pool.
rough the above establishment and application research of emergency cloud, cloud computing, and big data technologies, the application and deployment of emergency platform network resources are completed, and a doublelayer particle swarm optimization algorithm is proposed to establish constraints and effectively determine the number of emergency cloud resource scheduling.

Urban Traffic Operation Model.
With the rapid development of urban transportation in China, the demand for passenger transport is also growing day by day, and various capacity scheduling problems in passenger transport are obvious. In order to solve the capacity problems, the adjustment of operation scheme and operation diagram has become very frequent. Passenger flow is the basic basis for determining the operation plan. Short-term passenger flow forecasting method and gradient lifting decision tree method are used for comparative analysis.
(1) e classification of short-term passenger flow forecasting methods is as follows (Figure 4): Passenger flow forecasting is based on the time characteristics of historical passenger flow and predicts the total amount and distribution of future passenger flow. Measurement, such as the prediction of future lines and related traffic at each station. (2) CART decision tree and gradient lifting algorithm.
Decision tree model is a nonparametric classifier, which is composed of regression tree and classification tree, and the two tree types are different in essence.
e CART decision tree formula is as follows: F(j, s) � arg min min In the above formula, C m is the mean value generated after division.
J.H. Friedman, a professor at Stanford, invented gradient lifting method, which is one of the ensemble algorithms. As an iterative decision tree algorithm, it has fast training speed and can reduce prediction deviation, so it is one of the most effective methods in machine learning algorithms. Its basic origin is as follows: In the above formula, h(x; a m ) is the subtree, a m is the parameter, and β m is the weight in the prediction function. e first regression tree: Negative gradient of loss function: Update prediction function: Mathematical Problems in Engineering 5

PVHH Prediction Model and IDT Prediction Algorithm.
Combining the model analysis of urban traffic in the previous chapter with the passenger data of Tianjin network car and taxi, this paper puts forward the PVHH prediction model and IDT prediction algorithm of passenger capacity. e general framework of its passenger hotspots is shown in Figure 5. e PVHH prediction model is based on the collected data of taxis and network cars and then analyzes the popular passenger points in the past, extracts the flow and distribution of passengers, and understands the trend of mobile personnel in the whole city.
e IDT algorithm mainly uses the information gain theory in decision tree to analyze the influence of different data at each moment on the predicted value. e labels of prediction models are original features (hotspotsi, m) and original labels (hotspotsi, m), respectively. In order to be suitable for machine learning classification algorithms, it is especially necessary to quantify data and weigh them. e    specific characteristics of prediction model labels are as follows: Adaboost algorithm constantly updates the sample weight value to achieve the correct sample weight value reduction. e wrong sample weight value increases this classification purpose. Because taxis and network cars need to update the data of popular passenger points in real time, the Adaboost algorithm can update the prediction model. e IDT algorithm is shown in Algorithm 1.

Ford-Fulkerson Algorithm.
Nowadays, urban traffic congestion is becoming more and more serious, so it is very important to analyze and evaluate traffic bottlenecks. Aiming at the three problems of node bottleneck, road bottleneck, and regional bottleneck, taking Chaoyang road for 4 days as an example, the congestion situation of this road section is studied, as shown in Figure 6. It is of new reference significance to analyze traffic flow through data statistics in different time periods, and the traffic state in different time periods is different. erefore, it is necessary to adopt the most reasonable and scientific forecasting methods for traffic conditions in different time periods. Generally, the traffic pressure is large in the morning and evening, and there are many geographical locations for analysis, so the global analysis scope is large. It can effectively reflect the advantages of the algorithm, can be analyzed for different time periods, and has the advantage of fast convergence speed.
Algorithm is the core model to solve the bottleneck of traffic flow. It uses labeling method to search the augmented path continuously until there is no path to search, and then get the maximum feasible flow. In order to identify the bottlenecks that lead to traffic obstruction and minimize the traffic flow, a bottleneck identification model of urban traffic network can be established by combining the road network framework, as shown in Figure 7. e bottleneck identification model of urban traffic network is shown as follows:

Experimental Simulation Comparison
To establish the Ford-Fulkerson traffic flow prediction and analysis model, it is necessary to analyze and apply the whole traffic flow effectively. is paper analyzes the traffic flow data of taxis to analyze the prediction effect and application of traffic flow in different places and different time periods. e four intelligent algorithms used in this paper have the effect of traffic prediction. ey have a good application prospect in the field of transportation, especially when there is an optimal problem between global and local. erefore, Input: import trainset (hotspotsi, m, w)--dataset R; Number of times--K; Scheme-single-level decision tree; Output: composite model.
Mathematical Problems in Engineering traffic flow forecasting is a global optimal problem, and intelligent algorithms can quickly realize the global minimum cost to achieve traffic forecasting and guide traffic planning.
Among the traffic flow forecasting methods under different algorithms in Figure 8, Ford-Fulkerson has certain advantages, with an average error of 0.31. Other algorithms have large errors and are unstable in prediction accuracy.
In order to forecast the traffic flow at different locations, the paper selects 10 traffic centers to forecast the traffic flow. e prediction effect is shown in Figure 9. en, analyze the error comparison of experiments under different algorithms, as shown in Figure 10.
In Figure 10, the prediction error comparison of different algorithms in different time periods every day is based on the deviation of relevant positions after 10 positions are    predicted as the position error comparison. Ford-Fulkerson has lower error and better stability ( Figure 10). Next, t he prediction effect is analyzed from the prediction accuracy, as shown in Figure 11.
In Figure 11, the prediction accuracy of the above four algorithms is compared for different days in one month. e prediction results of the four algorithms are not ideal in the early stage, and the highest prediction results of Ford-Fulkerson are 0.62. With the increase of days, the accuracy of the four algorithms is also improving. Because there are many early learning data, it can provide some reference for later prediction, and the accuracy is constantly improving. Finally, the Ford-Fulkerson accuracy rate is above 0.96, and the prediction effect is good.

Conclusion
With the in-depth development and technology application of smart cities, intelligent transportation technology has become a new technical means of urban development. is paper puts forward the prediction and analysis of traffic flow supported by Industry 4.0 and big data technology, which can play a key role in traffic development in smart cities.
ere are still some problems in the proposed algorithms, such as prediction accuracy, time, and other factors. Future work focuses on time delay and unpredictable factors in traffic forecasting and then puts forward corresponding models and solutions.

Data Availability
e experimental data used to support the findings of this study are available from the corresponding author upon request.