A Study on the Health Output Effect of Chinese Medical Service Industry Agglomeration Based on Big Data Analysis

The agglomeration health output eﬀect of the medical service industry in the era of big data is an important part of the agglomeration innovation of medical resources. This paper used the regression model of data mining to set up the ﬁxed eﬀect model and system GMM model to study the relationship between the agglomeration of medical service industry and resident’s health level, based on the panel data of 31 provinces of China from 2003 to 2017. The results show that the health outcome of the medical industrial agglomeration is positive and diﬀerent in provinces. The inﬂuence of medical service cluster on residents’ health level in the eastern region fails the signiﬁcance test, while the medical service cluster in the central and western regions can signiﬁcantly improve residents’ health level. And, this eﬀect is also related to the characteristics of medical resources, economic development, demographic characteristics, and other heterogeneous factors. On this basis, the paper puts forward policy suggestions to promote the market structure of the medical industry from the aspects of strengthening synergies and policy guidance.


Introduction
In 2018, Paul Zane Pilzer, an American economist, proposed that the healthcare industry is a star industry with a global scale of trillions after the electronic information industry. From the development practice of all countries in the world, the health care industry has become a strong driving force for the growth of national economy. In OECD countries, the added value of the health care industry accounts for 10% of GDP, while in the United States it has exceeded 17%, while in China it is only 5% [1]. With the advancement of a new round of health care reform, the development of health service in China has risen as a national strategy. In October 2016, the "healthy China 2030" planning outline, issued by the State Council, explicitly pointed out that the total scale of China's health service industry will reach 16 trillion yuan by 2030, and the medical and health industry will become the pillar industry of China's national economy. With the development of big data, the most important innovative resources in the healthcare industry will focus on the allocation and utilization of information resources, so the economic benefits of big data in healthcare are becoming increasingly prominent. McKinsey's (2013) e Big Data Revolution in Health Care estimates that big data has reduced health spending in the United States by $300 billion to $450 billion, equivalent to 12 percent to 17 percent of US health spending in 2011 [2].
As an important branch of medical and health industry, the health output effect of medical service industry agglomeration is an important content of the innovation of health and medical resources agglomeration. ere has been fast development of medical services in China since the reform and opening up. According to statistics, the total number of national medical and health institutions increased from 1978 in 170000 to 2017 in 987000, practicing physician per thousand population rose from 1.08 in 1978 to 2.44 in 2017, and medical and health institutions of beds per thousand population from 1.94 in 1978 rose to 5.72 in 2017. However, because the regional imbalance in the development of China's medical service industry still exists, the phenomenon of cross-regional medical treatment is increasing, and the medical industry cluster in different regions is significantly different. Will the level of medical industry cluster affect the health level of residents? Does it promote or inhibit the health of residents? What factors influence its performance? e regression analysis method based on data mining technology to study the health output effect of the agglomeration of China's medical service industry will be helpful to promote the agglomeration innovation of China's medical and health resources, which effectively solve the heterogeneous differences in the distribution of regional health resources.

Literature Review
Industrial cluster is a geographical phenomenon in the dynamic evolution of industry, so scholars at home and abroad have conducted many fruitful researches on this economic phenomenon. However, there are few studies on cluster of medical service industry, especially the literature on the influence of cluster of medical service industry on health. After reviewing the existing literature, it can be summarized into the following aspects.

Research on the Essence and Development Mode of
Medical Service Industry. Scholars mainly study the connotation of medical service industry from the aspects of economic impact and medical reform policy. In terms of economic impact, Relman proposed that the medical service industry was a high-profit industry, which not only affected the implementation effect of the national health policy, but also promoted economic growth [3]. DeVol and Koepp measured the cluster degree of health services in the US, and the empirical evidence showed that health services in the US played a significant role in promoting economic development [4]. Zhou and Liu put forward that the medical service industry is an important channel to achieve economic transformation and upgrading and put forward suggestions on developing the medical service industry from the aspects of top-level design, reform, and innovation [5]. Gao pointed out that, in the stage of high-quality economic development, promoting the supply-side reform of the medical service industry is the key to achieving the sick having access to medical care [6].
In terms of medical reform policies, Serrano Ibis analyzed the current situation and characteristics of American medical service industry and discussed the impact of American medical reform on the pharmaceutical industry [7]. Devlin-Foltz et al. pointed out that the social security system plays a leading role in the income level of most American retirees based on the perspective of distribution [8]. Schoen et al. proposed policy recommendations to improve access to medical services and reduce the economic burden of high cost of health services, regarding low-income and disadvantaged groups as the research subjects [9]. Domestic scholars have conducted extensive studies on medical reform policy. For example, Xing focused on medical system reform on public hospitals and proposed that public hospitals are the core carrier of the development of medical service industry [10]. Song et al. studied the allocation of public resources between different hierarchies and regions and pointed out the importance and urgency of hierarchical medical reform [11].

Research on the Mechanism of Service Industry Cluster.
Scholars at home and abroad have studied the cluster mechanism of service industry mainly from endogenous mechanism and exogenous mechanism. In endogenous mechanism, firstly, the cluster of service industry reduces the information cost and transaction cost of enterprises, thus gaining advantages in competitive cost (Keeble & Nachum) [12]. Secondly, the complementary symbiosis is the driving force of service industry cluster; that is, agglomeration between industries within the service industry and between the service industry and other industries can effectively achieve its ultimate goal (Pandit et al.; Bathelt) [13,14]. irdly, the service industry cluster aims to acquire tacit knowledge. Tacit knowledge cannot be transmitted by traditional means and can only be acquired through close interaction between industries (Zhao et al.) [15]. Fourthly, the aggregation motivation of service industry is the demand for industrial innovation.
rough agglomeration, a large-scale innovation network is formed, and the communication between industries can stimulate the application of new technologies and methods (Keeble & Nachum; Han) [12,16]. In exogenous mechanism, most scholars believe that the main reason for service industry cluster is externality, which means that, due to external economies of scale, enterprises can reduce transaction costs by concentrating in geographical regions to improve their competitiveness ( [13,[17][18][19][20]. In addition, some scholars hold that the motivation of aggregation is the reservoir effect of human resources. e high-quality talents can reduce the search cost of human capital of enterprises and achieve common benefits in the area.

Research on the Influence of Medical Service Industry
Cluster on Health. Health was originally a medical problem, but since the 1980s, scholars at home and abroad began to study the relationship between health and economy. Most studies have focused on the impact on health such as medical insurance and health expenditure. Literature studies have focused on the impact of health insurance and health expenditure on the resident health.
In terms of the impact of medical insurance on health, most scholars believe that medical insurance lowers the relative price of medical services to a certain extent and improves the utilization rate of medical services, thus reducing the incidence of some diseases and significantly improving the health conditions of residents (Goldman et al.; Card et al.; Huang and Gan) [21][22][23]. However, some scholars believe that medical insurance significantly reduces the marginal cost of medical services, but the moral hazard will lead to the waste of medical resources, and the insured's health conditions do not improve significantly ( [24][25][26]. In terms of the impact of health expenditure on health, scholars believe that the allocation of health resources is highly dependent on the government's health expenditure, which can not only guarantee the public welfare of health services, but also improve the effective of health services and the health level of residents (Grossman; Farag et al.; Sun et al.) [27][28][29]. In the context of rapid supply-side reform of health service industry, social health expenditure develops rapidly and improves the effective supply of health service. As an important part of health financing, social health expenditure, like government health expenditure, has significant health output effect (Hu) [30].

Research on the Impact of Big Data on the Medical Service
Industry. At present, domestic and foreign researches on the effect of big data on the health output of the medical service industry are rarely involved, and existing relevant researches mainly focus on the following two aspects. e first one is about the impact of big data on the medical industry. Kayyali et al. studied the impact of big data on the medical industry in the United States and pointed out that many potential values of medical big data are gradually being discovered [31]. Kaggal et al. proposed that the lack of big data analysis technology and the openness of medical big data will seriously hinder the development of medical service industry [32]. Wang et al., taking Jiangsu Province as an example, pointed out that the establishment of big data sharing mechanism for health care is the key [33].
e other one is about the processing technology and risk of medical big data. Raghupathi and Raghupathi pointed out that the biggest difference between the application architecture of medical big data and the traditional medical information architecture lies in distributed computing and information storage [34]. Guo et al. proposed that the service based on medical big data involves the benefit distribution and responsibility sharing of all participants, which may lead to new problems such as the definition of medical responsibility and the prevention of medical accidents [35]. Huang studied the impact of big data technology on medical service system and expounded the innovation and risks that big data technology may bring from the aspects of knowledge, organization, rules, and culture [36].
Scholars at home and abroad mainly focus on the connotation, development model, agglomeration mechanism of the service industry, and the application of big data in the medical field and have made many fruitful research results. However, there are few literatures on the effect of big data on health output of medical service industry agglomeration. Based on the theory of agglomeration economy, this paper uses provincial panel data of China's medical service industry to estimate the degree of agglomeration of medical service industry and conducts an empirical study on the influence mechanism of agglomeration of medical service industry on residents' health under the background of big data.

Influence of Medical Service Industry Cluster on Residents'
Health. With accelerating trend of urbanization and population aging, the health of residents is threatened of infectious and chronic diseases, so the demand for medical and health services is increasing. Due to the particularity of the medical service market, information asymmetry, and the contradiction between the supply and demand of medical service, the Chinese government implements a relatively strict supervision policy on the medical service market. In the stage of high-quality economic development, the supply of medical services is difficult to meet the diversified and multilevel medical needs. It is too difficult and expensive to see a doctor in China. With the deepening of marketization and reform in the property rights of public hospital, the development of China's medical service industry is characterized by insufficiency and imbalance, and urban-rural gap and regional imbalance are increasingly significant. At the same time, China's medical service industry shows agglomeration phenomenon, which is reflected in medical funds that continue to gather in the eastern coastal areas. e medical service industry cluster has brought about lots of high-quality medical resources, which is conducive to exerting scale effect, improving the accessibility of highquality medical services for residents, and thus improving the health level of residents.
H1: medical service industry cluster has a positive impact on residents' health level.

Influence of Medical Service Industry Cluster on
Residents' Health

Influence of Medical Resources on the Agglomeration of the Medical Service Industry.
According to the theory of industrial cluster, industrial economic activities tend to be concentrated in areas with rich sensitive resources. e medical service industry cluster is less affected by natural resource endowment but more affected by high-quality talents. If the medical service industry enterprises in a region attract more high-quality medical technical talents, the transaction cost of the enterprises will decrease and the market scale will expand rapidly, which will further promote the medical service industry cluster and form the accumulation cycle of agglomeration. At the same time, highquality medical technical personnel represent the innovation ability of the region and can produce significant knowledge spillover effect, which will further promote the agglomeration of medical service industry. In China's first-tier cities, medical service enterprises often have the country's firstclass medical technical personnel and the most advanced medical service facilities. Lv pointed out that China's medical resources show a polarization, on the one hand, between rural and urban areas and, on the other hand, between major hospitals in cities and grass-roots health organizations [37]. Major hospitals in big cities have far more medical resources than those in second-tier cities and other regions.

Mathematical Problems in Engineering 3
H2: medical resources have a positive impact on the medical service industry cluster and the health of residents.

Influence of Resident Income on the Medical Service
Industry Cluster. Residents' income directly determines their material living standard. In general, the higher the disposable income is, the more the residents spend on nutrition, exercise, physical examination, and health care, which means that the increase of disposable income can bring more expenditure on medical service and more demand for medical service, thus affecting the agglomeration of medical service industry. Zhang proposed the income health effect and the income effect [38]. Income health effect means the health status of high-income people may be relatively good, and the medical expenditure of high-income people is less when other conditions are similar, while income effect means medical consumption is a normal, and the higher the income is, the more medical services and the more medical expenditure they spend. e income gap also affects the medical services cluster. On the one hand, the income gap leads to the difference in the demand for medical services between the rich and the poor. e decrease in the investment and expenditure of public goods affects the medical service industry cluster. Krugman pointed out that large differences in people's preferences would lead to the undervaluation of public goods, which would lead to the reduction of public expenditure and increase the difficulty in the implementation of public policies [39]. On the other hand, according to the demand theorem, the increase in income leads to the increase in the demand for medical services, but high-income people take up more medical resources, which will not only reduce the accessibility of medical services for low-income people, but also allocate medical resources to the place with the greatest profit in the market price system. Wei and Gustafsson pointed out that the insufficient public medical expenditure would lead to the higher-income class finding it easier to access medical services. e widening income gap eventually leads to better medical services concentrate in big cities [40].
H3: residents' income has a positive impact on medical service industry cluster and has a positive impact on residents' health.

Empirical Model.
According to Grossman's theory of healthy production, health is a commodity jointly produced by a series of factors such as education, income, medical service, and lifestyle. In order to examine the impact of the market structure of the medical service industry on health, this paper takes the agglomeration degree of the medical service industry as an input factor and sets the static panel model as follows: where Health i,t , an explanatory variable, is the health level of population. LQ i,t , a core explanatory variable, is the degree of agglomeration of medical services. X i,t is a control variable. α is constant term. μ i is region effect. ε i,t is random perturbed variable. e relationship between residents' health and the concentration of medical service industry is complex, and the current health level of residents may be affected by the health level of the previous period. In this paper, residents' health level with a lag of one stage is taken as an explanatory variable, and the dynamic panel model is established as follows: where, Health i,t−1 is 1 period lag in the health level of population. e other variables are the same.

Explained Variable.
e explained variable is the level of health of residents (Health i,t ). e UN's Millennium Development Goals are to reduce child mortality, improve maternal health, and combat HIV and other diseases. Due to the availability of data, Maternal Mortality Rate (Mmr) is chosen to reflect the health level of the residents. e maternal mortality rate is the number of maternal deaths per 100,000 live births during the year. e higher the value of this negative indicator is, the lower the health level of residents will be.

Explaining Variable.
We considered the degree of medical service industry cluster (LQ i,t ). is paper applies the location entropy method to measure the agglomeration degree of medical service industry in China. Location entropy was first proposed by Hargate, which is used in location analysis. Also known as the regional concentration index of production, it is the ratio of ratios. e calculation formula is as follows: where. LQ i,j,t represents the concentration degree of industry i in region j during period t. q i,j,t represents the employment number of industry i in region j during period t. q i,t represents the number of people employed in nationwide i industry during t period. q t represents the number of employed people in the whole country during period t. In this paper, i represents the employment in medical service industry, which consists of professional physicians, registered nurses, pharmacist, rural doctors, licensed (assistant) physicians, licensed physicians, registered nurses, pharmacists, and health workers. e characteristic variable of medical resources is represented by the number of beds in medical institutions per 1,000 population (Bper1000), which reflects the level of medical facilities and services in different regions.
Economic development variables include per capita disposable income and urbanization rate. e population proportion index method is generally adopted to measure the urbanization level, that is, the proportion of urban population in the total population. e demographic characteristics variable refers to the level of education. In this paper, the length of schooling (Edu) is used to indicate that illiteracy is 0 years, primary school education is 6 years, junior high school education is 9 years, senior high school education is 12 years, and junior college education and above are 16 years. In order to weaken the heteroscedasticity of the data, all variables were logarithmically processed. e balance panel data of 31 provincial regions in China from 2003 to 2017 were adopted in this paper. e data were obtained from China Statistical Yearbook and China Statistical Yearbook of Health and Family Planning. Table 1 shows the variables and descriptive statistical results in the empirical model. e minimum value and maximum value of Mmr of residents' health level are 1.1 and 399.1, indicating that there are large differences in residents' health level in the sample. e minimum value of LQ, the indicator variable of the agglomeration degree of medical service industry, is 0.53 and the maximum value is 3.19, indicating that there are significant differences in the agglomeration degree of medical service industry in various provinces of China. At the same time, per capita disposable income varies in a larger range, and the standard deviation is as high as 8226, which shows the regional economic development and its imbalance in China. e above problems also exist in the urbanization rate and the years of schooling (Edu), and the gap between the minimum and maximum values is also large.

Full Sample Regression of the Influence of the Medical Service Industry Cluster on the Health Level of Residents.
Regression methods of panel data generally include mixed regression, fixed effect regression, and random effect regression. Generally, F test is used to judge whether the empirical model should use mixed regression or fixed effect regression, while Hausman test is used to judge whether the empirical model should use fixed effect regression or random effect regression. e empirical test of the sample data shows that both the F value and the Hausman value show that the static panel model (1) should adopt the fixed effect regression method.
In order to avoid the result bias caused by the possible endogeneity problems between variables, instrumental variable method is further introduced for estimation. e estimation methods commonly used in the instrumental variable method are generalized moment estimation (GMM) and two-stage least square method (2SLS). e assumptions of generalized moment estimation (GMM) are more lenient, so it is not necessary to assume the same variance, but to use instrumental variables to estimate parameters, which can effectively solve the endogenous problem. System-GMM estimation method introduces the horizontal equation on the difference-GMM estimation method and takes the lag difference variable as the instrumental variable, which improves the effectiveness of the estimation results. In this paper, residents' health level lagged by one stage was added into the static panel model, and the dynamic panel model (2) above was constructed, and the robustness test was carried out by applying the system GMM estimation method. Specific regression results are shown in Table 2: e validity test of system GMM estimation mainly includes constraint test of overidentification (Sargan test or Hansen test) and sequence correlation test (AR(1) and AR (2)). e former is used to test the validity of instrumental variables in sample estimation. e latter is used to verify the existence of sequence correlation. e system GMM estimation method allows the existence of first-order difference sequence correlation, but it does not allow the existence of second-order difference sequence correlation. e regression results in Table 2 show that the Sargan test P value of the dynamic panel model (2) is 0.669, indicating that the selection of instrumental variables is effective, and the P value of the AR(2) test is 0.269, indicating that there is no second-order sequence correlation between the residuals after the first-order difference. erefore, it is reasonable to judge that the system GMM estimation method is adopted. e fixed effect regression in Table 2 is consistent with the sign of each coefficient in the system GMM regression, and individual coefficient estimates become more significant, which also indicates the robustness of the estimated results. According to the GMM estimation results of the system, the estimated coefficient of the agglomeration degree of the medical service industry is -0.056, which is significant at the 10% level. is result confirms hypothesis H1; the higher the agglomeration degree of the medical service industry, the lower the maternal mortality rate, indicating the higher the health level of residents. is is because the agglomeration of medical service industry has brought a large number of high-quality medical resources and advanced medical equipment, which contributes to a more effective allocation of medical resources and the realization of scale effect. e estimated coefficient of the number of beds in medical institutions per 1,000 population is −0.168, which is significant at the 1% level, indicating that the higher the number of beds in medical institutions per 1,000 population, the lower the maternal mortality rate. is result confirms hypothesis H2. e reasons are as follows. e more medical resources, the better the agglomeration of medical service industry and the better the accessibility of medical services and the lower the probability of residents not seeking medical treatment due to illness, which is conducive to improving the health level of residents. e estimated coefficient of per capita disposable income is −0.200, which is significant at the level of 5%, indicating that the higher the per capita disposable income is, the higher the residents' health level is, confirming hypothesis H3. is result shows that the higher the level of disposable income of residents, the greater the demand for medical services and the greater the expenditure on medical services, which promotes the agglomeration of medical services to a Mathematical Problems in Engineering certain extent, which confirms the above income effect. Urbanization rate has a positive influence on residents' health level, but it fails to pass the significance test. is may be due to the fact that, on the one hand, the acceleration of urbanization has brought an increase in the number of urban residents, and medical resources are gradually inclined to populated areas. e improvement of residents' access to medical services contributes to the improvement of their health level. On the other hand, urban work pressure and life pressure are relatively high, and residents may have a significantly higher incidence of disease and a lower level of health due to higher stress. e improvement of education level has significantly improved the health level of residents.
is is because the higher the education level of residents, the more initiative to obtain health information. At the same time, the improvement of health awareness is conducive to increasing the number of health checks, early detection, and early treatment of diseases. erefore, the higher the level of education, the greater the effect of healthy production function.

Regional Difference Analysis of the Influence of Medical
Service Cluster on Residents' Health Level. To investigate whether there are regional differences in the health output effect of medical service agglomeration, the whole sample was divided into eastern, central, and western regions. Table 3 shows the regression results of the influence effect of the agglomeration of medical service industry on the health level of residents in the subregion. F test value and Hausman test say the eastern, central, and western regions are suitable for the fixed effects regression model; to estimate the robustness of results, further to the eastern, central, and western regions, respectively, system GMM estimation, the sample data of AR (1) and AR (2), and Sargan test P values show that a systematic GMM estimation method is effective and feasible.
It can be seen from Table 3 that the agglomeration degree of medical service industry in eastern China has a positive impact on the health level of residents, but it fails the significance test. e agglomeration degree of medical service industry in central and western China significantly improves the health level of residents. As the economic level of eastern region is more developed, with leading medical services and diminishing the marginal effect of health, medical service industry cluster is not the key to improve the health level of residents. e economic level of central and western region is less developed, which the medical service level is relatively backward, so the medical service industry cluster brings an increasing marginal effect in health. e effect of the number of beds per 1,000 people in the eastern region on the health of residents did not pass the significance test, but in the central and western regions, the more the beds per 1,000 people in the medical institutions, the better the health of residents. e reason is that the eastern region has abundant medical resources, while the central and western regions are in short supply. e agglomeration of medical service industry in the central and western regions brings a larger scale effect of medical resources and plays a more important role in improving the health level of residents. e impact of per capita disposable income on residents' health in eastern, central, and western  Note: (1) FE represents fixed effect model, FE represents random effect model, and GMM represents system GMM estimation of dynamic panel model; (2) FE estimation and GMM estimation are both T value in brackets, and Re estimation is Z value in brackets; (2) symbols * , * * , and * * * , respectively, represent the significance level of 10%, 5%, and 1%. regions has a significant promoting effect, and high income contributes to the improvement of residents' health level, for the same reasons as in the full sample test. e promotion effect of urbanization rate on the residents' health in eastern regions is more significant than the central and western regions because in the eastern region, urbanization rate is higher, attracting a large number of population and increasing higher medical service demand. At the same time, the improvement of urbanization rate has brought more medical resources and the increase of the medical service accessibility to improve the health level of the residents. In eastern, central, and western regions, years of education have a significant effect on the health of residents, which is consistent with the full sample regression.

Conclusion
An outline for the "Healthy China 2030" initiative proposed that health is the basic condition for economic and social development. Previous studies have focused on the health output effect and comparative analysis of medical input, but few have discussed the health output effect of medical industrial structure. Based on Grossman's healthy production function, using China's provincial data from 2003 to 2017, this paper studies the health output effect of medical service industry agglomeration with full sample regression and regional difference, drawing the following conclusions: (1) using fixed effects, random effects, and system GMM method to analyze the sample, the result showed that medical service significantly promoted the residents to improve the level of health. However, there are significant regional differences in the health output effect of medical service industry cluster. e medical industry cluster in central and western regions has significantly improved the health level of residents, while the effect in eastern regions has not passed the significance test.
(2) Considering other control variables, this paper found that the medical institution beds per thousand population to promote the health level of residents in the sample, central, and western regions of the country's samples passed the test of significance; personal disposable income and education level of the improvement of residents' health passed the test of significance in all samples and regional samples. While the health effects of urbanization rate in the national sample, central, and western regions sample are not by significance test, the eastern part of the effect is significant. Based on the empirical research results, this paper proposes the following policy recommendations.
First, strengthen the construction of medical and health big data application system and promote the aggregation of advantageous resources in the medical service industry. To remove data barriers, government should promote the data sharing, data mining, and application of health big data based on the regional population health information platform. Standardize the management of medical and health big data, give full play to the resource optimization function of big data technology, improve the coordination and comprehensiveness of the medical service industry, enhance the accessibility of high-quality medical resources, and promote the aggregation of advantageous resources of the medical service industry.
Second, use regional synergy and promote structure optimization of the medical industry. In central and western regions where the development of medical service industry is (2) symbols * , * * , and * * * , respectively, represent the significance level of 10%, 5%, and 1%. not comprehensive, it is necessary to play to the ability of medical industry cluster to optimize the allocation of medical resources and improve the efficiency, so as to strengthen the scale effect of medical service industry cluster. In eastern region, where medical resources are relatively abundant, the technical level should be improved, and the regional gradient transfer and radiation range should be promoted, so as to improve the health of the residents. ird, the government needs to strengthen policy guidance to adapt to local conditions and to intensify policy support in central and western areas, through preferential tax credit and financial policies to strengthen the supervision function of government, based on the goal of balanced development of medical resources among regions in China. At the same time, aided by private capital to improve the construction of medical facilities, we can improve the health of the residents and gradually narrow the development gap with the eastern region.

Data Availability
e data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare there are no conflicts of interest regarding the publication of this paper.