Knowledge Graph Construction for Intelligent Media Based on Mobile Internet

Mobile Internet-based intelligent media has become a popular academic topic. This study uses the CiteSpace visualisation tool and Scientific Citation Index Expanded database to comb the existing research in the field of intelligent media from a quantitative perspective. A total of 7248 English papers were published on the topic of “intelligent media” from 2012 to 2021, and 145 highly cited papers refined were analysed. Scientific knowledge graphs were analysed from six dimensions: annual publication quantity, country of publication, institution of publication, author, keywords, and cited references. In the last 10 years, the research literature on intelligent media has been found to increase annually. Presently, the People’s Republic of China and the United States of America have a high proportion of documents in this field. Chinese universities and institutions have achieved significantly in terms of the quantity and quality of documents. From the perspective of the whole intelligent media discipline, the high-yield author group has not been formed, and there is minimal cooperation amongst authors. Popular intelligent media topics include film, social media, machine learning, swarming motility, data mining, and artificial intelligence. Subject words of the main research directions are event recognition, fake news, Cable News Network model, reconfigurable intelligent surface, comprehensive survey, microblog message, strain sensor, and traffic event. Combined with popular topics and time zone maps, the future research frontier in the field of smart media is identified.


Introduction
Presently, the development of mobile communication, artificial intelligence, intelligent terminal technology, and social media has resulted in some technologies penetrating the media industry, such as Internet of things (IoT), cloud computing, virtual augmented reality, intelligent hardware, and big data. The transformation of new media has led to the evolution and change of media forms, content production, distribution channels, user experience, and other fields. The media space dominated by people in the past has been substantially expanded, and intelligent media has gradually developed. With the support of mobile Internet and big data technology, meeting the personalised needs and customised requirements of users has become a possibility. The emer-gence of various media forms has resulted in intelligent media becoming an important development direction for media organisations to win against competitors. Additionally, changes in user behaviour have promoted the intelligent development of media.
In the last decade, the number of intelligent media research has been increasing, and the distribution of intelligent media research in different disciplines has been expanding. The analysis of the development trend of intelligent technology, influence of intelligent technology on intelligent media, media market environment, and future development trend of intelligent media has an important reference value for regulators to formulate intelligent media policies and for enterprises to choose the development path of intelligent media. Although the development of intelligent media has provided substantial convenience, it has also brought numerous social problems. This study conducts the necessary examination and reflection on previous academic research.

Literature Review
According to the search of scholars and Scientific Citation Index Expanded (SCIE) databases, research on intelligent media has mainly focused on such aspects as intelligent technology, personalised recommendation system, user and algorithm credibility, diversity of information environment, participation of intelligent technology in Internet information governance, information values, transparency, and algorithm accountability mechanism. This study selects highly cited articles in the aforementioned fields for analysis.
In the intelligent technology research, Pollack surveyed new technologies that incorporate artificial intelligence techniques to support older adults and help them cope with the changes of aging, particularly with cognitive decline [1]. Brooks et al. provided a detailed description of dataassisted approaches to building intelligent technologyenhanced learning systems, which focus on helping instructional experts discover insights into the teaching and learning process, and leveraged these insights as instructional interventions [2]. Marres evaluated an emerging paradigm for testing intelligent technology in society through the analysis of recent street trials of self-driving cars [3]. Jin et al. proposed the method of combining reward network and cross-entropy loss to guide the training of generators and produce high-quality music works [4]. Zhang et al. provided a computational intelligence method for different system levels, described the development direction of intelligent systems, and introduced the basic state-of-the-art concepts of data mining [5]. For intelligent systems, Liu et al. presented tactile sensors with excellent application prospects in numerous fields, such as wearable devices, medical treatment, artificial limbs, and robotics [6]. Intelligent technology has been widely used in the Industry 4.0 era, in which manufacturing will be via the Internet, to achieve internal and external network integration towards the intelligent direction [7]. Industry 4.0 strategy revolutionises manufacturing systems and processes and also leads to the formation of an intelligent supply chain [8].
In the aspect of a personalised recommendation system, numerous studies have discussed users' preference between personalised and nonpersonalised news. Sang et al. developed a recommendation system that enables bidirectional communication between users and systems using a utility range-based product recommendation algorithm to provide markedly dynamic and personalised recommendations [9]. The traditional recommendation system disregards the internal relationship between user preference and time. To address this issue, Cui et al. proposed a novel recommendation model based on time correlation coefficient and an improved K-means with cuckoo search (CSK-means) [10]. When the number of books provided by libraries is relatively high, users experience difficulty selecting the appropriate books from numerous candidate books. For this case, Tian et al. designed a personalised recommendation system for college libraries based on a hybrid recommendation algorithm [11]. According to the characteristics of each user group, Yan et al. proposed an environment prediction algorithm based on mobile user classification, and the algorithm has great application prospects in news recommendation [12]. According to statistics of users in 26 countries, although there are individual differences, users generally believe that the algorithm based on historical consumption behaviour is a better way of news recommendation than manual editing. Such variables as age, trust in news, attention to privacy, and use of mobile news have an impact on this preference [13]. Readers' attitude towards personalised news depends on the diversity of news recommendation systems and degree of care in the field of public news, rather than privacy and other factors [14]. Moreover, users' attitudes towards personalised news are heterogeneous. Young and poorly educated users favour personalised news and have minimal contact with diversified information services provided by public media.
With the help of computer technology, researchers have designed random, manual, and multiple personalised recommendation systems based on the principles of popularity, semantic filtering, and collaborative filtering to measure the impact of recommendation algorithms on content diversity [15]. Accordingly, the diversity of the algorithm recommendation has not been reduced compared with the nonpersonalised recommendation, and content topics recommended by personalised collaborative filtering systems are the most abundant. Nevertheless, the recommendation system continue to face the risk of gradual marginalisation of long-tail content, which should be countered by weighting minority content.
Scholars have determined that the algorithm formulated by dominant organisations can cause the entire industry to follow and reflect the power in the process of controlling the production and circulation of information and also demonstrate dominance by formulating industry norms [16]. Researchers are convinced that apart from emphasising the role of algorithms in constructing reality, people should also realise the function of algorithms as a relationship management mechanism between organisations and individuals [17]. As part of this organisational structure, algorithm systems can automate the management mechanism of leading organisations and also provide a common language or structure to redefine value and affect the legitimacy of other organisations and individuals [18].
In the research of transparency and algorithm accounting mechanisms, fake news remains a difficult problem for Internet users and scholars. The fake news research published by Science magazine has observed the proliferation of all verified real and fake news published on Twitter from 2006 to 2017 and found that the dissemination range of false information substantially exceeds that of real news regardless of theme [19]. Additionally, fake news has proven to have a powerful agenda-setting function [20]. Given the flood of false news and overstretched manual verification 2 Wireless Communications and Mobile Computing in the Internet information environment, intelligent technology has been widely used in identifying and communicating false news [21]. In 2018, Facebook announced changes to its algorithm, prioritising reliable news in response to questions on whether or not its algorithm promotes the social spread of spam [22]. To date, no systematic review has been undertaken of studies conducted in the field of "algorithmic accountability." Wieringa obtained a perspective from critical algorithm research and also provided an interdisciplinary overview using materials from data research to law and from computer science to governance research [23].

Materials, Methods, and Tools
3.1. Research Tools. This study is based on the research methods of "cocitation analysis" and "cooccurrence analysis" of bibliometrics and conducts visual analysis using Cite-Space software (version 5.8. R3). Mapping knowledge domain is drawn for the cited reference and citing articles after sorting. CiteSpace, which was developed by Chaomei Chen, is a freely available Java application for visualising and analysing trends and patterns in the scientific literature. The software measures the literature in specific fields to explore the key path and knowledge turning point of discipline evolution. Thereafter, this research draws a series of visual maps, analyses the potential dynamic mechanism of discipline evolution, and explores the frontier of discipline development. The design of the tool is based on the basic assumption that actual scientific knowledge is constantly changing, similar to the gradual obsolescence and emergence of old and new literature, respectively. The so-called scientific domain refers to any boundary range that can be customised at different levels of clustering, including a scientific topic, research field, or discipline. Knowledge domain is represented by scientific literature sample data sources from different fields. This domain can be used to analyse the potential knowledge contained in massive literature data and present its structure, law, and distribution in the form of a knowledge graph. CiteSpace can effectively help readers acquire an improved understanding of their research fields. It can not show the overall situation of a research field and also highlight some specific important documents in the development process. Specifically, CiteSpace helps answer the following questions: "Which literature is pioneering and iconic in a research field?," "Which literature plays a key role in the development of a research field?," "Which topics occupy the mainstream position in the whole research field?," "How are different research fields related to each other?," and "How did the research frontier based on a certain knowledge base evolve?". CiteSpace provides a new research perspective and a valuable scientific research tool.
This research mainly studies the development status and trend of intelligent media in the recent 10 years by analysing the six dimensions of annual publication quantity, country of publication, institution of publication, author, keywords, and cited references. CiteSpace map has two most direct foundations, namely, (1) disciplinary basis based on citation analysis and (2) technical basis based on information visualisation, thereby intuitively expressing the research results of this study. The current research reveals the main forces, evolution process, research hotspots and topic categories of intelligent media research in the past decade. Through scientific cooperation network, keyword symbiosis network, and keyword clustering analyses, this study reveals the main forces, evolution process, research hotspots, and theme categories of intelligent media research in the recent 10 years. This study uses the results of the knowledge map in predicting the possible development trend of intelligent media in the future.

Research Object and Data
Source. This study selects the core collection of Web of Science (WOS) as a data source.

Wireless Communications and Mobile Computing
According to the interpretation and analysis of the research literature in the field of intelligent media mentioned in the literature review, the relevant keywords in intelligent media have been preliminarily determined. This study searches keywords in the WOS database and attempts to unlock the keywords that can reflect the field in the WOS database. "Intelligent media" and "smart media" are used as search words for the trial check, and keywords related to the two fields of "artificial intelligence" and "media" are sorted through trial search. Combined with the interpretation of intelligent media by wireless communications and mobile computing, Li et al. [24] and Qian et al. [25] formulated a retrieval formula for subsequent trial retrieval for the research of key terms in the field of artificial intelligence and common media and enterprises using intelligent technology in social and economic activities. On this basis, the current study repeatedly attempts and adjusts the retrieval formula and browse and compare the retrieval results after each adjustment, excluding the content not related to the subject. Lastly, this study determines the retrieval formula as [TS = ((("adaptive system") OR ("machine intelligen * ") OR ("intelligent network") OR ("intelligent systhesis") OR     Wireless Communications and Mobile Computing ("ambient intelligen * ") OR ("brain model * ") OR ("virtual reality") OR ("augmented reality") OR ("cognitive system") OR ("deep learn * ") OR ("algorithm") OR ("expert system") OR ("formal logic * ") OR ("heuristic program") OR ("intelligent agent * ") OR ("local search * ") OR ("prediction model * ") OR ("logic program") OR ("reinforcement learning") OR (robot * ) OR ("image recogn * ") OR ("speech recogn * ") OR ("distributed comput * ") OR ("system Theor * ") OR ("cloud computing") OR ("fuzzy control * ") OR ("knowledge engineering") OR ("ubiquitous comput * ") OR ("intelligent system") OR ("agent-based model * ") OR ("multi-agent system") OR ("neural network") OR ("problem solv * ") OR (swarm * ) OR ("data proces * ") OR ("artificial intelligence") OR ("text minning") OR ("feature recogn * ") OR ("pattern analysis") OR ("intelligent creation") OR ("resource allocat * ") OR ("immersive experience")) AND (((((smart) OR (intell * )) AND ((film) OR (television) OR (audio) OR (radio)))) OR ("social media") OR ("mass media") OR ("news media") OR ("short video") OR (Youtube) OR (Blog) OR (facebook) OR (Twitter) OR (QQ) OR (wechat) OR (instagram) OR (TikTok) OR (vimeo) OR (matecafe) OR (dailymotion) OR (liveleak) OR (hulu) OR (viewster) OR (twitch) OR (netflix) OR (vube) OR (weibo) OR (zhihu) OR (bilibili)))]. The data set is limited to "SCIE," time is limited to "2012-2021," language is limited to "English," document types are limited to "article" and "review," retrieval time is October 10, 2021, and a total of 7248 records are obtained. Additionally, 145 highly cited papers are obtained through refining for comparison.

Results and Discussion
4.1. Analysis of Annual Publications. Statistics on annual publication quantity can show the research attention of scholars in intelligent media at different periods. This study used the method of metrological visualisation analysis to calculate the annual number of 7248 retrieved articles (see Figure 1). The result indicated that the number of relevant

Analysis of the Publication of Paper from a National
Perspective. The visual analysis of the author's issuing country by CiteSpace software indicates that the node type is country. Figure 2 shows that countries with over 100 starting documents are obtained. The national cooperation perspective indicates 132 nodes, 508 connections between nodes, and network density of 0.0588. Note that the current national cooperation in intelligent media research is relatively close. Figure 2 shows that the larger a country's name and the thicker the node ring, the more the total number of articles published by the country. The number of documents issued can also reflect a country's research strength and level in this field. The thicker the connection between nodes, the closer the cooperation between countries.
The People's Republic of China ranks first in the number of published papers, with 2473 papers. The United States of America (the USA) and India are second and third with 1529 and 554 articles, respectively (see Figure 3). Note that the People's Republic of China and the USA have high proportions of documents in this field. Presently, various countries are closely connected in intelligent media research.

Analysis of Productive Institutions.
The visual analysis of the document issuing organisation by CiteSpace software indicates that the node type is organisation. Organisations with over 40 published papers are obtained (see Figure 4). The cooccurrence graph of institutional cooperation shows 452 nodes, 921 connections between nodes, and network density of 0.009. Note that there is cooperation amongst institutions in intelligent media research, but such cooperation is insufficient. Figure 4 shows that the larger the name of organisations and the thicker the annual rings of the nodes representing the total number of documents issued by organisations reflect the research strength and level of these organisations in this field. The thicker the connection between nodes, the closer the cooperation between representative organisations.
The top three institutions are Chinese Academy of Sciences (CAS), Tsinghua University, and Beijing University of Posts and Telecommunications. A total of 28 institutions have a volume of at least 40 articles, 21 of which are Chinese universities and scientific research institutions and 4 are American universities (see Table 1). This result is consistent with the cooccurrence graph of the national document volume in the second part. Note that China's document volume is dominant in this field, and the cooperation between domestic institutions is relatively close.
A total of 145 highly cited documents obtained from refining were visually analysed based on CiteSpace software (see Figure 5). Amongst the highly cited documents, CAS has the most number of documents and has close cooperation with major universities and institutions locally and overseas.
CAS has the most number of papers, with a total of 12. There are 13 institutions with at least 3 articles, of which 7 are Chinese universities and scientific research institutions  Wireless Communications and Mobile Computing and 3 are American universities (see Table 2). Figure 5 and Table 2 show that CAS ranks first in intelligent media in terms of the number of publications and number of highly cited literature. The overall proportion of Chinese universities is also high. This result shows that China's research in intelligent media is at the forefront and has achieved good results in terms of quantity and quality.

Author
Analysis. The author's cooccurrence analysis can reflect the cooperative relationship between the core authors and researchers in this field. Price law points out that the author's calculation formula is as follows: where N max is the number of articles published by the author with the largest number of articles in a certain field. If the number of articles published by an author is not below N min , then the author is called a high-yield author. Table 3 shows that if the value N max is 21, then N min ≈ 3:432 is obtained and the integer is 4. Therefore, if there are at least 4 published papers, then the author is called a high-yield author.
According to the price law, only when the high-yield author group accounts for 50% of the total can the discipline high-yield author group be formed. After calculation, the number of high-yield authors in this field accounts for only 7.19% of the total number of papers, indicating that researchers in intelligent media are relatively scattered and have not formed a high-yield author group. Table 3 shows that the author with the most number of articles (21) is Wei Wang of Sichuan University, followed by Yingchang Liang (with 14) of the University of Electronic Science and Technology. Note that the high-yield authors began to publish more papers in intelligent media in 2016.
The graph shows 529 nodes, 422 connections between nodes, and network density of 0.003. Cooperation of researchers in this field is not close (see Figure 6). Particularly, teams with relatively more papers and outstanding cooperation have two cooperative relationships (1)   CiteSpace software was used in conducting cooccurrence analysis of the keywords of 7248 retrieved literature and removing words irrelevant to the research field (see Figure 7). Particularly, there are 526 nodes 689 connections  Nanyang Technol Univ 6 6 Dalian Univ Technol 5 7 Peking Univ 3 8 Arizona State Univ 3 9 Huawei France R&D 3 10 Univ Houston 3 11 Carleton Univ 3 12 Tsinghua Univ 3 13 Beijing Univ Chem Technol 3 7 Wireless Communications and Mobile Computing between nodes and network density is 0.005. The larger the node and font size the higher the keyword usage frequency. The thicker the connection the closer the connection between the two keywords. The greater the centrality the more important the keywords. The general belief is that keywords with centrality above 0.1 are important and keywords with high frequency and great centrality can be used as research hotspots.
Keywords with frequency of over 40 are analysed (see Table 4). The frequency and centrality of film and communication are very high, indicating that the two keywords are more important. Related keywords include social media, twitter, neural network, 5G, sentient analysis, radio, and big data. CiteSpace conducts cluster analysis on high-frequency keywords in intelligent media. It sets the cluster number to 6 and obtains 6 clustering points about intelligent media (see Figure 8): film, social media, machine learning, swarming motility, data mining, and artificial intelligence. Each cluster is composed of closely related keywords. When the cluster S value is above 0.5, clustering is reasonable. When the clustering S value is above 0.7, clustering results are convincing. Given that the S values of the six clusters shown in Figure 8 are above 0.7, the clustering results of this time have a high degree of credibility.     Figure 9). The topic path map can reflect the cooccurrence relationship of keywords in a specific time range and the development context and research trend of a certain field. Figure 9 shows that the location of each circle represents the earliest publication year of an article in which the keyword is located, the thickness of the growth ring represents the heat of the research, the line between keywords represents the cooccurrence relationship, and the trend of the line represents the development of this research direction. Figure 9 shows a trend in the development of highfrequency keywords in intelligent media in the past decade from 2012 to 2021. The keyword with high frequency in 2013 is social media, which has continued to date.

Keyword Burst
Analysis. Emergent words are key terms in which keywords suddenly increase or their use frequency has increased significantly in a certain period. Mutation words can show some cutting-edge research fields of intelligent media research. Figure 10 shows that a total of 15 mutation words are obtained. Amongst them, the strongest is swarming mobility, with mutation intensity of 13.25. The first year of its emergence was 2012 and ended in 2017. Swarming motility has a longer time of existence, which was also the area of interest in the early years and   4.6. Document Cocitation Analysis. Cocitation analysis of references primarily is aimed at studying key documents and research hotspots in this field. Cocitation clustering reflects the popular topics and research directions of common concern amongst clustered documents. The cocitation time zone graph can reflect the evolution process of this field in the time dimension. CiteSpace conducted cooccurrence analysis and cluster analysis on literature cocitation (see Table 5 and Figure 11). References with citation frequency of at least 40 are concentrated in 2015-2016 from the time of publication, which is relatively early. In terms of content, it is mainly aimed at artistic intelligence, news, Cable News Network (CNN), and other fields. These cited references lay the foundation for the subsequent study of intelligent media.
Cluster analysis was conducted on the cited literature. The number of clusters in this cluster was set to 9, of which

12
Wireless Communications and Mobile Computing the No. 0 cluster point was electronic skin, which is inconsistent with the theme of "intelligent media" studied in this research. Therefore, it was removed. Lastly, eight cluster points on intelligent media references were obtained. Subject words of the main research directions in the recent 10 years are event recognition, fake news, CNN model, reconfigurable intelligent surface, comprehensive survey, microblog message, strain sensor, and traffic event (see Figure 12). According to CiteSpace, highly cited literature was analysed. If there is less literature in a time zone (see Figure 13), then there are less influential achievements in that time zone. By contrast, if there is more literature in a time zone, then numerous influential achievements have accumulated in that time zone. The connection of nodes between time zones indicates the inheritance of the research. Figure 13 shows that in the past 10 years, there were numerous literature in the three-year time zone from 2015 to 2017, which accumulated many influential achievements in this field and laid a certain research foundation for the subsequent research in intelligent media. Given that highly cited literature also involves time delay, papers in recent years have not formed a highly cited effect. With the passage of time, the results accumulated in recent years will continue for follow-up research, and high-quality papers will also be highly cited.

Conclusion
This research conducted a bibliometric analysis of SCIE retrieved journals from 2012 to 2021 and provided beneficial insights into document types, distribution of countries, regions and institutions, geographical distribution of authors, co-author network, and content analysis. The following conclusions can be drawn according to the development and analysis of research: (1) In the past decade, the number of papers in intelligent media has been increasing annually. Since 2017, the speed has accelerated. From the perspective of countries that issue papers, the People's Republic of China and the USA have high proportions of documents in this field. Presently, the research of various countries in intelligent media is closely connected (2) From the perspective of institutions that issue papers, CAS has published the most number of papers. However, in intelligent media research, cooperation amongst institutions is not close because intelligent media is a research topic that has only emerged in recent years. From the perspective of authors, researchers in intelligent media are relatively scattered and have yet to form a highyielding discipline writer group (3) In intelligent media research, a high-frequency word in the last decade is social media, which shows that intelligent media is closely related to social media. Sharing of social media is essential to meet social needs but not necessarily the needs of the target object. Intelligent media uses algorithm push, tap the needs of the audience, and push the information that meets the needs. Social media will eventually evolve into intelligent media (4) Judging from the increasing number of papers annually, the general trend is to promote the entry of intelligent technology to the media industry. The key is how to integrate with the existing media business forms or develop new media forms and platforms. According to the law of technological innovation, the best organisational form conducive to the promotion of innovative business is to establish new institutions, adopt new technologies and operate new businesses (5) Presently, the focus should be research on social issues caused by intelligent media. The privacy, addiction, and risk caused by intelligent media are also important factors affecting the development of intelligent media. Only by studying in advance and preparing scientifically can we prevent problems before they happen The research on intelligent media presents an interdisciplinary trend. The academic field of intelligent media comprises computer and communication scholars, as well as numerous scholars from communication, journalism, management, and philosophy. With the help of their professional knowledge and research perspective, they discuss the operation mechanism of algorithms, legal supervision, ethical risks and other issues. This aspect shows that systematic problems brought by intelligent media to information communication and even the entire society go beyond the scope of a single discipline and require the joint participation of multiple disciplines.

Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
The authors declare no conflicts of interest.