Properties Exploring and Information Mining in Consumer Community Network : A Case of Huawei Pollen

Substantial changes took place in the role of consumers in the supply chainwith the development of practices.They became creators from consumers of product values. More and more consumers express their consumption experiences by posting in network community. Consumer community network is an important place for feedback of product experiences and facilitating product innovation in future. Manufacturers can promote improvement and innovation of products by exploring effective information on the consumer community network, thus improving the experience level of consumers. Therefore, how to explore information in topics (posts) and their relationships becomes very important. Is it possible to describe the structure of consumer community network by complex network and explore information about products and consumers?There is important and positive significance to study the collaborative innovation in the supply chain in which consumers participate. In this paper, the consumer community network was constructed by Boolean retrieve programming and discussed in the methodology and empirical way based on the community data of Huawei P10/P10 Plus. In methodology, interaction difference and uniformity within consumer community were explored by the density of isolated nodes and generalized variance of degree of network. In empirical studies, community network users were divided into ordinary user group, intermediary user group, and enterprise user group according to empirical data, and corresponding interaction networks were constructed. A contrastive analysis on the interaction of these three groups was carried out by combining the existing properties and innovative properties. Topics in each network were put in the order according to significance. Research conclusions have important significance to enrich the network analysis methods, explore the effective information in consumer community network, facilitate product improvement and innovations, and improve the experience level of consumers.


Introduction
The social, biological, physical, and technological networks often contain some interactive individuals, which make the complex network, the extension of graph theory, an edged tool to analyze internal structure and dynamic involutions of these networks [1][2][3].For example, Boolean network is the combination of the Boolean operation with network structure to solve difficult problems in biological area [4][5][6].However, interaction of individuals in the research system of social network services (SNS) [7] has become an important component for rapid high-efficiency propagation of information and discovery of key nodes in the studying networks [8].The academic circles often abstract the corresponding "nodes" and "edges" from the network data [9,10] and then construct the network model to analyze its topological properties, including average degree [11], density of graph [12], diameter of graph [13], eigenvector centrality [14], average clustering coefficient [15], etc.This network model not only is conducive to explore deep-layer information like key information propagation [16,17] and community structure [18,19] but also helps enterprises in consumer service management [20,21].
Nevertheless, with the rapid network development, the "Internet +" technologies that combine information technologies arouse the significant impacts of consumers on the market [22,23].More and more consumers are active Complexity in expressing their experiences of some bought products or giving purchase suggestions to others [24,25] in the consumer community formed by different media, including cell phone, PC, or PAD [26].Manufacturers explore these opinions or suggestions deeply for the purpose of product updating or improvement [27].Consequently, the role of consumers in the supply chain is changed substantially.They shifted from the consumer of product values into creators [28].In structure, these consumer communities are more like the derived structures of social network [29].Therefore, network community has become the important way for communication between enterprises and consumers and information mining [30].Hence, enterprises shall understand the immediate opinions of consumers in the consumer community, which is very important to develop potential products.
Among consumer electronics, cell phone has become the mobile computer in people's daily life and it is related to various living aspects of users [31].Moreover, the lifecycle of cell phone is going to be shortened to less than 2 years, which is attributed to the high replacement rate and frequent use [32].With the progress in informationalization, the brand community and consumer community are developed accordingly.Consumer experience and other information in these consumer communities facilitate the continuous improvement of cell phone in view of some perspective [33].Currently, Huawei is the leader in the Chinese smartphone market, followed by Xiaomi and OPPO.The market shares of these brands in the fourth quarter in 2017 reached 10.2%, 7.2%, and 6.9%, respectively [34].They all established their own official consumer communities to exhibit their product design philosophies and accept suggestions from consumers.For example, by May 2018, the number of the published posts on the Huawei Pollen Club about P10/P10 Plus reached 1,556,433 [35], the number of published posts on Huawei P20 reached 261,691 [36], and the number of published posts on Huawei Mate10 reached 1,293,712 [37].These posts covered tremendous product information and experiences of consumers [38].
Considering the extreme importance of consumers to manufacturers, what is the structure of consumers in the web community?What cell phone topics are different users concerned with in the community?Which connections are there between different topics and how strong such connection is?What characteristics are there in the changes of topics as time goes on?
On this basis, this paper is going to explore data from the Huawei P10/P10 Plus community to address above problems mainly from methodologies and empirical studies.In methodology, interaction difference and uniformity among different consumer community networks as well as key time points of the network dynamics were explored by the density of isolated nodes, generalized variance of degree of network and node sequential emergence determination coefficient.In empirical studies, community network users were divided into Ordinary User Group (OUG), Intermediary User Group (IUG), and Enterprise User Group (EUG) according to empirical data, and corresponding interaction networks were constructed.A contrastive analysis on the interaction of these three groups was carried out by combining the existing properties and innovative properties.Topics in each network were put in the order according to significance.Secondly, the emergence law of cell phone topic lifecycle was analyzed by combining the theory of product lifecycle with node sequential emergence determination coefficient.
The remainder of this paper is organized as follows.Section 2 is literature review on existing research methods.Section 3 extracts topics and classifies users according to posting data of Huawei P10/P10 Plus users.Section 4 constructs the complex network models for three types of users, respectively.Some new properties, such as density of isolated nodes, generalized variance of degree of network, and node sequential emergence determination coefficient, are proposed.A statistical analysis is carried out by combining these new properties with the traditional statistical properties.Section 5 further analyzes "leaders" in networks and explores information like closeness and significance of topics.Section 6 elaborates enlightenments to enterprise management which are gained from empirical analysis.Finally, the corresponding sketch of methods is displayed in Figure 1.

Literature Review
There are rather more literature reports about the consumer interaction from 4 ways to explore the law behind it, which are stated below.

Consumer Interaction.
Many studies on consumer interaction have been reported worldwide.Georgi and Mink (2013) explored the positive impacts of interaction of electronic (online) consumers on performance of innovative enterprises [39].Smaliukiene et al. (2015) analyzed consumers' discussions of network construction in the online forum provided by suppliers when they studied the online tourism service, finding that consumer interaction was conducive to analyze procedures in global online tourism service departments effectively [40].Bruhn M et al. (2014) performed the online investigation of three virtual B2B brand communities and verified the positive effect of consumer interaction on brand loyalty by an empirical study [41].Millán et al. (2016) analyzed the impacts of consumer interaction on satisfaction to vocation by the fuzzy qualitative method, finding that quality, strength, value, and influence of consumer interaction are important conditions of vocation experience [42].Based on 821 samples, Wei et al. (2017) discussed the fundamental mechanism of influences of consumer interaction on experiences of participants.They reported that specialized knowledge communication and social emotional support during the consumer interaction are vital to the implementation of activities of service providers [43].Chen et al. (2011) discussed the influence of customer interaction on the relationship quality between service companies and customers by constructing a conceptual model and found that such relationship quality can be improved by improving the consumer interaction methods [44].However, it is easy to know that all above studies are mainly macroscopic analyses on consumers' behavior based on survey questionnaire but neglect the difference among different consumer community networks.Few scholars have discussed differences of interaction contents in the community brought by changes of product lifecycles.

Research Methods of Consumer Online Community.
From the view of methodology, there are mainly four methods in research of customer interactive behavior containing statistical methods, structure equation modelling, experiment and case study and complex network analysis, which are stated as below.
In method of statics, Oh., et al. (2015), classified the test subjects of 315 university students as three groups and conducted two-way ANOVA to test the hypotheses of the research model [45].Zollet and Back (2015) collected data from 138 firms in Switzerland and Germany and analyzed with multiple regression analysis [46].Khan et al. (2016) analyzed 1,922 brand posts from five different brands of a single product category in three different countries and used ordinary least square and hierarchical moderation regression to test the hypotheses [47].Nourikhah and Akbari (2016) used Bayesian data analysis with a generalized linear model (GLM) to estimate the overall satisfaction of the users in the form of the posterior distribution of opinions [48].Wan et al. (2016) introduced least squares support vector machine (LS-SVM) innovatively into the study on consumer electronics supply chains [23].These studies took consumers as a whole, then from the perspective of the supply chain or enterprises, analyzing consumers' interaction impact on supply chain or their features.However, consumer network is not a simple whole, but a complex structure, which meets the structure of the general complex network and has its own characteristics at the same time.
Many scholars introduce structure equation modelling method to study consumer behavior in online community.Shobeiri et al. (2014) used structural equation modelling based on EQS 6.1 to assess the measurement and structural models [49].Liou et al. (2015) adopted structural equation modeling to investigate the factors that influence users' use intentions regarding broadband television [50].Islam and Rahman (2017) analyzed the data using structure equation modelling through a questionnaire survey of 430 Facebook users [51].The structure equation modelling can explain features in customers interactive network; however it also ignores the structure of the customers communities which would leave out some detailed information like the important topics and customers.
In terms of experiment and case study method, Kilgour et al. (2015) employed depth interviews initially, followed by questionnaires, and then computer assisted content analysis was performed on 723 online media articles relating to social media marketing to identify semantic and conceptual relationships [52].McKechnie and Nath (2016) explored this issue in an online experiment with 273 subjects browsing 4 websites offering identical products but with variable levels of interactivity and personalization features [53].Chu et al. (2017) conducted two experiments to identify an effective communication strategy to facilitate social media marketing using a combination of communication facets such as frequency, direction, formality, and content [54].Firstly, in these papers, experiment and case studies were conducted within a confined condition, which means that the participants are easy to be interrupted by some other reasons.Secondly, participants and case study could not represent the whole interactive network to some extent.
In complex network analysis method, Chiang and Wang (2015) extended research on the interactive features of product-review networks by considering the out degree centralization, density, and microstructure of product-review networks [55].Li and Gu (2015) proposed an OSN link formation model from the perspective of user behavior, which reproduced degree distribution, clustering and degree correlation of OSN [56].Andersen and Mørch (2016) classified user types through social network statistical analysis and constructed "user-topic" hybrid network with user interaction analysis of user posts [57].Baumgartner and Peiper (2017) extended a novel method called stochastic block modeling to derive communities of cannabis consumers as part of a complex social network on Twitter [58].Liu et al. (2017) proposed a complex network model with reviews as nodes by calculating reviews topics with latent Dirichlet allocation model and topic similarities among reviews with Pearson similarity [59].These studies consider it from a complex network view, ignoring the statistics characteristics between the same type networks in different periods.In our paper, data were scrawled from club.huawei.com,which enable us to avoid interview effects [60] and some other possible negative influence accompanying survey research [61,62].Later we will clean the data and build complex model.

Data Crawling and Preprocessing
3.1.Data Source.For topic type, Huawei community, Xiaomi community, and OPPO community emphasize on different topics.For example, the OPPO community focuses on camera performance of the phone.Huawei P10/P10 Plus community has relatively more topics, covering hardware, software, system, appearance design, and even price.More importantly, the community has stronger data integrity and accessibility.Although the Xiaomi community has many topics, it only displays the latest data, which were not as comprehensive as that of Huawei P10/P10 Plus community since February 2017.Hence, post data in the Huawei P10/P10 Plus community were collected in this paper for information mining by complex network.
In this paper, post data in the Huawei Pollen Club (HPC), a consumer community formed by Huawei P10/P10 Plus, from February 8 th , 2017, to November 4 th , 2017, were collected [35].Members of the club participated in communication of relevant products after registration.In this club, consumers can raise questions and interact with others by replies, thus increasing understanding on Huawei products.On the other hand, Huawei can make responses in time, help them to solve ticklish questions, and explore problems that consumers are highly concerned according to consumers' information feedback in the community, thus enabling improvement of products during upgrading and increasing profits of the enterprise.

Initial Data Screening.
A total of 125,163 data were collected initially, covering titles and contents of posts (excluding replies), user name, user level, and publishing time.Since user browsing or reply was updating dynamically and generated continuously during data acquisition, it was inevitable to generate some repeated data.In this paper, the latest state of the same post was applied.After selected operation, 78,320 data were retained.Later, 824 invalid data of banned to post, banned to login and shielded data because of advertisement and unrelated information were further eliminated.Finally, 77,496 valid data were kept.

Data Analysis and Processing.
In this section, data were analyzed from extraction of hot topics and user classification, which prepares for the construction of weighted network model in Section 4.

Extraction of Topics.
Firstly, the core topics are extracted from what the users consider.Most of the data presented by users on the website are in the form of posts.It is necessary to extract the topics from the post in order to learn the needs of the users.
Through calculating the frequency of the topic combined with the features of phone via programming with Boolean operation to judge whether the topics occur in the post or not, 100 topics are selected (see Appendix A).After sorting the higher frequency ones, they are divided into three parts including system, software, and hardware, according to their feature, showed in Table 1.

Classification of Users.
In order to specify interaction and different topic focus within community, users of the HPC can be divided into three groups according to functions and roles [63][64][65][66][67][68], namely, OUG, IUG, and EUG.The OUG refers to users who bought Huawei products and registered in the HPC.The IUG refers to users who have received official training of Huawei and are willing to answer questions of other users.The EUG refers to the official enterprise employees, covering technicians, salesmen, and publicists.Level labels and meaning of each group are listed in Table 2.
A statistical calculation on posting frequencies of all users of each level showed in Table 2 was made (see Appendix B), getting proportions of posts of three user groups in Figure 2.
It can be seen from Figure 2 that 99% posts were published by ordinary users, indicating that OUG is the main force.However, it still cannot replace the key role of the rest two groups in the community.Hence, different models were constructed to the OUG, IUG, and EUG, respectively.

Weighted Network Analysis
In this part, this paper introduces complex network analysis method.The nodes denote 61 topics in Table 1, and if a user mentions two topics   and   , in a post title and text at the same time, it suggests that there is a close relationship between these two topics, which corresponds with an edge between nodes  and .This step is achieved by Boolean retrieve in programming.The weight of edges denotes the number of users.That forms undirected weighted network.Because different groups of users have different positions in the network, they play different roles.Therefore, this paper establishes networks according to users' groups.
4.1.Modeling of Networks.By using Gephi software, we get three groups of users' interaction networks, respectively, in Figures 3, 4, and 5.In the graphs, the nodes of the same color represent the same kind of community [69,70].The size of the nodes represents the eigenvector centrality, that is, the power to control other topics.The color of the edge represents the number of people who focus on two topics at the same time.The deeper the color (purple) is, the more the people who are concerned about the two topics are, which shows that these two topics have strong correlation.
The interaction network of OUG is shown in Figure 3 Topics are divided into five communities: "Taking pictures," "System update and battery," "Fingerprint unlock," "APP," and "Internet speed," which reflects the system problems, software problems, and hardware problems users are concerning.However, the connection between "System" and "Update" has the deepest color among all topics' edges, indicating the high frequency of simultaneous mention of these two topics by users.This implies that the cell phone problems might be brought by system updating.In addition, "WeChat" is closely related to the "APP" community and topics of other communities, indicating that "WeChat" is the core application of OUG.
It is easy to note that edges in the network have relatively uniform color, which implies that users concern extensive problems.Besides, the OUG often proposes their questions by posting in the community and make partial or complete effective answers to problems of other users.They have strong uncertainty.
The interaction network of IUG is shown in Figure 4.The network is divided into two communities: "System applications" and "Hardware."Although problems still involve system, software, and hardware of cell phone, the system and applications are divided into one community, indicating that the IUG can classify topics effectively.Compared with the OUG, the IUG is aware of problems that the OUG has not noticed.For instance, "Pattern" is just a periphery topic in the interaction network of OUG, but it is a core topic in the interaction network of IUG and highly related to other topics.
Compared with the interaction network of OUG, edge color in the interaction network of IUG is not uniform.Many edges have deep color, especially in the "System applications" community.The IUG associates key topics that users are discussing effectively according to users' questions and offer corresponding answers.They fulfill the responsibility of answering questions authorized by the Huawei community.
In Figure 5, the interaction network of EUG is also divided into 3 communities: "System updating," "Taking pictures," and "Software applications."In the "Taking pictures" community, edges between any two topics have relatively deeper color, indicating that the Huawei officials pay attentions to propagation of the camera performance of cell phones.This is because Huawei officials regularly encourage OUG to exhibit their own pictures.Moreover, the topic "System" is strongly correlated with other topics.
Obviously, the IUG answers questions of users and summarizes topics.Based on the IUG, the Huawei officials answer questions related to "System," "Upgrade," and "Update."They also answered the "WeChat" problems that users are concerned.In other words, the EUG can not only guide the discussion themes in the community by observing the OUG and IUG but also answer problems of the OUG accurately.
By comparing these three networks, three characteristics are recognized: (1) The number of hotspots of core topics increases gradually.The node size in networks represents the significance.Node size in the interaction network of OUG is more uniform than that in the interaction network of IUG, indicating that the OUG has more questions in both quantity and complexity.However, the IUG and EUG with experiences can explain topics specifically, thus increasing the number of core topics relatively.The concerned problems also present targeted variation.
(2) There are significant differences among different communities.The difference among different user groups is manifested by the number and members of communities.Just as definitions of IUG, it is mainly to classify problems of the OUG and give specific answers.Therefore, it only involves two communities.The EUG will cooperate with concerned points of the OUG and make corresponding guidance.Therefore, these two groups have similar number of communities.However, these two groups have certain differences in communities' members, which is caused by their different cognition degree on correlation degree of problems.
(3) The correlations of topics are significantly different.The OUG concerns all aspects of cell phone, because they have poor knowledge on roots of cell phone problems.Therefore, edges have relatively uniform color.In contrast, the IUG understands relevant problems of cell phone well.It highlights connections of different types of problems during reasonable standardizing of problems.The EUG is mainly to answer most questions of the OUG and propagate the system and unique camera performance of cell phone.Hence, only edges within these two communities are relatively deep.

Statistical Analysis of Networks.
From the former descriptive analysis of three networks, the difference between them will be quantified by using complex network properties: let G(V,E,W) be a nonempty weighted graph with || =  and|| = .= (  ) × is the adjacency matrix of G, in which   is 1 if node i and node j are connected and 0 otherwise.Similarly,   = (   ) × , is the weighted adjacency matrix of G, in which    denotes the weight of the edge between node i and node j.   represents the sum of the weight of the edges in G. Through comparing the statistical properties between the constructed networks and relative null models, which includes average degree, density of graph, average clustering coefficient, diameter of graph, modularity, and initiative ones, containing density of isolated nodes, generalized variance of degree of network, we can specify the information value of networks, in which the null model denotes (  ,   ,   ) with |  | = || and   =    .  is the he sum of the weight of the edges of null model.This paper use following statistical properties [69]: (1) Average degree: average degree, denoted ⟨⟩, describes the mean of all nodes in the network.In this paper it represents the average of topics' relative topics.(2) Density of graph: density of graph, , is the ratio of the existing number of the edges m to its maximum possible number of edges.We use it to detect the density of topics network.to the largest of all distance in the graph.The smaller   is, the more stable network would be.This property can describe the closeness of the topics.(4) Average clustering coefficient: if there are edges between each two of nodes i, j, and k, then it forms a triangle.Thus, average clustering coefficient C is defined as the ratio of such triangles in graph G.It can describe the local stability of network.(5) Modularity is a measure of the level or degree to which a network's communities may be separated and recombined, which is a commonly used criterion for determining the quality of network partitions.It can classify the topics according to their associations.Although they can describe the general features of different networks, it is still necessary to measure the following features of networks.The quantitative description of isolated topics and judgment rule of "leaders" network in weighted networks, which has nodes with special important status called "leaders" in it.The OUG network has no isolated topics; however, different circumstance occurs in the IUG and EUG, which means topics have different status among three groups.What is more, there are some researches on the judgment rule of "leader" network in unweighted networks but no one in weighted network like topics networks in this paper.Mining the "leaders" can guide EUG focus on important topics.If company can solve the timely, other problems would be modified.
Due to these demand for research, this paper proposes the following properties to dig out the properties of the networks.
Therefore, density of isolated nodes   is defined as the ratio of the number of isolated nodes   to n.That is, measures the connectivity of network, which means that the more connective network is, the smaller   it would be.Traditional network analysis based on good connectivity of graph, lacks of this property.It can explore the difference due to interaction level between user groups; as a result some topics become isolated ones.

Generalized Variance of Degree of Network.
Considering the nodes in undirected network, the generalized degree of each node is the sum of the weight of links between their neighbors.So the variance of all generalized degree in graph is the generalized variance of degree of network.It can judge whether G has "leaders" compared to the null model.We can use it to detect whether the network has "leaders" or not, as well as the uniformity of weight distribution.The thought of its definition comes from variance of degree of undirected unweighted graph [71]: if the degree of node i denotes   , However, if G is an undirected weighted graph, variance of degree ignores the weight of edges' impact on the uniformity of G.As a result, this paper defines generalized variance of degree: if   neighbors of node  denotes    , where    is the weight of link between node i and its  ℎ neighbor node, similarly the expectation of average of generalized degree is   () = ∑  =1    /, and generalized variance of degree in network can be computed as follows: This property is the general form of variance of degree, if G is undirected unweighted graph generalized variance of degree degenerates as variance of degree, through calculating ( 3) and ( 5).
Moreover generalized variance of degree of its relative null model (  ,   ,   ) is Var()  and the standard deviation of (  ,   ,   ) is [Var()  ].Since generalized variance of degree of (  ,   ,   ) matches the Z distribution, in Appendix C Z distribution is proved to approximate normal distribution.In this paper, "leader" network is defined as follows: if the generalized variance of degree of G is bigger than the "3 − " margin of null model (  ,   ,   ); that is, Var() > Mar() = [Var()  ]+ 3[Var()  ], or "autonomy" network otherwise.From this definition, "leader" network has significant nodes named "leaders" controlling other nodes and influencing the generalized variance of degree, however the importance of "autonomy" network is relatively even.
Var()  may change every time along with the different result of random construction.If the generalized variance of degree of  ℎ random result is Var()   , and there are N random graphs; then below forms From this equation, it shows that a large number of generalized variance of random networks can be computed 10 Complexity

Comparison Analysis.
The numerical result of 5 traditional properties showed in Section 4.2 (1)-( 5) and 2 initiative ones are displayed in Table 3: From OUG to IUG and EUG, in terms of the average degree, the relationships among topics decrease gradually.The density of them changes from great density to sparseness.Especially the proportion of from isolated topics and Figures 3-5, there is no isolated topic in OUG but an isolated topic "GPS" in IUG and three in EUG including "King Glory," "Anti-fingerprint oleophobic coating," and "Flash back."This shows that the IUG solves three problems of ordinary users through interaction, so three topics become isolated in the EUG network.However, IUG does not solve "GPS," so EUG makes relevant interpretation.In terms of diameter of graph, the OUG is more compact compared with other two networks, suggesting that users in OUG equally focus on topics not having a clear mind on their relation.The average clustering coefficient illustrates microstructure of three networks, because the IUG and EUG have content knowledge, reducing the number of unnecessary contact between topics.Modularity shows the rationality of the division of the communities of three networks.Generalized variance of degree of network indicates that all the three groups have some provocation opinions on the relations between the topics, some topics holding more attention compared with others.These topics with significant status are mined in Section 5.

Information Mining of (Leaders)
Fu et al. (2016) suggested that nodes, which hold great importance, having strong relationship with others in the network, are called "leaders" [72].This paper also judges whether there are "leaders" in three networks by computing properties results with their corresponding null models respectively.Moreover, "leaders" and closeness of topics are analyzed via eigenvector centrality method.

Existence of "
Leaders".Firstly, whether networks have "leaders" that are judged: 1000 random networks are established by Matlab programming according to each null model structure separately.The 6 properties of the 1000 random networks are as follows.
As it shows in Table 4, since the characteristics of the networks built in this article are:   > ( − 1)/2, which results in the mean average degree of all random networks corresponding to three groups are 60.By comparing the results between Tables 3 and 4, it is found that their properties have a significant difference.
If the generalized variance of degree of OUG network, IUG network, and EUG network is denoted, respectively, by Var  , Var  , Var  and the margin value of their corresponding null models is denoted by Mar  , Mar  , Mar  , the following results are obtained from computing: Var  = 7620144.613≫ Mar  = 4652.730,Var  = 265766.736≫ Mar  = 857.300,Var  = 15014.359≫ Mar  = 211.174.It shows that the generalized variance of degree of three networks is greater than "3 − " boundary of that of their null models.So the OUG network, IUG network, and EUG network are "leader" networks with significant "leaders."The "leaders" in these networks will be explored below.

Finding "Leaders" in Certain
Network.After certifying networks with "leaders," this section will dig them out and analyze the closeness of topics by eigenvector knowledge.
Iranzo (2016) analyzed the financial ability of village [73], so the importance of topics is also calculated by this method in this section.The concept of eigenvector centrality is that the importance of every node in network is associated with the number and quality (importance) of its neighbor nodes.
The results of eigenvector of maximum eigenvalue of three networks are calculated and normalized by Matlab, denoting    =   / ∑  =1   , shown in Appendix D. Fu et al. (2016) proposed that the top 10 percent of importance of all nodes are "leaders" [72] and there are 61 topics in this research, so the "leaders" of three networks are in Table 5.
From Appendix D and Table 5, it is obvious that there is a certain difference among the importance ranking of three networks topics and "leaders", which means that the interaction levels of users on website are different and cause the difference in core topics.Kendall coefficient test in nonparametric statistics are computed in Table 6 to see whether the rankings of topics importance of three groups are different or not.
From the results of Kendall coefficient, the overall consistency is relatively low.Moreover, in the comparisons between two among them, the consistency of ranking "OUG vs IUG" is lowest compared with other two pairs.And the highest consistency pair is "OUG vs EUG." From the results of P value in Table 6, the Kendall coefficient is reliable under 5% significant level.
As is analyzed in the former part, EUG need to combine the feedback of IUG, for example, system version test and solution to the problems, with taste of OUG to issue content, so it has a relatively high consistence with OUG in ranking.However, the classification of topics of IUG makes it different from others.
In computing eigenvectors centrality progress, we can also get the maximum eigenvalue  1 of corresponding networks, which means the interaction intensity of the network.If the maximum eigenvalue of the OUG network, IUG network, and EUG, respectively, is denoted by   ,   ,   , the values of them are calculated:   = 5651.53≫   = 1045.26>   = 267.81.From the maximum eigenvalues of the three networks, it is shown that the maximum eigenvalue gradually declines from the OUG to the IUG and then to the EUG.Because the OUG has 99% valid data, the maximum eigenvalue of the network is naturally large.However, the interaction effect of IUG network is better than that of the EUG network.This also proves that intermediary users have reduced the pressure of the EUG as the backbone.

Ranking Topics Based on Multiplex
Network.Importance of topics in all groups is analyzed.However, three networks have overlapping topics.To get all topics that users concerned, that is, to discover the overall "leaders," three networks are overlapped effectively in this section and analyzed from the perspective of multiplex.
Firstly, the multiplex network is designed as follows.The bottom layer is the interaction network of OUG, the middle layer is the interaction network of IUG, and the top layer is the interaction network of EUG.Same topics in two adjacent layers are connected, getting Figure 6.
In Figure 6, the pink nodes represent isolated topics in the corresponding layer and the grey nodes are interactive topics.Larger nodes (topic name) reflect higher degree.Clearly, the IUG fails to solve "GPS" and thereby generates the isolated topics.However, these topics are absent in the isolated topics of interaction network of EUG, indicating that EUG solves problems beyond the competence of OUG and IUG.Similarly, three isolated topics in the interaction network of EUG are absent in the interaction network of IUG, which reflects that these three topics are solved by the IUG and the EUG does not need to explain them.In brief, the IUG, the bridge between the OUG and the EUG serves as the "problem filter" well.They enhance the ability of users to solve problems through interaction and relieve pressure of customer service of enterprises.This also reflects that stimulating the interaction between the IUG and the OUG can bring consumers fast updating in product experience.Boccaletti et al. (2014) introduced the method of ranking of node importance based on multiplex network.If normalized eigenvector centrality of OUG network, IUG network, and EUG network is ( () 1 ,  () 2 , . . .,  ()  )  ( = 1, 2, 3), then the  ℎ ( = 1, 2, . . ., ) node importance is defined below [74]: By combining results in Appendix D and ( 7), the importance and its ranking of topics based on multiplex network are shown in Appendix E. "Leaders" in the multiplex network are "system," "memory," "camera," "data," "update," and "pixel."However, "Anti-fingerprint oleophobic coating," "King Glory," and "Fingerprint" are less correlated with other topics, indicating their less importance.These topics have certain difference in importance.

Conclusions
The consumer community network is explored in this paper by methodology and empirical study based on the data in Huawei P10/P10 Plus community.In methodology, interaction difference and uniformity within consumer community are explored by the density of isolated nodes and generalized variance of degree of network.In empirical studies, community network users are divided into OUG, IUG, and EUG according to empirical data and corresponding interaction networks are constructed.A contrastive analysis on these three interaction networks is carried out by combining the existing properties and innovative properties.Topics in each network are put in the order according to significance.
Based on above studies, we conclude that consumer community network is the important place that reflects product experiences and facilitates product innovation in future.Manufacturers can promote improvement and innovation of products by exploring effective information on the consumer community network, thus improving the experience level of consumers.On this basis, three strategies to improve information mining in consumer community networks are proposed: (1) Problems that users concerned are recognized by deep exploring and full understanding of post contents and themes as well as characteristics of cell phone.Problems could be classified reasonably (community division in the network) and core problems could be recognized by multiplex network, thus enabling to solve and guide users' problems in time.
(2) The IUG shall be encouraged and guided to improve the overall interaction performance in the community network.By analyzing the member structure of consumer community, the role of IUG as the bridge between OUG and EUG deserves attention.Enterprises encourage the IUG to interact with OUG and help them to solve problems.This can not only relieve pressure of enterprises in early counseling and late after-sale services but also guide users to improve self-management.Moreover, enterprise group users shall make use of the key role of IUG in development and test of new products, collecting effective feedbacks quickly and shortening the launch time of new products.

A. The Frequency of Initial Tocpics
See Table 7.

B. The Statistics of Valid Posts of Corresponding User and Group
See Table 8.

C. Norm Feature of Z Distribution
Here, we, respectively, build null models of OUG network, IUG network, and EUG network, according to which 1000 random graphs are generated.And Z statistic is built for general variance of degrees of network, which is approximated to normal distribution proved by Kolmogorov-Smirnov test method, so that the "leader" network is determined by "3−" boundary Mar.Firstly, Z statistics of general variance of degrees of network is defined as follows:  where Var()  is a random variable of general variance of degrees of random network, with [Var()  ] standard deviation of Var()  and   () the average value of Var()  .
We calculate the general variance of degrees of 1000 random graphs of the OUG network, IUG network, and EUG network.Figures 7-9 show the frequency distribution histogram.
In Figures 7-9, lines represent the normal distribution curve fitting according to the mean and standard deviation of frequency of general variance of degrees of network.And histograms of the three groups of corresponding null models are bell shaped.In order to test whether the distribution of general variance of degrees of null models conform to normal distribution, the K-S test method results are as in Table 9.
From Table 9, we can see that considering norm curve fitting of general variance of degrees' frequency, their    Based on the "3" principle in statistics, there is a significant difference between G and the corresponding null model, if Var() > Mar().As a result, the network G is a "leader" network with uneven importance nodes.

D. The Importance of Topics in Three Networks
See Table 10.

E. The Importance of Topics in Multiplex Netwok
See Table 11.

Data Availability
The Huawei P10/P10 Plus data used to support the findings of this study are available from the corresponding author upon request.

Figure 1 :
Figure 1: Sketch of the exploration study.

Figure 2 :
Figure 2: The ratio of posts of different groups.

Figure 6 :
Figure 6: Consumer network based on multiplex network.

Figure 7 :
Figure 7: Generalized variance of degree of network of null model of OUG.

Figure 8 :
Figure 8: Generalized variance of degree of network of null model of IUG.

Table 1 :
Classification of topics.
IUGHot fansActivating area atmosphere and eager to answer the questions of other usersExpert fansWilling to experience the latest products and ROM, positive feedback problems during use with good language organization, having enough time to participate in product evaluation, and enjoying taking pictures and reading experience Female fans Special female members dedicated to women's topicsInternal managerOn the basis of all Pollen member, an independent special user group with management authority Internal expert Application for internal test, an independent special user group, with members of internal test core groupPollen director of cityThe core link of regional Pollen fans and participating in Huawei's deep marketing decision in the region

Table 3 :
Statistical results of three networks.

Table 4 :
The results of null models.

Table 8 :
Related statistics of valid posts of corresponding user.

Table 10 :
The importance ranking and eigenvector corresponding to maximum eigenvalue of three networks.

Table 11 :
The importance and its ranking of topics based on multiplex network.