Online social networks appear to enrich our social life, which raises the question whether they remove cognitive constraints on human communication and improve human social capabilities. In this paper, we analyze the users' following and followed relationships based on the data of Sina Microblogging and reveal several structural properties of Sina Microblogging. Compared with real-life social networks, our results confirm some similar features. However, Sina Microblogging also shows its own specialties, such as hierarchical structure and degree disassortativity, which all mark a deviation from real-life social networks. The low cost of the online network forms a broader perspective, and the one-way link relationships make it easy to spread information, but the online social network does not make too much difference in the creation of strong interpersonal relationships. Finally, we describe the mechanisms for the formation of these characteristics and discuss the implications of these structural properties for the real-life social networks.
In the past decade, the online social network has made new opportunities for communication and has revolutionized many aspects of our lives. As a new kind of online social networks, the Microblogging, such as Twitter and Sina Microblogging, has been developing rapidly in recent years. The online social network provides people with a public network platform, meets needs of computer-mediated communications, and rebuilds social connections. The structure and evolution of online social network have attracted the attention of researchers in different disciplines, including sociology, physics, and computer information technology [
As for the mapping and expansion of the actual social networks on the Internet, online social networks have broken the spatiotemporal limitations and decreased communication cost. Will the online social networks change the rules of actual social networks? The following studies reach fundamentally different conclusions: Backstrom and Boldi [
Has online social network changed our real-life social pattern? Why do such discrepancies exist between the studies? The differences are due to the different analytical content and analysis approach. The low cost of the online network forms a broader perspective, and the one-way link relationships make it easy to spread information, but the online social networks do not make too much difference in the creation of strong interpersonal relationships. These online social relationships, which may reflect the real-life social networks, create an unprecedented field to understand the characteristics of human networks [
In this paper, we study the topological characteristics of Sina Microblogging and try to explain its characteristics formation mechanism by comparing it with the real-life social networks. This paper is organized as follows. Section
In August 2009, Sina Microblogging began its trial service and became the most popular Microblogging service in China, with more than 500 million users as of 2013. Sina blog and Sina news provide good capital bases and natural advantages for the success of Sina Microblogging. Due to these advantages, Sina Microblogging developed the characteristics of We Media that spread news faster than any other media. In order to study the structure of Sina Microblogging, we develop a spider with Python for Sina Microblogging. The snowball [
We begin our analysis of Sina Microblogging from the following aspects. In Section
In Table
Summary of degree distribution.
|
Number of nodes | Proportion |
|
---|---|---|---|
|
783 | 93.3% |
|
|
50 | 6.00% | |
|
6 | 0.07% |
|
We begin the analysis of Sina Microblogging topology by looking at its degree distributions. Networks of a power-law degree distribution,
The degree distribution of Sina Microblogging is shown as in Figure
Degree distribution for nodes in Sina Microblogging.
The networks which meet a power-law distribution have scale-free property, and such networks are called scale-free networks. Therefore, Sina Microblogging is a scale-free network. The real-life social networks are usually scale-free networks. The scale-free network is caused by the growing and preferential mechanisms that the new nodes trend to connect with hub nodes, and this phenomenon is called the Matthew effect.
The study of “small-world” networks has become a key to understanding the societal structure, ever since Stanley Milgram’s famous “six degrees of separation” experiment. In his work, he reports that any two people could be connected on average within six hops from each other. Watts and Strogatz [
The concept of average path length for undirected networks is well known; the measures are related by
The average path length is closely connected to the characteristics of the network, such as connectivity, reachability, and transferring latency. A short average path length facilitates the quick transfer of information and reduces costs.
Another property of “small-world” networks is the clustering coefficient. It is the measure of the extent to which one's friends are also friends of each other. The local clustering coefficient for a node is then given by the proportion of links between the nodes within its neighborhood divided by the number of links that could possibly exist between them. The local clustering coefficient for undirected graphs can be defined as
The clustering coefficient for the whole network is given by Watts and Strogatz [
The networks with the largest possible average clustering coefficient are found to have a modular structure.
The results concerning average path length and clustering coefficient are displayed in Table
Small-World property compared with different social networks.
Social network | Average path length | Clustering coefficient |
---|---|---|
Sina Microblogging | 2.794 | 0.374 |
Random network | 4.162 | 0.006 |
Cyworld [ |
3.2 | 0.16 |
Renren [ |
3.48 | 0.2 |
Mixi [ |
5.53 | 0.1215 |
Co-author network [ |
4.6 | 0.066 |
Renren, Cyworld, and Mixi are the largest and oldest online social networking services in China, South Korea, and Japan, respectively. Coauthor network is a kind of real-life social network. By contrast, the average path length of Sina Microblogging has a shorter average path length and a greater clustering coefficient. It is revealed that Sina Microblogging’s small-world property is the most obvious. The striking small-world phenomenon indicates that there are rich local connections in Sina Microblogging network, the nodes are linked closely, and information dissemination is more efficient.
We analyze the mechanisms for the formation of small-world property from the following two aspects. First is the user’s behavior aspect. In 2011, for many users, the main reasons they use social network out there are the latest developments of friends (73.4%), keeping in touch with old friends (73.1%), and documentation of life and feeling (67.5%), and the main purposes of Microblogging users are getting information (58.1%), following celebrities (57.6%), discussing the hot topics, and personal experience (52.3%), according to the data of “the user behavior research of SNS and microblogging in China.” The Microblogging users tend to use the Microblogging to record personal feelings, share the news, and find groups with similar interests, and the traditional SNS users show themselves, contact with friends in real life, and expand social circle on SNS. Consequently, the former are oriented to information exchange, but the latter stress interpersonal communication.
Second is the aspect of users’ link mode. The main difference between the Renren, Cyworld, and Mixi networks and Microblogging is the directed nature of Microblogging relationship. In Renren, Cyworld, and Mixi, a link represents a mutual agreement of a relationship, while on Sina Microblogging a user is not obligated to reciprocate followers by following them. Thus, a path from a user to another may follow different hops or not exist in the reverse direction. Both the internal links (strong ties) and one-way following links (weak ties) are in Microblogging. And the Microblogging’s distinction between different types of interactions allows us to get more information: personal interactions are more likely to occur on internal links (strong ties) and events transmitting new information rely more on one-way following links (weak ties) [
In conclusion, Microblogging is easier to form the user centric We Media than traditional SNS; therefore, it possesses small-world property that can facilitate the flow of information.
In order to gauge the correlation between clustering coefficient and the degree, we plot the clustering coefficient of nodes
Correlations between clustering coefficients and degrees.
The structure is caused by Sina Microblogging’s fans mechanism. Basing on the user interest and the celebrity effect, Sina Microblogging builds many kinds of fans groups. Celebrities and professionals in a certain area who have more resources and influence will get more attention, and these users only keep internal links with peers or slightly lower level users and so forth until the hierarchical structure is formed.
Mixing patterns refer to systematic tendencies of one type of nodes in a network to connect to another type. There are three types of mixing patterns: assortative network, disassortative network, and neutral network. Similar vertices tend to connect to each other in assortativity network, and nodes of low degree are more likely to connect with nodes of high degree in disassortative network. A lot of empirical studies [
We cannot understand the mixing pattern intuitively, so we introduce the concept of excess average degree. The excess degree is the number of edges leaving the vertex other than the one we arrived along. This number is one less than the degrees of the vertices themselves. The excess average degree is defined as
The excess average degree is positive in assortative network and negative otherwise. Figure
Correlations between excess average degrees and degrees.
To better understand the meaning of disassortative network, we illustrate the network connections of the node ID 975 in Figure
The network connections of the node ID 975.
Social networks are discrete systems with a large amount of heterogeneity among nodes. Measures of centrality direct at a quantification of nodes' importance for structure and function. The most direct measure of centrality is the degree centrality; that is, the node with greater degree is the most important one. In addition, betweenness centrality is also a measure of node’s centrality in a network and can be used to measure the influence a node has over the spread of information through the network. It is equal to the number of the shortest paths from all vertices to all others that pass through that node.
In Figure
Correlations between node betweenness centralities and degrees.
In this paper, we have studied the structural properties of Microblogging ever created (839 active Sina Microblogging users and their 2112 social relations) from several viewpoints.
First of all, we have found a power-law distribution, a short average length, and a high clustering coefficient in its topology analysis, which are all compatible with known characteristics of other online social networks and real-life social networks. In order to illuminate the mechanisms for the formation of small-world property, we have studied the difference between Sina Microblogging and traditional social networks from the aspect of users’ behavior and the users’ link mode and found that Sina Microblogging can easily form the user centric We Media. Therefore, it possesses small-world property that can facilitate the flow of information. Then, we calculated the correlation between clustering coefficients and degrees and showed that Sina Microblogging has apparent hierarchical structure, and we have found that Sina Microblogging trends to be disassortative network, which all mark a deviation from real-life social networks. Moreover, we analyzed the betweenness centralities of intermediary nodes and confirmed that the intermediary nodes can control the spread of information. Last but not least, our work is only the first step towards exploring the difference between the online and real-life social networks. Much work still remains.
The authors declare that there is no conflict of interests regarding the publication of this paper.
This work was partly supported by the National Science Foundation of China under Grant no. 70903016 and the National Science & Technology Support Program under Grant no. 2012BAH81F03.