The online chat group is a small-scale multiuser social networking platform, in which users participate in the discussions and send and receive information. Online chat group service providers are concerned about the number of active members because more active members means more advertising revenues. For the group owners and members, efficiency of information acquisition is the concern. So it is of great value to model these two indicators’ impacting factors. This paper deduces the mathematical models of the number of active members and efficiency of information acquisition and then conducts numerical experiment. The experimental results provide evidences about how to improve the number of active members and efficiency of information acquisition.
1. Introduction
With the development of online social networks, a kind of small-scale platform, the online chat group, appears. Online chat group is an extension of the social group on online social networks. Therefore, they have similarities. For example, scale of online chat group accords with “Dunbar’s Number” [1]; it gathers people by region, interest, or task, and there is a certain degree of social interaction or social relation among group members.
A central theme of researches on group is group effectiveness, which has been studied in the literatures. For instance, Reagans and Zuckerman’s research [2] claimed that high network density improves group effectiveness by enhancing coordination and trust among their members; Oh et al. [3] analyzed such influencing factors as group closure, bridging relationships, and current and past relationships on the social capital. They argued that greater group social capital leads to greater group effectiveness. As to the realistic interaction group, similar to online chat group, Dong et al. [4] discovered that higher turn-taking rate and backchannel rate increase the group effectiveness while higher turn competition rate and uneven turn transferring decrease the group effectiveness. Obviously, the yardsticks are as varied as the type of groups.
And distinctions exist between online chat groups and other types of groups. In contrast to groups in the physical offline world, online chat groups are organized in the virtual world, other than the form of sitting face-to-face. And some advantages are listed as follows [5]. (1) Members with various backgrounds can communicate without the geographical restrictions. (2) It is easy to communicate for information. (3) It eases the process of archiving and retrieving historical chats. (4) Chats are in public form and everybody gets the equality of speaking. Moreover, online chat group service providers are concerned about the number of active members because more active members means more advertising revenues. For the group owners and members, the information acquisition is the concern, whose importance for members of online groups has been underlined in literatures [5–7] as well. Finally, the online chat groups we are studying are the synchronous ones, while most related work about online groups, like Usenet newsgroups [8], Yahoo groups [9], Google groups [10], and so forth, referred to the asynchronous manner.
From the perspective of means, most studies of online group mainly focus on information mining, including group detection and topic detection. For group detection, Mutton [11], utilizing the timing characteristics of the chat data, proposed three heuristic rules to tap the communication relationship between users; Acar et al. [12] found smaller discussion groups from the chat room by tensor (user × keyword × time) analysis techniques, and in a recent study, an approach called multilayered edge clustering coefficient to detect groups from multilayered social network is developed [13]. As for topic detection, most technologies are text-based analysis such as Butterfly system [14] and ChatTrack system [15]. In fact, question of topic detection in group is nearly the same as that in other text-based environment. So the means in [16, 17] are suitable for online chat group as well.
In general, these studies employ methods based on user-generated data combined with data mining techniques, to develop some algorithms or systems. They are the basis of application. However, they do not directly reveal any related laws or characteristics of the effectiveness of the online chat group.
So this paper studies the effectiveness of online chat groups, and it proceeds as follows. Firstly, we give a description of the online chat group and make some assumptions. Secondly, we model group members’ information acquisition efficiency and the number of active members. At last, we conduct the numerical experiment to reveal the effect of the related parameters on the online chat group.
2. Basic Concepts, Assumptions, and Symbol Definitions
Because users join the newly created group with a purpose of participating in the group chat, all the group members are assumed to be active at the beginning. Each member has a certain number of topics of interest. As time goes, some members may be annoyed with the continuous useless topics and thus shield (i.e., ignore) messages from the group and become inactive. However, inactive members may convert from inactive state to active state after a length of time. Thus, every member is in either inactive or active state.
Before deducing the mathematical models, we give the following assumptions.
Assumption 1.
The total number of online members (including active members and inactive members) in a group keeps constant for the duration when we analyze the group. In other words, neither new members join the group nor old members drop out.
Assumption 2.
At any moment, the group has only one topic being discussed. That is, one topic must be stopped before another begins.
Assumption 3.
Each topic appears with the same probability.
Assumption 4.
Either active or inactive state duration of each member per cycle, experiencing two complete states as a cycle, is subjected to the exponential distribution.
Definitions of symbols used in the model are shown in the Tables 1 and 2.
Symbols of independent variables.
Symbol
Definition
N
Number of group members
mi
Number of topics that the ith member is interested in
ri
Number of uninteresting topics that the ith member can tolerate
1/β
Average duration of each group member’s inactive state per cycle
1/δ
Average duration of each topic
w
Average number of repeated interesting topics per member
Symbols of dependent variables.
Symbol
Definition
Na
Number of active members in a steady group
m
Average number of interesting topics per member
M
Number of topics in a group
1/α
Average duration of each group member’s active state per cycle
r
Average number of uninteresting topics that each member can tolerate
P
Appearance probability of interesting topics
s
Average number of interesting topics acquired within unit time by a member per cycle
si
Average number of interesting topics acquired by a member per cycle
su
Average number of uninteresting topics acquired by a member per cycle
μ
The ratio of interesting topics acquired by a member
3. Model of Information Acquisition Efficiency
Number of topics that the ith member is interested in is mi; so, average number of interesting topics per member is
(1)m=∑i=1NmiN.
Between any two members, they may have a common part of topics which they are interested in. For example, member A is interested in football and basketball, while member B is interested in table tennis and basketball. Thus, the repeated interesting topic to them is basketball.
Topics in a group are the union set of topics of interest to every member. In our model, the average number of repeated topics of interest to each member is expressed as w. N members group has (N-1)w repeated topics. So, excluding repeated interesting topics of each member, total number of topics in a group is
(2)M={Nm-(N-1)w,N≥10,N=0.
According to Assumption 3, the appearance probability of each topic is 1/M. So, to a member, the ratio of interesting topics among all topics is equivalent to the appearance probability of interesting topics. Consider
(3)P=mM=mNm-(N-1)w.
Average duration of each topic obeys exponential distribution. Assuming its parameter is δ, average duration of each topic is 1/δ.
Similar to the average number of interesting topics per member, average number of uninteresting topics that each member can tolerate is
(4)r=∑i=1NriN.
It is easy to derive the number of topics experienced by a member throughout one cycle, during which there are mx topics. During the cycle, any topic can be repeatedly discussed. The cycle will not be finished unless the number of uninteresting topics is more than ri. Therefore, mx can be positive infinity. In detail, the range of mx is [r,+∞). The probability distributions of the number of topics experienced by a member throughout one cycle are shown as follows:
(5)viPr=P(mx=r)=Cr-10(1-P)r,Pr+1=P(mx=r+1)=Cr-1P(1-P)r,Pr+2=P(mx=r+2)=Cr+12P2(1-P)r,⋮viPk=P(mx=k)=Ck-1k-rPk-r(1-P)r⋮
As shown above, average duration of each topic is 1/δ. If a member experiences i topics during one active state, the duration of the active state is i·1/δ, and the number of interesting topics experienced is i-r. Therefore, average number of interesting topics acquired within unit time by a member per cycle is
(6)s=∑i=r+∞Pii-ri(1/δ).
Then average number of interesting topics acquired by a member per cycle and uninteresting topics acquired by a member per cycle are
(7)si=∑i=r+∞Pi(i-r),su=r.
Finally, information acquisition efficiency can be defined as the ratio of interesting topics among topics acquired by a member, that is, average number of interesting topics acquired by a member per cycle/(average number of interesting topics acquired by a member per cycle + average number of uninteresting topics acquired by a member per cycle); namely,
(8)μ=sisi+su.
4. Model of Number of Active Members
In our model, average duration of each member’s active state per cycle is
(9)1α=∑i=r+∞Pi·i1δ.
According to Assumption 4, either active or inactive state duration of each member per cycle follows exponential distribution, parameters of which are α and β, respectively. Then the active state probability of each member is e-αt, and inactive state probability is e-βt. Let active state be represented by T, and let inactive state be represented by S. Its state changing process can be described by the Ehrenfest model [18], a birth-death process to explain the second law of thermodynamics. The model considers N particles in two containers. Particles independently change container at a rate λ. We consider particles to be the members of a group, and two containers represent two states. But in this case, birth rate and death rate are not necessarily equal. Thus, state transition diagram of each member can be shown as Figure 1.
State transition diagram of each member.
And stationary distribution of active member number follows binomial distribution, B(N,β/(α+β)) [19].
Therefore, when the group is steady, the expected number of active members is
(10)Na=Nβα+β.
5. Numerical Experiments and Results Analysis
We carried out comprehensive numerical experiments to disclose the effect on the number of active members and efficiency of information acquisition.
Let us set 1/δ=0.1 hr, 1/β=3 hr, r=10, and m=20. Figures 2 and 3 show the relationships of Na versus w and μ versus w, respectively.
Na versus w.
μ versus w.
When the average number of repeated interesting topics per member is small, both the number of active members in a steady group and the ratio of interesting topics acquired by a member are less affected. When the average number of repeated interesting topics per member is close to the average number of interesting topics per member, it greatly affects the number of active members of a group and the ratio of interesting topics acquired by a member.
Let us set N=50, r=10, m=20, and 1/δ=0.1 hr. Figure 4 shows the relationship of Na versus 1/β.
Na versus 1/β.
With the increase of the average duration of each group member’s inactive state per cycle, the number of active members in a group becomes less (as shown in Figure 4).
The ratio of interesting topics acquired by a member is unrelated to the average duration of each group member’s inactive state per cycle.
Let us set m=20, r=10, w=10, and 1/β=3 hr. Figures 5 and 6 show the relationships of Na versus 1/δ and s versus 1/δ, respectively.
Na versus 1/δ.
s versus 1/δ.
Longer average duration of each topic means less topics that the group members are not interested in. Then there will be more group members staying in active state. If the average duration of each topic was long enough, all the group members are active (as shown in Figure 5).
The ratio of interesting topics acquired by a member is unrelated to the average duration of each topic.
Longer average duration of each topic means less interesting topics acquired within unit time by a member per cycle (as shown in Figure 6).
Let us set 1/β=3 hr, m=20, 1/δ=0.1 hr, and w=10. Figures 7 and 8 show the relationships of Na versus r and s versus r, respectively.
Na versus r.
s versus r.
More uninteresting topics that each member can tolerate means longer inactive time of the group members. Then there will be more group members staying in the active state. If the average number of uninteresting topics that each member can tolerate was big enough, all the group members are active (as shown in Figure 7).
The ratio of interesting topics acquired by a member is unrelated to the average number of uninteresting topics that each member can tolerate.
The variation of the average number of interesting topics acquired within unit time by a member per cycle has the similar trend with the number of active members in a group (as shown in Figure 8).
Let us set 1/β=3 hr, w=10, 1/δ=0.1 hr, and r=10. Figures 9 and 10 show the relationships of Na versus m and μ versus m, respectively.
Na versus m.
μ versus m.
More interesting topics per member (while the number of the repeated topics is kept the same) means more diversity of interesting topics among members. Then the number of uninteresting topics will increase. It will cause that fewer group members remain in the active state. The ratio of interesting topics acquired by a member will also decrease. When the average number of interesting topics per member is large enough, the number of active members in a group and the ratio of interesting topics acquired by a member will be less affected.
The variation of the average number of interesting topics acquired within unit time by a member per cycle is the same with the ratio of interesting topics acquired by a member.
Let’s set 1/β=3, m=20, 1/δ=0.1, r=10. Figures 11, 12 and 13 show the relationships of 1/α versus N, Na versus N and μ versus N, respectively.
1/α versus N.
Na versus N.
μ versus N.
More group members means more active members in a group. More group members will give more topics and increase the ratio of uninteresting topics among all topics. That will make the average duration of each group member’s active state shorter (as shown in Figure 11).
Figure 12 shows that the number of active members of a group and the number of group members have incremental linear relationship.
To each group member, the number of his interesting topics being discussed is reduced. So the ratio of interesting topics acquired by a member is smaller (as shown in Figure 13).
The variation of the average number of interesting topics acquired within unit time by a member per cycle is the same with the ratio of interesting topics acquired by a member.
6. Conclusions
The main contribution of this paper is modeling of the number of active group members and efficiency of information acquisition. According to the models and their numerical experiments results, we have the following conclusions. (1) In the case that other conditions remain unchanged, more group members means more active members, but there will be a sharp decline in duration of each group member’s active state and members’ efficiency of information acquisition. Effectiveness of the group deteriorates instead, so it is not so good that too many members stay in a group. (2) For group service providers, to shorten the inactive duration can significantly improve the effectiveness of the group. Therefore, we recommend that the group service providers should routinely publish attractive contents to attract inactive members to return to the active state. (3) In order to ensure that the group has a higher activity, the group administrators should absorb members with more common topic. (4) In order to get more information, group members should improve their tolerance. As a result, group will be more active. (5) Too many topics of interest will lead the proposed topics to be more dispersed so that some group members lose interest in the group. Therefore, we recommend that the interest scope of a group should be set narrow.
This study may provide a reference value for the development of the group’s management and related studies. Besides, this study is only for the homogenous case. Namely, we assume that each group member has the same parameter or average value. Heterogeneous case is our future study.
Acknowledgments
This work was supported by a grant from the National Natural Science Foundation of China (61170215 and 61102074).
GonçalvesB.PerraN.VespignaniA.Modeling users' activity on Twitter networks: validation of Dunbar's numberhttp://arxiv.org/pdf/1105.5170v2.pdfReagansR.ZuckermanE. W.Networks, diversity, and productivity: the social capital of corporate R&D teams20011245025172-s2.0-0035620125OhH.LabiancaG.ChungM.-H.A multilevel model of group social capital20063135695822-s2.0-33745714911DongW.LepriB.KimT.PianesiF.PentlandA. P.Modeling conversational dynamics and performance in a social dilemma taskProceedings of the 5th International Symposium on Communications, Control and Signal Processing2012Rome, Italy14BudakC.AgrawalR.On participation in group chats on twitterProceedings of the International World Wide Web Conference2013Rio, BrazilBatemanP. J.GrayP. H.ButlerB. S.Community commitment: how affect, obligation, and necessity drive online behaviorsProceedings of the 6th International Conference on Intelligent Computing and Intelligent Systems2006Madison, Wis, USA63BrandtzægP. B.HeimJ.User loyalty and online communities: why members of online communities are not faithfulProceedings of the 2nd International Conference on Intelligent Technologies for Interactive Entertainment2008Cancun, MexicoArguelloJ.ButlerB.JoyceE.KrautR.LingK. S.WangX.Talk to me: foundations for successful individual-group interactions in online communitiesProceedings of the Conference on Computer-Human InteractionApril 2006Montreal, Canada9599682-s2.0-33745852290BackstromL.KumarR.MarlowC.NovakJ.TomkinsA.Preferential behavior in online groupsProceedings of the International Conference on Web Search and Data Mining (WSDM '08)February 2008Stanford, Calif, USA1171282-s2.0-4254909614510.1145/1341531.1341549HuffakerD. A.TengC.MatthewP. S.GongL.AdamicL. A.The impact of group attributes on communication activity and shared language in online communities20111642-s2.0-79958763407MuttonP.Inferring and visualizing social networks on internet relay chatProceedings of the 8th International Conference on Information Visualisation (IV '04)July 2004London, UK35432-s2.0-4644285038AcarE.̧amtepeS. A. C.KrishnamoorthyM. S.YenerB.Modeling and multiway analysis of chatroom tensorsProceedings of the IEEE International Conference on Intelligence and Security Informatics2005Atlanta, Ga, USA256268BródkaP.FilipowskiT.KazienkoP.An introduction to community detection in multi-layered social network201327818519010.1007/978-3-642-35879-1_23van DykeN. W.LiebermanH.MaesP.Butterfly: a conversation-finding agent for Internet relay chatProceedings of the 4th International Conference on Intelligent User Interfaces (UIST '99)November 1998Los Angeles, Calif, USA39412-s2.0-0032628729BengelJ.GauchS.MitturE.VijayaraghavanR.ChatTrack: chat room topic detection using text classificationProceedings of the 2nd Symposium on Intelligence and Security Informatics2004Tucson, Ariz, USA266277GuoX.XiangY.ChenQ.HuangZ.HaoY.LDA-based online topic detection using tensor factorization2013392111LiuS.ChangH.A topic detection and tracking system with TF-Density2013115120KarlinS.McgregorJ.Ehrenfest urn models19652235237610.2307/3212199KratO.SchaeferM.Mean passage times for tridiagonal transition matrices and a two-parameter Ehrenfest urn model199330496497010.2307/3214525