|
Journal of Zhejiang University SCIENCE C
ISSN 1869-1951(Print), 1869-196x(Online), Monthly
2014 Vol.15 No.2 P.81-90
Querying dynamic communities in online social networks
Abstract: Online social networks (OSNs) offer people the opportunity to join communities where they share a common interest or objective. This kind of community is useful for studying the human behavior, diffusion of information, and dynamics of groups. As the members of a community are always changing, an efficient solution is needed to query information in real time. This paper introduces the Follow Model to present the basic relationship between users in OSNs, and combines it with the MapReduce solution to develop new algorithms with parallel paradigms for querying. Two models for reverse relation and high-order relation of the users were implemented in the Hadoop system. Based on 75 GB message data and 26 GB relation network data from Twitter, a case study was realized using two dynamic discussion communities: #musicmonday and #beatcancer. The querying performance demonstrates that the new solution with the implementation in Hadoop significantly improves the ability to find useful information from OSNs.
Key words: Follow Model, Hadoop, MapReduce, Querying, Twitter
创新要点:在线社交网络的研究缺乏使用和方便的基础理论模型,粉丝模型(Follow Model)的建立,为研究动态群组查询和微博转发预测等提供有效的元模型。结合映射和化简(MapReduce)理念,本文算法为在线社交网络动态群组的查询,即大数据的动态查询,提供并行计算的实用性算法。
方法提亮:组成粉丝模型(Follow Model)的各类函数把微博用户关系简洁和准确地描述出来,同时具备以下三个特点:反对称与对称性、可扩展性和可组合性。这些特性的灵活应用,形成本文提出的两大类查询算法:反对称关系查询算法(reverse relation)和高阶关系查询算法(high-order relation)。
重要结论:本文研究在线社交网络,特别是Twitter和新浪微博平台的动态群组形成机理,提出描述用户间关系的逻辑模型,即粉丝模型。将此模型结合映射和化简理念,提出对这些动态群组信息查询的并行算法。特别是通过对Twitter平台内两个群组信息查询的实际检验,展示大数据环境下本文算法的有效性。
关键词组:
References:
Open peer comments: Debate/Discuss/Question/Opinion
<1>
DOI:
10.1631/jzus.C1300281
CLC number:
TP393.09
Download Full Text:
Downloaded:
3219
Download summary:
<Click Here>Downloaded:
2393Clicked:
8055
Cited:
2
On-line Access:
2024-08-27
Received:
2023-10-17
Revision Accepted:
2024-05-08
Crosschecked:
2014-01-15