Publishing Service

Polishing & Checking

Journal of Zhejiang University SCIENCE C

ISSN 1869-1951(Print), 1869-196x(Online), Monthly

Querying dynamic communities in online social networks

Abstract: Online social networks (OSNs) offer people the opportunity to join communities where they share a common interest or objective. This kind of community is useful for studying the human behavior, diffusion of information, and dynamics of groups. As the members of a community are always changing, an efficient solution is needed to query information in real time. This paper introduces the Follow Model to present the basic relationship between users in OSNs, and combines it with the MapReduce solution to develop new algorithms with parallel paradigms for querying. Two models for reverse relation and high-order relation of the users were implemented in the Hadoop system. Based on 75 GB message data and 26 GB relation network data from Twitter, a case study was realized using two dynamic discussion communities: #musicmonday and #beatcancer. The querying performance demonstrates that the new solution with the implementation in Hadoop significantly improves the ability to find useful information from OSNs.

Key words: Follow Model, Hadoop, MapReduce, Querying, Twitter

Chinese Summary  <71> 在线社交网络内动态群组查询

研究目的:在线社交网络的动态群组形成具有在线即时、信息突发和传播迅速等特点,在大数据环境下及时发现有用的群组内的信息,是本专业的一项富有挑战性的工作。本文引用描述用户关系的逻辑模型(Follow Model,简称“粉丝模型”),结合文章映射和化简(MapReduce)概念,探讨映射关注和化简粉丝(MapFollowee & ReduceFollower)机制在Hadoop系统联机实现的算法。
创新要点:在线社交网络的研究缺乏使用和方便的基础理论模型,粉丝模型(Follow Model)的建立,为研究动态群组查询和微博转发预测等提供有效的元模型。结合映射和化简(MapReduce)理念,本文算法为在线社交网络动态群组的查询,即大数据的动态查询,提供并行计算的实用性算法。
方法提亮:组成粉丝模型(Follow Model)的各类函数把微博用户关系简洁和准确地描述出来,同时具备以下三个特点:反对称与对称性、可扩展性和可组合性。这些特性的灵活应用,形成本文提出的两大类查询算法:反对称关系查询算法(reverse relation)和高阶关系查询算法(high-order relation)。
重要结论:本文研究在线社交网络,特别是Twitter和新浪微博平台的动态群组形成机理,提出描述用户间关系的逻辑模型,即粉丝模型。将此模型结合映射和化简理念,提出对这些动态群组信息查询的并行算法。特别是通过对Twitter平台内两个群组信息查询的实际检验,展示大数据环境下本文算法的有效性。

关键词组:粉丝模型,Hadoop,映射和化简,信息查询,Twitter微博


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





DOI:

10.1631/jzus.C1300281

CLC number:

TP393.09

Download Full Text:

Click Here

Downloaded:

2932

Download summary:

<Click Here> 

Downloaded:

2208

Clicked:

7258

Cited:

2

On-line Access:

2014-01-29

Received:

2013-10-08

Revision Accepted:

2013-12-22

Crosschecked:

2014-01-15

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE