Journal of Zhejiang University SCIENCE C 2012 Vol.13 No.11 P.828-839


Overlapping community detection combining content and link

Author(s):  Zhou-zhou He, Zhong-fei (Mark) Zhang, Philip S. Yu

Affiliation(s):  Zhejiang Provincial Key Laboratory of Information Network Technology, Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China; more

Corresponding email(s):   zju_hzz@zju.edu.cn, zhongfei@zju.edu.cn, psyu@uic.edu

Key Words:  Overlapping, Content, Link, Community detection

Zhou-zhou He, Zhong-fei (Mark) Zhang, Philip S. Yu. Overlapping community detection combining content and link[J]. Journal of Zhejiang University Science C, 2012, 13(11): 828-839.

In classic community detection, it is assumed that communities are exclusive, in the sense of either soft clustering or hard clustering. It has come to attention in the recent literature that many real-world problems violate this assumption, and thus overlapping community detection has become a hot research topic. The existing work on this topic uses either content or link information, but not both of them. In this paper, we deal with the issue of overlapping community detection by combining content and link information. We develop an effective solution called subgraph overlapping clustering (SOC) and evaluate this new approach in comparison with several peer methods in the literature that use either content or link information. The evaluations demonstrate the effectiveness and promise of SOC in dealing with large scale real datasets.

