Journal of Zhejiang University

Frontiers of Information Technology & Electronic Engineering 2015 Vol.16 No.4 P.293-300

Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules

Author(s): Xiao-xia Zhang, Qiang-hua Xiao, Bin Li, Sai Hu, Hui-jun Xiong, Bi-hai Zhao
Affiliation(s): 1. Department of Mathematics and Computer Science, Changsha University, Changsha 410003, China more
Corresponding email(s): zhangxx111@yeah.net, bihaizhao@163.com
Key Words: Protein-protein interaction network, Essential protein modules, Overlap, Overlap maximum matching ratio

Share this article to： More <<< Previous Article \|Next Article >>>

Xiao-xia Zhang, Qiang-hua Xiao, Bin Li, Sai Hu, Hui-jun Xiong, Bi-hai Zhao. Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules[J]. Frontiers of Information Technology & Electronic Engineering, 2015, 16(4): 293-300.

@article{title="Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules",
author="Xiao-xia Zhang, Qiang-hua Xiao, Bin Li, Sai Hu, Hui-jun Xiong, Bi-hai Zhao",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="16",
number="4",
pages="293-300",
year="2015",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.1400282"
}

%0 Journal Article
%T Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules
%A Xiao-xia Zhang
%A Qiang-hua Xiao
%A Bin Li
%A Sai Hu
%A Hui-jun Xiong
%A Bi-hai Zhao
%J Frontiers of Information Technology & Electronic Engineering
%V 16
%N 4
%P 293-300
%@ 2095-9184
%D 2015
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.1400282

TY - JOUR
T1 - Overlap maximum matching ratio (OMMR): a new measure to evaluate overlaps of essential modules
A1 - Xiao-xia Zhang
A1 - Qiang-hua Xiao
A1 - Bin Li
A1 - Sai Hu
A1 - Hui-jun Xiong
A1 - Bi-hai Zhao
J0 - Frontiers of Information Technology & Electronic Engineering
VL - 16
IS - 4
SP - 293
EP - 300
%@ 2095-9184
Y1 - 2015
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.1400282

Abstract
Chinese Summary
Academic Network
Reviewer Comment

Abstract: Protein complexes are the basic units of macro-molecular organizations and help us to understand the cell’s mechanism. The development of the yeast two-hybrid, tandem affinity purification, and mass spectrometry high-throughput proteomic techniques supplies a large amount of protein-protein interaction data, which make it possible to predict overlapping complexes through computational methods. Research shows that overlapping complexes can contribute to identifying essential proteins, which are necessary for the organism to survive and reproduce, and for life’s activities. Scholars pay more attention to the evaluation of protein complexes. However, few of them focus on predicted overlaps. In this paper, an evaluation criterion called overlap maximum matching ratio (OMMR) is proposed to analyze the similarity between the identified overlaps and the benchmark overlap modules. Comparison of essential proteins and gene ontology (GO) analysis are also used to assess the quality of overlaps. We perform a comprehensive comparison of serveral overlapping complexes prediction approaches, using three yeast protein-protein interaction (PPI) networks. We focus on the analysis of overlaps identified by these algorithms. Experimental results indicate the important of overlaps and reveal the relationship between overlaps and identification of essential proteins.

In this paper, authors propose a new measure, namely OMMR, to evaluate the quality of overlaps of protein complexes identified. In general, unlike previous works that only focus on the overlapping between two protein complexes, this new measure targets to compute the overall overlapping score for all protein complexes identified.

OMMR:一种关键模块重叠部分评价指标

目的：设计蛋白质复合物或功能模块重叠部分评价指标。
创新点：考虑到蛋白质复合物重叠部分与关键蛋白质识别之间存在紧密联系，首次提出关键模块评价指标—重叠最大匹配率(OMMR)，可用于评价挖掘的具有重叠部分的功能模块算法优劣，从而进一步服务于关键蛋白质的识别。
方法：首先，通过Benchmark集分析，得到参考Overlap集合。然后得到功能模块预测算法得到的复合物集合的overlap集合，利用公式（5）得到该预测算法的OMMR值。
结论：重叠蛋白复合物，尤其是它们的重叠部分，在识别必要性蛋白中发挥重要作用。本文提出名为OMMR的方法来评估必要性模块的重叠部分。实验结果表明重叠部分的重要性，并揭示重叠部分与关键性蛋白识别之间关系。

关键词：蛋白质相互作用网络；关键模块；重叠部分；OMMR

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Reference

[1]Adamcsek, B., Palla, G., Farkas, I.J., et al., 2006. CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics, 22(8):1021-1023.

[2]Bader, G.D., Hogue, C.W.V., 2003. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform., 4:2.1-2.27.

[3]Boyle, E.I., Weng, S., Gollub, J., et al., 2004. GO::TermFinder—open source software for accessing gene ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics, 20(18):3710-3715.

[4]Chen, B., Shi, J., Zhang, S., et al., 2013. Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy. Proteomics, 13(2):269-277.

[5]Cherry, J.M., Adler, C., Ball, C., et al., 1998. SGD: Saccharomyces Genome Database. Nucl. Acids Res., 26(1):73-79.

[6]Dezső, Z., Oltvai, Z.N., Barabási, A.L., 2003. Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae. Genome Res., 13:2450-2454.

[7]Enright, A.J., van Dongen, S., Ouzounis, C.A., 2002. An efficient algorithm for large-scale detection of protein families. Nucl. Acids Res., 30(7):1575-1584.

[8]Gavin, A.C., Aloy, P., Grandi, P., et al., 2006. Proteome survey reveals modularity of the yeast cell machinery. Nature, 440:631-636.

[9]Han, J.D., Bertin, N., Hao, T., et al., 2004. Evidence for dynamically organized modularity in the yeast protein– protein interaction network. Nature, 430:88-93.

[10]Hart, G.T., Lee, I., Marcotte, E.M., 2007. A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality. BMC Bioinform., 8:236.1-236.11.

[11]Hu, H., Yan, X., Huang, Y., et al., 2005. Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics, 21(suppl 1):i213-i221.

[12]Jiang, P., Singh, M., 2010. SPICi: a fast clustering algorithm for large biological networks. Bioinformatics, 26(8):1105-1111.

[13]Krogan, N., Cagney, G., Yu, H., et al., 2006. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature, 440:637-643.

[14]Lei, X., Wu, S., Ge, L., et al., 2013. Clustering and overlapping modules detection in PPI network based on IBFO. Proteomics, 13(2):278-290.

[15]Leung, H.C.M., Xiang, Q., Yiu, S.M., et al., 2009. Predicting protein complexes from PPI data: a core-attachment approach. J. Comput. Biol., 16(2):133-144.

[16]Liu, G., Wong, L., Chua, H.N., 2009. Complex discovery from weighted PPI networks. Bioinformatics, 25(15):1891-1897.

[17]Macropol, K., Can, T., Singh, A.K., 2009. RRW: repeated random walks on genome-scale protein networks for local cluster discovery. BMC Bioinform., 10:283.1-283.10.

[18]Maraziotis, I.A., Dimitrakopoulou, K., Bezerianos, A., 2007. Growing functional modules from a seed protein via integration of protein interaction and gene expression data. BMC Bioinform., 8:408.1-408.15.

[19]Mewes, H.W., Frishman, D., Mayer, K.F.X., et al., 2006. MIPS: analysis and annotation of proteins from whole genomes in 2005. Nucl. Acids Res., 34(suppl 1):D169-D172.

[20]Nepusz, T., Yu, H., Paccanaro, A., 2012. Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods, 9(5):471-472.

[21]Ni, W.Y., Xiong, H.J., Zhao, B.H., et al., 2013. Predicting overlapping protein complexes in weighted interactome networks. J. Zhejiang Univ.-Sci. C (Comput. & Electron.), 14(10):756-765.

[22]Palla, G., Derényi, I., Farkas, I., et al., 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435:814-818.

[23]Pu, S., Wong, J., Turner, B., et al., 2009. Up-to-date catalogues of yeast protein complexes. Nucl. Acids Res., 37(3):825-831.

[24]Shih, Y.K., Parthasarathy, S., 2012. Identifying functional modules in interaction networks through overlapping Markov clustering. Bioinformatics, 28(18):i473-i479.

[25]Stark, C., Breitkreutz, B.J., Reguly, T., et al., 2006. BioGRID: a general repository for interaction datasets. Nucl. Acids Res., 34(suppl 1):D535-D539.

[26]Wu, M., Li, X., Kwoh, C.K., et al., 2009. A core-attachment based method to detect protein complexes in PPI networks. BMC Bioinform., 10:169.1-169.16.

[27]Zhang, R., Lin, Y., 2009. DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes. Nucl. Acids Res., 37(suppl 1):D455-D458.

[28]Zhao, B., Wang, J., Li, M., et al., 2014. Prediction of essential proteins based on overlapping essential modules. IEEE Trans. NanoBiosci., 13(4):415-424.

Open peer comments: Debate/Discuss/Question/Opinion

<1>