CLC number: TP391
On-line Access: 2020-03-18
Received: 2018-06-08
Revision Accepted: 2018-12-08
Crosschecked: 2019-08-14
Cited: 0
Clicked: 5422
Wei-ming Lu, Jia-hui Liu, Wei Xu, Peng Wang, Bao-gang Wei. EncyCatalogRec: catalog recommendation for encyclopedia article completion[J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21(3): 436-447.
@article{title="EncyCatalogRec: catalog recommendation for encyclopedia article completion",
author="Wei-ming Lu, Jia-hui Liu, Wei Xu, Peng Wang, Bao-gang Wei",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="21",
number="3",
pages="436-447",
year="2020",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.1800363"
}
%0 Journal Article
%T EncyCatalogRec: catalog recommendation for encyclopedia article completion
%A Wei-ming Lu
%A Jia-hui Liu
%A Wei Xu
%A Peng Wang
%A Bao-gang Wei
%J Frontiers of Information Technology & Electronic Engineering
%V 21
%N 3
%P 436-447
%@ 2095-9184
%D 2020
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.1800363
TY - JOUR
T1 - EncyCatalogRec: catalog recommendation for encyclopedia article completion
A1 - Wei-ming Lu
A1 - Jia-hui Liu
A1 - Wei Xu
A1 - Peng Wang
A1 - Bao-gang Wei
J0 - Frontiers of Information Technology & Electronic Engineering
VL - 21
IS - 3
SP - 436
EP - 447
%@ 2095-9184
Y1 - 2020
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.1800363
Abstract: Online encyclopedias such as Wikipedia provide a large and growing number of articles on many topics. However, the content of many articles is still far from complete. In this paper, we propose EncyCatalogRec, a system to help generate a more comprehensive article by recommending catalogs. First, we represent articles and catalog items as embedding vectors, and obtain similar articles via the locality sensitive hashing technology, where the items of these articles are considered as the candidate items. Then a relation graph is built from the articles and the candidate items. This is further transformed into a product graph. So, the recommendation problem is changed to a transductive learning problem in the product graph. Finally, the recommended items are sorted by the learning-to-rank technology. Experimental results demonstrate that our approach achieves state-of-the-art performance on catalog recommendation in both warm- and cold-start scenarios. We have validated our approach by a case study.
[1]Banerjee S, Mitra P, 2015a. Filling the gaps: improving Wikipedia stubs. Proc ACM Symp on Document Engineering, p.117-120.
[2]Banerjee S, Mitra P, 2015b. WikiKreator: improving Wikipedia stubs automatically. Proc 53rd> Annual Meeting of the Association for Computational Linguistics and the 7$^rm th$ Int Joint Conf on Natural Language Processing, p.867-877.
[3]Banerjee S, Mitra P, 2016. WikiWrite: generating Wikipedia articles automatically. Proc 25th> Int Joint Conf on Artificial Intelligence, p.2740-2746.
[4]Bizer C, Lehmann J, Kobilarov G, et al., 2009. DBpedia—a crystallization point for the web of data. it J Web Semant, 7(3):154-165.
[5]Datar M, Immorlica N, Indyk P, et al., 2004. Locality-sensitive hashing scheme based on $p$-stable distributions. Proc 20th> Annual Symp on Computational Geometry, p.253-262.
[6]Fetahu B, Markert K, Anand A, 2015. Automated news suggestions for populating Wikipedia entity pages. Proc 24th> ACM Int Conf on Information and Knowledge Management, p.323-332.
[7]Gambhir M, Gupta V, 2017. Recent automatic text summarization techniques: a survey. it Artif Intell Rev, 47(1):1-66.
[8]Haveliwala TH, 2002. Topic-sensitive PageRank. Proc 11th> Int Conf on World Wide Web, p.517-526.
[9]He XN, Liao LZ, Zhang HW, et al., 2017. Neural collaborative filtering. Proc 26th> Int Conf on World Wide Web, p.173-182.
[10]Hoffart J, Suchanek FM, Berberich K, et al., 2013. YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. it Artif Intell, 194:28-61.
[11]Joachims T, 2002. Optimizing search engines using clickthrough data. Proc 8th> ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining, p.133-142.
[12]Joachims T, 2006. Training linear SVMs in linear time. Proc 12th> ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining, p.217-226.
[13]Koren Y, Bell R, Volinsky C, 2009. Matrix factorization techniques for recommender systems. it Computer, 42(8):30-37.
[14]Le QV, Mikolov T, 2014. Distributed representations of sentences and documents. Proc 31th> Int Conf on Machine Learning, p.1188-1196.
[15]Liu HX, Yang YM, 2015. Bipartite edge prediction via transductive learning over product graphs. Proc 32nd> Int Conf on Machine Learning, p.1880-1888.
[16]Luo X, Zhou MC, Xia YN, et al., 2014. An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. it IEEE Trans Ind Inform, 10(2):1273-1284.
[17]Mikolov T, Sutskever I, Chen K, et al., 2013a. Distributed representations of words and phrases and their compositionality. Proc 26th> Int Conf on Neural Information Processing Systems, p.3111-3119.
[18]Mikolov T, Chen K, Corrado G, et al., 2013b. Efficient estimation of word representations in vector space. https://arxiv.org/abs/1301.3781
[19]Reinanda R, Meij E, de Rijke M, 2015. Mining, ranking and recommending entity aspects. Proc 38th> Int ACM SIGIR Conf on Research and Development in Information Retrieval, p.263-272.
[20]Sauper C, Barzilay R, 2009. Automatically generating Wikipedia articles: a structure-aware approach. Proc 47th> Annual Meeting of the ACL and the 4th> Int Joint Conf on Natural Language Processing of the AFNLP, p.208-216.
[21]Strube M, Ponzetto SP, 2006. WikiRelate! Computing semantic relatedness using Wikipedia. Proc 21st> National Conf on Artificial Intelligence, p.1419-1424.
[22]Suchanek FM, Kasneci G, Weikum G, 2007. YAGO: a core of semantic knowledge. Proc 16th> Int Conf on World Wide Web, p.697-706.
[23]Tanaka S, Okazaki N, Ishizuka M, 2010. Learning web query patterns for imitating Wikipedia articles. Proc 23rd> Int Conf on Computational Linguistics, p.1229-1237.
[24]Wagstaff KL, Riloff E, Lanza NL, et al., 2016. Creating a Mars target encyclopedia by extracting information from the planetary science literature. AAAI Workshop on Knowledge Extraction from Text, p.532-536.
[25]Wulczyn E, West R, Zia L, et al., 2016. Growing Wikipedia across languages via recommendation. Proc 25th> Int Conf on World Wide Web, p.975-985.
[26]Zhao Y, Karypis G, 2002. Evaluation of hierarchical clustering algorithms for document datasets. Proc 11th> Int Conf on Information and Knowledge Management, p.515-524.
[27]Zhao Y, Karypis G, Fayyad U, 2005. Hierarchical clustering algorithms for document datasets. it Data Min Knowl Discov, 10(2):141-168.
Open peer comments: Debate/Discuss/Question/Opinion
<1>