CLC number: TP391.3
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 2017-11-01
Cited: 0
Clicked: 7163
Lei-lei Kong, Zhi-mao Lu, Hao-liang Qi, Zhong-yuan Han. A machine learning approach to query generation in plagiarism source retrieval[J]. Frontiers of Information Technology & Electronic Engineering,in press.https://doi.org/10.1631/FITEE.1601344 @article{title="A machine learning approach to query generation in plagiarism source retrieval", %0 Journal Article TY - JOUR
基于机器学习的抄袭源检索的查询生成方法关键词组: Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article
Reference[1]Alzahrani, S.M., Salim, N., Abraham, A., 2012. Understanding plagiarism linguistic patterns, textual features, and detection methods. IEEE Trans. Syst. Man Cybern. C, 42(2):133-149. ![]() [2]Barrón-Cedeño, A., Vila, M., Martí, M.A., et al., 2013. Plagiarism meets paraphrasing: insights for the next generation in automatic plagiarism detection. Comput. Ling., 39(4):917-947. ![]() [3]Cao, Y., Xu, J., Liu, T.Y., et al., 2006. Adapting ranking SVM to document retrieval. Proc. 29th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, p.186-193. ![]() [4]Cortes, C., Vapnik, V., 1995. Support-vector networks. Mach. Learn., 20(3):273-297. ![]() [5]Elizalde, V., 2013. Using statistic and semantic analysis to detect plagiarism—notebook for PAN at CLEF 2013. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [6]Gillam, L., 2013. Guess again and see if they line up: surrey’s runs at plagiarism detection—notebook for PAN at CLEF 2013. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [7]Hagen, M., Potthast, M., Stein, B., 2015. Source retrieval for plagiarism detection from large web corpora: recent approaches. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [8]Haggag, O., El-Beltagy, S., 2013. Plagiarism candida-te retrieval using selective query formulation and discriminative query scoring—notebook for PAN at CLEF 2013. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [9]Hastie, T., Tibshirani, R., Friedman, J., 2001. The Elements of Statistical Learning: Data Mining, Inference and Prediction. CRC Press, Boca Raton. ![]() [10]Herbrich, R., Graepel, T., Obermayer, K., 2000. Large margin rank boundaries for ordinal regression. In: Smola, A.J., Bartlett, P., Schölkopf, B., et al. (Eds.), Advances in Large Margin Classifiers. MIT Press, Cambridge, p.115-132. ![]() [11]Höffgen, K.U., Simon, H.U., Vanhorn, K.S., 1995. Robust trainability of single neurons. J. Comput. Syst. Sci., 50(1):114-125. ![]() [12]Jayapal, A., 2012. Similarity overlap metric and greedy string tiling at PAN 2012: plagiarism detection—notebook for PAN at CLEF 2012. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [13]Joachims, T., 2002. Optimizing search engines using clickthrough data. Proc. 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.133-142. ![]() [14]Kong, L.L., Qi, H.L., Wang, S., et al., 2012. Approaches for candidate document retrieval and detailed comparison of plagiarism detection—notebook for PAN at CLEF 2012. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [15]Lee, T., Chae, J., Park, K., et al., 2013. CopyCaptor: plagiarized source retrieval system using global word frequency and local feedback—notebook for PAN at CLEF 2013. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [16]Nallapati, R., 2004. Discriminative models for information retrieval. Proc. 27th Annual ACM SIGIR Int. Conf. on Research and Development in Information Retrieval, p.64-71. ![]() [17]Potthast, M., Gollub, T., Hagen, M., et al., 2012a. Overview of the 4th International Competition on Plagiarism Detection. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [18]Potthast, M., Hagen, M., Stein, B., et al., 2012b. ChatNoir: a search engine for the ClueWeb09 corpus. Proc. 35th Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, p.1004. ![]() [19]Potthast, M., Hagen, M., Gollub, T., et al., 2013a. Overview of the 5th International Competition on Plagiarism Detection. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [20]Potthast, M., Hagen, M., Völske, M., et al., 2013b. Crowdsourcing interaction logs to understand text reuse from the web. Proc. 51st ACM Annual Meeting of the Association of Computational Linguistics, p.1212-1221. ![]() [21]Potthast, M., Hagen, M., Beyer, A., et al., 2014. Overview of the 6th International Competition on Plagiarism Detection. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [22]Prakash, A., Saha, S., 2014. Experiments on document chunking and query formation for plagiarism source retrieval—notebook for PAN at CLEF 2014. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [23]Rafiei, J., Mohtaj, S., Zarrabi, V., et al., 2015. Source retrieval plagiarism detection based on noun phrase and keyword phrase extraction—notebook for PAN at CLEF 2015. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [24]Robertson, S.E., 1997. Overview of the Okapi projects. J. Docum., 53(1):3-7. ![]() [25]Suchomel, Š., Brandejs, M., 2015. Improving synoptic querying for source retrieval—notebook for PAN at CLEF 2015. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [26]Toutanova, K., Klein, D., Manning, C.D., et al., 2003. Feature-rich part-of-speech tagging with a cyclic dependency network. Proc. Conf. of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, p.173-180. ![]() [27]Williams, K., Chen, H.H., Choudhury, S.R., et al., 2013. Unsupervised ranking for plagiarism source retrieval— notebook for PAN at CLEF 2013. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [28]Williams, K., Chen, H.H., Giles, C.L., 2014a. Supervised ranking for plagiarism source retrieval—notebook for PAN at CLEF 2014. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() [29]Williams, K., Chen, H.H., Giles, C.L., 2014b. Classifying and ranking search engine results as potential sources of plagiarism. Proc. ACM Symp. on Document Engineering, p.97-106. ![]() [30]Zubarev, D., Sochenkov, I., 2014. Using sentence similarity measure for plagiarism source retrieval—notebook for PAN at CLEF 2014. Proc. CLEF Evaluation Labs and Workshop, Working Notes Papers. ![]() Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou
310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn Copyright © 2000 - 2025 Journal of Zhejiang University-SCIENCE |
Open peer comments: Debate/Discuss/Question/Opinion
<1>