Informatics and Applications
2017, Volume 11, Issue 3, pp 73-79
METHODS FOR INTRINSIC PLAGIARISM DETECTION
- K. F. Safin
- M. P. Kuznetsov
- M. V. Kuznetsova
Abstract
There are two ways to find plagiarism in documents: “external” and “intrinsic” plagiarism detection. External plagiarism detection is the task with a known set of possible references. Intrinsic plagiarism detection aims at discovering plagiarism by analyzing only the document by itself. The paper investigates the methods of intrinsic plagiarism detection. The authors developed a plagiarism detection method based on constructing statistics from the features of the document parts and detecting outliers. The proposed algorithm was tested on the PAN-2011 collection for intrinsic plagiarism detection.
[+] References (10)
- Nikitov, A.V., O.A. Orchakov, and Ju. V. Chehovich. 2012. Plagiat v rabotakh studentov i aspirantov: Problema i metody protivodeystviya [Plagiarism in works of undergraduate and graduate students: Problem and methods of counteraction]. Universitetskoe upravlenie: Praktika i analiz [University Management: Practice and Analysis] 5:61-68.
- Zechner, M., M. Muhr, R. Kern, andM. Granitzer. 2009. External and intrinsic plagiarism detection using vector space models. CEUR Workshop Proceedings. 502:47—55.
- Tschuggnall, M., and G. Specht. 2013. Countering plagiarism by exposing irregularities in authors grammars. European Intelligence and Security Informatics Conference Proceedings. IEEE. 15—22.
- Eissen, S.M., and B. Stein. 2006. Intrinsic plagiarism detection. Advances in information retrieval. Eds. M. Lalmas, A. MacFarlane, S. M. Ruger, et al. Lecture notes in computer science ser. Springer. 3936:565—569.
- Stamatatos, E. 2009. Intrinsic plagiarism detection using character n-gram profiles. CEUR Workshop Proceedings. 502:38-46.
- Oberreuter, G., G. L’Huillier, S. Rl'os, and J. Velasquez. 2011. Outlier-based approaches for intrinsic and external plagiarism detection. Knowlege-based and intelligent information and engineering systems. Eds. A. Konig, A. Dengel, K. Hinkelmann, et al. Lecture notes in computer science ser. Springer. 6882:11-20.
- Bensalem, I., P. Rosso, and S. Chikhi. 2014. Intrinsic plagiarism detection using n-gram classes. Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA: Association for Computational Linguistics. 1459-1464.
- Vartapetiance, A., and L. Gillam. Quite simple approaches for authorship attribution, intrinsic plagiarism detection and sexual predator identification. Available at: http://epubs.surrey.ac.uk/id/eprint/766727 (accessed September 23, 2013).
- Kuznetsov, M., A. Motrenko, R. Kuznetsova, and V. Strijov. Methods for intrinsic plagiarism detection and author diarization. Available at: http://ceur-ws.org/Vol-1609/16090912.pdf (accessed September 6, 2016).
- Potthast, M., B. Stein, A. Barron-Cedeno, and P. Rosso. 2010. An evaluation framework for plagiarism detection. 23rd Conference (International) on Computational Linguistics Proceedings. Stroudsburg, PA: Association for Computational Linguistics. 997-1005.
[+] About this article
Title
METHODS FOR INTRINSIC PLAGIARISM DETECTION
Journal
Informatics and Applications
2017, Volume 11, Issue 3, pp 73-79
Cover Date
2017-09-30
DOI
10.14357/19922264170308
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
natural language processing; intrinsic plagiarism detection; outliers detection
Authors
K. F. Safin , , M. P. Kuznetsov , and M. V. Kuznetsova ,
Author Affiliations
Moscow Institute of Physics and Technology, 9 Institutskiy Per., Dolgoprudny, Moscow Region 141700, Russian Federation
Antiplagiat JSC, 33 Varshavskoe Shosse, Moscow 117105, Russian Federation
"Forecsys" LLC, 42 Vavilov Str., Moscow 119333, Russian Federation
|