DETEKSI PLAGIAT TESIS BERBAHASA INDONESIA MENGGUNAKAN METODE COSINE SIMILARITY

Syukry Ansis, Endang Palupi Listyaningsih, Hari Soetanto

Abstract


This research aims to test the performance of the Cosine Similarity method in comparison with the Jaccard Similarity method and to obtain the percentage of similarity. Sample data is obtained from students' data at Budi Luhur campus. The test model will be evaluated by comparing several original theses with documents containing plagiarism. The original documents are processed using Natural Language Processing (NLP) methods. One important NLP method is the Jaro Winkler method, which focuses on spelling correction. Subsequently, text mining algorithms are applied for text processing. The results showed that the Cosine Similarity method achieved high accuracy, at 96.63%, demonstrating its ability to classify documents well as plagiarism or not. The use of Jaccard Similarity shows low accuracy, around 50.5%, but provides an overview of potential improvements or updates to the model to improve performance.
Keywords - Cosine Similarity, Jaccard Similarity Thesis Classification, Threshold, NLP


References


S. H. Saniati, “Implementasi Algoritma Cosine Similarity untuk Mendeteksi Kemiripan Topik Judul,†In Jecsit, vol. I, no. I, pp. 51 - 56, 2021.

M. Azmi, “Analisis Tingkat Plagiasi Dokumen Skripsi Dengan Metode Cosine Similarity dan pembobotan tf-idf,†TEKNIMEDIA, vol. II, no. 2, pp. 90 - 95, 2021.

H. Herlambang, J. Suwita dan B. Tiara, “Analisa dan Perancangan Sistem Pendeteksi Plagiarisme Skripsi pada STMIK Insan Pembangunan Menggunakan Metode Cosine Similarity,†Jurnal IPSIKOM , vol. IX, no. 1, pp. 10-22, 2021; 9(1): .

F. A. Nugroho, F. Septian, D. Pungkastyo dan J. Riyanto, “Penerapan Algoritma Cosine Similarity untuk Deteksi Kesamaan Konten pada Sistem Informasi Penelitian dan Pengabdian Kepada Masyarakat,†Jurnal Informatika Universitas Pamulang, vol. X, no. 4, pp. 529-536, 2020.

J. Joni dan J. Halim, “Implementasi Metode Cosine Similarity dan Tf-Idf dalam Klasifikasi Pengaduan Masyarakat,†Jurnal Ilmiah Core IT: Community Research Information Technology, vol. X, no. 4, pp. 51-58, 2021.

S. Dwiasnati dan N. .. Fatonah, “Penerapan Metode Cosine Similarity dalam Mendeteksi Plagiarisme pada Jurnal,†Jurnal Format, vol. XII, no. 2, p. 142–150, 2023; .

A. Sanjaya, A. B. Setiawan, U. Mahdiyah, I. N. Farida dan A. R. Prasetyo, “Pengukuran Kemiripan Makna Menggunakan Cosine Similarity dan Basis Data Sinonim Kata,†JTII: Jurnal Teknologi Informasi dan Ilmu Komputer, vol. X, no. 14, pp. 747- 752, 2023.

S. M. Pamungkas, I. Aqimuddin, C. Gunawan, M. A. Yaqin dan A. C. Fauzan, “Analisis Kemiripan Model Proses Bisnis PMBoK dan Scrum menggunakan Metode Jaccard Coefficient Similarity dan Semantic Similarity,†ILKOMNIKA: Journal of Computer Science and Applied Informatics, vol. V, no. 2, pp. 53-64, 2023.

S. Rianti dan R. A. Supono, “Perbandingan Algoritma Edit Distance, Levenshtein Distance, Hamming Distance Jaccard Similarity dalam Mendeteksi String Matching,†JSI: Jurnal Sistem Informasi Universitas Suryadarma, vol. X, no. 1, pp. 305-314, 2023.

S. Rismayani, Nirwana, T. Darwansyah dan I. Mansyur, “Implementasi Algoritma Text Mining dan Cosine Similarity untuk Desain Sistem Aspirasi Publik Berbasis Mobile,†Komputika: Jurnal Sistem Komputer, vol. IX, no. 28, p. 169–176, 2022.

A. Manso, C. G. Marques, V. Alencar dan P. Santos, “Plagiarism Detection in Algorithms - a Case Study Using Algorithmi. Creative Commons License Attribution 4.0 International (CC BY 4.0),†Journal of Information Technology and Computer Science, vol. i, no. 1, pp. 1-6, 2020.

M. Davoodifard, “Automatic Detection of Plagiarism in Writing. Studies in Applied Linguistics & TESOL at Teachers College,†Columbia University, vol. XXI, no. 2, pp. 54-60, 2022.




DOI: https://doi.org/10.35314/isi.v9i1.4003

Refbacks

  • There are currently no refbacks.




Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.


This Journal has been listed and indexed in :

Crossref logo Find in a library with WorldCat

Copyright of Jurnal Inovtek Polbeng - Seri Informatika (ISSN: 2527-9866)

Creative Commons License
ISI: Inovtek Polbeng Seri Informatikan is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Editorial Office :
Pusat Penelitian dan Pengabdian kepada Masyarakat
 Politeknik Negeri Bengkalis 
Jl. Bathin alam, Sungai Alam Bengkalis-Riau 28711 
E-mail: jurnalinformatika@polbeng.ac.id
www.polbeng.ac.id

Web
Analytics
View My Stats