SRL based Plagiarism Detection System for Malayalam Documents
Automatic techniques of measuring plagiarism between documents
have gained importance in the recent years because of the availability
of enormous volume of information over the internet. . The most
general form of detecting plagiarism is by computing similarity
between a source document and a possibly plagiarised document.
Existing plagiarism detection systems are mainly designed for
detection in English.. Moreover, plagiarism detection systems using
natural language processing techniques are still very limited.
Automated plagiarism detection systems so far have involved minimal
syntactic and semantic linguistic techniques. Even though, in some
systems shallow techniques have been included as part of the preprocessing stage, studies involving deep techniquesIare less. Very
negligible research has been done for plagiarism detection in
Malayalam text documents . This paper presents a method for
plagiarism detection in Malayalam documents based on extracting the
semantic roles and computing their similarity to detect plagiarism. The
technique can detect documents created by direct copy methods,
replacement of words with similar ones , changing the order of words
or restructuring the sentences and also converting the sentence from
active/ passive to passive/active.
Keywords: Plagiarism detection, semantic role labelling, Malayalam, Karaka relations.
Download Full-Text
ABOUT THE AUTHORS
Sindhu.L
Sindhu.L is working as Assistant Professor at College of Engineering, Poonjar. She took B.E degree in Computer Engineering from Madurai Kamaraj University and M.Tech in Software Engineering from Cochin University of Science and Technology.
Sumam Mary Idicula
Prof . Dr. Sumam Mary Idicula is currently the Head of the Department of Computer Science at Cochin University of Science and Technology. Dr. Sumam Mary Idicula took B.Sc(Engg) degree in Electrical Engineering from College of Engineering Trivandrum and took M.Tech degree in Computer and Information Science from Cochin University of Science & Technology. She took PhD degree in Computer Science from the same department. She is an active researcher in the field of Natural Language Processing, Datamining and Human Computer Interaction
Sindhu.L
Sindhu.L is working as Assistant Professor at College of Engineering, Poonjar. She took B.E degree in Computer Engineering from Madurai Kamaraj University and M.Tech in Software Engineering from Cochin University of Science and Technology.
Sumam Mary Idicula
Prof . Dr. Sumam Mary Idicula is currently the Head of the Department of Computer Science at Cochin University of Science and Technology. Dr. Sumam Mary Idicula took B.Sc(Engg) degree in Electrical Engineering from College of Engineering Trivandrum and took M.Tech degree in Computer and Information Science from Cochin University of Science & Technology. She took PhD degree in Computer Science from the same department. She is an active researcher in the field of Natural Language Processing, Datamining and Human Computer Interaction