Thursday, May 15, 2014

New Tests for Plagiarism

There are professional tools on the net to detect plagiarism. Those are effective, but do not always work well. For example, if there are HTML files from which someone has copied stuff to be passed as one's own the tools will detect the theft. But if one has copied from a PDF file, the tools fail. I have developed some tools out of experience, and they do not fail. They do not necessarily tell about the source, but they do tell if there has been plagiarism. They are as follows.

  1. The article has two parts, which may be admixed. One part has perfect English grammar and composition. The spelling is faultless. The statements hold profound wisdom (if the original source had profound wisdom). This is the stolen stuff. The other part has perfectly horrible English grammar and composition. Three or more words are joined at places, because the space bar has not been hit between the words. The spelling is atrocious. Punctuation marks are immediately preceding words rather than after the previous words. There is often no wisdom in the content. This is the original part of the article.
  2. There is a special test to detect theft from PDF files. The formatting of a PDF file is such that words do not flow automatically into the next lines when text copied from a PDF file is pasted into a word processor document. So one finds words broken by hyphens in the middle of lines. Look at the example shown in blue. There are professional to- ols on the net to detect plagiarism. Those are effective, but do not alw- ays work well. For example, if ther- e are HTML files from which someone has copied stuff to be passed as one's own the tools will det- ect the theft.
  3. The good part of the article does not remain true to the theme of the article. The stuff stolen from another article is often good for the topic of that article, but the topics of the two articles do not match well. As a result, the stolen stuff appears irrelevant.
If the stuff has been stolen from a book in print, the typing can be atrocious, but the content can be good. Luckily, a person who is too lazy to write something on his/her own is also too lazy to type anything, when there is stuff ready to be copied from and pasted. So plagiarism from printed material is less of a worry for an editor.

प्रशंसा करायचीय, नावे ठेवायचीयेत, काही विचारायचय, किंवा करायला आणखी चांगले काही सुचत नाहीये, तर क्लिक करा.

संपर्क