High School Cheaters Nabbed by Neural Network

Researchers trained a neural network to scrutinize high school essays and sniff out ghostwritten papers. Christopher Intagliata reports.

Illustration of a Bohr atom model spinning around the words Science Quickly with various science and medicine related icons around the text

Join Our Community of Science Lovers!

The English-language version of Wikipedia has almost six million articles. And if you're a cheating student, that's six million essays already written for you, footnotes and all. Except plagiarism isn't really an effective tactic—just plug the text into a search engine and game over.

But what about having a ghostwriter at a paper mill compose your final essay?

"Standard plagiarism software cannot detect this kind of cheating."


On supporting science journalism

If you're enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.


Stephan Lorenzen, a data analyst at the University of Copenhagen. In Denmark, where he's based, ghostwriting is a growing problem at high schools. So Lorenzen and his colleagues created a program called Ghostwriter that can detect the cheats.

At its core is a neural network trained and tested on 130,000 real essays from 10,000 Danish students. After reading through tens of thousands of essays labeled as being written by the same author or not, the machine taught itself to tune into the characteristics that might tip off cheating. For example, did a student's essays share the same styles of punctuation? The same spelling mistakes? Were the abbreviations the same?

By scrutinizing inconsistencies like those, Ghostwriter was able to pinpoint a cheated essay nearly 90 percent of the time. The team presented the results at the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. [Magnus Stavngaard et al., Detecting Ghostwriters in High Schools]

There's one more aspect here that could help students. Your high school essays presumably get better over time as you learn to write—and the machine can detect that. "The final idea is to detect students who are at risk because their development in writing style isn't as you'd expect."

Teachers could thus give extra help to kids who really need it, while sniffing out the cheaters too.

—Christopher Intagliata

[The above text is a transcript of this podcast.]

It’s Time to Stand Up for Science

If you enjoyed this article, I’d like to ask for your support. Scientific American has served as an advocate for science and industry for 180 years, and right now may be the most critical moment in that two-century history.

I’ve been a Scientific American subscriber since I was 12 years old, and it helped shape the way I look at the world. SciAm always educates and delights me, and inspires a sense of awe for our vast, beautiful universe. I hope it does that for you, too.

If you subscribe to Scientific American, you help ensure that our coverage is centered on meaningful research and discovery; that we have the resources to report on the decisions that threaten labs across the U.S.; and that we support both budding and working scientists at a time when the value of science itself too often goes unrecognized.

In return, you get essential news, captivating podcasts, brilliant infographics, can't-miss newsletters, must-watch videos, challenging games, and the science world's best writing and reporting. You can even gift someone a subscription.

There has never been a more important time for us to stand up and show why science matters. I hope you’ll support us in that mission.

Thank you,

David M. Ewalt, Editor in Chief, Scientific American

Subscribe