
Clayton G
- Research Program Mentor
PhD Doctor of Philosophy
Expertise
data science, computer science, computational linguistics, natural language processing, web scraping (Twitter) and analytics, probability theory, statistics, calculus
Project ideas
Detecting bots on Twitter
Now that computers are good enough to generate very convincing text completely on their own, people have become quite concerned about "fake news". In this project, we will investigate how easy it is to detect Tweets that have been written by computers in four steps: 1) Collect some data, some possibly labelled already as "fake". 2) Look at the statistical properties of "real" Tweets versus "fake" Tweets. 3) Write a computer program, for example a Naive Bayes classifier, for labelling new Tweets as "real" or "fake". 4) Evaluate how good the program is using a sensible metric.