PhD Doctor of Philosophy


data science, computer science, computational linguistics, natural language processing, web scraping (Twitter) and analytics, probability theory, statistics, calculus

Project ideas

Detecting bots on Twitter

Now that computers are good enough to generate very convincing text completely on their own, people have become quite concerned about "fake news". In this project, we will investigate how easy it is to detect Tweets that have been written by computers in four steps: 1) Collect some data, some possibly labelled already as "fake". 2) Look at the statistical properties of "real" Tweets versus "fake" Tweets. 3) Write a computer program, for example a Naive Bayes classifier, for labelling new Tweets as "real" or "fake". 4) Evaluate how good the program is using a sensible metric.

Coding skills

Python (and Jupyter notebook), MATLAB, R, UNIX scripting, Java, C

