PhD Doctor of Philosophy candidate

Machine Learning, Data Science, Statistical inference.

Project ideas

Project ideas are meant to help inspire student thinking about their own project. Students are in the driver seat of their research and are free to use any or none of the ideas shared by their mentors.

Machine Learning Application

This end-goal of this project would be to build a predictive machine learning model on a dataset of your choice (or one we can find together). If you have no prior experience with machine learning, we would begin by going over the fundamental models and concepts of the field before moving on to the application. If you already have a sufficient background, we can dive straight into the data analysis.

Understanding the "Reproducibility Crisis"

In this project we would go over several important readings and papers to understand the so-called "reproducibility crisis" in science. "Reproducibility crisis" refers to there being a large number of studies whose findings cannot be reproduced by other researchers. For instance, one psychology report was only able to reproduce 36 out of 100 findings from top journals. We will begin by discussing the statistical, game-theoretic, and structural reasons causing the crisis, and then discuss several new tools scientists have developed to overcome it. The culmination of this project could be a blog post summarizing what we have learned and suggestions for further improvement. Alternatively, a student could also conduct an analysis of these tools to compare and contrast their efficacy.

Coding skills

Python, R, C++

