American high schoolClass of 2022Fremont, CA
- "Creating a Data Visualization Dashboard to visualize interesting trends in a dataset containing titles from the library of Netflix’s content." with mentor Jim (Dec. 1, 2022)
Creating a Data Visualization Dashboard to visualize interesting trends in a dataset containing titles from the library of Netflix’s content.
Started Mar. 26, 2022
Abstract or project description
This project aims to create an online dashboard that visually presents information about a dataset containing over five thousand unique titles on Netflix. The end product will contain several graphs showcasing the correlation between specific columns of data. I plan on using Dash as the method of creating a dashboard and using the Python Language. The dataset being used was taken from the Kaggle website. The dataset contains 5850 rows and 15 columns. Each row is an entry for a title on Netflix and each column is a certain quality about it like whether it is a show or a movie, the name, the description, the year it was released, age rating(r, pg-13, tv-ma), the entire length in minutes, the genres that the movie or show satisfies, the country in which it was produced, the number of seasons there are if it is a show, several IMDB statistics and ratings, and the same for TMDB. There is also a secondary dataset that contains 54314 different people’s names each shown multiple times to amass more than 77 thousand rows and five columns. These columns include the actor’s name, the name of the character they played, and whether they were a director or actor. If they were the director, then they did not have a value in the column containing the names of the characters. I don’t plan on using this second dataset, instead focusing on the first one. I also plan to have multiple graphs with callback interactions and link different graphs together.