Polygence Scholar2021
Pierce Zhang
Tesla STEM High SchoolClass of 2023
About
Projects
- Bacteria Species Identification using DNA Sequences with mentor Jeremy (Mar. 6, 2022)
Project Portfolio
Bacteria Species Identification using DNA Sequences
Started Aug. 23, 2021Abstract or project description
From a data set containing partial genetic sequences of bacteria, identify the bacterial species. First, we could do exploratory analysis, calculating basic statistical properties such as A/T, C/G content. Second, we could use a basic model such as logistic regression as our initial classifier. Third, we could use either a random forest algorithm or a neural network as a more advanced classifier.