Final Project

Presentation video due date: April 25, 2023

Final report due date: May 9, 2023

Your final project is to build a series of ML models with a dataset of your choosing using either (1) at least 3 of the techniques we learned about in class or (2) a new technique which you developed yourself. You may use any libraries you want for your project implementation.

You may NOT copy code from public tutorials. There are a very limited number of datasets which tend to be used for ML tutorials, so we will know if you copy these tutorials. Most Kaggle datasets would not qualify for the final project since there are many code submissions publicly viewable associated with each Kaggle dataset. However, if the dataset you find on Kaggle has FEWER THAN 5 submissions, then that is acceptable.

If you use a dataset which can be imported from sklearn, TensorFlow, or PyTorch, then you must create a new methodology or evaluate an existing methodology which is not implemented by default in sklearn. For example, you cannot simply use sklearn on an sklearn dataset, but you can use sklearn on a new interesting dataset you find -OR- you can evaluate a new methodlogy on an sklearn dataset.

Rubric

3 points: Coding

4 points: Written Report

2 points: Discretionary Points / Course-Specific Requirements (435 vs 635)

1 point: Final Video

Submission instructions

Submit a PDF on Laulima. Make sure that your Colab notebook is shared with the course instructor and TA.