This is a template repository for the "Projects in Data Science" course. You should use this repository for your project.
If using github.itu.dk, you need to download the repository and make your own.
If you are using general Github, you can clone or fork the repository directly.
Your repository MUST be named 2026-PDS-XX where XX is your group name (e.g. 2026-PDS-Pandas).
Follow TA instructions when setting up the Python environment before running any code. Remember to export your Python library requirements by pip freeze > requirements.txt and attach it to the repo so we can evaluate your scripts.
The file hierarchy of your hand-in repo should be as follows:
ProjectInDataScience2026_ExamTemplate/
βββ data/
β ββ features.csv # all image file names, ground-truth labels, and chosen features
β β
β βββ imgs/ # skin images (to not add on GitHub)
β β βββ img_XX1.png
β β βββ img_XX2.png
β β ......
β β βββ img_XXX.png
β β
β βββ masks/ # masks images (to not add on GitHub)
β βββ mask_XX1.png
β βββ mask_XX2.png
β ......
β βββ mask_XXX.png
β
βββ src/
β βββ __init__.py
β βββ feature_A.py # code for feature A extraction
β βββ feature_B.py # code for feature B extraction
β ......
β βββ feature_X.py # code for feature X extraction
β
βββ result/
β βββ figures/ # Figures used in your report
β βββ models/ # Trained models
β βββ predictions/ # Probabilities outputed by the models
β βββ reports # Files related to the Mandatory assignment
β βββ report_GROUPEID.pdf
β βββ features_GROUPEID.csv
β
βββ main.py # script to train or evaluate models
βββ README.md
Notes:
- DO NOT upload your data (images) to Github.
- When the same code block needs to be executed multiple times in the script, make it a custom function instead. All the custom functions and modules should be grouped into different files under the "src" subfolder, based on the task they are designed for. Do not put everything in a single Python file or copy-paste the same code block across the script.