Skip to content

demirhalilbasic/uuap-case-study-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧬 UuAP Case Study 1: DNA Sequence Classification via K-mers

Welcome to the main project repository for UuAP Case Study 1. This project focuses on the classification and machine learning analysis using purely data-driven k-mer frequency extraction over numerical string segments.

🎓 Academic Context

Course: Introduction to Data Analysis
Institution: IPI Academy Tuzla
Semester: Spring 2026

👨‍🎓 Author & Contact

Student: Demir Halilbašić
linkedin youtube


⚖️ Disclaimer & Ethical Statement

Important: The datasets used in this project were downloaded from the NCBI (National Center for Biotechnology Information) platform in FASTA format and serve exclusively for data analysis exercises. All copyrights and ownership of the genomic sequences belong entirely to NCBI.

This repository does not represent a validated biological research project, nor does it aim to establish genuine biological conclusions. The objective is simply to experiment with mathematical classifiers, sequence feature extraction, pattern matching, and analytical visualization entirely within a computational context.


🛠️ Project Structure

  • lab_frog_dna/: Main analytics directory.
    • Contains Python pipelines (lab_pipeline.py) used for algorithm evaluation and metric visualizations.
    • View the lab_frog_dna/README.md for a comprehensive visual gallery of our machine learning model accuracies and generated PCA charts.

About

Data-driven DNA sequence classification of amphibian species via k-mer analysis and predictive modeling.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages