MSc Data Science Student | Python Developer | Data Analyst
I'm passionate about turning complex data into clear, actionable insights. Currently completing my MSc in Data Science, I work across the full data science stack — from database design and big data processing to spatial analysis, interactive visualisations, and machine learning.
| Repository | Module | Semester | Focus |
|---|---|---|---|
| HIV-Antiretroviral-Therapy-ART-Coverage | DAS7000 — Data Analytics & Visualisation | S1 | EDA, Plotly, interactive mapping |
| air-quality-analysis | CMP7005 — Programming for Data Analysis | S1 | Python, pandas, Streamlit dashboard |
| CMP7005-Programming-for-Data-Analysis | CMP7005 — Programming for Data Analysis | S1 | Module overview & slides |
| Big-Data-for-Enterprise_S2_24 | DAS7001 — Big Data for Enterprise | S2 | MongoDB, NoSQL, enterprise architecture |
| Big-Data-Technologies | DSA7002 — Big Data Technologies | S2 | Apache Spark, PySpark, distributed computing |
| Geospatial-Analysis_S2_24 | DAS7003 — Geospatial Analysis | S2 | GeoPandas, spatial visualisation |
| CST7001-Research-and-Professional-Practice | CST7001 — Research & Professional Practice | S2 | Academic research, ethics, professional skills |
| Repository | Description |
|---|---|
| HIV-Antiretroviral-Therapy-ART-Coverage | Interactive EDA of global HIV ART coverage (2010–2021) using UNICEF/UNAIDS data, Plotly, Choropleth maps, and Sankey diagrams |
| air-quality-analysis | Beijing air quality analysis across 12 PRSA monitoring stations with a Streamlit dashboard |
| data-science-portfolio | Gold price prediction & house price prediction pipeline using Scikit-Learn |
Languages & Core Tools Python · SQL · Markdown · Git
Data Science & Machine Learning pandas · NumPy · Scikit-Learn · SciPy · Jupyter Notebooks
Data Visualisation Matplotlib · Seaborn · Plotly · Folium
Big Data & Databases Apache Spark · PySpark · MongoDB · PyMongo · Hadoop
Geospatial GeoPandas · Folium · QGIS
Web & Dashboards Streamlit
Version Control Git · GitHub
| Repository | Description |
|---|---|
| data-science-learning | Foundational Python and data analysis assignments — the starting point of this journey |
- MSc Dissertation (Semester 3)
- Expanding skills in deep learning and neural networks
- Building end-to-end ML pipelines for real-world datasets
GitHub: @Allenstrange
Feel free to explore the repositories and reach out if you'd like to connect or collaborate.
LinkedIn: Allen Chima
Portfolio: allenstrange.github.io
