Skip to content
View Allenstrange's full-sized avatar

Block or report Allenstrange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Allenstrange/README.md

Hi, I'm Allen Chima

MSc Data Science Student | Python Developer | Data Analyst

I'm passionate about turning complex data into clear, actionable insights. Currently completing my MSc in Data Science, I work across the full data science stack — from database design and big data processing to spatial analysis, interactive visualisations, and machine learning.


MSc Data Science Modules

Repository Module Semester Focus
HIV-Antiretroviral-Therapy-ART-Coverage DAS7000 — Data Analytics & Visualisation S1 EDA, Plotly, interactive mapping
air-quality-analysis CMP7005 — Programming for Data Analysis S1 Python, pandas, Streamlit dashboard
CMP7005-Programming-for-Data-Analysis CMP7005 — Programming for Data Analysis S1 Module overview & slides
Big-Data-for-Enterprise_S2_24 DAS7001 — Big Data for Enterprise S2 MongoDB, NoSQL, enterprise architecture
Big-Data-Technologies DSA7002 — Big Data Technologies S2 Apache Spark, PySpark, distributed computing
Geospatial-Analysis_S2_24 DAS7003 — Geospatial Analysis S2 GeoPandas, spatial visualisation
CST7001-Research-and-Professional-Practice CST7001 — Research & Professional Practice S2 Academic research, ethics, professional skills

Data Science Projects

Repository Description
HIV-Antiretroviral-Therapy-ART-Coverage Interactive EDA of global HIV ART coverage (2010–2021) using UNICEF/UNAIDS data, Plotly, Choropleth maps, and Sankey diagrams
air-quality-analysis Beijing air quality analysis across 12 PRSA monitoring stations with a Streamlit dashboard
data-science-portfolio Gold price prediction & house price prediction pipeline using Scikit-Learn

Skills & Technologies

Languages & Core Tools Python · SQL · Markdown · Git

Data Science & Machine Learning pandas · NumPy · Scikit-Learn · SciPy · Jupyter Notebooks

Data Visualisation Matplotlib · Seaborn · Plotly · Folium

Big Data & Databases Apache Spark · PySpark · MongoDB · PyMongo · Hadoop

Geospatial GeoPandas · Folium · QGIS

Web & Dashboards Streamlit

Version Control Git · GitHub


Learning Journey

Repository Description
data-science-learning Foundational Python and data analysis assignments — the starting point of this journey

Currently Working On

  • MSc Dissertation (Semester 3)
  • Expanding skills in deep learning and neural networks
  • Building end-to-end ML pipelines for real-world datasets

GitHub: @Allenstrange

Feel free to explore the repositories and reach out if you'd like to connect or collaborate.


GitHub Stats

Allen's GitHub Stats

Top Languages

LinkedIn: Allen Chima

Portfolio: allenstrange.github.io

Popular repositories Loading

  1. HIV-Antiretroviral-Therapy-ART-Coverage HIV-Antiretroviral-Therapy-ART-Coverage Public

    Interactive EDA of global HIV ART coverage among adolescents (2010–2021) using UNICEF/UNAIDS data, Plotly, and advanced censored-value imputation

    Jupyter Notebook 1

  2. data-science-learning data-science-learning Public

    Foundational data science assignments in Python — EDA, data manipulation with pandas, visualisation, and statistics

    Jupyter Notebook

  3. Allenstrange Allenstrange Public

    GitHub profile README — MSc Data Science student portfolio showcasing projects in Python, Spark, MongoDB, Plotly, and Streamlit

    Jupyter Notebook

  4. data-science-portfolio data-science-portfolio Public

    Data science projects: gold price prediction and house price ML pipeline using Python and Scikit-Learn

    Jupyter Notebook

  5. air-quality-analysis air-quality-analysis Public

    Beijing air quality analysis across 12 PRSA monitoring stations with Streamlit dashboard — CMP7005 PRAC1 assessment

    Jupyter Notebook

  6. CMP7005-Programming-for-Data-Analysis CMP7005-Programming-for-Data-Analysis Public

    MSc Data Science — Programming for Data Analysis (CMP7005): module overview, PRES1 slides. PRAC1 project in air-quality-analysis repo