Skip to content
View saws-lab's full-sized avatar
  • University of Melbourne
  • Melbourne, Australia
  • LinkedIn in/syedawais

Highlights

  • Pro

Block or report saws-lab

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saws-lab/README.md

Hi, I'm Awais 👋

I'm a Data Scientist with a PhD and deep expertise in machine learning, predictive modelling, and biological sequence data. I build end-to-end ML pipelines — from raw data and feature engineering through to deployed, production-ready applications. I've proven track record of turning complex, large-scale datasets into insights that shape strategic decision-making.

My domain is infectious disease: I develop models that turn genetic sequence data into actionable predictions, with real-world impact at scale. My work has been published in Nature Communications and adopted by the World Health Organization platform at GISAID.

🚀 What I Build

  • Predictive ML and NN models on high-dimensional biological data (ensemble methods, AdaBoost, Random Forest, XGBoost, MLP, CNN)
  • End-to-end pipelines — data ingestion, feature engineering, model training, evaluation, and deployment
  • Production web applications using Streamlit, deployed on cloud platforms with CI/CD
  • Interpretable AI — feature importance, permutation importance, SHAP-style analysis, identifying what drives model predictions

📦 Featured Projects

SAP_H3N2_ML — Influenza Antigenic Prediction Model

An AdaBoost regression and classification model trained on genetic sequence data to predict antigenic properties of influenza viruses. Trained on historical seasonal data; evaluated prospectively on future seasons.

  • 92% average AUROC across 14 held-out test seasons
  • Nonlinear feature mapping from sequence mutations to phenotypic outcomes
  • Adopted by the WHO for influenza surveillance and vaccine strain selection
  • Published: Nature Communications, 2024

SAP_H3N2_ML_webapp — Deployed Prediction App

A production Streamlit application serving the AdaBoost model to end users — interactive single-sample prediction and CSV batch processing with downloadable results.

🛠️ Tech Stack

Python scikit-learn Pandas NumPy Streamlit Jupyter GitHub Actions HuggingFace

📬 Connect

LinkedIn Email Google Scholar

Popular repositories Loading

  1. SAP_H3N2_ML SAP_H3N2_ML Public

    ML model for influenza antigenic surveillance · Nature Communications 2024 · Live demo on Hugging Face

    Jupyter Notebook 4 2

  2. SAP_H3N2_ML_webapp SAP_H3N2_ML_webapp Public

    Streamlit web app for seasonal antigenic prediction of influenza A (H3N2) from HA1 sequences using the SAP_H3N2_ML AdaBoost model.

    Python

  3. saws-lab saws-lab Public

    About me