-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathMultimodalPreprocessingPipeline.html
More file actions
86 lines (76 loc) · 4.09 KB
/
MultimodalPreprocessingPipeline.html
File metadata and controls
86 lines (76 loc) · 4.09 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
<!DOCTYPE HTML>
<!--
Massively by HTML5 UP
html5up.net | @ajlkn
Free for personal and commercial use under the CCA 3.0 license (html5up.net/license)
-->
<html>
<head>
<title>Joseph Ruff - Multimodal Biosignal Preprocessing Pipeline</title>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1, user-scalable=no" />
<link rel="stylesheet" href="assets/css/main.css" />
<noscript><link rel="stylesheet" href="assets/css/noscript.css" /></noscript>
</head>
<body class="is-preload">
<!-- Wrapper -->
<div id="wrapper">
<!-- Header -->
<header id="header">
<a href="https://josephruff.github.io" class="logo">Joseph Ruff</a>
</header>
<!-- Nav -->
<nav id="nav">
<ul class="links">
<li><a href="https://josephruff.github.io">Portfolio</a></li>
<li><a href="CurriculumVitae.html">Curriculum Vitae</a></li>
<li class="active"><a href="MultimodalPreprocessingPipeline.html">Preprocessing Pipeline</a></li>
</ul>
<ul class="icons">
<li><a href="Curriculum Vitae Joseph David Ruff.pdf" class="icon solid fa-download"><span class="label">Download CV</span></a></li>
<li><a href="https://www.linkedin.com/in/joseph-ruff-a99062393" class="icon brands fa-linkedin"><span class="label">Instagram</span></a></li>
<li><a href="https://github.com/JosephRuff" class="icon brands fa-github"><span class="label">GitHub</span></a></li>
</ul>
</nav>
<!-- Main -->
<div id="main">
<!-- Post -->
<section class="post">
<header class="major">
<h1>Multimodal Biosignal Preprocessing Pipeline</h1>
<p>Reproducible data engineering across EEG, ECG, and wearable HAR datasets for downstream self-supervised learning</p>
</header>
<hr>
<p>
This project implements a fully reproducible pipeline for downloading, preprocessing, and validating five open-access multimodal time-series datasets — PAMAP2, WISDM, mHealth, EEGMMIDB, and PTB-XL — in preparation for downstream self-supervised learning workflows.
</p>
<p>
The pipeline harmonises three wearable activity recognition datasets (HAR) to a common 20 Hz representation with a shared six-channel accelerometer/gyroscope schema and unified class label taxonomy, enabling a single model to train across datasets. EEG data from the PhysioNet EEG Motor Movement/Imagery Database is preprocessed using MNE, with event-aligned 4-second epochs extracted from motor imagery runs. 12-lead ECG data from PTB-XL is ingested via the PhysioNet AWS S3 mirror, bandpass filtered, and split into patient-safe train, validation, and test folds.
</p>
<p>
All outputs are stored as float32 NumPy arrays in a consistent [N, C, T] format alongside structured metadata CSVs covering subject provenance, label mappings, sampling rates, channel schemas, and QC flags. A validation script checks array integrity, label distributions, subject-level leakage controls, and HAR harmonisation across datasets.
</p>
<p>
The pipeline scored 3rd out of 17 submissions in a competitive technical assessment for a Research Assistant post at Imperial College London, with the preprocessing plan rated the most thoroughly reasoned of all submissions.
</p>
<hr>
<p style="text-align:center">
<a href="https://github.com/JosephRuff/multimodal-preprocessing-pipeline" class="button icon brands fa-github">Source Code</a>
</p>
</section>
</div>
<!-- Copyright -->
<div id="copyright">
<ul><li>© Joseph Ruff</li><li>Design: <a href="https://html5up.net">HTML5 UP</a></li></ul>
</div>
</div>
<!-- Scripts -->
<script src="assets/js/jquery.min.js"></script>
<script src="assets/js/jquery.scrollex.min.js"></script>
<script src="assets/js/jquery.scrolly.min.js"></script>
<script src="assets/js/browser.min.js"></script>
<script src="assets/js/breakpoints.min.js"></script>
<script src="assets/js/util.js"></script>
<script src="assets/js/main.js"></script>
</body>
</html>