Akatsuki(Team name)

SignSense(Project name)

Vihan 9.0 Project

This project explores a different approach: flex sensors on a glove, capturing finger bend angles directly, combined with an on-device LSTM model and a streaming inference pipeline that translates gestures into speech in real time.

Real-Time Sign Language to Speech (Arduino + Python)

This project converts live Arduino glove sensor data into:

Gesture -> Text -> Speech

Sensor Input Format

Arduino sends one CSV line per sample at 115200 baud:

timestamp,f0,f1,f2,f3,f4

f0-f4: flex sensors

Project Structure

hardware/
├── data/
├── models/
├── collect_data.py
├── train_model.py
├── realtime_predict.py
├── tts.py
├── utils.py
├── api.py
├── automate.py
├── gesture_ui.py
└── requirements.txt

1) Install Dependencies

pip install -r requirements.txt

2) Collect Gesture Data

Collect 40 samples (~2 sec) per gesture capture.

python collect_data.py --port COM3 --baudrate 115200 --samples 40

Port can be auto-detected now, so this also works:

python collect_data.py --baudrate 115200 --samples 40

You will be prompted for a gesture label repeatedly. Data is appended to:

data/.csv

3) Train Model

Uses sliding windows with:

window size = 40
step size = 20

python train_model.py --data-dir data --model-dir models --window-size 40 --step-size 20

Outputs:

models/gesture_model.pkl
models/label_encoder.pkl

4) Real-Time Prediction + Speech

python realtime_predict.py --port COM3 --baudrate 115200

Auto-detect port version:

python realtime_predict.py --baudrate 115200

Features included:

rolling buffer of 40 samples
smoothing with majority vote of last 5 predictions
confidence display (if model supports predict_proba)
idle/noise filtering
offline TTS with debounce (prevents repeating same word continuously)

Optional flags:

python realtime_predict.py --port COM3 --disable-tts
python realtime_predict.py --port COM3 --min-confidence 0.55

5) Optional FastAPI Backend

Run API server:

uvicorn api:app --reload --host 0.0.0.0 --port 8000

Predict endpoint:

POST /predict
Body: { "window": [[...11 values...], ...] }

Example request body shape should match your training window size (default 40x11).

6) Automation Script (Single Entry Point)

Use one launcher for all actions:

python automate.py collect --baudrate 115200 --samples 40
python automate.py train --data-dir data --model-dir models
python automate.py predict --baudrate 115200
python automate.py api --reload
python automate.py ui
python automate.py runtime-ui

7) Gesture UI (Letters, Numbers, Space, Custom)

Launch UI:

python automate.py ui

or:

streamlit run gesture_ui.py

UI features:

Auto/manual COM port selection
Add gestures using presets (A-Z, 0-9, SPACE) or custom labels
Collect repeated samples with one click
Train model from collected dataset
Live prediction preview with optional speech

8) Continuous Runtime Display (New Page)

Start dedicated runtime page:

python automate.py runtime-ui

Open http://localhost:8502.

Features:

Continuous prediction (no fixed runtime)
Large word display
Large first-letter display
Optional speech output
Start/Stop controls

Notes for Best Accuracy

Collect multiple sessions per gesture.
Include an idle label to reduce false positives.
Keep sampling rate consistent between collection and prediction.
Use balanced data across gesture classes.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api.py		api.py
automate.py		automate.py
collect_data.py		collect_data.py
connection.py		connection.py
gesture_ui.py		gesture_ui.py
realtime_predict.py		realtime_predict.py
requirements.txt		requirements.txt
runtime_ui.py		runtime_ui.py
simulate_train.py		simulate_train.py
train_model.py		train_model.py
tts.py		tts.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Akatsuki(Team name)

SignSense(Project name)

Real-Time Sign Language to Speech (Arduino + Python)

Sensor Input Format

Project Structure

1) Install Dependencies

2) Collect Gesture Data

3) Train Model

4) Real-Time Prediction + Speech

5) Optional FastAPI Backend

6) Automation Script (Single Entry Point)

7) Gesture UI (Letters, Numbers, Space, Custom)

8) Continuous Runtime Display (New Page)

Notes for Best Accuracy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Akatsuki(Team name)

SignSense(Project name)

Real-Time Sign Language to Speech (Arduino + Python)

Sensor Input Format

Project Structure

1) Install Dependencies

2) Collect Gesture Data

3) Train Model

4) Real-Time Prediction + Speech

5) Optional FastAPI Backend

6) Automation Script (Single Entry Point)

7) Gesture UI (Letters, Numbers, Space, Custom)

8) Continuous Runtime Display (New Page)

Notes for Best Accuracy

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages