Streaming on-device speech recognition for Android — NEON-accelerated, encrypted FastConformer (32M params), ~150 ms latency, no cloud. Powered by the VoxRT runtime.
-
Updated
Jun 4, 2026 - Kotlin
Streaming on-device speech recognition for Android — NEON-accelerated, encrypted FastConformer (32M params), ~150 ms latency, no cloud. Powered by the VoxRT runtime.
Streaming on-device speech recognition for iOS — NEON-accelerated, encrypted FastConformer (32M params), RTF 0.08–0.10 on iPhone 13 Pro Max. Built on the VoxRT custom Rust inference runtime. SwiftPM distribution.
Pure Rust implementation of NVIDIA's Parakeet-TDT-0.6B-v3 ASR model (25 languages) using Candle, targeting Apple Silicon
Train the full version of the FastConformer model.
OpenVINO INT8 ASR and speech-to-text model for Intel GPU: NVIDIA NeMo Parakeet TDT-CTC 110M converted from ONNX to OpenVINO IR.
Stream on-device speech recognition on Android using the custom VoxRT inference runtime with NeMo FastConformer support.
Perform on-device streaming speech recognition on iOS using the high-performance VoxRT inference runtime with custom NEON-accelerated kernels.
Add a description, image, and links to the fastconformer topic page so that developers can more easily learn about it.
To associate your repository with the fastconformer topic, visit your repo's landing page and select "manage topics."