On device speech-to-text for real-time transcription

Run fast, accurate speech-to-text directly on your devices with Shunya Labs’ ONNX-based English ASR model

Operates without network connectivity or in low-bandwidth environments

Optimized edge architectures for resource-constrained devices

Portable models deploy seamlessly across iOS, Android, Linux, and embedded systems

Fast local processing without round-trip delays

Ideal for healthcare, automotive, mobile apps, and privacy-critical use cases.

Small enough to fit on edge devices and existing servers without a hardware refresh

Transcribes as people speak, with sub-100 ms latency for partials

Transcribes as people speak, with sub-100 ms latency for partials

Trained on high entropy data to achieve industry best 3.10% WER