Switchboard for voice AI

Open source tools and examples for voice AI developers.

Built to address real-world constraints

Switchboard for Voice AI uses a hybrid on-device + cloud architecture to help you get the best of both worlds. On-device processing with hand-off to cloud only when necessary.

Reduce costs
Process audio on-device to minimize expensive API calls and bandwidth usage.
Improve latency
Local processing eliminates network round-trips for near-instant voice interactions.
Enhance privacy
Keep sensitive audio data on-device and send only processed text to the cloud.
Enable offline
Build voice AI features that work without an internet connection.

Learn more

Open source repositories

Production-ready examples and reusable components to accelerate your voice AI development

EdgeSpeech

React Native

On device speech recognition (ASR / STT) and text-to-speech (TTS) so that you can cut costs and latency while simplifying cloud infra. You only send text to the LLM so don't have to worry about webRTC, sockets, or scaling audio in the cloud.

^{STT (local)}^•^{LLM (cloud)}^•^{TTS (local)}

EdgeAudio

Swift, Kotlin

On-device preprocessing for speech to speech models (aka S2S or audio models). On device voice activity detection (VAD), echo cancellation, and other audio preprocessing runs locally before connecting to cloud-based speech model (such as OpenAI Realtime API) to optimize performance.

^VAD^•^{Echo Cancellation}^•^{Specific Speaker Recognition}^•^{OpenAI Realtime API}

EdgeWhisper

iOS, Android, macOS, Windows, Linux

Run OpenAI's Whisper speech recognition (ASR) model entirely on-device for maximum privacy and offline functionality across mobile and desktop platforms (iOS, Android, mac, Windows, Linux).

^{Whisper (local ASR)}

EdgeAgent

React Native

Run a full STT-LLM-TTS pipeline locally. The STT and LLM components each have optional hand-off (or fallback) to cloud alternatives.

^{STT (local)}^•^{LLM (local)}^•^{TTS (local)}

EmbeddedVoice

Linux (& custom upon request)

Optimized voice AI components for resource-constrained IoT and embedded systems, including smart speakers, wearables, and edge devices.

^Embedded^•^IoT^•^{Edge Computing}

Need to launch faster or build something custom?

Our team offers expert consulting and forward-deployed engineers to help you design, build, and ship exactly what you need on your timeline.

All of the above can be made available on any platform.

Switchboard for voice AI

Reduce costs

Improve latency

Enhance privacy

Enable offline

Open source repositories

EdgeSpeech

React Native

EdgeAudio

Swift, Kotlin

EdgeWhisper

iOS, Android, macOS, Windows, Linux

EdgeAgent

React Native

EmbeddedVoice

Linux (& custom upon request)

Need to launch faster or build something custom?

Get in touch

Additional resources

Voice AI resource hub

On-device speech recognition

Building voice AI agents