Voice AI — Switchboard Audio SDK | Switchboard Audio SDK
Open navigation menu

Switchboard for Voice AI

Open source tools and examples for voice AI developers.

Speech Recognition Text to Speech LLM Integration Echo Cancellation Voice Activity Detection Noise Suppression Turn Detection Speaker Isolation Speech Recognition Text to Speech LLM Integration Echo Cancellation Voice Activity Detection Noise Suppression Turn Detection Speaker Isolation

Built to address real-world constraints

Switchboard for Voice AI uses a hybrid on-device + cloud architecture to help you get the best of both worlds. On-device processing with hand-off to cloud only when necessary.

Reduce costs

Process audio on-device to minimize expensive API calls and bandwidth usage.

Improve latency

Local processing eliminates network round-trips for near-instant voice interactions.

Enhance privacy

Keep sensitive audio data on-device and send only processed text to the cloud.

Enable offline

Build voice AI features that work without an internet connection.

Hybrid on-device + cloud Voice AI architecture

Open source repositories

Production-ready examples and reusable components to accelerate your voice AI development

EdgeSpeech

React Native

On device speech recognition (ASR / STT) and text-to-speech (TTS) so that you can cut costs and latency while simplifying cloud infra. You only send text to the LLM so don't have to worry about webRTC, sockets, or scaling audio in the cloud.

STT (local)LLM (cloud)TTS (local)

EdgeAudio

Swift, Kotlin

On-device preprocessing for speech to speech models (aka S2S or audio models). On device voice activity detection (VAD), echo cancellation, and other audio preprocessing runs locally before connecting to cloud-based speech model (such as OpenAI Realtime API) to optimize performance.

VADEcho CancellationSpecific Speaker RecognitionOpenAI Realtime API

EdgeWhisper

iOS, Android, macOS, Windows, Linux

Run OpenAI's Whisper speech recognition (ASR) model entirely on-device for maximum privacy and offline functionality across mobile and desktop platforms (iOS, Android, mac, Windows, Linux).

Whisper (local ASR)

EdgeAgent

React Native

Run a full STT-LLM-TTS pipeline locally. The STT and LLM components each have optional hand-off (or fallback) to cloud alternatives.

STT (local)LLM (local)TTS (local)

EmbeddedVoice

Linux (& custom upon request)

Optimized voice AI components for resource-constrained IoT and embedded systems, including smart speakers, wearables, and edge devices.

EmbeddedIoTEdge Computing

The repos on this page were built with Switchboard. They stand alone, but you can also explore the lower level Switchboard repositories here.

Need to launch faster or build something custom?

Our team offers expert consulting and forward-deployed engineers to help you design, build, and ship exactly what you need on your timeline.

All of the above can be made available on any platform.

Get in touch

Have a question or want to work together? We'd love to hear from you.

Contact us

Additional resources

Guides, tutorials, and best practices for building production voice AI applications.

Explore

Learn how companies are implementing local speech-to-text to reduce costs and improve privacy.

Explore

Explore real-world implementations of conversational AI agents with voice interfaces.

Explore