Skip to content
Console

voice_agent

Add an AI participant that listens to room audio and responds with synthesized speech.

Providerprovider valueDocs
OpenAI Realtimeopenai_realtimeSpeech-to-speech over WebSocket
Google Gemini Livegemini_liveFull-duplex audio
Cloudflare Workers AIcloudflareYour own Cloudflare voice agent Worker
{
"name": "bot",
"type": "voice_agent",
"inputs": ["mix_audio"],
"config": {
"provider": "openai_realtime",
"instructions": "You are a helpful live show co-host.",
"greeting": "Hello everyone!",
"config": {
"apiKey": "<openai-api-key>",
"voice": "alloy"
}
}
}
FieldDescription
providerProvider id (see table)
instructionsSystem prompt (max 8000 chars)
greetingOptional first utterance (max 2000 chars)
configProvider-specific credentials and options

$0.025/min while the agent is actively processing audio. Node pricing.