Voice Agents quickstart
Get your API key
- Sign up at console.speechify.ai
- Go to API Keys
- Copy your default API key (or create a new one)
Install the SDK
The official Python and TypeScript SDKs auto-generate against the same OpenAPI spec — every method below is type-checked and version-pinned. Both read SPEECHIFY_API_KEY from the environment automatically.
Python
TypeScript
cURL
Create an agent
An agent bundles a prompt, a voice, and a default LLM. Voice IDs come from the regular /v1/voices catalog — anything that works for TTS works for Voice Agents, including your cloned voices.
Python
TypeScript
cURL
Start a conversation
POST /v1/agents/{id}/conversations provisions a realtime voice session, dispatches the agent, and returns a short-lived access token. The caller connects directly to the session with that token — audio never flows through our server.
Python
TypeScript
cURL
The response shape:
Connect from the browser
The upcoming @speechify/agents-js SDK handles the session connection, microphone capture, and audio playout in a single call — we’ll link it from here as soon as it publishes. Until then, the easiest path is the console Test Call button.
Test it from the console
The quickest path to hearing the agent without writing client code: open the agent in the console, click Test Call, and talk.
Inspecting conversations
Every turn is streamed to the control plane and persisted with timestamps.
Python
TypeScript
cURL
Next steps
Give the agent access to your backend, the caller’s device, or built-in actions like end_call.
Receive conversation.started, conversation.ended, message.created webhooks.
Use a custom voice with your agents.
Full schemas for /v1/agents, /v1/tools, /v1/conversations.