The Phone channel turns a conversation flow into a real voice agent. Inbound calls land in the flow; outbound calls let you dial out from the editor or a workflow. The actor speaks via TTS; the user speaks naturally; the conversation runs the same way it would in chat.Documentation Index
Fetch the complete documentation index at: https://docs.cogniagent.ai/llms.txt
Use this file to discover all available pages before exploring further.
When to use Phone
- A receptionist agent that answers inbound calls and routes to the right specialist.
- An outbound sales-development flow that places follow-up calls.
- Voice support for users who prefer talking over typing.
- Multi-leg flows (warm transfer to a human, then back to the AI).
Before you start
Phone uses Twilio for the carrier and self-hosted LiveKit SIP for the voice layer. You’ll need:- A Twilio account (or use the shared CogniAgent test number — workspaces get one free claim, time-limited).
- A workspace phone number added in Settings → Phone Numbers. Either bring your own Twilio number (BYOC) or claim the shared test number.
Adding the Phone channel
Choose voice and language
| Field | Notes |
|---|---|
| Voice | TTS voice (e.g. alloy). Test a few to find one that fits your brand. |
| Language | Locale code (en-US, en-GB, etc.). |
Inbound: how a call lands in the flow
The caller hears the first actor speak through TTS. They reply by voice. Speech-to-text runs locally; the actor sees text turns and replies normally. The text is spoken back via TTS. Most flows feel natural after the first few minutes of tuning the agent’s instructions for spoken cadence (shorter sentences, fewer enumerations).Outbound: placing a call
In Initiator mode:Set initial parameters
Anything the first actor needs —
customer_name, account_id, reason_for_call, etc.Hanging up
Voice calls need an explicit “end call” signal that text channels don’t. Phone-bound flows expose a special hang-up tool that the LLM can call when the conversation is genuinely done. In the actor’s instructions:Speech quality tips
Limits in v1
- No call recording. Phone audio isn’t stored. Transcripts (text turns) are stored normally.
- No DTMF input as a first-class flow trigger. If the user presses keys, the actor doesn’t see them.
- No call transfer to another phone number. Hand-off is within the flow only.
- One phone number per flow. You can have multiple workspace phone numbers, but each binds to one flow.
Next
Deploy a flow
Schedule the phone listener.
Start a conversation
Place an outbound call.
