Name	Name	Last commit message	Last commit date
parent directory ..
.env.example	.env.example
.gitignore	.gitignore
README.md	README.md
config.py	config.py
main.py	main.py
prompts.py	prompts.py
requirements.txt	requirements.txt
voice_agent.py	voice_agent.py
voice_session.py	voice_session.py

Sayna Python Example — SIP Webhook + Gemini Voice Agent

A minimal FastAPI service demonstrating the SIP webhook flow: a phone call arrives at Sayna, Sayna forwards a signed webhook here, and a background voice agent (Deepgram STT → Gemini stream → ElevenLabs TTS) holds a conversation with the caller until they hang up.

Architecture

┌──────────┐       ┌──────────┐  signed webhook   ┌──────────────────┐
│  Caller  │──SIP─>│  Sayna   │──────────────────>│  POST /sayna/    │
└──────────┘       │  Server  │                   │     webhook      │
                   │          │                   │                  │
                   │          │<──WebSocket──┐    │   (background)   │
                   └────┬─────┘    audio     │    └────────┬─────────┘
                        │                    │             │
                        ▼                    │             ▼
                   ┌──────────┐              │       ┌──────────────┐
                   │ LiveKit  │              └───────│  SaynaClient │
                   │   room   │                      │  + Gemini    │
                   └──────────┘                      └──────────────┘

Flow:

SIP call hits the Sayna server, which routes it into a LiveKit room.
Sayna posts a signed webhook to POST /sayna/webhook.
The route verifies the HMAC signature with WebhookReceiver and dispatches a background task.
The background VoiceSession opens a WebSocket to Sayna, joins the same LiveKit room as ai-agent, and speaks the greeting.
Each final STT transcript is fed to the VoiceAgent, which streams a Gemini response.
The agent yields one sentence at a time; each sentence is sent to TTS immediately for low-latency speech.
When the caller hangs up, participant_disconnected fires, the session disconnects, and history is cleared.

Prerequisites

Python 3.10+
A Google AI API key for Gemini
Docker (for the local Sayna + LiveKit + LiveKit SIP stack)

Run the Sayna stack

The bundled ../docker-compose.yml brings up Sayna (port 3002), LiveKit (7880), LiveKit SIP (5063), and Redis. From the examples/ directory:

export DEEPGRAM_API_KEY=...
export ELEVENLABS_API_KEY=...
docker compose up

The sayna service is configured with ../sayna.example.yaml, which forwards SIP webhooks to http://localhost:5002/sayna/webhook — the route exposed by this example.

Run the example

cd python-sayna-example
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

cp .env.example .env
# edit .env and set GOOGLE_API_KEY

python main.py

The server listens on http://0.0.0.0:5002 by default. Place a SIP call to the running LiveKit SIP gateway and the agent will answer.

Environment Variables

Variable	Required	Default	Description
`GOOGLE_API_KEY`	yes	—	Google AI API key for Gemini
`SAYNA_URL`	no	`http://localhost:3001`	Sayna API base URL
`SAYNA_API_KEY`	no	`secret-key-1234567890`	Matches `auth.api_secrets[0].secret` in `sayna.example.yaml`
`SAYNA_WEBHOOK_SECRET`	no	`hook-secret-1234567890`	Matches `sip.hook_secret` in `sayna.example.yaml`
`ELEVENLABS_VOICE_ID`	no	`ZIlrSGI4jZqobxRKprJz`	ElevenLabs voice the agent speaks with
`PORT`	no	`5002`	FastAPI bind port — must match `sayna.example.yaml` `hooks[].url`

Endpoints

Method	Path	Purpose
`GET`	`/`	Liveness check
`POST`	`/sayna/webhook`	Receives signed SIP webhooks from Sayna

Project Structure

python-sayna-example/
├── main.py            # FastAPI app + the two routes
├── config.py          # Settings (defaults from sayna.example.yaml)
├── prompts.py         # Voice assistant system prompt + fallbacks
├── voice_agent.py     # Gemini streaming + sentence extraction (no Sayna imports)
├── voice_session.py   # SaynaClient lifecycle + STT → agent → TTS glue
├── requirements.txt
├── .env.example
└── .gitignore

voice_agent.py and voice_session.py are intentionally decoupled — the agent module has no Sayna dependency, so it can be exercised against any text-in/text-out harness.

../nestjs-ai-sdk-server/ — Node.js sibling demonstrating the browser/POST /start flow
../sayna.example.yaml — Sayna server config used by this example
sayna-client on PyPI — the Python SDK
Sayna docs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Sayna Python Example — SIP Webhook + Gemini Voice Agent

Architecture

Prerequisites

Run the Sayna stack

Run the example

Environment Variables

Endpoints

Project Structure

Related

Uh oh!

FilesExpand file tree

python-sayna-example

Directory actions

More options

Directory actions

More options

Latest commit

History

python-sayna-example

Folders and files

parent directory

README.md

Sayna Python Example — SIP Webhook + Gemini Voice Agent

Architecture

Prerequisites

Run the Sayna stack

Run the example

Environment Variables

Endpoints

Project Structure

Related