Basic Agent Creation
The simplest way to create an AI Voice Agent is usingAgentSession:
Agent Configuration
Required Components
- LLM - Large Language Model (the “brain”)
- STT - Speech-to-Text (for voice input)
- VAD - Voice Activity Detection (detects when user speaks)
- Turn Detection - Determines when user finishes speaking
Optional Components
- Tools - Functions the agent can call
- Instructions - Agent personality and behavior
- Name - Agent identifier
Configuration Methods
String-Based (Recommended)
Environment Variables
Agent with Instructions
Instructions define your AI Voice Agent’s personality and behavior:Agent with Tools
Add capabilities to your agent with tools:Complete Example
Next Steps
- Instructions → - Write effective instructions
- Context → - Understand conversation context
- Tools → - Add capabilities
- Python SDK → - Implementation details

