Skip to main content
Choose the best Speech-to-Text (STT) provider for your AI Voice Agents.

What are STT Providers?

STT providers convert spoken audio into text:
  • Real-time transcription - Convert speech as it happens
  • Multiple languages - Support for various languages
  • High accuracy - State-of-the-art transcription models
  • Streaming support - Low-latency processing

Available Providers

Choosing a Provider

Deepgram

Best for:
  • High accuracy requirements
  • Real-time streaming
  • Multiple language support
  • Fast response times
Requirements:
  • DEEPGRAM_API_KEY environment variable
Learn more →

Google Cloud STT

Best for:
  • Google Cloud ecosystem integration
  • High accuracy
  • Enterprise deployments
  • Vertex AI integration
Requirements:
  • GOOGLE_STT_API_KEY OR GOOGLE_STT_CREDENTIALS environment variable
Learn more →

Configuration

Basic Configuration

from kuralit.server.agent_session import AgentSession

# Using Deepgram
agent = AgentSession(
    stt="deepgram/nova-2:en-US",
    # ...
)

# Using Google Cloud STT
agent = AgentSession(
    stt="google/en-US",
    # ...
)

Next Steps