Shunya Labs Plans

Flexible pricing for every scenario

Pay as you go

Free

$200

of Credit

Then pay-as-you-go. No minimums. No expiration. No credit card required.

  • Industry leading speech to text foundation models
  • Advanced intelligence features
  • Custom voice agent orchestrations

Volume

$500

Prepaid credits for the year with up to 10% lower rates on all services. Credits are redeemed against actual usage.

  • Industry leading speech to text foundation models
  • Advanced intelligence features
  • Custom voice agent orchestrations

Enterprise

Custom pricing

For businesses with large volumes, data or deployment requirements, or support needs.

  • Access all models with our best discounts
  • Access to custom-trained speech-to-text models and intelligence features
  • Highest concurrency support
  • Self-hosted deployment options
  • Dedicated SLAs and support

Voice Agents

Calculate your per minute cost for voice agents.

Speech to text

LLM

Text to Speech

Estimated cost/minute

$0.0154
Speech to text
$ 0.0045
LLM
$ 0.0034
Text to Speech
$ 0.0075

Estimate your monthly custom plan for voice agents:

$ 18.48
10500
Hours required per month
Add $18 to wallet

Speech to Text

Industry-best speech to text foundation models for superior performance.

Model

Pay as you go(USD/min)

Volume(USD/min)

Zero STT

Supports 200+ languages

$0.0039
$0.0035

Zero STT Indic

Superior accuracy for Indic languages

$0.0045
$0.0040

Zero STT Codeswitch

Native codeswitch model for multilingual speech

$0.0050
$0.0045

Zero STT Med

Specialised model for healthcare transcriptions

$0.0050
$0.0045

Zero STT Numerical

Specialised model for transcripts containing numerical values

$0.0050
$0.0045

Audio Processing

Get better transcripts with cleaner audio.

Product

Pay as you go(USD/min)

Volume(USD/min)

Denoiser

$0.0039
$0.0034

Enhancer

$0.0039
$0.0034

Speech Intelligence Features

Get analytics directly from speech and formatted outputs for integration into your workflows.

Feature

Pay as you go(USD/min)

Volume(USD/min)

Language Identification

Automatically detect the language in your audio files

$0.0001
$0.00009

Translation

Translate audio during or after transcription

$0.0003
$0.00027

Transliteration

Convert output to your preferred script

$0.0003
$0.00027

Speaker Diarization

Separate transcripts by speaker automatically

$0.0012
$0.00100

Speaker Identification

Customize speaker labels for personalized transcripts

$0.0009
$0.00080

Word Timestamps

Word-level timing for precise navigation

$0.0012
$0.00100

Profanity and Keyword Hashing

Filter and mask profanity or custom keywords

$0.0003
$0.00027

Intent Detection

Understand the purpose behind every conversation

$0.0003
$0.00027

Sentiment Analysis

Track emotional tone across interactions

$0.0003
$0.00027

Emotion Diarization

Get granular emotion tracking throughout conversations

$0.0005
$0.00045

Summarization

Generate concise summaries from audio or text

$0.0003
$0.00027

Keyword Normalization

Standardize brand names, acronyms, and custom terminology

$0.0003
$0.00027

Medical Keyterm Correction

Ensure accurate transcription of medical terminology

$0.0003
$0.00027

Frequently Asked Questions