Token pricing for each tool
Approximate token cost for Translate Audio, Translate Video, Add Captions, AI Chat and other tools.
Written By Umakhan Magomedov
Last updated About 21 hours ago
This page lists the approximate token cost for each AI tool in VocaLingo. The app always shows an estimate before you start, so you can verify the cost for your specific input.
ℹ️ All prices are estimates. Actual charges may vary slightly based on final processing results..
Translate Video
Add Captions
Learn more: How Add Captions works.
Translate Audio
Charged per pipeline step. Voiceover is optional and runs when you tap play on the Translation tab. Full settings guide: Translate Audio settings.
Speech recognition
Example: 60-second voice message with ElevenLabs Scribe ≈ 0.8 tokens.
Translation
First upload always uses Gemini 3. Settings model applies to re-translations only.
Voiceover (TTS)
Example: 30-second voiced translation with ElevenLabs ≈ 0.3 tokens. Same text with HeyGen ≈ 55 tokens.
Other tools
Speech to Text, Video to Text, Text Analysis, Text to Speech, Tourist Translator, Text Translator, AI Calls and AI Chat show a token estimate before you start. Cost depends on duration, selected model, text length and optional summaries.