🎉 New Year Sale: Get 20% OFF on all plans — Use code NEWYEAR2026.

Upgrade now
Speech to Text thumbnail

ModelsLab/Speech To Text

speech-to-text
Speech-to-Text transforms audio into written transcription, allowing spoken language to be converted into text for various applications.
API PlaygroundAPI Documentation

API Endpoint URL

Base URL for all API requests to this endpoint.

https://modelslab.com/api/v6/voice/speech_to_text

API Authentication

Authentication requires a valid API key included in the request. Generate and manage your API keys from your developer dashboard. Include the key in the key parameter for all API requests.

Integration Examples

Production-ready code samples for API integration

{
"init_audio": "https://pub-f3505056e06f40d6990886c8e14102b2.r2.dev/audio/tom_hanks_1.wav",
"language": "en",
"timestamp_level": null,
" webhook": null,
" track_id": null,
"key": "YOUR_API_KEY"
}

SDKs

Official SDKs

Production-ready SDKs and client libraries for all major programming languages

API Parameters

Technical specifications for API request parameters.

Field NameParameterTechnical Description
Init Audioinit_audioThe URL of the audio file to be transcribed. Supported formats: WAV, MP3,FLAC, OPUS. min: 5 seconds - max: 1 hour.
languagelanguageThe language code of the audio content in ISO 639-1 format (e.g. 'en' for English, 'es' for Spanish).
timestamp_leveltimestamp_levelLevel of detail for timestamps in transcription
webhook webhookURL to receive POST notification upon completion
track_id track_idID for webhook identification