ModelsLab/Speech To Text
speech-to-textSpeech-to-Text transforms audio into written transcription, allowing spoken language to be converted into text for various applications.
API Endpoint URL
Base URL for all API requests to this endpoint.
https://modelslab.com/api/v6/voice/speech_to_text
API Authentication
Authentication requires a valid API key included in the request. Generate and manage your API keys from your developer dashboard. Include the key in the key parameter for all API requests.
Integration Examples
Production-ready code samples for API integration
{"init_audio": "https://pub-f3505056e06f40d6990886c8e14102b2.r2.dev/audio/tom_hanks_1.wav","language": "en","timestamp_level": null," webhook": null," track_id": null,"key": "YOUR_API_KEY"}
SDKs
Official SDKs
Production-ready SDKs and client libraries for all major programming languages
API Parameters
Technical specifications for API request parameters.
| Field Name | Parameter | Technical Description |
|---|---|---|
| Init Audio | init_audio | The URL of the audio file to be transcribed. Supported formats: WAV, MP3,FLAC, OPUS. min: 5 seconds - max: 1 hour. |
| language | language | The language code of the audio content in ISO 639-1 format (e.g. 'en' for English, 'es' for Spanish). |
| timestamp_level | timestamp_level | Level of detail for timestamps in transcription |
| webhook | webhook | URL to receive POST notification upon completion |
| track_id | track_id | ID for webhook identification |