ModelsLab/Voice Cover
voice-coverThe Voice Cover endpoint allows you to transform a song or audio file into a celeb/fictional character/singer/politician voice using a proper model id of that character.
API Endpoint URL
Base URL for all API requests to this endpoint.
https://modelslab.com/api/v6/voice/voice_cover
API Authentication
Authentication requires a valid API key included in the request. Generate and manage your API keys from your developer dashboard. Include the key in the key parameter for all API requests.
Integration Examples
Production-ready code samples for API integration
{"init_audio": "https://www.youtube.com/watch?v=ixkoVwKQaJg","model_id": "voice-cover","language": "english","pitch": "none","base64": "false","algorithm": "rmvpe","rate": "0.5","emotion": "neutral","radius": "3","speed": "1","mix": "0.25","originality": "0.33","lead_voice_volume_delta": "+1","backup_voice_volume_delta": "-2","instrument_volume_delta": "+2"," reverb_size": "0.15","wetness": "0.2","damping": "0.7","webhook": null," track_id": null,"key": "YOUR_API_KEY"}
SDKs
Official SDKs
Production-ready SDKs and client libraries for all major programming languages
API Parameters
Technical specifications for API request parameters.
| Field Name | Parameter | Technical Description |
|---|---|---|
| Init Audio | init_audio | Init Audio |
| Model ID | model_id | Model ID |
| language | language | The language for the voice. arabic, brazilian portuguese, chinese, dutch, french, hindi, hungarian, italian, japanese, korean, polish, russian, turkish. Default is english. |
| pitch | pitch | Use none when the provided audio singer is male and model id is male. use m2f when the provided audio is male anf the model id is female. use f2m when the provided audio is female anf the model id is male |
| base64 | base64 | Indicates if the input sound clip is in base64 format. Defaults to false. |
| algorithm | algorithm | Algorithm used for voice cloning |
| rate | rate | Controls generated voice resemblance to training data |
| emotion | emotion | Emotion of the voice |
| radius | radius | Median filtering length to reduce voice artifacts |
| speed | speed | Playback speed of the speaker |
| mix | mix | Controls loudness similarity to original audio |
| originality | originality | Controls similarity to original vocals' voiceless consonants |
| lead_voice_volume_delta | lead_voice_volume_delta | Adjusts volume of lead vocals (-5 to +5) |
| backup_voice_volume_delta | backup_voice_volume_delta | Adjusts volume of backup vocals (-5 to +5) |
| instrument_volume_delta | instrument_volume_delta | Adjusts volume of instrumental tracks (-5 to +5) |
| reverb_size | reverb_size | Size of the reverb room |
| wetness | wetness | Reverb applied to original vocals |
| damping | damping | Damping factor for high frequencies in reverb |
| webhook | webhook | URL to receive POST notification upon completion |
| track_id | track_id | ID for webhook identification |