Clone an African voice in 60 seconds
VocaBusta's voice cloning is zero-shot — no training run, no waiting. Give it a short, clean reference clip and you get a voice you can synthesize with right away. Here's the whole loop.
1. Clone from a clip
Upload ~10–30 seconds of clear, single-speaker audio:
import os, requests
with open("my-voice.wav", "rb") as f:
res = requests.post(
"https://api.satryx.ai/voice/clone",
headers={"Authorization": f"Bearer {os.environ['SATRYX_API_KEY']}"},
files={"file": f},
data={"name": "Chidi", "description": "My narration voice"},
)
res.raise_for_status()
voice_id = res.json()["voice_id"] # cloned_…
The response includes an instant preview_url so you can hear the clone
immediately.
2. Speak with it
Pass the cloned_… id straight to text-to-speech:
audio = requests.post(
"https://api.satryx.ai/voice/tts",
headers={"Authorization": f"Bearer {os.environ['SATRYX_API_KEY']}"},
json={"text": "Ndewo, this na my cloned voice.", "voice_id": voice_id},
)
open("clone.wav", "wb").write(audio.content)
Dial exaggeration up for livelier delivery, or cfg_weight up to hew closer to
the reference timbre.
3. Dub a video in that voice
Cloned voices drop straight into the dubbing pipeline. Map a speaker to your
clone in the voice_map when you render:
voice_map = {"SPEAKER_00": voice_id, "SPEAKER_01": "preserve"}
"preserve" keeps a speaker's own voice; your cloned_… id swaps in the clone.
See the Dubbing guide for
the full analyze → translate → render flow.
A note on consent
Only clone voices you own or have explicit permission to clone. Cloning is powerful; use it responsibly.