Skip to main content

Clone an African voice in 60 seconds

· 2 min read
VocaBusta
VocaBusta Team

VocaBusta's voice cloning is zero-shot — no training run, no waiting. Give it a short, clean reference clip and you get a voice you can synthesize with right away. Here's the whole loop.

1. Clone from a clip

Upload ~10–30 seconds of clear, single-speaker audio:

import os, requests

with open("my-voice.wav", "rb") as f:
res = requests.post(
"https://api.satryx.ai/voice/clone",
headers={"Authorization": f"Bearer {os.environ['SATRYX_API_KEY']}"},
files={"file": f},
data={"name": "Chidi", "description": "My narration voice"},
)
res.raise_for_status()
voice_id = res.json()["voice_id"] # cloned_…

The response includes an instant preview_url so you can hear the clone immediately.

2. Speak with it

Pass the cloned_… id straight to text-to-speech:

audio = requests.post(
"https://api.satryx.ai/voice/tts",
headers={"Authorization": f"Bearer {os.environ['SATRYX_API_KEY']}"},
json={"text": "Ndewo, this na my cloned voice.", "voice_id": voice_id},
)
open("clone.wav", "wb").write(audio.content)

Dial exaggeration up for livelier delivery, or cfg_weight up to hew closer to the reference timbre.

3. Dub a video in that voice

Cloned voices drop straight into the dubbing pipeline. Map a speaker to your clone in the voice_map when you render:

voice_map = {"SPEAKER_00": voice_id, "SPEAKER_01": "preserve"}

"preserve" keeps a speaker's own voice; your cloned_… id swaps in the clone. See the Dubbing guide for the full analyze → translate → render flow.

Only clone voices you own or have explicit permission to clone. Cloning is powerful; use it responsibly.

Go deeper