Create Character Voices
Transform a single voice recording into distinct character voices for games, animations, and storytelling.
Voice swap with emotion preserved
Convert an audio recording into a different voice while preserving pacing, emotion, and delivery. Provide source audio and a target voice ID for speech-to-speech conversion. For anonymization, dubbing, localization, and brand voice matching. Includes noise removal.
Convert speech audio into a different voice while preserving the original emotion, delivery, and pacing. Provide a source audio URL and a target voice ID to produce a transformed audio file with fine-grained control over stability, similarity, and style.
curl -H "Authorization: Bearer $TOOLROUTER_API_KEY" \
-d '{
"tool": "voice-transformer",
"skill": "transform_voice",
"input": {
"audio_url": "https://example.com/podcast-clip.mp3",
"voice_id": "JBFqnCBsd6RMkjVDRZzb"
}
}' \
https://api.toolrouter.com/v1/tools/callclaude mcp add --transport stdio \
--env TOOLROUTER_API_KEY=YOUR_API_KEY \
toolrouter -- npx -y toolrouter-mcpcurl -H "Authorization: Bearer $TOOLROUTER_API_KEY" \
-d '{"tool":"voice-transformer","skill":"transform_voice","input":{}}' \
https://api.toolrouter.com/v1/tools/callTransform a single voice recording into distinct character voices for games, animations, and storytelling.
Transform voice recordings to conceal speaker identity while preserving speech content and intelligibility.
Translate and dub your marketing videos into multiple languages to reach international audiences.
Clean up recordings by removing background noise, hum, wind, and environmental sounds.
Build a complete sonic brand identity with custom AI voice, voice transformations, and signature sound design.
It swaps the voice while keeping the pacing, emotion, and delivery of the original recording.
Yes. Pick a voice with `list_voices`, then pass that `voice_id` into `transform_voice`.
Yes. Turn on background-noise removal when the source audio has ambient noise or room tone.
The default model is English-only, so that is the safest assumption unless the tool output says otherwise.