Skip to content

Tools

Synthesize speech from text.

ParameterDefaultDescription
text(required)Text to synthesize
voiceVoice ID or preset name
speed1.0Speed multiplier (0.5–2.0)
moodHumor mood: dry, roast, chaotic, cheeky, cynic, zoomer
format"wav"Output format: wav, mp3, ogg, raw
artifactMode"path"Delivery mode: path or base64
outputDirSubdirectory within output root
sfxfalseEnable [ding], [chime] etc.

Multi-speaker dialogue synthesis using Speaker: line format.

ParameterDefaultDescription
script(required)Dialogue in Speaker: line format
castSpeaker-to-voice mapping, e.g. { "Alice": "af_sky" }
speed1.0Speed multiplier
concattrueCombine all lines into a single file
debugfalseInclude cue_sheet in response
artifactMode"path"Delivery mode: path or base64
outputDirSubdirectory within output root

When no cast is provided, speakers are auto-assigned voices from the approved list.

Returns engine health, available voices, presets, and backend info. Takes no arguments.

Use this to check what voices are available, which backend is active, and whether the engine is healthy.

Stop or rollback active synthesis.

ParameterDefaultDescription
streamIdSpecific stream to interrupt
reasonWhy: user_spoke, context_change, timeout, manual

Ephemeral micro-utterances for ambient narration. Requires --ambient flag or VOICE_SOUNDBOARD_AMBIENT_ENABLED=1.

ParameterDefaultDescription
text(required)Max 500 chars, auto-redacted
category"general"One of: general, thinking, observation, debug