stepfun-tts Guide

Name: stepfun-tts
Author: daymade

Generate Chinese / Japanese speech with StepFun's stepaudio-2.5-tts — Contextual TTS that replaces step-tts-2's `voice_label` with natural-language `instruction` (≤200 chars) plus inline `()` parentheses for句内 prosody. Use when the user wants emotional / prosody control over voice synthesis (whisper, pause, stress, mood pivot mid-sentence), batch-generates game / app voice lines, migrates from `step-tts-2` (the `voice_label → instruction` breaking change), or hits StepFun's stricter 2.5-era censorship (死/消失/political terms). Triggers on 阶跃 TTS, StepAudio 合成, 语音合成, 配音, 文本转语音, TTS 升级, 迁移 step-tts-2. For transcription with the sibling stepaudio-2.5-asr model, use the stepfun-asr skill instead.

1,118 starsby daymade

When to use stepfun-tts

How to use stepfun-tts

stepfun-tts is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/daymade/claude-code-skills/main/daymade-audio/stepfun-tts/SKILL.md

Details

PlatformClaude

CategoryCode & Development

Invocationuser-invocable

Modelany

Maintainerdaymade

LicenseMIT

stepfun-tts Guide

When to use stepfun-tts

How to use stepfun-tts

Details

Resources