stepfun-asr Guide

Name: stepfun-asr
Author: daymade

Transcribe audio with StepFun's stepaudio-2.5-asr — an SSE endpoint (NOT /v1/audio/transcriptions) with 32K context, ~85-101x RTF on long audio, and a single-call ceiling around 30 minutes (no client-side chunking). Use when transcribing Chinese / English audio with StepFun, when long-form recordings (5-30 min) need to land in one request, when migrating from step-asr / step-asr-1.1, or when hitting the misleading `model stepaudio-2.5-asr not supported` error (which actually means wrong endpoint). Triggers on 阶跃 ASR, StepFun ASR, stepaudio-2.5-asr, 转录, 语音识别, 长音频转写, 语音转文字. For TTS with the sibling stepaudio-2.5-tts model, use the stepfun-tts skill instead.

1,118 starsby daymade

When to use stepfun-asr

How to use stepfun-asr

stepfun-asr is a Claude skill in the SKILL.md format. Add it to your Claude environment from the source repository below, then it activates as a user-invocable skill when your task matches its description.

Skill source

https://raw.githubusercontent.com/daymade/claude-code-skills/main/daymade-audio/stepfun-asr/SKILL.md

Details

PlatformClaude

CategoryBackend & APIs

Invocationuser-invocable

Modelany

Maintainerdaymade

LicenseMIT

stepfun-asr Guide

When to use stepfun-asr

How to use stepfun-asr

Details

Resources