Paraformer V2 Speech Recognition
- Paraformer V2 audio file recognition model, supports Chinese (including dialects), English, Japanese, and other languages
- Supports speaker diarization, hot word customization, and disfluency filtering
- Asynchronous processing mode, use the returned task ID to query status
- Recognition results are returned in the
resultsfield of the task detail
Authorizations
All APIs require Bearer Token authentication
Add to request header:
Authorization: Bearer YOUR_API_KEY
Body
paraformer-v2: Supports Chinese, English, Japanese, and other languages
paraformer-8k-v2: 8kHz sample rate, Chinese only
"paraformer-v2"
"paraformer-8k-v2"
Audio file URL list
Notes:
- Supports publicly accessible URLs via HTTP/HTTPS
- Up to 100 URLs per request
- Supported formats: aac, amr, avi, flac, flv, m4a, mkv, mov, mp3, mp4, mpeg, ogg, opus, wav, webm, wma, wmv
- Single file must not exceed 2GB and 12 hours in duration
1 - 100 elements["https://example.com/audio/meeting.wav"]Language hints for recognition
Notes:
- Only supported by
paraformer-v2, not applicable toparaformer-8k-v2 - Supported language codes:
zh(Chinese),en(English),ja(Japanese),yue(Cantonese),ko(Korean),de(German),fr(French),ru(Russian)
["zh", "en"]Audio track index
Notes:
- Index starts from 0,
[0]means the first track - Default is
[0](only process the first track) - Each specified track is billed independently
Do not pass this parameter unless necessary.
[0]Recognition configuration
Notes:
- Includes disfluency filtering, timestamp alignment, hot words, and sensitive word filter settings
- If not provided, default configuration is used
Do not pass this parameter unless necessary.
Speaker diarization configuration
Notes:
- Includes diarization toggle and speaker count hint
- If not provided, speaker diarization is not enabled
Do not pass this parameter unless necessary.
Response
Task created successfully
Task creation timestamp
1757165031
Task ID
"task-unified-1757165031-uyujaw3d"
Actual model name used
Specific task type
audio.generation.task Task progress percentage (0-100)
0 <= x <= 1000
Task status
pending, processing, completed, failed "pending"
Asynchronous task info
Task output type
audio "audio"