MiniMax Preset Voice List
Authorizations
All APIs require Bearer Token authentication
Add to request header:
Authorization: Bearer YOUR_API_KEY
Query Parameters
Language version of the returned data: zh (Chinese, default) / en (English). The voice_id is identical between the two versions; only the voice_name / language / accent / gender / age / tag_list / description text switches with the language.
zh, en "zh"
Response
Query successful — returns an array of preset voices
Internal database ID of the voice (numeric string) — for internal traceability only; do not pass it to speech synthesis requests
"295"
Voice ID. This is the value to pass directly into the voice_setting.voice_id field of speech synthesis requests.
"Arabic_CalmWoman"
Display name of the voice (switches between Chinese / English based on language)
"Calm Woman"
Language the voice belongs to (switches between Chinese / English based on language, e.g. Arabic / 阿拉伯语, Chinese (Mandarin) / 中文-普通话)
"Arabic"
Accent tag (switches between Chinese / English based on language, e.g. Standard / 标准口音). May be an empty string for some newer entries.
"Standard"
Gender (switches between Chinese / English based on language, e.g. Male / Female / Character, or 男 / 女 / 角色)
"Female"
Age group (switches between Chinese / English based on language, e.g. Young / Middle-aged / Elderly / Child, or 青年 / 中年 / 老年 / 儿童). May be an empty string for some newer entries.
"Middle-aged"
Style / scene tag list (switches between Chinese / English based on language). May be an empty array for some newer entries.
Cover image URL. May be an empty string for some newer entries.
Sample audio URL (we recommend previewing it before selection)
Textual description of the voice (switches between Chinese / English based on language). May be an empty string for some newer entries.