MiniMax Preset Voice List

[ { "id": "298735722549316", "voice_id": "Afrikaans_female_1_v1", "voice_name": "Outgoing Girl", "language": "阿非利卡语", "accent": "标准口音", "gender": "女", "age": "青年", "tag_list": [ "广告与预告", "播客与社媒", "企业宣传与旁白", "明亮清脆", "语速轻快", "电台主播感" ], "cover_url": "https://cdn.hailuoai.video/moss/staging/2024-11-21-14/moss-audio/voice_cover//1732170987980918894-207331589840977.png?x-oss-process=image/resize,p_50/format,webp", "sample_audio": "https://filecdn.minimax.chat/public/6579ac88-28ed-46a4-b930-eed7602faa1d.mp3", "description": "这是一个充满活力的青年女声。其声线明亮清脆且极具穿透力，说话节奏语速轻快、干脆利落。整体风格热情洋溢且具有极强的感染力，自带一种专业的电台主播感，非常适合用于商业广告、社交媒体短视频或快节奏的广播节目等场景。" }, { "id": "295", "voice_id": "Arabic_CalmWoman", "voice_name": "Calm Woman", "language": "阿拉伯语", "accent": "", "gender": "女", "age": "中年", "tag_list": [], "cover_url": "", "sample_audio": "https://ftres.oss-cn-beijing.aliyuncs.com/aitools/voices/Arabic_CalmWoman_audio.mp3", "description": "" } ]

Authorizations

Authorization

string

header

required

All APIs require Bearer Token authentication

Add to request header:

Authorization: Bearer YOUR_API_KEY

Query Parameters

language

enum<string>

default:zh

Language version of the returned data: zh (Chinese, default) / en (English). The voice_id is identical between the two versions; only the voice_name / language / accent / gender / age / tag_list / description text switches with the language.

Available options:

zh,

en

Example:

"zh"

Response

Query successful — returns an array of preset voices

string

required

Internal database ID of the voice (numeric string) — for internal traceability only; do not pass it to speech synthesis requests

Example:

"295"

voice_id

string

required

Voice ID. This is the value to pass directly into the voice_setting.voice_id field of speech synthesis requests.

Example:

"Arabic_CalmWoman"

voice_name

string

required

Display name of the voice (switches between Chinese / English based on language)

Example:

"Calm Woman"

language

string

Language the voice belongs to (switches between Chinese / English based on language, e.g. Arabic / 阿拉伯语, Chinese (Mandarin) / 中文-普通话)

Example:

"Arabic"

accent

string

Accent tag (switches between Chinese / English based on language, e.g. Standard / 标准口音). May be an empty string for some newer entries.

Example:

"Standard"

gender

string

Gender (switches between Chinese / English based on language, e.g. Male / Female / Character, or 男 / 女 / 角色)

Example:

"Female"

age

string

Age group (switches between Chinese / English based on language, e.g. Young / Middle-aged / Elderly / Child, or 青年 / 中年 / 老年 / 儿童). May be an empty string for some newer entries.

Example:

"Middle-aged"

Style / scene tag list (switches between Chinese / English based on language). May be an empty array for some newer entries.

cover_url

string

Cover image URL. May be an empty string for some newer entries.

sample_audio

string

Sample audio URL (we recommend previewing it before selection)

description

string

Textual description of the voice (switches between Chinese / English based on language). May be an empty string for some newer entries.