- Large Language Model
- API Migration Guide
- Exclusive Feature
- Model Support
- OpenAI
- Anthropic
- Gemini
- China Model
- Chat (Baidu ERNIE)
- Chat (Tongyi Qianwen)
- Chat (Tongyi Qianwen-VL)
- Chat(Tongyi Qianwen-OCR)
- Chat (Zhipu GLM-4)
- Chat (Zhipu GLM-4V)
- Chat (Baichuan AI)
- Chat (Moonshot AI)
- Chat (Moonshot AI-Vision)
- Chat (01.AI)
- Chat (01.AI-VL)
- Chat (DeepSeek)
- Chat (DeepSeek-VL2)
- Chat (ByteDance Doubao)
- Chat (ByteDance Doubao-Vision)
- Chat(ByteDance Doubao Image Generation)
- Chat (Stepfun)
- Chat (Stepfun Multimodal)
- Chat (iFLYTEK Spark)
- Chat (SenseTime)
- Chat(Minimax)
- Chat (Tencent Hunyuan)
- SiliconFlow
- Open Source Model
- Large Language Model
- Chat(LLaMA3.3)
- Chat(LLaMA3.2 multimodal)
- Chat(LLaMA3.1)
- Chat(Mixtral-8x7B)
- Chat(Mistral-Large-2411)
- Chat(Mistral-small-2503)
- Chat(Pixtral-Large-2411multimodal)
- Chat(Gemma-7B、Gemma-3-27b-it)
- Chat(Gemma2-9B)
- Chat(Command R+)
- Chat(Qwen2)
- Chat(Qwen2.5)
- Chat(Qwen2.5-VL)
- Chat(Llama-3.1-nemotron)
- Chat(QwQ-32B、QwQ-Plus、QwQ-32B-Preview)
- Chat(Mistral-Large-2411) Copy
- Expert Model
- Other Models
- Image Generation
- DALL.E
- Stability.ai
- Text-to-image (Image Generation-V1)
- Generate (Image Generation-SD2)
- Generate (Image Generation-SD3-Ultra)
- Generate (Image Generation-SD3)
- Generate(Image Generation-SD3.5-Large)
- Generate(Image Generation-SD3.5-Medium)
- Generate(Image to Image-SD3)
- Generate(Image to Image-SD3.5-Large)
- Generate(Image to Image-SD3.5-Medium)
- Midjourney
- Midjourney-Relax
- 302.AI
- Glif
- Flux
- Ideogram
- Recraft
- Luma
- Doubao
- Google
- Minimax
- ZHIPU
- Baidu
- Image Processing
- 302.AI
- Upscale
- Upscale-V2
- Upscale-V3
- Upscale-V4
- Super-Upscale
- Super-Upscale-V2
- Face-upscale
- Colorize
- Colorize-V2
- Removebg
- Removebg-V2
- Inpaint
- Erase
- Face-to-many
- Llava
- Relight
- Relight-background
- Relight-V2
- Face-swap-V2
- Fetch
- HtmltoPng
- SvgToPng
- image-translate
- image-translate-query
- image-translate-redo
- Flux-selfie
- Trellis(Image to 3D model)
- Pose-Transfer(Human Pose Transformation)
- Pose-Transfer(Human Pose Transformation Result)
- Virtual-Tryon
- Virtual-Tryon(Fetch Result)
- Denoise(AI Denoising)
- Deblur(AI Deblurring)
- 302.AI-ComfyUI
- Create Outfit Change Task
- Query Outfit Change Task Status
- Create Face Swap Task
- Query Face Swap Task Status
- Create a Task to Replace Any Item
- Check the Status of Any Object Replacement Task
- Create a Task to Transform Cartoon Characters into Real People
- Query the status of the task to turn a manga character into a real person
- Create Style Transfer Task
- Query the status of the style transfer task
- Vectorizer
- Stability.ai
- Glif
- Clipdrop
- Recraft
- BRIA
- Flux
- Flux-V1.1-Ultra-Redux(Image-to-image generation-Ultra)
- Flux-V1.1-Pro-Redux(Image-to-image generation-Pro)
- Flux-Dev-Redux(Image-to-image generation-Dev)
- Flux-Schnell-Redux(Image-to-image generation-Schnell)
- Flux-V1-Pro-Canny(Object consistency)
- Flux-V1-Pro-Depth(Depth consistency)
- Flux-V1-Pro-Fill(Partial repainting)
- Hyper3D
- Tripo3D
- FASHN
- Ideogram
- Doubao
- Kling
- 302.AI
- Video Generation
- Unified Interface
- 302.AI
- Stable Diffusion
- Luma AI
- Runway
- Kling
- Txt2Video(Text to Video 1.0 Rapid-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-10s)
- Image2Video(Image to Video 1.0 Rapid-5s)
- Image2Video(Image to Video 1.0 Rapid-10s)
- Image2Video(Image to Video 1.5 Rapid-5s)
- Image2Video(Image to Video 1.5 Rapid-10s)
- Image2Video_HQ(Image to Video 1.5 HQ-5s)
- Image2Video_HQ(Image to Video 1.5 HQ-10s)
- Extend_Video
- Txt2Video(Text to Video 1.6 Standard-5s)
- Txt2Video(Text to Video 1.6 Standard-10s)
- Txt2Video(Text to Video 1.6 HQ-5s)
- Txt2Video(Text to Video 1.6 HQ-10s)
- Image2Video(Image to Video 1.6 Standard-5s)
- Image2Video(Image to Video 1.6 Standard-10s)
- Image2Video(Image to Video 1.6 HQ-5s)
- Image2Video(Image to Video 1.6 HQ-10s)
- Image2Video (Multiple pictures for reference)
- Fetch
- CogVideoX
- Minimax
- Pika
- PixVerse
- Genmo
- Hedra
- Haiper
- Sync.
- Lightricks
- Hunyuan
- Vidu
- Tongyi Wanxiang
- Jimeng
- SiliconFlow
- Kunlun Tech
- Audio/Video Processing
- 302.AI
- Stable-Audio(instrumental generation)
- Transcript (Audio/Video to Text)
- Transcriptions(Speech to Text)
- Alignments(Subtitle Timing)
- WhisperX
- F5-TTS(Text to Speech)
- F5-TTS (Asynchronous Text-to-Speech)
- F5-TTS (Asynchronously Retrieve Results)
- mmaudio(Text-to-Speech)
- mmaudio(AI Video Voiceover)
- mmaudio (Asynchronous Result Retrieval)
- Diffrhythm(Song Generation)
- OpenAI
- Azure
- Suno
- Doubao
- Fish Audio
- Minimax
- Dubbingx
- Udio
- Elevenlabs
- Information Processing
- 302.AI
- Admin Dashboard
- Information search
- Xiaohongshu_Search
- Xiaohongshu_Note
- Get_Home_Recommend
- Tiktok_Search
- Douyin_Search
- Twitter_Search
- Twitter_Post(X_Post)
- Twitter_User(X_User)
- Weibo_Post
- Search_Video
- Youtube_Info
- Youtube_Subtitles(Youtube Obtain Subtitles)
- Bilibili_Info(Bilibili Obtain Video Information)
- MP_Article_List(Get the list of WeChat official account articles)
- MP_Article(Retrieve WeChat Official Account articles)
- File processing
- Code execution
- Tavily
- SearchAPI
- Search1API
- Doc2x
- Glif
- Jina
- DeepL
- Exa
- Bocha AI
- RSSHub
- Firefly card
- Youdao
- Mistral
- 302.AI
- RAG-related
- Tools API
- AI Video Creation Hub
- AI Paper Writing
- AI Podcast Production
- AI Writing Assistant
- AI Video Real-Time Translation
- AI Document Editor
- AI Prompt Expert
- Al Answer Machine
- AI 3D Modeling
- AI Search Master 3.0
- AI Vector Graphics Generation
- Web Data Extraction Tool
- AI PPT Generator
- Generate PPT interface with one click
- File parsing
- Generate an outline
- Generate outline content
- Get template options
- Generate PPT interface (synchronous interface)
- Load PPT data
- Generate PPT interface (asynchronous interface)
- Asynchronous query generates PPT status
- Download PPT
- Add/update custom PPT templates
- Pagination query PPT template
- AI Academic Paper Search
- Help Center
Generate(Music Generation)
POST
/udio/generate-proxy
This setting specifies the start and end positions for generating clips within a full song: 0% (0) represents the beginning, 50% (0.5) the middle, and 100% (1) the end. Clicking the value indicator enables automatic mode, letting the model determine the optimal position. This is particularly useful when using the Song Expansion feature.
This setting specifies the placement of lyrics within the complete song: 8% places them at the beginning, 50% at the middle, and 90% generates a snippet toward the end. This feature is particularly useful when using Song Extension.
This setting controls the extent to which the prompt influences the generated content. Higher values enhance adherence to prompt but may produce less natural-sounding audio.
This setting controls the influence of lyrics on the generated content. Lower values may yield a more natural sound, but some lyrical content might be disregarded.
This setting controls how the model prioritizes clarity and instrument separation in audio generation. Higher values may result in clearer audio, but the sound may feel less natural.
This is an experimental feature used to balance generation quality and speed.
Udio130-v1.5:0.1 PTC/call
Request
Authorization
Header Params
Authorization
string
optional
Example:
Bearer {{YOUR_API_KEY}}
Content-Type
string
optional
Example:
application/json
Accept
string
optional
Example:
application/json
Body Params application/json
gen_params
object
required
prompt
string
required
lyrics
string
required
lyrics_type
enum<string>
required
Allowed values:
userinstrumentalgenerate
model_type
enum<string>
required
Allowed values:
udio32-v1.5udio130-v1.5
config
object
optional
negative_prompt
string
optional
song_section_start
number
required
song_section_end
number
required
lyrics_placement_start
number
required
lyrics_placement_end
number
required
prompt_strength
number
required
clarity_strength
number
required
lyrics_strength
number
required
generation_quality
enum<number>
required
Allowed values:
0.250.50.751
seed
number
required
bypass_prompt_optimization
boolean
required
Example
{
"gen_params": {
"prompt": "a song about the moon",
"lyrics": "",
"lyrics_type": "generate",
"bypass_prompt_optimization": false,
"seed": -1,
"song_section_start": 0,
"prompt_strength": 0.5,
"clarity_strength": 0.25,
"lyrics_strength": 0.5,
"generation_quality": 0.75,
"negative_prompt": "",
"model_type": "udio32-v1.5",
"config": {
"mode": "regular"
}
}
}
Request samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.302.ai/udio/generate-proxy' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Accept: application/json' \
--header 'Content-Type: application/json' \
--data-raw '{
"gen_params": {
"prompt": "a song about the moon",
"lyrics": "",
"lyrics_type": "generate",
"bypass_prompt_optimization": false,
"seed": -1,
"song_section_start": 0,
"prompt_strength": 0.5,
"clarity_strength": 0.25,
"lyrics_strength": 0.5,
"generation_quality": 0.75,
"negative_prompt": "",
"model_type": "udio32-v1.5",
"config": {
"mode": "regular"
}
}
}'
Responses
🟢200成功
application/json
Body
code
string
required
message
string
required
data
string
required
Example
{
"code": "success",
"message": "",
"data": "69c61512-278e-4ff4-b6ea-f3b278329b04"
}
Modified at 2024-12-13 08:12:10