- Large Language Model
- API Migration Guide
- Exclusive Feature
- Model Support
- OpenAI
- Anthropic
- Gemini
- China Model
- Chat (Baidu ERNIE)
- Chat (Tongyi Qianwen)
- Chat (Tongyi Qianwen-VL)
- Chat(Tongyi Qianwen-OCR)
- Chat (Zhipu GLM-4)
- Chat (Zhipu GLM-4V)
- Chat (Baichuan AI)
- Chat (Moonshot AI)
- Chat (Moonshot AI-Vision)
- Chat (01.AI)
- Chat (01.AI-VL)
- Chat (DeepSeek)
- Chat (DeepSeek-VL2)
- Chat (ByteDance Doubao)
- Chat (ByteDance Doubao-Vision)
- Chat(ByteDance Doubao Image Generation)
- Chat (Stepfun)
- Chat (Stepfun Multimodal)
- Chat (iFLYTEK Spark)
- Chat (SenseTime)
- Chat(Minimax)
- Chat (Tencent Hunyuan)
- SiliconFlow
- Open Source Model
- Large Language Model
- Chat(LLaMA3.3)
- Chat(LLaMA3.2 multimodal)
- Chat(LLaMA3.1)
- Chat(Mixtral-8x7B)
- Chat(Mistral-Large-2411)
- Chat(Mistral-small-2503)
- Chat(Pixtral-Large-2411multimodal)
- Chat(Gemma-7B、Gemma-3-27b-it)
- Chat(Gemma2-9B)
- Chat(Command R+)
- Chat(Qwen2)
- Chat(Qwen2.5)
- Chat(Qwen2.5-VL)
- Chat(Llama-3.1-nemotron)
- Chat(QwQ-32B、QwQ-Plus、QwQ-32B-Preview)
- Chat(Mistral-Large-2411) Copy
- Expert Model
- Other Models
- Image Generation
- DALL.E
- Stability.ai
- Text-to-image (Image Generation-V1)
- Generate (Image Generation-SD2)
- Generate (Image Generation-SD3-Ultra)
- Generate (Image Generation-SD3)
- Generate(Image Generation-SD3.5-Large)
- Generate(Image Generation-SD3.5-Medium)
- Generate(Image to Image-SD3)
- Generate(Image to Image-SD3.5-Large)
- Generate(Image to Image-SD3.5-Medium)
- Midjourney
- Midjourney-Relax
- 302.AI
- Glif
- Flux
- Ideogram
- Recraft
- Luma
- Doubao
- Minimax
- ZHIPU
- Baidu
- Image Processing
- 302.AI
- Upscale
- Upscale-V2
- Upscale-V3
- Upscale-V4
- Super-Upscale
- Super-Upscale-V2
- Face-upscale
- Colorize
- Colorize-V2
- Removebg
- Removebg-V2
- Inpaint
- Erase
- Face-to-many
- Llava
- Relight
- Relight-background
- Relight-V2
- Face-swap-V2
- Fetch
- HtmltoPng
- SvgToPng
- image-translate
- image-translate-query
- image-translate-redo
- Flux-selfie
- Trellis(Image to 3D model)
- Pose-Transfer(Human Pose Transformation)
- Pose-Transfer(Human Pose Transformation Result)
- Virtual-Tryon
- Virtual-Tryon(Fetch Result)
- Denoise(AI Denoising)
- Deblur(AI Deblurring)
- 302.AI-ComfyUI
- Create Outfit Change Task
- Query Outfit Change Task Status
- Create Face Swap Task
- Query Face Swap Task Status
- Create a Task to Replace Any Item
- Check the Status of Any Object Replacement Task
- Create a Task to Transform Cartoon Characters into Real People
- Query the status of the task to turn a manga character into a real person
- Create Style Transfer Task
- Query the status of the style transfer task
- Vectorizer
- Stability.ai
- Glif
- Clipdrop
- Recraft
- BRIA
- Flux
- Flux-V1.1-Ultra-Redux(Image-to-image generation-Ultra)
- Flux-V1.1-Pro-Redux(Image-to-image generation-Pro)
- Flux-Dev-Redux(Image-to-image generation-Dev)
- Flux-Schnell-Redux(Image-to-image generation-Schnell)
- Flux-V1-Pro-Canny(Object consistency)
- Flux-V1-Pro-Depth(Depth consistency)
- Flux-V1-Pro-Fill(Partial repainting)
- Hyper3D
- Tripo3D
- FASHN
- Ideogram
- Doubao
- Kling
- Video Generation
- Unified Interface
- 302.AI
- Stable Diffusion
- Luma AI
- Runway
- Kling
- Txt2Video(Text to Video 1.0 Rapid-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-5s)
- Txt2Video_HQ(Text to Video 1.5 HQ-10s)
- Image2Video(Image to Video 1.0 Rapid-5s)
- Image2Video(Image to Video 1.0 Rapid-10s)
- Image2Video(Image to Video 1.5 Rapid-5s)
- Image2Video(Image to Video 1.5 Rapid-10s)
- Image2Video_HQ(Image to Video 1.5 HQ-5s)
- Image2Video_HQ(Image to Video 1.5 HQ-10s)
- Extend_Video
- Txt2Video(Text to Video 1.6 Standard-5s)
- Txt2Video(Text to Video 1.6 Standard-10s)
- Txt2Video(Text to Video 1.6 HQ-5s)
- Txt2Video(Text to Video 1.6 HQ-10s)
- Image2Video(Image to Video 1.6 Standard-5s)
- Image2Video(Image to Video 1.6 Standard-10s)
- Image2Video(Image to Video 1.6 HQ-5s)
- Image2Video(Image to Video 1.6 HQ-10s)
- Image2Video (Multiple pictures for reference)
- Fetch
- CogVideoX
- Minimax
- Pika
- PixVerse
- Genmo
- Hedra
- Haiper
- Sync.
- Lightricks
- Hunyuan
- Vidu
- Tongyi Wanxiang
- Jimeng
- SiliconFlow
- Kunlun Tech
- Audio/Video Processing
- 302.AI
- Stable-Audio(instrumental generation)
- Transcript (Audio/Video to Text)
- Transcriptions(Speech to Text)
- Alignments(Subtitle Timing)
- WhisperX
- F5-TTS(Text to Speech)
- F5-TTS (Asynchronous Text-to-Speech)
- F5-TTS (Asynchronously Retrieve Results)
- mmaudio(Text-to-Speech)
- mmaudio(AI Video Voiceover)
- mmaudio (Asynchronous Result Retrieval)
- Diffrhythm(Song Generation)
- OpenAI
- Azure
- Suno
- Doubao
- Fish Audio
- Minimax
- Dubbingx
- Udio
- Elevenlabs
- 302.AI
- Information Processing
- 302.AI
- Admin Dashboard
- Information search
- Xiaohongshu_Search
- Xiaohongshu_Note
- Get_Home_Recommend
- Tiktok_Search
- Douyin_Search
- Twitter_Search
- Twitter_Post(X_Post)
- Twitter_User(X_User)
- Weibo_Post
- Search_Video
- Youtube_Info
- Youtube_Subtitles(Youtube Obtain Subtitles)
- Bilibili_Info(Bilibili Obtain Video Information)
- MP_Article_List(Get the list of WeChat official account articles)
- MP_Article(Retrieve WeChat Official Account articles)
- File processing
- Code execution
- Tavily
- SearchAPI
- Search1API
- Doc2x
- Glif
- Jina
- DeepL
- Exa
- Bocha AI
- RSSHub
- Firefly card
- Youdao
- Mistral
- 302.AI
- RAG-related
- Tools API
- AI Video Creation Hub
- AI Paper Writing
- AI Podcast Production
- AI Writing Assistant
- AI Video Real-Time Translation
- AI Document Editor
- AI Prompt Expert
- Al Answer Machine
- AI 3D Modeling
- AI Search Master 3.0
- AI Vector Graphics Generation
- Web Data Extraction Tool
- AI PPT Generator
- Generate PPT interface with one click
- File parsing
- Generate an outline
- Generate outline content
- Get template options
- Generate PPT interface (synchronous interface)
- Load PPT data
- Generate PPT interface (asynchronous interface)
- Asynchronous query generates PPT status
- Download PPT
- Add/update custom PPT templates
- Pagination query PPT template
- AI Academic Paper Search
- Help Center
Character(Character Feature Preservation)
POST
/doubao/drawing/character
The General 2.0 - Character Feature Preservation model (formerly known as General 2.0S - Subject Preservation) integrates proprietary subject preservation technology, which includes both subject appearance preservation and facial feature preservation.The subject appearance information covers a wide range of subjects, including humans, animals, and objects. During the image generation process, both subject appearance and facial features work together with the input text to produce images that not only align with the appearance of the subject in the reference image but also match the textual description.
Request
Authorization
Header Params
Authorization
string
required
Example:
Bearer {{YOUR_API_KEY}}
Body Params application/json
prompt
string
required
You can input prompts in either Chinese or English to generate images. To improve the accuracy of the text written in the image, enclose the desired text in quotation marks. For example:
"A Christmas poster with the text 'Merry Christmas' written on it."
image_urls
string
required
seed
integer
optional
Default value: -1
scale
number
optional
Default value: 3.5
Value range: [1, 10]
return_url
boolean
optional
logo_info
object
optional
add_logo
boolean
optional
position
integer
optional
language
integer
optional
opacity
number
optional
logo_text_content
string
optional
ddim_steps
integer
optional
Default Value:9
Value Range:[1, 200]
width
integer
optional
Default Value:512
Value Range:[256, 768]
height
integer
optional
Default Value:512
Value Range:[256, 768]
cfg_rescale
number
optional
ref_ip_weight
number
optional
The higher the weight, the greater the similarity between the generated result and the subject in the reference image.
Default Value:0.7
Value Range:[0, 1]
ref_id_weight
number
optional
Default Value: 0.36
Value Range: [0, 1]
Recommended Range: [0.2, 0.4]
use_sr
boolean
optional
false:Text-to-Image
Default Value:true
Example
{
"image_urls": [
"https://xxx"
],
"prompt": "千军万马",
"desc_pushback": true,
"seed": -1,
"scale": 3.5,
"ddim_steps": 9,
"width": 512,
"height": 512,
"cfg_rescale": 0.7,
"ref_ip_weight": 0.7,
"ref_id_weight": 0.36,
"use_sr": true,
"return_url": true,
"logo_info": {
"add_logo": false,
"position": 0,
"language": 0,
"opacity": 0.3,
"logo_text_content": "这里是明水印内容"
}
}
Request samples
Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.302.ai/doubao/drawing/character' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Content-Type: application/json' \
--data-raw '{
"image_urls": ["https://xxx"],
"prompt": "千军万马",
"desc_pushback": true,
"seed": -1,
"scale": 3.5,
"ddim_steps": 9,
"width": 512,
"height": 512,
"cfg_rescale": 0.7,
"ref_ip_weight": 0.7,
"ref_id_weight": 0.36,
"use_sr": true,
"return_url": true,
"logo_info": {
"add_logo": false,
"position": 0,
"language": 0,
"opacity": 0.3,
"logo_text_content": "这里是明水印内容"
}
}'
Responses
🟢200OK
application/json
Body
completed_at
string
required
created_at
string
required
error
string
required
id
string
required
model
string
required
output
string
required
started_at
string
required
status
string
required
Example
{
"completed_at": "",
"created_at": "",
"error": "",
"id": "",
"model": "",
"output": "",
"started_at": "",
"status": ""
}
Modified at 2025-01-14 09:36:56