302.AI API Document
  1. Jina
  • Large Language Model
    • API Migration Guide
    • Exclusive Feature
      • Search Online
        • Chat(Search online)
      • Depth-First Search
        • Chat(Depth-First Search)
      • Image Analysis
        • Chat(Image analysis)
      • Reasoning mode
        • Chat(Reasoning mode)
      • Link Parsing
        • Chat(Link Parsing)
      • Tool Invocation
        • Chat(tool invocation)
      • Long-term memory (Beta)
        • Memobase
          • User Management
            • Create User
            • Get User
            • Update User
            • Delete User
          • Data Management
            • Insert Data
            • Get Datas
            • Get Data
            • Delete Data
          • Memory Management
            • Flush Buffer (Generate Memory)
            • Get User Profile (Get Memory)
            • Delete User Profile (Delete Memory)
        • Chat (Long-term Memory)
      • Simplified API
        • Chat (Simplified API)
    • Model Support
      • Models (List models)
      • Status(Model Status)
    • OpenAI
      • Chat(Talk)
      • Chat(Streamed return.)
      • Chat (gpt-4o Image Analysis)
      • Chat (gpt-4o Structured Output)
      • Chat (gpt-4o function call)
      • Chat (gpt-4-plus image analysis)
      • Chat (gpt-4-plus image generation)
      • Chat(gpt-4o-image-generation modify image)
      • Chat (gpts model)
      • Chat (chatgpt-4o-latest)
      • Chat (o1 Series Model)
      • Chat (o3 Series Model)
      • Chat(o4 Series)
      • Chat(gpt-4o audio model)
    • Anthropic
      • Chat(Talk)
      • Chat(Analyze image)
      • Chat(Function Call)
      • Messages(Original format)
      • Messages(Function Call)
      • Messages(Thinking mode)
      • Messages(128k output)
    • Gemini
      • Chat(Talk)
      • Chat(Analyze image)
      • Chat(Image Generation)
    • China Model
      • Chat (Baidu ERNIE)
      • Chat (Tongyi Qianwen)
      • Chat (Tongyi Qianwen-VL)
      • Chat(Tongyi Qianwen-OCR)
      • Chat (Zhipu GLM-4)
      • Chat (Zhipu GLM-4V)
      • Chat (Baichuan AI)
      • Chat (Moonshot AI)
      • Chat (Moonshot AI-Vision)
      • Chat (01.AI)
      • Chat (01.AI-VL)
      • Chat (DeepSeek)
      • Chat (DeepSeek-VL2)
      • Chat (ByteDance Doubao)
      • Chat (ByteDance Doubao-Vision)
      • Chat(ByteDance Doubao Image Generation)
      • Chat (Stepfun)
      • Chat (Stepfun Multimodal)
      • Chat (iFLYTEK Spark)
      • Chat (SenseTime)
      • Chat(Minimax)
      • Chat (Tencent Hunyuan)
    • SiliconFlow
      • Chat(SiliconFlow)
    • Open Source Model
      • Chat(LLaMA4)
      • Chat(LLaMA3.3)
      • Chat(LLaMA3.2 multimodal)
      • Chat(LLaMA3.1)
      • Chat(Mistral)
      • Chat(Pixtral-Large-2411multimodal)
      • Chat(Gemma-7B、Gemma-3-27b-it)
      • Chat(Gemma2-9B)
      • Chat(Command R+)
      • Chat(Qwen2)
      • Chat(Qwen2.5)
      • Chat(Qwen2.5-VL)
      • Chat(Qwen3)
      • Chat(Llama-3.1-nemotron)
      • Chat(QwQ-32B、QwQ-Plus、QwQ-32B-Preview)
    • Expert Model
      • Chat(WiseDiag Medical Model)
      • Chat (ChatLaw Legal Model)
      • Chat (Xuanyuan Financial Model)
      • Chat (Farui Legal Model)
      • Chat (Alibaba Math Model)
      • Chat(Perplexity search)
    • Other Models
      • Chat(grok-3)
      • Chat(grok-2)
      • Chat(grok-2-vision)
      • Chat(Nova)
  • Image Generation
    • Unified interface
      • Generate images
    • GPT-Image-1
      • Generations(Image generation)
      • Edits(Modify Image)
    • DALL.E
      • Generations(DALL·E 3和DALL·E 2)
      • Edits(DALL·E 2)
      • Variations(DALL·E 2)
    • Stability.ai
      • Text-to-image (Image Generation-V1)
      • Generate (Image Generation-SD2)
      • Generate (Image Generation-SD3-Ultra)
      • Generate (Image Generation-SD3)
      • Generate(Image Generation-SD3.5-Large)
      • Generate(Image Generation-SD3.5-Medium)
      • Generate(Image to Image-SD3)
      • Generate(Image to Image-SD3.5-Large)
      • Generate(Image to Image-SD3.5-Medium)
    • Midjourney
      • Imagine
      • Action
      • Blend
      • Describe
      • Modal
      • Fetch
      • Cancel
    • Midjourney-Relax
      • Imagine
      • Action
      • Blend
      • Describe
      • Modal
      • Fetch
      • Cancel
    • 302.AI
      • SDXL
      • SDXL-Lora
      • SDXL-Lightning
      • SDXL-Lightning-V2
      • SDXL-Lightning-V3
      • SD3
      • SD3-V2
      • Aura-Flow
      • Kolors
      • Kolors(Reference Image Generation-KLING)
      • QRCode Generation
      • Lora
      • SD-3.5-Large
      • SD-3.5-Large-Turbo
      • SD-3.5-Medium
      • Lumina-Image-V2(Image generated)
      • Playground-v25(Image generated)
      • Omnigen-V1(Image generated)
    • Glif
      • Glif(Claude+SD3)
      • Glif (Text-to-Sticker)
      • Glif (Text-to-Graffiti)
      • Glif (Text-to-Wojak Comic)
      • Glif (Text-to-Lego)
    • Flux
      • Official API
        • Generate
        • Finetune
        • Result
      • Flux-Ultra(v1.1)
      • Flux-Pro
      • Flux-Pro(v1.1)
      • Flux-Dev
      • Flux-Schnell
      • Flux-Realism
      • Flux-Lora
      • Flux-General
      • Flux-General-Inpainting(Advanced Customization)
      • Flux-Lora-Training(Training Lora)
      • Flux-Lora-Training(Fetch Results Asynchronously)
    • Ideogram
      • Generate(Text to ImageV3)
      • Generate(Text to Image)
    • Recraft
      • Recraft-V3(Text to Image)
      • Create-Style(Customized Styles)
      • Recraft-20B(Image Generation)
    • Luma
      • Luma-Photon(Image generation)
      • Luma-Photon-Flash(Fast image generation)
    • Doubao
      • Drawing(Doubao image generation)
    • Google
      • Imagen-4-Preview (Image generated)
      • Imagen-3 (Image generated)
      • Imagen-3-Fast (Image generated)
    • Minimax
      • image(Text-to-Image Generation)
    • ZHIPU
      • image(Text-to-Image Generation)
    • Baidu
      • iRAG(Text-to-Image Generation)
    • Hidream
      • Hidream-i1-full(Advanced Version)
      • Hidream-i1-dev(Intermediate Version)
      • Hidream-i1-fast(Entry-Level Version)
    • Bagel
      • Bagel(Image generation)
  • Image Processing
    • 302.AI
      • Upscale
      • Upscale-V2
      • Upscale-V3
      • Upscale-V4
      • Super-Upscale
      • Super-Upscale-V2
      • Face-upscale
      • Colorize
      • Colorize-V2
      • Removebg
      • Removebg-V2
      • Removebg-V3
      • Inpaint
      • Erase
      • Face-to-many
      • Llava
      • Relight
      • Relight-background
      • Relight-V2
      • Face-swap-V2
      • Fetch
      • HtmltoPng
      • SvgToPng
      • image-translate
      • image-translate-query
      • image-translate-redo
      • Flux-selfie
      • Trellis(Image to 3D model)
      • Pose-Transfer(Human Pose Transformation)
      • Pose-Transfer(Human Pose Transformation Result)
      • Virtual-Tryon
      • Virtual-Tryon(Fetch Result)
      • Denoise(AI Denoising)
      • Deblur(AI Deblurring)
    • 302.AI-ComfyUI
      • Create Outfit Change Task
      • Create Outfit Change Task (Upload Mask)
      • Query Outfit Change Task Status
      • Create Face Swap Task
      • Query Face Swap Task Status
      • Create a Task to Replace Any Item
      • Create Object Replacement Task (Upload Mask)
      • Check the Status of Any Object Replacement Task
      • Create a Task to Transform Cartoon Characters into Real People
      • Query the status of the task to turn a manga character into a real person
      • Create Style Transfer Task
      • Query the status of the style transfer task
      • Create Image Removal Task
      • Query Image Removal Task Status
      • Create Video Face Swap Task
      • Query Video Face Swap Task Status
    • Vectorizer
      • Vectorize
    • Stability.ai
      • Fast Upscale
      • Creative Upscale
      • Conservative Upscale
      • Fetch
      • Erase
      • Inpaint
      • Outpaint
      • Search-and-replace
      • Search-and-recolor
      • Remove-background
      • Sketch
      • Structure
      • Style
      • Replace-Background
      • Stable-Fast-3D
      • Stable-Point-3D(Image to 3D Model Conversion -New Version)
    • Glif
      • Glif(Portrait Photo Stylization)
      • Glif(Photo-to-Sculpture)
      • Glif(Photo Pixelation)
      • Glif(Logo Materialization)
      • Glif(Image-to-GIF)
    • Clipdrop
      • Cleanup
      • Upscale
      • Remove-background
      • Uncrop
    • Recraft
      • Vectorize Image
      • Remove Background
      • Clarity Upscale
      • Generative Upscale
    • BRIA
      • Remove Background
      • Blur Background
      • Generate Background
      • Erase Foreground
      • Eraser
      • Expand Image
      • Increase Resolution
      • Crop
      • Cutout
      • Packshot
      • Shadow
      • Scene
      • Caption
      • Register
      • Mask
      • Presenter info
      • Modify Presenter
      • Delayer Image
    • Flux
      • 官方API
        • Generate(Image Edit)
        • Result(Query Task)
      • Flux-V1.1-Ultra-Redux(Image-to-image generation-Ultra)
      • Flux-V1.1-Pro-Redux(Image-to-image generation-Pro)
      • Flux-Dev-Redux(Image-to-image generation-Dev)
      • Flux-Schnell-Redux(Image-to-image generation-Schnell)
      • Flux-V1-Pro-Canny(Object consistency)
      • Flux-V1-Pro-Depth(Depth consistency)
      • Flux-V1-Pro-Fill(Partial repainting)
      • Flux-Kontext-Pro(Image Edit)
      • Flux-Kontext-Max(Image Edit)
    • Hyper3D
      • Hyper3d-Rodin(Generate 3D models)
      • Hyper3d-Rodin(Obtain task results)
    • Tripo3D
      • Task(Task Submission)
      • Upload(Image Upload)
      • Fetch
    • FASHN
      • Fashn-Tryon(Virtual Try-On)
      • Fashn-Tryon(Virtual Try-On v1.5)
    • Ideogram
      • Edit(Image EditionV3)
      • Remix(Image to ImageV3)
      • Reframe(Image ExtensionV3)
      • Replace Background(V3)
      • Remix(Image to Image)
      • Upscale(Image Upscaling)
      • Describe(Image Description)
      • Edit(Image Edition)
    • Doubao
      • SeedEdit(Image Command Editing)
      • Character(Character Feature Preservation)
    • Kling
      • Virtual-Try-On
      • Fetch(Get Task Result)
    • StepFun
      • Step1x-Edit(Modify Image)
    • Bagel
      • Bagel-Edit(Image Edit)
  • Video Generation
    • Unified Interface
      • Create Video Generation Task
      • Retrieve Video Task Information
    • 302.AI
      • Image-to-video
      • Live-portrait
      • Video-To-Video
      • Fetch
      • Latentsync (Open source digital person)
      • Latentsync (get task results)
      • Upscale-Video(Video Enhancement)
      • Upscale-Video(Get Video Results)
    • Stable Diffusion
      • Image-to-video
      • Fetch Image-to-video
    • Luma AI
      • Submit(Text / Image to Video)
      • Extend(Video)
      • Fetch
    • Runway
      • Submit(Text to Video)
      • Submit(Image to Video)
      • Submit(Image to Video Rapid)
      • Submit(Image-to-Video Generation with Gen4)
      • Submit(Image to Video Generation Gen4-Turbo)
      • Submit(Video to Video)
      • Submit(Video to Video Rapid)
      • Submit(Act-one motion capture)
      • Submit(Video extension)
      • Fetch
    • Kling
      • Txt2Video(Text to Video 1.0 Rapid-5s)
      • Txt2Video_HQ(Text to Video 1.5 HQ-5s)
      • Txt2Video_HQ(Text to Video 1.5 HQ-10s)
      • Image2Video(Image to Video 1.0 Rapid-5s)
      • Image2Video(Image to Video 1.0 Rapid-10s)
      • Image2Video(Image to Video 1.5 Rapid-5s)
      • Image2Video(Image to Video 1.5 Rapid-10s)
      • Image2Video_HQ(Image to Video 1.5 HQ-5s)
      • Image2Video_HQ(Image to Video 1.5 HQ-10s)
      • Txt2Video(Text to Video 1.6 Standard-5s)
      • Txt2Video(Text to Video 1.6 Standard-10s)
      • Txt2Video(Text to Video 1.6 HQ-5s)
      • Txt2Video(Text to Video 1.6 HQ-10s)
      • Image2Video(Image to Video 1.6 Standard-5s)
      • Image2Video(Image to Video 1.6 Standard-10s)
      • Image2Video(Image to Video 1.6 HQ-5s)
      • Image2Video(Image to Video 1.6 HQ-10s)
      • Txt2Video(Text-to-Video 2.0 – HD – 5s)
      • Image2Video(Image-to-Video 2.0 – HD – 5s)
      • Image2Video(Image-to-Video 2.0 – HD – 10s)
      • Image2Video (Multiple pictures for reference)
      • Image2Video(Multiple pictures for reference)
      • Extend_Video
      • Image2Video(Image video 2.1-5 seconds)
      • Image2Video(Image video 2.1-10 seconds)
      • Image2Video(Image Video 2.1-HD-5 seconds)
      • Image2Video(Image Video 2.1-HD-10 seconds)
      • Fetch
    • CogVideoX
      • Generations (text-generated video)
      • Generations(Image-generated video)
      • Results (get task results)
    • Minimax
      • Video(Text-to-Video)
      • Video(Image-to-video)
      • Video(Based on Subject Reference)
      • Video(Camera movement control)
      • Query(Result)
      • Files(Video Download)
    • Pika
      • 1.5 pikaffects(Image-to-Video Generation)
      • Turbo Generate(Text-to-Video Generation)
      • Turbo Generate(Text-to-Video Generation)
      • 2.1 Generate(Text-to-Video Generation)
      • 2.1 Generate(Image-to-Video Generation)
      • 2.2 Generate(Text-to-Video Generation)
      • 2.2 Generate(Image-to-Video Generation)
      • 2.2 Pikascenes(Generate scene videos)
      • Fetch(Result)
    • PixVerse
      • Pixverse Special Effect ID
      • Pixverse Sound Effect ID
      • Generate
      • Fetch
    • Genmo
      • Mochi-v1 (Get task results)
      • Mochi-v1(Text to Video)
    • Hedra
      • 2.0
        • Audio(Upload)
        • Portrait(Upload)
        • Characters(lip-synthesis)
        • Fetch(Result)
      • 3.0
        • List Models(Get a list of models)
        • Create Asset(Resource creation)
        • Upload Asset(Resource upload)
        • Generate Asset(Resource Synthesis)
        • Get Status(Get resource synthesis results)
    • Haiper
      • Haiper(Text to Video)
      • Haiper(Image to Video)
      • Haiper(Text to Video V2.5)
      • Haiper(Image to Video V2.5)
      • Haiper(Fetch Task Result)
    • Sync.
      • Generate
      • Fetch
    • Lightricks
      • Ltx-Video
      • Ltx-Video-I2V
      • Ltx-Video-v095(Text-to-video generation)
      • Ltx-Video-v095-I2V(Image-to-Video Generation)
    • Hunyuan
      • Hunyuan(Text-to-Video)
      • Hunyuan(Obtain Task Results)
    • Vidu
      • Vidu(Text-to-Video)
      • Vidu(Image to Video)
      • Vidu(Generate video from the first and last frames)
      • Vidu(Reference-based video generation)
      • Vidu(Generate scene video)
      • Vidu(Smart Ultra HD)
      • Fetch(Retrieve Task Results)
      • Vidu V2(Text-to-Video Generation)
      • Vidu V2(Image-to-Video)
      • Vidu V2(Start-and-End Frame Video Generation)
      • Vidu V2(Subject-Driven Video Generation)
      • Vidu(Scene Video Generation V2)
      • Vidu V2(AI Ultra HD – Premium)
      • Fetch V2(Retrieve Task Result)
    • Tongyi Wanxiang
      • T2V(Text-to-Video)
      • Tasks(Fetch Task Result)
      • wan-t2v(Text-to-video open source version)
      • wan-t2v(Fetch Task Result)
      • wan-i2v(Image-to-video open source version)
      • wan-i2v(Fetch Task Result)
      • wan-vace(Video Edit)
      • wan-vace(Fetch Task Result)
    • Jimeng
      • Seaweed (Text/picture generated video)
      • Seaweed (Fetch Task Results)
    • SiliconFlow
      • LTX-Video(Video Generation)
      • HunyuanVideo(Video Generation)
      • Mochi-1-Preview(Video Generation)
      • Tasks(Fetch Task Result)
    • Google
      • Veo2(Text-to-video)
      • Veo2(Get task results)
    • Kunlun Tech
      • Skyreels(Image to Video)
      • Skyreels(Get task results)
    • Higgsfield
      • Generate(Image to Video)
      • Shortads(Image-Generated Advertising Video)
      • Fetch(Retrieve Task Results)
  • Audio/Video Processing
    • Unified interface
      • TTS
        • Text-to-Speech Generation
        • Query TTS Provider Information
    • 302.AI
      • Stable-Audio(instrumental generation)
      • Transcript (Audio/Video to Text)
      • Transcriptions(Speech to Text)
      • Alignments(Subtitle Timing)
      • WhisperX
      • F5-TTS(Text to Speech)
      • F5-TTS (Asynchronous Text-to-Speech)
      • F5-TTS (Asynchronously Retrieve Results)
      • mmaudio(Text-to-Speech)
      • mmaudio(AI Video Voiceover)
      • mmaudio (Asynchronous Result Retrieval)
      • Diffrhythm(Song Generation)
    • OpenAI
      • Speech(Text to Speech tts-1)
      • Transcriptions(Speech to Text whisper-1)
      • Translations(Speech to English Text whisper-1)
      • Realtime
    • Azure
      • AzureTTS(Text to Speech)
      • Voice-List
    • Suno
      • Music(Automatic Mode)
      • Music(Custom Mode)
      • Music(Generate Lyrics)
      • Music(Song Continuation)
      • Fetch
    • Doubao
      • tts_hd(Text to Speech)
      • vc-ata(Automatic subtitle timing)
      • fetch(Query Generation Status)
      • vc(Audio and video caption generation)
      • fetch(Query caption result)
    • Fish Audio
      • TTS(Text to Speech)
      • Model(Create Voice)
      • Model(Obtain Voice)
      • Model(Delete Voice)
      • Model(Update Voice)
      • Model(Get Voice List)
    • Minimax
      • T2A(Async extra content generation)
      • T2A(Status Inquiry)
      • T2V(Create Voice)
      • Files(Audio File Download)
      • Music_Upload(Upload original music)
      • Music Generation API
      • T2A (voice generation-synchronization)
    • Dubbingx
      • TTS(Text to Speech)
      • GetTTSList(Get Voice List)
      • GetTTSTask(Get Task Status)
      • Analyze(emotions)
    • Udio
      • Generate(Music Generation)
      • Generate(Music Continuation)
      • Query
    • Elevenlabs
      • Speech-to-text(Speech-to-Text)
      • Speech-to-text(Asynchronously fetch results)
      • TTS-Multilingual-v2(Text-to-Speech)
      • TTS-Multilingual-v2(Asynchronous result retrieval)
      • TTS-Flash-v2.5(Text-to-Speech)
      • TTS-Flash-v2.5(Asynchronous result retrieval)
    • Mureka
      • Upload Music
      • Generate Lyrics from a Prompt
      • Continue writing lyrics from existing lyrics
      • Generate a Song from Lyrics
      • Retrieve the Generated Song
      • Separate Music Stems
      • Generate Instrumental Music Track
      • Retrieve Instrumental Music Track
      • Text-to-Speech
      • Create Podcast Audio
  • Information Processing
    • Unified Search API
      • Unified Search API
    • 302.AI
      • Admin Dashboard
        • Balance(Account balance)
        • Price(Get API Pricing)
        • Retrieve User API Keys List Data
        • Retrieve Data for Specified API Key
        • Create API Key
        • Update API Key
        • Delete API Key
      • Information search
        • Xiaohongshu_Search
        • Xiaohongshu_Note
        • Get_Home_Recommend
        • Tiktok_Search
        • Douyin_Search
        • Twitter_Search
        • Twitter_Post(X_Post)
        • Twitter_User(X_User)
        • Weibo_Post
        • Search_Video
        • Youtube_Info
        • Youtube_Subtitles(Youtube Obtain Subtitles)
        • Bilibili_Info(Bilibili Obtain Video Information)
        • MP_Article_List(Get the list of WeChat official account articles)
        • MP_Article(Retrieve WeChat Official Account articles)
      • File processing
        • Parsing
        • Upload-File
      • Code execution
        • Virtual Machine Sandbox
          • One-click Code Execution
          • Create Sandbox
          • Query Your Sandbox List
          • Destroy Sandbox
          • Run-Code
          • Run Command Line
          • Query File Information at Specified Path
          • Import File Data into Sandbox
          • Export Sandbox Files
        • Static Sandbox
          • Run-Code
      • Remote Browser
        • Create Browser Automation Task
        • Query Browser Task Status
    • Tavily
      • Search
      • Extract
    • SearchAPI
      • Search
      • Search(News)
      • Search(Images)
      • Search(Lens)
      • Search(Videos)
      • Search(Scholar)
      • Search(Patents)
    • Search1API
      • Search
      • News
      • Crawl
      • Sitemap(Site Map)
      • Trending (Popular Trends)
    • Exa
      • Search
      • Contents(Get content)
      • Answer
    • Bocha AI
      • Web-search
      • Ai-search
    • Doc2x
      • Version 2
        • PDF(Upload - Asynchronous)
        • Status(View Status)
        • Parse(Request Export File - Asynchronous)
        • Result(exported results)
      • Version 1 (Deprecated)
        • PDF(PDF-to-MD)
        • PDF-Async
        • IMG-to-MD
        • IMG-Async
        • Status
        • Export
    • Glif
      • Glif(Bot)
    • Jina
      • Reader(Web Page to Markdown)
      • Search
      • Grounding(Verification of Facts)
      • Classify
    • DeepL
      • Chat(Translate into English)
      • Chat(Translate into Chinese)
      • Chat(Translate into Japanese)
      • Translate(Translate into various language)
    • RSSHub
      • RSSHub
    • Firefly card
      • saveImg(Card Generation)
    • Youdao
      • Youdao(Youdao Translate)
    • Mistral
      • OCR(PDF Parsing)
    • Firecrawl
      • Scrape
      • Batch Scrape
      • Get Batch Scrape Status
      • Get Batch Scrape Errors
      • Map
      • Search
  • RAG-related
    • OpenAI
      • Embeddings
    • Jina
      • Embeddings
        POST
      • Rerank
        POST
      • Rerank(Multimodal Reordering)
        POST
      • Tokenizer
        POST
    • China Model
      • Embeddings(Zhipu)
      • Embeddings(BAAI)
      • Embeddings(Baichuan AI)
      • Embeddings(Youdao)
      • Rerank(Youdao)
      • Rerank(BAAI)
    • 302.AI
      • Chat(with KB)
      • Chat(with KB-OpenAI compatible)
      • Create(Knowledge Base)
      • Delete(Knowledge Base)
      • Upload
      • List(KB)
      • Info
      • Meta-Chunking(Text LLM slices)
      • Meta-Chunking(File LLM slices)
  • Tools API
    • AI Video Creation Hub
      • Scripts(Generate Video Content Copy)
      • Terms(Generate Video Material Search Keywords)
      • Videos(Create Video Material Generation Task)
      • Tasks(Get Video Task Progress)
    • AI Paper Writing
      • CO-STORM
        • Create generate article task
        • Continue to generate dialogue interfaces
        • Update article content interface
        • Get article information
      • Asynchronous Paper Generate
      • Fetch
    • AI Podcast Production
      • Asynchronous Generate Podcast Transcripts
      • Check the status of podcast text generation task
      • Asynchronously Generate Podcast Audio
      • Check the status of podcast audio generation task
    • AI Writing Assistant
      • Get Tools‘ List
      • Generate Copywriting
    • AI Video Real-Time Translation
      • Query Video Information
      • Video Download
      • Extract Audio from Video
      • Audio vocal separation and transcription
      • Subtitle Translation
      • Video Burning
      • Original sound clone
      • Query task status
    • AI Document Editor
      • Generate a long text outline
      • Generate article content
    • Web Data Extraction Tool
      • Generate Schema
      • Create an extraction task
      • Query extraction progress
    • AI Prompt Expert
      • Prompt Optimization
      • Image prompt generation
      • Create SPO Prompt Optimization Task
      • Query SPO Prompt Optimization Results
    • AI 3D Modeling
      • 3D model file type conversion
    • AI Search Master 3.0
      • AI Search
    • AI Vector Graphics Generation
      • SVG to video
    • Al Answer Machine
      • Answer
    • AI PPT Generator
      • Generate PPT interface with one click
      • File parsing
      • Generate an outline
      • Generate outline content
      • Get template options
      • Generate PPT interface (synchronous interface)
      • Load PPT data
      • Generate PPT interface (asynchronous interface)
      • Asynchronous query generates PPT status
      • Download PPT
      • Add/update custom PPT templates
      • Pagination query PPT template
    • AI Academic Paper Search
      • arxiv Paper Search
      • Google Paper Search
    • One-Click Website Deployment
      • Create Hosted Webpage (Form Parameter API)
      • Create Hosted Webpage (JSON Parameter API)
      • Create Hosted Webpage (Binary Parameter API)
      • Query the List of Hosted Projects under an API Key
    • AI Avatar Maker
      • Generate Avatar (JSON Parameter Example)
      • Avatar Generation (form-data parameter example)
    • AI Card Generation
      • Generate Knowledge Card
      • Generate Poster
      • Generate philosophical cards
      • Generate philosophical quotation sayings
      • 生成知识卡片
    • AI Image Creative Station API
      • Creative Image Generation API Usage Instructions
      • Get Model List
      • Generate Creative Images
  • Help Center
    • HTTP Status Codes
    • List of supported languages for image translation
  1. Jina

Rerank(Multimodal Reordering)

POST
/jina/v1/rerank
Jina's multimodal rerank model can simultaneously rank images and text.
Supported Model:
jina-reranker-m0
Official Documentation: https://api.jina.ai/redoc#tag/rerank/operation/rank_v1_rerank_post
Price: 0.02 PTC / 1M Token

Request

Authorization
Provide your bearer token in the
Authorization
header when making requests to protected resources.
Example:
Authorization: Bearer ********************
Header Params
Authorization
string 
optional
Example:
Bearer {{YOUR_API_KEY}}
Body Params application/json
model
string 
required
input
array[string]
required
query
string 
required
documents
array[string]
required
top_n
integer 
required
Example
{
    "model": "jina-reranker-m0",
    "query": "small language model data extraction",
    "documents": [
        {
            "image": "https://raw.githubusercontent.com/jina-ai/multimodal-reranker-test/main/handelsblatt-preview.png"
        },
        {
            "image": "https://raw.githubusercontent.com/jina-ai/multimodal-reranker-test/main/paper-11.png"
        },
        {
            "image": "https://raw.githubusercontent.com/jina-ai/multimodal-reranker-test/main/wired-preview.png"
        },
        {
            "text": "We present ReaderLM-v2, a compact 1.5 billion parameter language model designed for efficient web content extraction. Our model processes documents up to 512K tokens, transforming messy HTML into clean Markdown or JSON formats with high accuracy -- making it an ideal tool for grounding large language models. The models effectiveness results from two key innovations: (1) a three-stage data synthesis pipeline that generates high quality, diverse training data by iteratively drafting, refining, and critiquing web content extraction; and (2) a unified training framework combining continuous pre-training with multi-objective optimization. Intensive evaluation demonstrates that ReaderLM-v2 outperforms GPT-4o-2024-08-06 and other larger models by 15-20% on carefully curated benchmarks, particularly excelling at documents exceeding 100K tokens, while maintaining significantly lower computational requirements."
        },
        {
            "image": "https://jina.ai/blog-banner/using-deepseek-r1-reasoning-model-in-deepsearch.webp"
        },
        {
            "text": "Is data extraction needed? Why not use regex? If you use regex, wouldn't it solve everything?"
        },
        {
            "text": "During the California Gold Rush, some merchants made more money selling supplies to miners than the miners made finding gold."
        },
        {
            "text": "Die wichtigsten Beiträge unserer Arbeit sind zweifach: Erstens führen wir eine neuartige dreistufige Datensynthese-Pipeline namens Draft-Refine-Critique ein, die durch iterative Verfeinerung hochwertige Trainingsdaten generiert; und zweitens schlagen wir eine umfassende Trainingsstrategie vor, die kontinuierliches Vortraining zur Längenerweiterung, überwachtes Feintuning mit spezialisierten Kontrollpunkten, direkte Präferenzoptimierung (DPO) und iteratives Self-Play-Tuning kombiniert. Um die weitere Forschung und Anwendung der strukturierten Inhaltsextraktion zu erleichtern, ist das Modell auf Hugging Face öffentlich verfügbar."
        },
        {
            "image": "iVBORw0KGgoAAAANSUhEUgAAAMwAAADACAMAAAB/Pny7AAAA7VBMVEX///8AAABONC780K49Wv5gfYu8vLwiIiIAvNRHLypceJ5hfoc4Vf//1bL8/PxSbsCCgoLk5OQpKSlOQDXctpgZEA9AXv8SG0sGCRorHRocKnY4U+sKDQ7rwqISGBssOkE+Pj5fX19MY29ZdIF1YFGHcF68m4EjLTKSkpInOqIcJSndzbU9UFlcv87DyrvrzrF1wcpOTk6jo6OixsE7MCg4JSHLy8skNZLNqo4EBQ9kU0VZSj0uJh93d3cyMjKihnBvamZca3KoqbI8R5YaLI41R3omM1lNZ7EAAEEbIy46TGcwPk8jEQyIw8eZjobFTeMIAAAFHUlEQVR4nO3da0PaOhwG8CGOHqYwKqBjFKQ6sJt63Biy6Siw+/18/48zSP7FhqU5XNr04vP4igRCfmsX2jSFBw+2TTm0bN2V7ePkQooTt2SWvhGOxejHLZml3w4H0wYm5ACTWExIA0A8GNN+5c/YYn2pF7dNh7dX0YvpyP5hG8WdLdPgDdnAAANM6jD1dGMa10K2tXiYTp9HzxmBh9l6U8gxlI4JDDDAABNRyibLsFNnCRtzzZutc8x4yN8tqhG6cGDNQ4qwLV6KtGnYe1kHhagwRkif9StheAxggAEGmJRidmiyhj5vDjosoc+qa8JQ6sIWCn0CSiumCAwwwNxfzA5N+tQzgaE0gAEGGGBCU5hDFmfUYNFpCR/jjFkGWjdJVJgKb1DvJgEGGGCAiQXjzeEXpaVi6GJuUVrppRgrRnZ4cJ2TpeFhpLU5oaFYMEU5xgIGGGDuDybXEMMLB5Meyy11VKgcUSVlwkstek7oszPrYKS5bZVYurLKwduSPzVpCwnCvKuV8vMEYfJ3AQaYLGBc3uCvjTHVBGEKlXmcqWoBoxxT7bJMWry/va4kk5qIoeJRRBi6japg5IJXAMkx3RbLoqstWfJieGGtGhGGopwEDMDkS/mNUmolEbNpgAEmuxi+OoTmAKxB1Z8Jde2KR97vK1ktYSy6RUjTchNxaeWoV/OHht3z35fzvPxXannNKi/FSsIYfb5UM/Tlp3KMuOh1UBOO52lgPr/8h0WOeckrX0sxelc1/YWR9BcYYO43ZkeBGaUM482biHNB72hypZUujBcR86wlDMapx8h6CgwwwGQTQ3M12cCIVytSjskBAwww/4ORXqBMKWZo80hNSszVb9mchbIyaox3B+14bUz+6pxFPtd0LquMGkORf+2EGrN+gAEGmIRijANf2qnGlIcFf1wrVIx3gfbZSAtmKfRlbeFhhL1XN6YNDDDRY7L0f8ZZDM3B07MB/ZZmae2MXszQYStr/lNNnMstrZ4stKzRqPAMtWI8Ez8ukF/SCNihxLU+YjR9vZESI7/YFIAZAAMMMMuLGlRRYsZxYkyXzdxMxeUmyvSmdnCmcWJo6sZ0qyvHNVVJwJfRl23FrrMUOwH9Vcacro6JdU9aJcAkNaa9OsZOOqbssrvtO3T1oz4a+DKi5YJGhz3JTfoAQFM3Q9rbbsXDe7qzaUpPSjrGC52ydcXPfLqxIQk/AbJOPIx4OAZM/AEmqcniACAfmlOKkQeYGANMUgNMjFFORzjts8C0HeVLY8HYwkVnMcbJQ0VOVK/U+ysnC4xqT7pQYS5UrwQGGGASjaHfJbVz7XlokaPV9sdSj2ZLT/a3MMPo/N1Ts+KyS6fvT1iOeV/OToScqjCn4nPPuOWYP3rPGncrmn6yhdZoUn8vOOZY2X0l7ZhjaM885a1ruj7jrTeLFqP5x3SAASaS8CFzhrmZJToMa32GiXSENvk6xg8fP72Z5dNjns83rC9fvj7eMF+/sAZuPtNj3vrHD/zdotpABb4DfGsesuzuz7P7/Akrfdrkj9fObvMpa+DJc2qQt978xt8t4ltOjpq7vhzeYTbMAnMolB6x0qjvnwEGGGCAAQYYYIABJjmY74+E/ODnMz8fbZyfrAHrh1j6XQvmxemeP4uTs70Nszg5E0tfaMIIJ4phn2l6pcAAAwwwwAADDDBRYvYWfz6Mr3Bv6U9V4MP46jVhMnXUfCTMkN9NnG82b76/vzRx7rWLkzNggAEGmCxg/gAcTwKRD+vGjgAAAABJRU5ErkJggg=="
        }
    ],
    "return_documents": false
}

Request samples

Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.302.ai/jina/v1/rerank' \
--header 'Authorization: Bearer sk-jls4AaVBGoe1GwZD64qZA1qyKTN1MPHa4NmvH1cT68z7K1Zz' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model": "jina-reranker-m0",
    "query": "small language model data extraction",
    "documents": [
        {
            "image": "https://raw.githubusercontent.com/jina-ai/multimodal-reranker-test/main/handelsblatt-preview.png"
        },
        {
            "image": "https://raw.githubusercontent.com/jina-ai/multimodal-reranker-test/main/paper-11.png"
        },
        {
            "image": "https://raw.githubusercontent.com/jina-ai/multimodal-reranker-test/main/wired-preview.png"
        },
        {
            "text": "We present ReaderLM-v2, a compact 1.5 billion parameter language model designed for efficient web content extraction. Our model processes documents up to 512K tokens, transforming messy HTML into clean Markdown or JSON formats with high accuracy -- making it an ideal tool for grounding large language models. The models effectiveness results from two key innovations: (1) a three-stage data synthesis pipeline that generates high quality, diverse training data by iteratively drafting, refining, and critiquing web content extraction; and (2) a unified training framework combining continuous pre-training with multi-objective optimization. Intensive evaluation demonstrates that ReaderLM-v2 outperforms GPT-4o-2024-08-06 and other larger models by 15-20% on carefully curated benchmarks, particularly excelling at documents exceeding 100K tokens, while maintaining significantly lower computational requirements."
        },
        {
            "image": "https://jina.ai/blog-banner/using-deepseek-r1-reasoning-model-in-deepsearch.webp"
        },
        {
            "text": "Is data extraction needed? Why not use regex? If you use regex, wouldn'\''t it solve everything?"
        },
        {
            "text": "During the California Gold Rush, some merchants made more money selling supplies to miners than the miners made finding gold."
        },
        {
            "text": "Die wichtigsten Beiträge unserer Arbeit sind zweifach: Erstens führen wir eine neuartige dreistufige Datensynthese-Pipeline namens Draft-Refine-Critique ein, die durch iterative Verfeinerung hochwertige Trainingsdaten generiert; und zweitens schlagen wir eine umfassende Trainingsstrategie vor, die kontinuierliches Vortraining zur Längenerweiterung, überwachtes Feintuning mit spezialisierten Kontrollpunkten, direkte Präferenzoptimierung (DPO) und iteratives Self-Play-Tuning kombiniert. Um die weitere Forschung und Anwendung der strukturierten Inhaltsextraktion zu erleichtern, ist das Modell auf Hugging Face öffentlich verfügbar."
        },
        {
            "image": "iVBORw0KGgoAAAANSUhEUgAAAMwAAADACAMAAAB/Pny7AAAA7VBMVEX///8AAABONC780K49Wv5gfYu8vLwiIiIAvNRHLypceJ5hfoc4Vf//1bL8/PxSbsCCgoLk5OQpKSlOQDXctpgZEA9AXv8SG0sGCRorHRocKnY4U+sKDQ7rwqISGBssOkE+Pj5fX19MY29ZdIF1YFGHcF68m4EjLTKSkpInOqIcJSndzbU9UFlcv87DyrvrzrF1wcpOTk6jo6OixsE7MCg4JSHLy8skNZLNqo4EBQ9kU0VZSj0uJh93d3cyMjKihnBvamZca3KoqbI8R5YaLI41R3omM1lNZ7EAAEEbIy46TGcwPk8jEQyIw8eZjobFTeMIAAAFHUlEQVR4nO3da0PaOhwG8CGOHqYwKqBjFKQ6sJt63Biy6Siw+/18/48zSP7FhqU5XNr04vP4igRCfmsX2jSFBw+2TTm0bN2V7ePkQooTt2SWvhGOxejHLZml3w4H0wYm5ACTWExIA0A8GNN+5c/YYn2pF7dNh7dX0YvpyP5hG8WdLdPgDdnAAANM6jD1dGMa10K2tXiYTp9HzxmBh9l6U8gxlI4JDDDAABNRyibLsFNnCRtzzZutc8x4yN8tqhG6cGDNQ4qwLV6KtGnYe1kHhagwRkif9StheAxggAEGmJRidmiyhj5vDjosoc+qa8JQ6sIWCn0CSiumCAwwwNxfzA5N+tQzgaE0gAEGGGBCU5hDFmfUYNFpCR/jjFkGWjdJVJgKb1DvJgEGGGCAiQXjzeEXpaVi6GJuUVrppRgrRnZ4cJ2TpeFhpLU5oaFYMEU5xgIGGGDuDybXEMMLB5Meyy11VKgcUSVlwkstek7oszPrYKS5bZVYurLKwduSPzVpCwnCvKuV8vMEYfJ3AQaYLGBc3uCvjTHVBGEKlXmcqWoBoxxT7bJMWry/va4kk5qIoeJRRBi6japg5IJXAMkx3RbLoqstWfJieGGtGhGGopwEDMDkS/mNUmolEbNpgAEmuxi+OoTmAKxB1Z8Jde2KR97vK1ktYSy6RUjTchNxaeWoV/OHht3z35fzvPxXannNKi/FSsIYfb5UM/Tlp3KMuOh1UBOO52lgPr/8h0WOeckrX0sxelc1/YWR9BcYYO43ZkeBGaUM482biHNB72hypZUujBcR86wlDMapx8h6CgwwwGQTQ3M12cCIVytSjskBAwww/4ORXqBMKWZo80hNSszVb9mchbIyaox3B+14bUz+6pxFPtd0LquMGkORf+2EGrN+gAEGmIRijANf2qnGlIcFf1wrVIx3gfbZSAtmKfRlbeFhhL1XN6YNDDDRY7L0f8ZZDM3B07MB/ZZmae2MXszQYStr/lNNnMstrZ4stKzRqPAMtWI8Ez8ukF/SCNihxLU+YjR9vZESI7/YFIAZAAMMMMuLGlRRYsZxYkyXzdxMxeUmyvSmdnCmcWJo6sZ0qyvHNVVJwJfRl23FrrMUOwH9Vcacro6JdU9aJcAkNaa9OsZOOqbssrvtO3T1oz4a+DKi5YJGhz3JTfoAQFM3Q9rbbsXDe7qzaUpPSjrGC52ydcXPfLqxIQk/AbJOPIx4OAZM/AEmqcniACAfmlOKkQeYGANMUgNMjFFORzjts8C0HeVLY8HYwkVnMcbJQ0VOVK/U+ysnC4xqT7pQYS5UrwQGGGASjaHfJbVz7XlokaPV9sdSj2ZLT/a3MMPo/N1Ts+KyS6fvT1iOeV/OToScqjCn4nPPuOWYP3rPGncrmn6yhdZoUn8vOOZY2X0l7ZhjaM885a1ruj7jrTeLFqP5x3SAASaS8CFzhrmZJToMa32GiXSENvk6xg8fP72Z5dNjns83rC9fvj7eMF+/sAZuPtNj3vrHD/zdotpABb4DfGsesuzuz7P7/Akrfdrkj9fObvMpa+DJc2qQt978xt8t4ltOjpq7vhzeYTbMAnMolB6x0qjvnwEGGGCAAQYYYIABJjmY74+E/ODnMz8fbZyfrAHrh1j6XQvmxemeP4uTs70Nszg5E0tfaMIIJ4phn2l6pcAAAwwwwAADDDBRYvYWfz6Mr3Bv6U9V4MP46jVhMnXUfCTMkN9NnG82b76/vzRx7rWLkzNggAEGmCxg/gAcTwKRD+vGjgAAAABJRU5ErkJggg=="
        }
    ],
    "return_documents": false
}'

Responses

🟢200成功
application/json
Body
object {0}
Example
{}
Previous
Rerank
Next
Tokenizer
Built with