AI Models Directory

Aion 1.0
AionLabs
Aion-1.0 is a multi-model system designed for high performance across various tasks, including reaso…

Aion 2.0
AionLabs
Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It is p…
Alex
Replicate
A relaxed, informal male voice for chatting with friends.
Arthur
Replicate
A noble, chivalrous young male voice for heroic tales.

BRIA RMGB 2.0
Replicate
Remove image backgrounds with AI precision.
Chloe
Replicate
A cheerful, bubbly young female voice that radiates positivity.
Claude 3 Haiku
Anthropic
Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick …
Claude 3.5 Haiku
Anthropic
Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engi…
Claude Haiku (Latest)
Anthropic
This model always redirects to the latest model in the Anthropic Claude Haiku family.
Claude Haiku 4.5
Anthropic
Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intellige…
Claude Opus 4
Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustaine…
Claude Opus 4.1
Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance i…
Claude Opus 4.5
Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, …
Claude Opus 4.6
Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built …
Claude Opus 4.6 Fast
Anthropic
Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher out…
Claude Opus 4.7
Anthropic
Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous age…
Claude Sonnet (Latest)
Anthropic
This model always redirects to the latest model in the Anthropic Claude Sonnet family.
Claude Sonnet 4.5
Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents…
Claude Sonnet 4.6
Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across codi…
David
Replicate
A deep, authoritative male voice ideal for professional presentations.

DeepSeek Chat V3
Specialized chat model from DeepSeek, optimized for conversation.

DeepSeek R1
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with full…

DeepSeek R1 Distill Llama 70B
DeepSeek
DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](…

DeepSeek R1 Distill Qwen 32B
DeepSeek
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://hug…

DeepSeek V3.2 Speciale
DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning an…

DeepSeek V4 Flash
DeepSeek
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total …

DeepSeek V4 Pro
DeepSeek
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters a…

DeOldify
Ariel Replicate
Colorize black and white images.

Devstral 2512
Mistral AI
Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It …
Dewi
Replicate
A serene, calm female voice in Indonesian.
Emily
Replicate
An energetic, uplifting young female voice full of motivation.

ERNIE Image
wavespeed
Baidu's text-to-image model. Supports English, Chinese, and Japanese.

ERNIE Image Turbo
wavespeed
Fast 8-step distilled ERNIE image generation.
Ethan
Replicate
A polite, well-mannered young male voice for formal settings.

Flux 1.1 Pro
Replicate
Professional Flux model with excellent prompt adherence.

Flux 2 Max
Replicate
Advanced image editing with improved prompt understanding.

Flux 2 Pro
Replicate
Professional image editing with aspect ratio control.

Flux Dev
Replicate
12B parameter model, supports img2img with prompt strength.

Flux Fast
Replicate
Fastest Flux endpoint, optimized by PrunaAI.

Flux Fill Pro
Black Forest Labs
Professional inpainting to remove objects.

Flux Kontext Max
Replicate
Premium text-based image editing with max performance.

Flux Kontext Pro
Replicate
Text-based image editing with natural language.

Flux Schnell
Replicate
Fast Flux model for quick iterations.

Flux.2 Flex
Replicate
Flexible model for various styles.

Flux.2 Klein 4B
openrouter
Fast and economical, good for drafts.

Flux.2 Max
openrouter
Maximum capability Flux model.

Flux.2 Pro
openrouter
Top-tier Flux model for demanding tasks.

Gemini 2.5 Flash
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced re…

Gemini 2.5 Pro
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathem…

Gemini 3 Pro Image Preview
openrouter
Google's latest high-quality image model preview.

Gemini 3.1 Flash Image Preview
openrouter
Latest Gemini image model preview.

Gemini 3.1 Flash Lite
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.…

Gemini 3.1 Pro
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineerin…

Gemini Flash (Latest)
This model always redirects to the latest model in the Google Gemini Flash family.

Gemini Pro (Latest)
This model always redirects to the latest model in the Google Gemini Pro family.

GFPGAN Face Restoration
Replicate
Restore old, damaged, or low-quality faces with AI.

GLM 4.5
Z.AI
GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leve…

GLM 4.5 Air
Z.AI
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for a…

GLM 4.5V
Z.AI
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture…

GLM 4.6
Z.AI
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The c…

GLM 4.6V
Z.AI
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-contex…

GLM 4.7
Z.AI
GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming c…

GLM 4.7 Flash
Z.AI
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficienc…

GLM 5.1 Reasoning
Z-AI
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling …

GLM-5
Z.AI
GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long…

GLM-5 Turbo
Z.AI
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-dri…

GLM-5V-Turbo
Z.AI
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding…

GPT (Latest)
OpenAI
This model always redirects to the latest model in the OpenAI GPT family.

GPT Chat Latest
OpenAI
GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest…

GPT Image 2
wavespeed
OpenAI GPT Image 2 text-to-image. Quality & resolution based pricing.

GPT Image 2 Edit
wavespeed
OpenAI GPT Image 2 editing. Modify images with text prompts.

GPT Mini (Latest)
OpenAI
This model always redirects to the latest model in the OpenAI GPT Mini family.

GPT-4o
OpenAI
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text…

GPT-4o Mini
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text…

GPT-4o Mini Search Preview
OpenAI
GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained …

GPT-5 Mini
OpenAI
GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It prov…

GPT-5 Nano
OpenAI
GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, r…

GPT-5.1 Codex Max
OpenAI
GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context s…

GPT-5.2
GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long c…

GPT-5.2 Chat
OpenAI
GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-late…

GPT-5.2 Codex
OpenAI
GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding …

GPT-5.3 Chat
OpenAI
GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, m…

GPT-5.3 Codex
OpenAI
GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engine…

GPT-5.4
GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It…

GPT-5.4 Mini
OpenAI
GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for…

GPT-5.4 Nano
OpenAI
GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for…

GPT-5.4 Pro
GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhance…

GPT-5.5
OpenAI
GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 …

GPT-5.5 Pro
OpenAI
GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, …
Grace
Replicate
A gentle, soothing female voice ideal for relaxation and meditation.

Granite 4.0 H Micro
IBM
Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the late…

Grok 2 Image
wavespeed
xAI's first image model based on Aurora architecture. Exceptional photorealism.

Grok 4 Fast
xAI
Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window…

Grok 4.1 Fast
xAI
Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like cust…

Grok 4.20 Beta
xAI
Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabil…

Grok 4.20 Multi-Agent
xAI
Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workfl…

Grok 4.3
xAI
Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is su…

Grok Imagine
Replicate
High-realism image generation from xAI.

Grok Imagine
wavespeed
xAI's flagship text-to-image model with exceptional aesthetic quality.

Grok Imagine Edit
wavespeed
xAI's image editing model. Precise control over edits with text instructions.

Grok Imagine Video
Replicate
xAI's flagship text-to-video model.

Grok Imagine Video Edit
wavespeed

Grok Imagine Video Extend
wavespeed

Grok Imagine Video I2V
wavespeed

Grok Imagine Video T2V
wavespeed

Hailuo 2.3
Replicate
High quality image-to-video with fine details.

Hailuo 2.3 Fast
Replicate
Faster version of Hailuo 2.3.

HappyHorse 1.0 Edit
wavespeed

HappyHorse 1.0 Extend
wavespeed

HappyHorse 1.0 I2V
wavespeed

HappyHorse 1.0 Ref-to-Video
wavespeed

HappyHorse 1.0 T2V
wavespeed
Alibaba's text-to-video model.
Henry
Replicate
A sophisticated, refined male voice for high‑end presentations.

HiDream L1 Fast
Replicate
Performance-optimized text-to-image with fast generation.

HiDream-I1
wavespeed
17B parameter open-source model with state-of-the-art quality and speed.

Hunter Alpha
Stealth (OpenRouter)
1T parameter omni-modal model with vision, reasoning, and agentic workflows. Free tier.

Hunyuan3D 3.1
Tencent
Generate 3D models from text or images.

HY3 Preview
Tencent
Tencent's preview model, free tier with limited usage.

Ideogram V2
Replicate
Second generation model for graphic design.

Ideogram V2 Turbo
Replicate
Fast version of V2.

Ideogram V3 Balanced
Replicate
Balance between speed and text quality.

Ideogram V3 Quality
Replicate
Highest quality Ideogram with excellent typography.

Ideogram V3 Turbo
Replicate
Fast text generation under 5 seconds.

Imagen 3
Replicate
Google's latest text-to-image model with high detail.

Imagen 3 Fast
Replicate
Optimized for low latency with Imagen 3 quality.

Imagen 4
Replicate
Google's flagship text-to-image model, high quality.

Imagen 4 Fast
Replicate
Fast version of Imagen 4, slightly lower quality.

Imagen 4 Ultra
Replicate
Ultra high-quality image generation from Google.

Intellect-3
Prime Intellect
INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (12B active) post-trained from GLM-4.5-Air-…
James
Replicate
A warm, approachable male voice perfect for everyday conversation.

Kimi (Latest)
Moonshot AI
This model always redirects to the latest model in the MoonshotAI Kimi family.

Kimi K2
Moonshot AI
Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, …

Kimi K2 Thinking
Moonshot AI
Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 serie…

Kimi K2-0905
Moonshot AI
Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixt…

Kimi K2.5
Moonshot AI
Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capabi…

Kimi K2.6
Moonshot AI
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, codin…

Kling 2.1 Master
Replicate
Ultimate quality, best for final renders.

Kling 2.1 Pro
Replicate
Professional grade, 1080p resolution.

Kling 2.1 Standard
Replicate
Balanced quality and speed, 720p up to 10s.

Kling 2.5 Turbo Pro
Replicate
Blazing fast, high quality 1080p.

Kling 2.6
Replicate
Next-gen model with optional native audio.

Kling 2.6 Pro T2V
wavespeed

Kling O3 4K I2V
wavespeed

Kling O3 4K Ref-to-Video
wavespeed

Kling O3 4K T2V
wavespeed

Kling V3 Omni
Replicate
Multimodal video model with advanced audio understanding.

Kling V3.0 4K I2V
wavespeed

Kling V3.0 4K T2V
wavespeed
Kling V3.0 4K text-to-video.
Leo
Replicate
A firm, resolute male voice that conveys confidence and purpose.

Leonardo Motion 2.0
Replicate
Add motion to static images.

Llama 3.2 11B Vision Instruct
Meta
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks comb…

Llama 3.3 70B Instruct
Meta
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned gen…

Llama 4 Maverick
Meta
Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built o…

Llama Nemotron Embed VL
Nvidia
Free embedding model with vision-language understanding.

Lucid Origin
Replicate
Leonardo.ai's all-purpose text-to-image model.

Lyria 3
Generates short music clips up to 30 seconds. Ideal for quick prototyping.

Lyria 3 Pro
Generates full songs up to 3 minutes with detailed structure and high‑quality vocals.
Maya
Replicate
An assertive, confident female voice in Indonesian.

Mercury 2
Inception
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead …
Mia
Replicate
A light, airy female voice, great for children's content.

MiMo V2 Flash
Xiaomi
MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Ex…

MiMo V2 Omni
Xiaomi
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs w…

MiMo V2 Pro
Xiaomi
MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M conte…

MiMo V2.5
Xiaomi
MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughl…

MiMo V2.5 Pro
Xiaomi
MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilit…

MiniMax Image 01
Replicate
Focus on Asian aesthetics and speed.

MiniMax M2 Her
Replicate
MiniMax M2-her is a dialogue-first large language model built for immersive roleplay, character-driv…

MiniMax M2.5
Replicate
MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diver…

MiniMax M2.7
Replicate
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productiv…

MiniMax Music 1.5
Replicate
Creates songs up to 4 minutes. 2 free trials!

MiniMax Music 2.5
Replicate
Full song generation with rich instrumentation and natural vocals.

Ministral 14B 2512
Mistral AI
The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and perfor…

Ministral 3B 2512
Mistral AI
The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language …

Ministral 8B 2512
Mistral AI
A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny language mo…

Mistral Large
This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a propriet…

Mistral Medium 3.5
Mistral AI
Mistral's 128B dense model with 256k context, vision, and hybrid reasoning. Great for coding and age…

Mixtral 8x22B Instruct
Mistral AI
Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). …

Nano Banana
Replicate
Image editing with img2img support.

Nano Banana Pro
Replicate
Advanced image editing with multi-image input support.

Nano Banana Pro Edit
wavespeed
Edit images with Nano Banana Pro.

Nano Banana Pro Edit Ultra
wavespeed
Ultra high-resolution image editing with Nano Banana Pro.

Nemotron 3 Nano
Nvidia
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and acc…

Nemotron 3 Super
Nvidia
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters fo…
Nora
Replicate
A serene, spiritual female voice with a calm and measured tone.

Nova 2 Lite V1
Amazon
Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, …

Nova Lite V1
Amazon
Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing…

Nova Micro V1
Amazon
Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon …

Nova Premier V1
Amazon
Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks an…

Nova Pro V1
Amazon
Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of …

Nucleus
wavespeed
Creative-focused model with minimal restrictions. Explore diverse artistic styles.
Oliver
Replicate
A calm, deliberate male speaker, excellent for educational content.

OpenAI o1
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before…

OpenAI o1-preview
Preview model with latest features from OpenAI, ideal for testing.

OpenAI o1-pro
The o1 series of models are trained with reinforcement learning to think before they answer and perf…

P-Image
Replicate
High-quality image generation with strong prompt adherence.
Pak Budi
Replicate
A calm, authoritative male leader voice in Indonesian.

Phi-4
Microsoft
[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and ca…

Phi-4 Mini Instruct
Microsoft
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly avai…

PixVerse Ref-to-Video
wavespeed

PixVerse Transition
wavespeed

PixVerse V4
Replicate
Fast generation with good quality.

PixVerse V4.5
Replicate
Enhanced with motion control and multi-image fusion.

Proteus V0.3
Replicate
Community model fine-tuned for photorealism and anime.

Qwen 3 Max
Model from Alibaba, very economical for daily use.

Qwen 3.5 35B
Alibaba
The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture tha…

Qwen 3.5 9B
Alibaba
Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reas…

Qwen 3.5 Flash (02-23)
Alibaba
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a…

Qwen 3.5 Plus (2026-04-20)
Alibaba
Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, …

Qwen 3.6 27B
Alibaba
Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released i…

Qwen 3.6 35B A3B
Alibaba
Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total paramete…

Qwen 3.6 Flash
Alibaba
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, …

Qwen 3.6 Max Preview
Alibaba
Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-…

Qwen 3.6 Plus
Alibaba
Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse m…

Qwen Image
Alibaba
Multimodal model with excellent text rendering.

Qwen Image 2.0
wavespeed
Alibaba's 7B text-to-image foundation model. Excellent photorealism and typography.

Qwen Image 2.0 Edit
wavespeed
Alibaba's 7B unified image editing model. Edit existing images with text instructions.

Qwen Image 2.0 Pro Edit
wavespeed
Pro version of Qwen Image 2.0 Edit with higher quality and better detail preservation.

Qwen3 VL 235B Thinking
Alibaba
Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual un…

RAD Posters
Replicate
Generate stunning posters in RPS style.

Real-ESRGAN
Replicate
Upscale images up to 4x with AI enhancement.

Recraft V3
Replicate
Legacy version, more affordable.

Recraft V4
Replicate
Standard version for fast design generation.

Recraft V4 Pro
Replicate
Professional design-focused image generation.

Recraft V4 Pro SVG
Replicate
Generate detailed editable SVG vector graphics.
Rina
Replicate
A soft‑spoken, gentle female voice in Indonesian.

Riverflow V2 Fast
openrouter
Quick generation with decent quality.

Riverflow V2 Fast Preview
openrouter
Preview version, fast results.

Riverflow V2 Max Preview
openrouter
Maximum quality preview.

Riverflow V2 Pro
openrouter
Professional grade image generation.

Riverflow V2 Standard Preview
openrouter
Balanced speed and quality.
Rudi
Replicate
A compassionate, caring male voice in Indonesian.
Sarah
Replicate
A wise, mature female voice, perfect for storytelling and narration.
Sari
Replicate
A cute, sweet young female voice in Indonesian.

SD 3.5 Large
Replicate
8B parameter multimodal diffusion transformer.

SDXL
Replicate
Stable Diffusion XL, versatile and powerful.

SDXL Lightning 4-Step
Replicate
Ultra-fast 4-step generation, high quality.

Seed 2.0 Lite
Replicate
Seed-2.0-Lite is a versatile, cost‑efficient enterprise workhorse that delivers strong multimodal an…

Seedance 1 Lite
Replicate
Lightweight, fast, and affordable.

Seedance 1 Pro
Replicate
Professional grade with higher detail.

Seedance 1 Pro Fast
Replicate
Faster Pro version.

Seedance 2.0
Replicate
Multimodal video model with support for up to 9 reference images.

Seedance 2.0 Fast Video Edit
wavespeed

Seedance 2.0 Fast Video Edit Turbo
wavespeed

Seedance 2.0 Fast Video Extend
wavespeed

Seedance 2.0 Video Edit
wavespeed

Seedance 2.0 Video Edit Turbo
wavespeed

Seedance 2.0 Video Extend
wavespeed

Seedream 3
Replicate
Stable text-to-image with good quality.

Seedream 4
Replicate
ByteDance's advanced image model with aspect ratio control.

Seedream 4 Edit
wavespeed
Older Seedream 4 image editing model.

Seedream 4.5
openrouter
ByteDance's advanced image model.

Seedream 4.5
wavespeed
ByteDance's latest text-to-image model.

Seedream 4.5 Edit
wavespeed
Edit images using the latest Seedream 4.5 model.

Seedream 4.5 Edit Sequential
wavespeed
Generate a sequence of edited images with Seedream 4.5.

Seedream 5.0 Lite
Replicate
Latest ByteDance model with multi-step reasoning.

Sonar
Perplexity
Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability …

Sonar Deep Research
Perplexity
Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and re…

Sonar Pro
Perplexity
Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexi…

Sonar Pro Search
Perplexity
Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most ad…

Sonar Reasoning Pro
Perplexity
Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexi…
Sophie
Replicate
A sweet, pleasant young female voice perfect for audiobooks.

Stable Diffusion
Replicate
Original Stable Diffusion model.

Step 3.5 Flash
StepFun
Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of …

Sticker Maker
Replicate
Generate stickers with transparent backgrounds.

Trinity Large Preview
Arcee AI
Trinity-Large-Preview is a frontier-scale open-weight language model from Arcee, built as a 400B-par…

Trinity Large Preview
Arcee AI
Large preview model from Arcee AI, free to use.

Trinity Large Thinking
Arcee AI
Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows…

VEED Fabric 1.0
Replicate
Lip-sync animation from image and audio.

Veo 2
Replicate
Google's premium text-to-video model.

Veo 3
Replicate
Latest Veo with optional audio generation.

Veo 3 Fast
Replicate
Faster Veo 3, slightly lower quality but great speed.

Veo 3.1
Replicate
Refined Veo 3.1 with improved prompt adherence and JSON support.

Veo 3.1 Fast
Replicate
Fast version of Veo 3.1 with JSON prompt support.
Victoria
Replicate
A commanding, regal female voice that exudes authority.

Wan 2.1 1.3B
Replicate
Lightweight video model, 5 seconds 480p. Fast and economical.

Wan 2.1 I2V 480p (Wavespeed)
Replicate
Optimized Wan 2.1 for image-to-video at 480p.

Wan 2.1 I2V 480p Ultra Fast
wavespeed

Wan 2.2 I2V Fast
Replicate
Optimized fast image-to-video with interpolated frames.

Wan 2.5 I2V
Replicate
Image-to-video with precise motion control.

Wan 2.5 T2V
Replicate
Advanced text-to-video with high quality output.

Wan 2.6 Image Edit
wavespeed
Alibaba's Wan 2.6 image editing model.

Wan 2.6 T2V
wavespeed

Wan 2.7 Image
Replicate
Next-gen text-to-image with high quality output.

Wan 2.7 Image Pro
Replicate
Professional grade image generation with advanced controls.

Wavespeed Chroma
wavespeed
8.9B parameter model built on FLUX.1-schnell. Ultra-fast with unique visual style.

Wavespeed InstantCharacter
wavespeed
Generate consistent characters from text prompts. Ideal for storytelling and branding.

Z-Image Turbo
Replicate
Super fast 6B parameter model, sub-second generation.

Z-Image Turbo
wavespeed
Ultra-fast 6B parameter image generation.
Zara
Replicate
An outgoing, lively female voice full of energy and enthusiasm.