Enjoy 50% OFF Vidu Q3 & Q3 Pro models • Only on WaveSpeedAI | May 20 – Jun 2
Home/Explore/WaveSpeed/Qwen Image/Text To Image Lora

Qwen Image Text to Image LoRA

wavespeed-ai /

Qwen-Image LoRA is a 20B MMDiT next-gen text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support
Input
width
height
1024 × 1024 px
Range: 256 - 1536
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

Valentin in a natural daylight selfie at a cafe entrance. He looks seriously into the camera, wearing a black coat or jacket and wireless earbud. Background includes wooden frames, warm pendant lights, and urban cafe details. With text "WaveSpeedAI"

$0.025per run·~40 / $1

ExamplesView all

Valentin in a natural daylight selfie at a cafe entrance. He looks seriously into the camera, wearing a black coat or jacket and wireless earbud. Background includes wooden frames, warm pendant lights, and urban cafe details. With text "WaveSpeedAI"

Valentin in a natural daylight selfie at a cafe entrance. He looks seriously into the camera, wearing a black coat or jacket and wireless earbud. Background includes wooden frames, warm pendant lights, and urban cafe details. With text "WaveSpeedAI"

realism, a female inventor with auburn hair in an intricate updo and goggles on her head, her eyes full of intellect. She wears a leather corset and a multi-layered skirt, standing in her workshop. The room is filled with brass gears, complex clockwork devices, and glowing vacuum tubes. Warm light from gas lamps illuminates the scene. Steampunk style, highly detailed, retro-futurism, masterpiece.

realism, a female inventor with auburn hair in an intricate updo and goggles on her head, her eyes full of intellect. She wears a leather corset and a multi-layered skirt, standing in her workshop. The room is filled with brass gears, complex clockwork devices, and glowing vacuum tubes. Warm light from gas lamps illuminates the scene. Steampunk style, highly detailed, retro-futurism, masterpiece.

A glamorous woman with a sharp bob haircut and dark lipstick. She is dressed in a stunning black and gold sequined flapper dress with long pearls. She leans against a gilded Art Deco bar, with a jazz band softly blurred in the background. Sophisticated, low-key lighting creates a luxurious and intimate mood, Great Gatsby era, glamorous, geometric patterns.

A glamorous woman with a sharp bob haircut and dark lipstick. She is dressed in a stunning black and gold sequined flapper dress with long pearls. She leans against a gilded Art Deco bar, with a jazz band softly blurred in the background. Sophisticated, low-key lighting creates a luxurious and intimate mood, Great Gatsby era, glamorous, geometric patterns.

A majestic Afrofuturist queen with magnificent braided hair adorned with golden rings and cybernetic circuits. She wears a vibrant robe that merges traditional Kente patterns with glowing energy lines. The background is a futuristic African metropolis with unique architecture and flying vehicles. Vibrant, vivid colors, sci-fi art, character portrait.

A majestic Afrofuturist queen with magnificent braided hair adorned with golden rings and cybernetic circuits. She wears a vibrant robe that merges traditional Kente patterns with glowing energy lines. The background is a futuristic African metropolis with unique architecture and flying vehicles. Vibrant, vivid colors, sci-fi art, character portrait.

A close-up portrait of a stylish woman with wavy, dark brown long hair and a warm smile, wearing a beige cashmere sweater. The background is a blurred city street with a soft bokeh effect. Natural afternoon light, cinematic, photorealistic, high detail, 8K.

A close-up portrait of a stylish woman with wavy, dark brown long hair and a warm smile, wearing a beige cashmere sweater. The background is a blurred city street with a soft bokeh effect. Natural afternoon light, cinematic, photorealistic, high detail, 8K.

realism, a young woman sitting alone in a laundromat at midnight, wearing headphones, staring at the rotating dryer drum, neon reflections on the glass, a subtle expression of nostalgia on her face

realism, a young woman sitting alone in a laundromat at midnight, wearing headphones, staring at the rotating dryer drum, neon reflections on the glass, a subtle expression of nostalgia on her face

realism, a woman with healthy tanned skin and natural long curly hair, with a few wildflowers woven into it. She wears a fringed, off-white linen dress and sits barefoot in a golden field at sunset, holding a guitar. The lighting is warm and soft, creating a free-spirited and romantic atmosphere, photorealistic, golden hour lighting.

realism, a woman with healthy tanned skin and natural long curly hair, with a few wildflowers woven into it. She wears a fringed, off-white linen dress and sits barefoot in a golden field at sunset, holding a guitar. The lighting is warm and soft, creating a free-spirited and romantic atmosphere, photorealistic, golden hour lighting.

real life anime, a woman with curly hair tied loosely, wearing a paint-stained oversized white shirt, barefoot, standing in a spacious industrial loft with large windows and exposed brick walls. She’s holding a large brush, working on a colorful abstract canvas. Natural light pouring in, art supplies scattered around, expressive, richly detailed scene.

real life anime, a woman with curly hair tied loosely, wearing a paint-stained oversized white shirt, barefoot, standing in a spacious industrial loft with large windows and exposed brick walls. She’s holding a large brush, working on a colorful abstract canvas. Natural light pouring in, art supplies scattered around, expressive, richly detailed scene.

realism, a woman like a mermaid, with flowing, long, blue hair and shimmering scales. She swims gracefully in clear tropical waters filled with coral and strange marine life. Sunlight penetrates the water's surface, creating moving beams of light that illuminate the entire scene—dreamy, vibrant, light and shadow effects, underwater photography, highly detailed.

realism, a woman like a mermaid, with flowing, long, blue hair and shimmering scales. She swims gracefully in clear tropical waters filled with coral and strange marine life. Sunlight penetrates the water's surface, creating moving beams of light that illuminate the entire scene—dreamy, vibrant, light and shadow effects, underwater photography, highly detailed.

realism, a young scholar with glasses, wearing a tweed blazer, sits in a grand, ancient library. Sunlight streams through a massive arched window, illuminating dust motes dancing in the air. An open book rests on her lap as she looks up thoughtfully. Warm and cozy atmosphere, light academia aesthetic, narrative lighting, photorealistic.

realism, a young scholar with glasses, wearing a tweed blazer, sits in a grand, ancient library. Sunlight streams through a massive arched window, illuminating dust motes dancing in the air. An open book rests on her lap as she looks up thoughtfully. Warm and cozy atmosphere, light academia aesthetic, narrative lighting, photorealistic.

A resilient female survivor with wind-swept short hair and a determined gaze. She wears patched-up leather gear and tactical equipment, holding a modified staff. She stands on a hill overlooking the ruins of a city at dusk, against a dramatic orange sky. Cinematic, post-apocalyptic style, realism, atmospheric lighting, wide-angle shot.

A resilient female survivor with wind-swept short hair and a determined gaze. She wears patched-up leather gear and tactical equipment, holding a modified staff. She stands on a hill overlooking the ruins of a city at dusk, against a dramatic orange sky. Cinematic, post-apocalyptic style, realism, atmospheric lighting, wide-angle shot.

Related Models

README

Qwen-Image-LoRA

Qwen-Image-LoRA extends the base 20B MMDiT text-to-image model by allowing users to plug in custom LoRA weights (.safetensors) for fine-tuned control over style, characters, or artistic domains. This makes it a versatile tool for creators who want both world-class text rendering and personalized generation.

Why it looks great

  • LoRA integration: Import external .safetensors LoRA weights and control blending strength via scale.
  • SOTA text rendering: Rivals GPT-4o in English and is best-in-class for Chinese typography.
  • In-pixel text generation: Text is seamlessly integrated into images (no overlays).
  • Bilingual support: Handles Chinese & English with diverse fonts and complex layouts.
  • General image excellence: Photorealistic, anime, impressionist, or minimalist styles—all supported.

Limits and Performance

  • Max resolution per job: up to 1024 × 1024 pixels
  • LoRA path: provide <owner>/<model-name> or external .safetensors URL
  • LoRA scale: adjustable strength (default = 1.0)
  • Output formats: JPEG / PNG / WEBP
  • Processing speed: ~6–10 seconds per image
  • Input prompt: supports multi-line descriptive text

Pricing

  • $0.025 per image
  • Each image is billed individually.

How to Use

  1. Enter a prompt (supports detailed narrative & embedded text).
  2. Set size (width & height, up to 1024×1024).
  3. Add one or more LoRAs:
  • Paste the path/URL of the LoRA .safetensors file.
  • Adjust the scale (e.g., 0.5 for subtle effect, 1.0 for full strength).
  1. (Optional) Set seed for reproducibility (-1 = random).
  2. Choose output format (JPEG / PNG).
  3. Run → preview results → iterate with different LoRA scales.

Pro tips for best quality

  • Use specific LoRAs for characters, art styles, or IP consistency.
  • Combine multiple LoRAs for hybrid results (e.g., anime + steampunk).
  • Adjust scale carefully—too high may distort, too low may fade.
  • Lock the seed to maintain subject consistency when swapping LoRAs.

Reference

Note

Accessibility:This website uses AI models provided by third parties.

Qwen Image Text To Image Lora API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image/text-to-image-lora with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Qwen Image Text To Image Lora below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image/text-to-image-lora" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/qwen-image/text-to-image-lora", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "size": "1024*1024",
        "seed": -1,
        "output_format": "jpeg",
        "enable_sync_mode": false,
        "enable_base64_output": false
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/qwen-image/text-to-image-lora",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Qwen Image Text To Image Lora API — Frequently asked questions

What is the Qwen Image Text To Image Lora API?

Qwen Image Text To Image Lora is a WaveSpeedAI model for AI inference, exposed as a REST API on WaveSpeedAI. Qwen-Image LoRA is a 20B MMDiT next-gen text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Qwen Image Text To Image Lora API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/qwen-image-text-to-image-lora.

How much does Qwen Image Text To Image Lora cost per run?

Qwen Image Text To Image Lora starts at $0.025 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Qwen Image Text To Image Lora accept?

Key inputs: `prompt`, `size`, `seed`, `enable_base64_output`, `enable_sync_mode`, `loras`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/qwen-image-text-to-image-lora.

How long does Qwen Image Text To Image Lora take to generate?

Average end-to-end generation time on WaveSpeedAI is around 32 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Qwen Image Text To Image Lora outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.