Image Generation

Generate any image, from any input, with any style. WaveSpeed brings together every AI image generation method — text-to-image, image-to-image, editing, upscaling, and more — in one unified platform built for speed.

Try for Free View API Docs

Every Way to Generate Images, One Platform

WaveSpeed doesn't limit you to one model or one method. Generate, edit, enhance, and transform images across every major AI approach.

📸Text to Image

Generate images purely from text prompts with full control over style, composition, and mood.

Prompt	Style	Notes
"Japanese zen garden at dawn, raked sand patterns, single cherry blossom tree, misty atmosphere, photorealistic"	Photorealistic	Detailed scene generation with environmental lighting and mood control
"Retro travel poster for Mars, 1960s NASA style, bold typography, flat colors"	Graphic Design	Design-ready output for print, social, or branding use

🔄Image to Image

Transform existing images into new styles, moods, or levels of detail while preserving structure.

Input	Transformation	Notes
Rough pencil sketch of a character	→ polished digital illustration with full color and lighting	Sketch to Render — concept artists go from draft to presentation in one step
Daytime photo of a building	→ same building at night with neon signage and rain reflections	Scene Relighting — transform existing photos into entirely new moods

✂️Image Editing

Make localized edits — swap backgrounds, remove objects, and recompose scenes without manual masking.

Input	Edit	Notes
Portrait photo	→ background swapped from office to outdoor beach setting	Background Swap — seamless compositing without manual masking
Product photo	→ object removed from the background, clean transparent output	Object Removal — e-commerce-ready asset prep in seconds

⬆️Upscaling & Enhancement

Restore, upscale, and enhance images for print, production, or archival use.

Input	Output	Notes
Low-res 512px thumbnail	→ sharp 2048px image with recovered detail	4x Upscale — rescue old or web-sourced images for print or production
Blurry group photo	→ enhanced facial detail on every face in frame	Face Enhancement — restore portraits without manual retouching

Image Generation Gallery

Real outputs across every generation method — all produced on WaveSpeed.

Every image above was generated or processed on WaveSpeed. Results vary by model and parameters.

Q & A

What is image generation?

Image generation uses AI to create or transform images from various inputs — text prompts, existing images, sketches, or photos. It covers a range of methods including text-to-image, image-to-image, editing, upscaling, and background removal. WaveSpeed provides all of these through one platform.

How is this different from the Text to Image page?

Text to Image focuses specifically on generating images from text prompts. Image Generation is the broader overview — covering every method of creating and transforming images with AI, including image-to-image, editing, upscaling, and enhancement.

Which models does WaveSpeed support for image generation?

WaveSpeed hosts FLUX.1, FLUX.1 Kontext, Seedream V4, Z-Image, Nano Banana Pro, Stable Diffusion XL, ESRGAN, GFPGAN, and more. New models are added as they release. Browse the full catalog on the Explore Models page.

Are the enhancement tools free?

Yes. Image Upscaler, Face Enhancer, Background Remover, and several other tools run locally and for free through the WaveSpeed Desktop app — no API key required, no uploads, fully private.

Can I access image generation via API?

Yes. All cloud-based image generation models are available through WaveSpeed's unified REST API. Batch processing, LoRA support, negative prompting, and full parameter control are included. Official Python and JavaScript SDKs are provided.