WaveSpeed Blog - Page 56

Introducing WaveSpeedAI Molmo2 Video Qa on WaveSpeedAI

Molmo2-4B Video QA: Answer questions about video content with temporal understanding. Open-source vision-language model. Ready-to-use REST API, no cold starts,

Jan 16, 2026 5 min read

Introducing WaveSpeedAI Molmo2 Video Understanding on WaveSpeedAI

Molmo2-4B Video Understanding: Analyze videos with specialized tasks (general, summary, analysis, counting, scene description). Open-source vision-language mode

Jan 16, 2026 5 min read

Introducing WaveSpeedAI Openai Whisper With Video on WaveSpeedAI

OpenAI Whisper Large v3 (Video-to-Text) delivers high-accuracy multilingual transcription directly from video files, with automatic language detection and optio

Jan 16, 2026 4 min read

Introducing WaveSpeedAI Paddle Ocr on WaveSpeedAI

PaddleOCR-VL is an ultra-compact 0.9B parameter vision-language model for document parsing, supporting 109 languages with text, table, formula, and chart recogn

Jan 16, 2026 5 min read

Introducing WaveSpeedAI Qwen Image 2512 LoRA Trainer on WaveSpeedAI

Qwen-Image-2512 LoRA Trainer lets you train custom LoRA models 10x faster with style, character, and object training. From concept to model in minutes, not hour

Jan 16, 2026 5 min read

Introducing WaveSpeedAI Qwen Image Text-to-Image 2512 LoRA on WaveSpeedAI

Qwen-Image-2512 LoRA is an enhanced 20B MMDiT text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST infer

Jan 16, 2026 5 min read

Introducing WaveSpeedAI Video Background Remover on WaveSpeedAI

WaveSpeed Video Background Remover replaces or removes video backgrounds with a custom image. Upload or paste a link to your video, then provide a background im

Jan 16, 2026 5 min read

Introducing WaveSpeedAI Z Image Turbo Controlnet on WaveSpeedAI

Z-Image-Turbo ControlNet generates images guided by structural control signals (depth, canny edge, pose) for precise composition control. Ready-to-use REST infe

Jan 16, 2026 6 min read

Introducing xAI Grok 2 Image on WaveSpeedAI

Grok 2 Image is xAI’s latest image generation model that turns simple text prompts into sharp, photorealistic visuals in seconds. From product shots to social

Jan 16, 2026 5 min read

Introducing Z AI CogView 4 on WaveSpeedAI

Z-AI CogView-4 generates high-quality images from text prompts with a quick and accurate understanding of user descriptions, letting AI express images more prec

Jan 16, 2026 5 min read

Introducing Z AI Glm Image Text-to-Image on WaveSpeedAI

Z-AI GLM Image generates high-quality images from text prompts, with enhanced understanding of user descriptions, resulting in images that are more precise and

Jan 16, 2026 5 min read

Introducing Z AI Glm Image Edit on WaveSpeedAI

GLM-Image Edit is a powerful image-to-image editing model that transforms images based on text prompts. Ready-to-use REST inference API, best performance, no co

Jan 16, 2026 5 min read