Introducing WaveSpeedAI Molmo2 Video Qa on WaveSpeedAI
Molmo2-4B Video QA: Answer questions about video content with temporal understanding. Open-source vision-language model. Ready-to-use REST API, no cold starts,
Introducing WaveSpeedAI Molmo2 Video Understanding on WaveSpeedAI
Molmo2-4B Video Understanding: Analyze videos with specialized tasks (general, summary, analysis, counting, scene description). Open-source vision-language mode
Introducing WaveSpeedAI Openai Whisper With Video on WaveSpeedAI
OpenAI Whisper Large v3 (Video-to-Text) delivers high-accuracy multilingual transcription directly from video files, with automatic language detection and optio
Introducing WaveSpeedAI Paddle Ocr on WaveSpeedAI
PaddleOCR-VL is an ultra-compact 0.9B parameter vision-language model for document parsing, supporting 109 languages with text, table, formula, and chart recogn
Introducing WaveSpeedAI Qwen Image 2512 LoRA Trainer on WaveSpeedAI
Qwen-Image-2512 LoRA Trainer lets you train custom LoRA models 10x faster with style, character, and object training. From concept to model in minutes, not hour
Introducing WaveSpeedAI Qwen Image Text-to-Image 2512 LoRA on WaveSpeedAI
Qwen-Image-2512 LoRA is an enhanced 20B MMDiT text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST infer
Introducing WaveSpeedAI Video Background Remover on WaveSpeedAI
WaveSpeed Video Background Remover replaces or removes video backgrounds with a custom image. Upload or paste a link to your video, then provide a background im
Introducing WaveSpeedAI Z Image Turbo Controlnet on WaveSpeedAI
Z-Image-Turbo ControlNet generates images guided by structural control signals (depth, canny edge, pose) for precise composition control. Ready-to-use REST infe
Introducing xAI Grok 2 Image on WaveSpeedAI
Grok 2 Image is xAI’s latest image generation model that turns simple text prompts into sharp, photorealistic visuals in seconds. From product shots to social
Introducing Z AI CogView 4 on WaveSpeedAI
Z-AI CogView-4 generates high-quality images from text prompts with a quick and accurate understanding of user descriptions, letting AI express images more prec
Introducing Z AI Glm Image Text-to-Image on WaveSpeedAI
Z-AI GLM Image generates high-quality images from text prompts, with enhanced understanding of user descriptions, resulting in images that are more precise and
Introducing Z AI Glm Image Edit on WaveSpeedAI
GLM-Image Edit is a powerful image-to-image editing model that transforms images based on text prompts. Ready-to-use REST inference API, best performance, no co