Enjoy 50% OFF Vidu Q3 & Q3 Pro models • Only on WaveSpeedAI | May 20 – Jun 2
Generate Music

Generate Music

Generate studio-quality soundtracks with WaveSpeedAI's advanced AI music creation and editing tools.

Our selection

minimax/music-v1.5
text-to-audio

minimax/music-v1.5

MiniMax Music v1.5 turns text prompts into high-quality, diverse music (Text-to-Audio) using advanced AI for versatile tracks. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

All models

13 models
minimax/music-v1.5
text-to-audio

minimax/music-v1.5

MiniMax Music v1.5 turns text prompts into high-quality, diverse music (Text-to-Audio) using advanced AI for versatile tracks. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/ace-step/audio-outpaint
audio-to-audio

wavespeed-ai/ace-step/audio-outpaint

ACE-Step Audio Outpaint generates seamless start or end extensions that match the original, ideal for intros, outros and longer tracks. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/ace-step/audio-inpaint
audio-to-audio

wavespeed-ai/ace-step/audio-inpaint

ACE-Step Audio Inpaint edits a specific audio segment to change lyrics or style while preserving the surrounding audio. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/music-01
text-to-audio

minimax/music-01

Minimax Music-01 Synthesizes Accompaniment And Vocals Simultaneously To Produce Complete Songs Across Diverse Styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/music-02
text-to-audio

minimax/music-02

Minimax Music-02 is a compact, fast, cost-effective MoE music generator (230B params, 10B active) for high-quality music production. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/music-2.5
text-to-audio

minimax/music-2.5

MiniMax Music 2.5 is a full-dimensional breakthrough in AI music generation with high-fidelity audio, humanized vocals, and precise creative control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/heartmula/transcribe-lyrics
speech-to-text

wavespeed-ai/heartmula/transcribe-lyrics

HeartMuLa Transcribe extracts lyrics from audio files using advanced AI. Supports multilingual transcription. Ready-to-use REST inference API with best performance, no coldstarts, and affordable pricing.

wavespeed-ai/heartmula/generate-music
text-to-audio

wavespeed-ai/heartmula/generate-music

HeartMuLa is a state-of-the-art music generation model that creates high-quality songs from lyrics and style tags. Ready-to-use REST inference API with best performance, no coldstarts, and affordable pricing.

elevenlabs/music
text-to-audio

elevenlabs/music

ElevenLabs Music generates original songs from text descriptions. Create instrumentals or full compositions with customizable duration. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/ace-step-1.5
text-to-audio

wavespeed-ai/ace-step-1.5

ACE-Step 1.5 generates up to 4-minute music with lyrics from text. Supports 50+ languages, high acoustic fidelity, and runs efficiently on consumer hardware. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/lyria-3-pro/music
text-to-audio

google/lyria-3-pro/music

Google Lyria 3 Pro generates high-quality music tracks from text prompts and optional image input. Pro tier delivers enhanced audio quality and richer compositions. Produces complete songs with lyrics, descriptions, and audio output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

google/lyria-3-clip/music
text-to-audio

google/lyria-3-clip/music

Google Lyria 3 Clip generates novel music tracks from text prompts and optional image input. Produces complete songs with lyrics, descriptions, and audio output. Supports negative prompts and seed control for reproducible results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

minimax/music-2.6
text-to-audio

minimax/music-2.6

MiniMax Music 2.6 generates complete songs with vocals and instrumentals from text prompts and lyrics. Supports instrumental-only mode, auto lyrics generation, structure tags for song arrangement, and configurable audio quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Generate Music API — pricing & performance

Run any model in the Generate Music collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Generate Music on WaveSpeedAI

Transparent pricing

Per-call pricing for every Generate Music model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Generate Music image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Generate Music API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Generate Music models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Yes — every account gets $1 in free credits on signup, enough to try most Generate Music models without a credit card.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.