Introducing ElevenLabs Music on WaveSpeedAI
ElevenLabs Music generates original songs from text descriptions. Create instrumentals or full compositions with customizable duration. Ready-to-use REST infere
ElevenLabs Music: Generate Original Songs From Text Prompts on WaveSpeedAI
ElevenLabs Music is a powerful AI music generation model that creates full songs — complete with vocals, instrumentals, and production — from simple text descriptions. Now available on WaveSpeedAI with instant inference, no cold starts, and straightforward per-second pricing, ElevenLabs Music lets developers and creators generate professional-quality tracks through a single API call.
Whether you need a 40-second jingle for a social media clip or a 5-minute anthem for a marketing campaign, ElevenLabs Music delivers studio-grade output trained on licensed data and cleared for commercial use. Try it now on WaveSpeedAI.
How ElevenLabs Music Works
ElevenLabs Music takes a text prompt combining style description and optional lyrics, then generates a complete audio track. The model was trained on commercially licensed data — a significant advantage over competitors that faced copyright litigation throughout 2025.
Here’s the workflow:
- Describe your style — specify genre, mood, instruments, tempo, and vocal characteristics
- Add structured lyrics — use markers like
[Verse],[Chorus], and[Bridge]to control song arrangement - Set parameters — choose duration (in milliseconds), output format, and whether to include vocals
- Generate — the model produces a finished track ready for download
The model supports any genre — from hard trap and hip hop to ambient electronic, pop ballads, jazz, and classical orchestral pieces. Its vocal generation is particularly notable: ElevenLabs’ expertise in voice synthesis produces AI vocals with natural breathiness, dynamics, and emotional expression that surpass many competitors.
Key Features of ElevenLabs Music
- Complete songs with AI vocals — Generate tracks with realistic singing voices in multiple styles, from rap to melodic pop, or switch to instrumental-only mode
- Commercially licensed output — Unlike some competitors, ElevenLabs built its music model on licensed training data from day one, making generated tracks safe for commercial projects
- Flexible duration control — Set output length from short jingles to full 5-minute compositions with millisecond precision
- Multiple output formats — Export as MP3 (standard or high quality) or WAV at sample rates up to CD quality (44.1kHz)
- Structured lyrics support — Use
[Verse],[Chorus],[Bridge]markers for professional song arrangement - Built-in Prompt Enhancer — Automatically refines your style descriptions for better results
- No cold starts on WaveSpeedAI — Instant inference means you get your track without waiting for model spin-up
Best Use Cases for ElevenLabs Music
Social Media Content at Scale
Content creators producing daily videos for TikTok, YouTube Shorts, or Instagram Reels need unique background music that won’t trigger copyright strikes. ElevenLabs Music generates original tracks tailored to each video’s mood in seconds, eliminating the need for royalty-free music libraries with overused tracks.
Podcast and Video Intros
Podcasters and YouTubers can generate custom theme music that perfectly matches their brand. Set force_instrumental to true, describe the vibe — “upbeat lo-fi hip hop, warm piano chords, vinyl crackle, relaxing” — and get a unique intro that no other creator has.
Game and App Soundtracks
Indie game developers and app creators can generate adaptive background music for different scenes or moods without hiring a composer. Generate ambient tracks for menus, energetic battle music, or emotional story beats — all through the API.
Rapid Song Prototyping for Musicians
Songwriters and producers can quickly prototype ideas before committing studio time. Write lyrics, describe the production style, and hear a full arrangement in under a minute. It’s like having a demo studio available 24/7.
Marketing and Advertising Campaigns
Brands running multi-channel campaigns need unique music that aligns with their message. Generate custom jingles, background scores for product videos, or full tracks for branded content — all commercially cleared and ready for broadcast.
E-Learning and Training Content
Course creators and corporate trainers can generate appropriate background music for educational videos, making content more engaging without the complexity of music licensing agreements.
ElevenLabs Music Pricing and API Access on WaveSpeedAI
WaveSpeedAI offers simple per-second pricing with no subscriptions or credit systems:
| Duration | Cost |
|---|---|
| Per second | $0.0083 |
| 40-second track | ~$0.33 |
| 2-minute track | ~$1.00 |
| 5-minute track | ~$2.49 |
API Example
import wavespeed
output = wavespeed.run(
"elevenlabs/music",
{
"prompt": "Indie folk, acoustic guitar, warm female vocals, nostalgic autumn mood.\n[Verse] Walking through the golden leaves\nMemories beneath the trees\n[Chorus] Take me back to simpler days\nWhen the world was soft and haze",
"music_length_ms": 60000,
"output_format": "mp3_high_quality"
},
)
print(output["outputs"][0])
API Parameters
| Parameter | Required | Description |
|---|---|---|
prompt | Yes | Style tags + optional lyrics with structure markers |
music_length_ms | No | Duration in ms (default: 40000) |
force_instrumental | No | Skip vocals, generate instrumental only |
output_format | No | mp3_standard, mp3_high_quality, wav_16khz, wav_22khz, wav_24khz, or wav_cd_quality |
WaveSpeedAI advantages: No cold starts, pay only for what you generate, standard REST API, no subscription required.
Tips for Best Results with ElevenLabs Music
-
Front-load your style description — Put genre, instruments, mood, and tempo before any lyrics. The model weights the beginning of the prompt more heavily.
-
Be specific with genre tags — “Dark trap, 808 bass, hi-hats, aggressive male vocals” works better than just “rap.”
-
Use structure markers —
[Verse],[Chorus],[Bridge],[Outro]give the model a roadmap for arrangement. Without them, you get a less structured output. -
Match duration to content — A verse + chorus typically needs 60-90 seconds. Don’t set
music_length_msto 300000 if you only have one verse of lyrics. -
Use
force_instrumentalfor background music — When you need underscore for video or podcasts, this produces cleaner results than leaving lyrics empty. -
Choose WAV for post-production — If you plan to edit the track in a DAW, use
wav_cd_quality. Usemp3_standardfor direct playback in apps or web content. -
Iterate with the Prompt Enhancer — If your first generation isn’t quite right, let the built-in enhancer refine your description before regenerating.
Frequently Asked Questions
What is ElevenLabs Music?
ElevenLabs Music is an AI model that generates original songs — with or without vocals — from text descriptions, supporting any genre, custom lyrics, and multiple audio output formats.
How much does ElevenLabs Music cost on WaveSpeedAI?
ElevenLabs Music costs $0.0083 per second on WaveSpeedAI, meaning a typical 40-second track costs approximately $0.33 and a full 5-minute song costs about $2.49 — with no subscription required.
Can I use ElevenLabs Music via API?
Yes. WaveSpeedAI provides a REST API for ElevenLabs Music with no cold starts and instant inference. You can integrate music generation into any application with a single API call.
Is music generated by ElevenLabs Music safe for commercial use?
Yes. ElevenLabs trained its music model on commercially licensed data, making generated output suitable for commercial projects including marketing, advertising, and published content.
What genres can ElevenLabs Music generate?
ElevenLabs Music supports virtually any genre — hip hop, pop, rock, electronic, jazz, classical, folk, ambient, country, R&B, and more. Specify your desired genre and style in the text prompt.
Start Creating Music with ElevenLabs Music on WaveSpeedAI
Skip the complexity of music licensing, composer fees, and stock music libraries. ElevenLabs Music on WaveSpeedAI gives you instant access to AI-generated songs and instrumentals through a simple API — no cold starts, no subscriptions, just pay per second of audio generated.




