Enjoy 50% OFF Vidu Q3 & Q3 Pro models • Only on WaveSpeedAI | May 20 – Jun 2

Sfx 1.6 Text to Audio

mirelo-ai /

Mirelo SFX1.6 Text to Audio is a fast AI audio generation model that creates sound effects and ambient audio directly from text prompts, with optional seamless ambience looping. Ready-to-use REST inference API for sound effect generation, game audio, video production, cinematic sound design, background ambience, loopable audio assets, and professional audio workflows with simple integration, no coldstarts, and affordable pricing.

text-to-audio
Input
When true, generate and stitch the result so the tile loops seamlessly.
Only used when ambience is true: concatenate the loop with itself for a 2x-length output.

Idle

$0.01per run·~100 / $1

ExamplesView all

Related Models

README

Mirelo AI SFX 1.6 Text-to-Audio

Mirelo AI SFX 1.6 Text-to-Audio generates sound effects, ambience, and short audio clips from natural-language prompts. It supports loop-friendly ambience mode, multiple variations, flexible duration control, and optional doubled loop output for seamless background audio workflows.

Why Choose This?

  • Prompt-based audio generation Generate sound effects or ambient audio directly from a text description.

  • Flexible duration control Choose the target duration for the generated clip, from short effects to longer ambient beds.

  • Loop-friendly ambience mode Enable ambience to generate audio designed for seamless looping.

  • Multiple variations Generate up to 4 different versions in one request with num_samples.

  • Optional doubled loop output When using ambience mode, enable double_output to concatenate the loop with itself for a longer seamless result.

  • Production-ready API Useful for games, film, podcasts, background ambience, sound design, and content production workflows.

Parameters

ParameterRequiredDescription
text_promptYesText prompt describing the sound effect or ambient audio to generate. Minimum length: 4 characters.
durationNoTarget duration in seconds. Range: 0.1–60. Default: 10.
ambienceNoWhen true, generate and stitch the result so the tile loops seamlessly. Default: false.
double_outputNoOnly used when ambience is true: concatenate the loop with itself for a 2x-length output. Default: false.
num_samplesNoNumber of variations to generate. Range: 1–4. Default: 1.

How to Use

  1. Write your prompt — describe the sound, mood, texture, or environment you want.
  2. Set duration — choose how long the generated audio should be.
  3. Enable ambience (optional) — turn this on if you want a seamless loopable result.
  4. Enable double output (optional) — when using ambience, use this to produce a doubled loop.
  5. Set number of samples — choose how many variations you want, from 1 to 4.
  6. Submit — run the model and download the generated audio.

Example Prompt

Dark cinematic ambience with distant thunder, soft low-frequency rumble, subtle wind, and evolving tension

Pricing

Pricing is based on generated duration and number of samples.

Duration1 Sample2 Samples3 Samples4 Samples
1s$0.01$0.02$0.03$0.04
5s$0.05$0.10$0.15$0.20
10s$0.10$0.20$0.30$0.40
20s$0.20$0.40$0.60$0.80
30s$0.30$0.60$0.90$1.20
60s$0.60$1.20$1.80$2.40

Billing Rules

  • Pricing is $0.01 per generated second
  • text_prompt does not affect pricing
  • This pricing assumes ambience and double_output do not change billing unless your backend explicitly makes them billable

Best Use Cases

  • Sound effects — Generate short custom SFX for games, apps, and media.
  • Ambient loops — Create seamless background beds for environments and scenes.
  • Content production — Add generated sound design to videos, podcasts, or social content.
  • Creative prototyping — Explore multiple sound directions quickly with several variations.
  • Game and app audio — Produce loopable background textures and interactive sound assets.

Pro Tips

  • Be specific in your prompt about texture, mood, environment, and intensity.
  • Use ambience when the output needs to loop smoothly.
  • Turn on double_output only when you want a longer looped deliverable.
  • Increase num_samples when you want multiple creative options from the same prompt.
  • Start with shorter durations for testing, then scale up once the direction feels right.

Notes

  • text_prompt is required.
  • duration supports 0.1–60 seconds.
  • num_samples supports 1–4.
  • double_output only applies when ambience is enabled.
  • Pricing is based on requested generation duration and sample count.

Related Models

  • Mirelo AI SFX 1.6 Extend Audio — Extend an existing audio clip with newly generated continuation.
  • Mirelo AI SFX 1.6 Inpaint Audio — Regenerate a selected segment inside an existing audio clip.
  • Other Mirelo AI sound generation workflows — Useful when you need continuation or localized audio editing instead of fresh generation.
Accessibility:This website uses AI models provided by third parties.

Sfx 1.6 Text To Audio API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/mirelo-ai/sfx-1.6/text-to-audio with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Sfx 1.6 Text To Audio below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/mirelo-ai/sfx-1.6/text-to-audio" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "duration": 10,
    "ambience": false,
    "double_output": false,
    "num_samples": 1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("mirelo-ai/sfx-1.6/text-to-audio", {
        "duration": 10,
        "ambience": false,
        "double_output": false,
        "num_samples": 1
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "mirelo-ai/sfx-1.6/text-to-audio",
    {
    "duration": 10,
    "ambience": false,
    "double_output": false,
    "num_samples": 1
}
)

print(output["outputs"][0])  # → URL of the generated output

Sfx 1.6 Text To Audio API — Frequently asked questions

What is the Sfx 1.6 Text To Audio API?

Sfx 1.6 Text To Audio is a Mirelo Ai model for audio generation, exposed as a REST API on WaveSpeedAI. Mirelo SFX1.6 Text to Audio is a fast AI audio generation model that creates sound effects and ambient audio directly from text prompts, with optional seamless ambience looping. Ready-to-use REST inference API for sound effect generation, game audio, video production, cinematic sound design, background ambience, loopable audio assets, and professional audio workflows with simple integration, no coldstarts, and affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Sfx 1.6 Text To Audio API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/mirelo-ai/mirelo-ai-sfx-1.6-text-to-audio.

How much does Sfx 1.6 Text To Audio cost per run?

Sfx 1.6 Text To Audio starts at $0.010 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Sfx 1.6 Text To Audio accept?

Key inputs: `duration`, `ambience`, `double_output`, `num_samples`, `text_prompt`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/mirelo-ai/mirelo-ai-sfx-1.6-text-to-audio.

How do I get started with the Sfx 1.6 Text To Audio API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Sfx 1.6 Text To Audio outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Mirelo Ai). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.