ElevenLabsReleased on Dec 1, 2024

Eleven Flash v2.5: Benchmarks, Pricing & Context Window

Eleven Flash v2.5 is a text-to-speech model from ElevenLabs, released in December 2024.

Ultra-low latency (~75ms) speech synthesis for real-time applications. Supports 32 languages, 40K character limit. Ideal for conversational AI and live streaming.

Input
Text
Output
Audio

Eleven Flash v2.5 pricing

Providers

Eleven Flash v2.5 starts at $1250000 per million input tokens via Elevenlabs.

ProviderInput $/MOutput $/MMax InputMax OutputLatency sThroughputQuantInputOutput
Elevenlabs logoElevenlabs
$125000010.0K

Eleven Flash v2.5 API

POST/v1/tts/synthesize

Run a request to see the response

Use it in your code

OpenAI-compatible endpoint through the LLM Stats gateway.

import requests

response = requests.post(
    "https://gateway.llm-stats.com/v1/tts/synthesize",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "model_id": "eleven_flash_v2_5",
        "text": "Hello, this is a test.",
        "format": "mp3",
        "sample_rate": 24000,
    },
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Need an API key? Create one above in the playground, or read the API documentation.

Eleven Flash v2.5 license

Eleven Flash v2.5 is released under the Proprietary license, which restricts commercial use.

License
Proprietary
Non-commercial

Proprietary license - usage restrictions apply

FAQ

Common questions about Eleven Flash v2.5.

Who created Eleven Flash v2.5?

Eleven Flash v2.5 was created by ElevenLabs.

What is the license for Eleven Flash v2.5?

Eleven Flash v2.5 is released under the Proprietary license.

Is Eleven Flash v2.5 multimodal?

Yes, Eleven Flash v2.5 is a multimodal model that can process both text and images as input.