ElevenLabsReleased on Jan 15, 2025

Eleven v3: Benchmarks, Pricing & Context Window

Name: Eleven v3
Author: ElevenLabs

Eleven v3 is a text-to-speech model from ElevenLabs, released in January 2025.

Most expressive AI text-to-speech model with high emotional range and contextual understanding. Supports 70+ languages, 3K character limit. Best for audiobooks and emotional dialogue.

Input

Text

Output

Audio

Speed

—

Cost

$6666667/ 1M · 8:1 in:out

$7500000 in · $0.00 out

Eleven v3 pricing

Providers

Eleven v3 starts at $7500000 per million input tokens via Elevenlabs.

Provider	Input $/M	Output $/M	Max Input	Max Output	Latency s	Throughput	Quant	Input	Output
Elevenlabs	$7500000	—	750	—	—	—	—

Eleven v3 API

POST/v1/tts/synthesize

Modeleleven_v3

API key●

Text●

Voice ID

Format

Sample rate (Hz)

Speed

Run a request to see the response

Use it in your code

OpenAI-compatible endpoint through the LLM Stats gateway.

import requests

response = requests.post(
    "https://gateway.llm-stats.com/v1/tts/synthesize",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "model_id": "eleven_v3",
        "text": "Hello, this is a test.",
        "format": "mp3",
        "sample_rate": 24000,
    },
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Need an API key? Create one above in the playground, or read the API documentation.

Eleven v3 license

Eleven v3 is released under the Proprietary license, which restricts commercial use.

License: Proprietary; Non-commercial

Proprietary license - usage restrictions apply

FAQ

Common questions about Eleven v3.

Who created Eleven v3?

Eleven v3 was created by ElevenLabs.

What is the license for Eleven v3?

Eleven v3 is released under the Proprietary license.

Is Eleven v3 multimodal?

Yes, Eleven v3 is a multimodal model that can process both text and images as input.