CartesiaReleased on Sep 1, 2024

Sonic Multilingual: Benchmarks, Pricing & Context Window

Name: Sonic Multilingual
Author: Cartesia

Sonic Multilingual is a text-to-speech model from Cartesia, released in September 2024.

Cartesia Sonic Multilingual TTS model

Input

Text

Output

Audio

Speed

—

Cost

$3333333/ 1M · 8:1 in:out

$3750000 in · $0.00 out

Sonic Multilingual pricing

Providers

Sonic Multilingual starts at $3750000 per million input tokens via Cartesia.

Provider	Input $/M	Output $/M	Max Input	Max Output	Latency s	Throughput	Quant	Input	Output
Cartesia	$3750000	—	2.5K	—	—	—	—

Sonic Multilingual API

POST/v1/tts/synthesize

Modelsonic-multilingual

API key●

Text●

Voice ID

Format

Sample rate (Hz)

Speed

Run a request to see the response

Use it in your code

OpenAI-compatible endpoint through the LLM Stats gateway.

import requests

response = requests.post(
    "https://gateway.llm-stats.com/v1/tts/synthesize",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "model_id": "sonic-multilingual",
        "text": "Hello, this is a test.",
        "format": "mp3",
        "sample_rate": 24000,
    },
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Need an API key? Create one above in the playground, or read the API documentation.

Sonic Multilingual license

Sonic Multilingual is released under the Proprietary license, which restricts commercial use.

License: Proprietary; Non-commercial

Proprietary license - usage restrictions apply

FAQ

Common questions about Sonic Multilingual.

Who created Sonic Multilingual?

Sonic Multilingual was created by Cartesia.

What is the license for Sonic Multilingual?

Sonic Multilingual is released under the Proprietary license.

Is Sonic Multilingual multimodal?

Yes, Sonic Multilingual is a multimodal model that can process both text and images as input.