Inworld AIReleased on Sep 1, 2024

Inworld TTS-1: Benchmarks, Pricing & Context Window

Inworld TTS-1 is a text-to-speech model from Inworld AI, released in September 2024.

Rich, expressive speech with low-latency streaming. Supports 12 languages including English, Spanish, French, Korean, Dutch, and Chinese.

Input
Text
Output
Audio

Inworld TTS-1 API

POST/v1/tts/synthesize

Run a request to see the response

Use it in your code

OpenAI-compatible endpoint through the LLM Stats gateway.

import requests

response = requests.post(
    "https://gateway.llm-stats.com/v1/tts/synthesize",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "model_id": "inworld-tts-1",
        "text": "Hello, this is a test.",
        "format": "mp3",
        "sample_rate": 24000,
    },
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Need an API key? Create one above in the playground, or read the API documentation.

Inworld TTS-1 license

Inworld TTS-1 is released under the Proprietary license, which restricts commercial use.

License
Proprietary
Non-commercial

Proprietary license - usage restrictions apply

FAQ

Common questions about Inworld TTS-1.

Who created Inworld TTS-1?

Inworld TTS-1 was created by Inworld AI.

What is the license for Inworld TTS-1?

Inworld TTS-1 is released under the Proprietary license.

Is Inworld TTS-1 multimodal?

Yes, Inworld TTS-1 is a multimodal model that can process both text and images as input.