Inworld AIReleased on Sep 1, 2024

Inworld TTS-1-Max: Benchmarks, Pricing & Context Window

Inworld TTS-1-Max is a text-to-speech model from Inworld AI, released in September 2024.

More expressive, contextually aware speech with highest quality. Preview model not optimized for real-time use. Supports 12 languages.

Input
Text
Output
Audio

Inworld TTS-1-Max API

POST/v1/tts/synthesize

Run a request to see the response

Use it in your code

OpenAI-compatible endpoint through the LLM Stats gateway.

import requests

response = requests.post(
    "https://gateway.llm-stats.com/v1/tts/synthesize",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    json={
        "model_id": "inworld-tts-1-max",
        "text": "Hello, this is a test.",
        "format": "mp3",
        "sample_rate": 24000,
    },
)

with open("output.mp3", "wb") as f:
    f.write(response.content)

Need an API key? Create one above in the playground, or read the API documentation.

Inworld TTS-1-Max license

Inworld TTS-1-Max is released under the Proprietary license, which restricts commercial use.

License
Proprietary
Non-commercial

Proprietary license - usage restrictions apply

FAQ

Common questions about Inworld TTS-1-Max.

Who created Inworld TTS-1-Max?

Inworld TTS-1-Max was created by Inworld AI.

What is the license for Inworld TTS-1-Max?

Inworld TTS-1-Max is released under the Proprietary license.

Is Inworld TTS-1-Max multimodal?

Yes, Inworld TTS-1-Max is a multimodal model that can process both text and images as input.