AssemblyAIReleased on Jan 1, 2024

Nano: Benchmarks, Pricing & Context Window

Nano is a speech-to-text model from AssemblyAI, released in January 2024.

Fastest speech recognition

Input
Audio
Output
Text

Nano API

POST/v1/stt/transcribe

Any audio format up to 25 MB.

Missing Audio file

Run a request to see the response

Use it in your code

OpenAI-compatible endpoint through the LLM Stats gateway.

import requests

with open("audio.mp3", "rb") as f:
    response = requests.post(
        "https://gateway.llm-stats.com/v1/stt/transcribe",
        headers={"Authorization": "Bearer YOUR_API_KEY"},
        files={"file": f},
        data={"model_id": "nano"},
    )

print(response.json()["text"])

Need an API key? Create one above in the playground, or read the API documentation.

Nano license

Nano is released under the Proprietary license, which restricts commercial use.

License
Proprietary
Non-commercial

Proprietary license - usage restrictions apply

FAQ

Common questions about Nano.

Who created Nano?

Nano was created by AssemblyAI.

What is the license for Nano?

Nano is released under the Proprietary license.

Is Nano multimodal?

Yes, Nano is a multimodal model that can process both text and images as input.