FAQ
Common questions about Novita.
What is Novita?
Novita is an API provider that hosts large language models. Active models: 30; From (input): $0.08 / 1M tok; Avg throughput: 51 tok/s; Avg latency: 0.95 s; Max context: 262K.
How many models does Novita offer?
Novita currently serves 30 active models out of 41 historical offerings on LLM Stats.
What is Novita's API pricing?
Novita input pricing starts from $0.08 per 1M tokens, with the most expensive offering at $0.98 per 1M tokens. See the Pricing tab above for the full per-model breakdown.
How fast is Novita?
Novita averages 51 output tokens per second across its catalog, with average latency of 0.95s. Per-model performance is shown in the Performance tab.
Does Novita support multimodal models?
Yes. Novita's catalog includes 13 vision-capable models. See the Models and Capabilities tabs for the full per-model breakdown.
Whose models does Novita host?
Novita hosts models from Baidu, DeepSeek, Google, MiniMax, Moonshot AI, and OpenAI, plus 3 more. See the Models tab for the full catalog grouped by creator.
How do I start using Novita?
Sign up at https://novita.ai/ to get an API key, then call Novita's API directly from your application. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.