API Provider1 active models2 organizationssambanova.ai

Sambanova: API pricing, performance & models

Sambanova hosts 1 active AI models, with input pricing from $0.40 per 1M tokens, averaging 328 tok/s output throughput, with up to 128K context window. Compare Sambanova's API pricing, latency, and feature support against other LLM providers.

1Active
Pricing
$0.400/MFrom
$0.400/MAvg
Performance
328tok/sThroughput
1.08sLatency
128KMax

Catalog

Type
Price
1 model

FAQ

Common questions about Sambanova.

What is Sambanova?

Sambanova is an API provider that hosts large language models. Active models: 1; From (input): $0.40 / 1M tok; Avg throughput: 328 tok/s; Avg latency: 1.08 s; Max context: 128K.

How many models does Sambanova offer?

Sambanova currently serves 1 active models out of 6 historical offerings on LLM Stats.

What is Sambanova's API pricing?

Sambanova input pricing starts from $0.40 per 1M tokens, with the most expensive offering at $0.4 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is Sambanova?

Sambanova averages 328 output tokens per second across its catalog, with average latency of 1.08s. Per-model performance is shown in the Performance tab.

Whose models does Sambanova host?

Sambanova hosts models from Alibaba Cloud / Qwen Team and Meta. See the Models tab for the full catalog grouped by creator.

How do I start using Sambanova?

Sign up at https://sambanova.ai/ to get an API key, then call Sambanova's API directly from your application. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.