Question 1

What is Sambanova?

Accepted Answer

Sambanova is an API provider that hosts large language models. Active models: 1; From (input): $0.40 / 1M tok; Catalog throughput: 328 c/s.

Question 2

How many models does Sambanova offer?

Accepted Answer

Sambanova currently serves 1 active models out of 6 historical offerings on LLM Stats.

Question 3

What is Sambanova's API pricing?

Accepted Answer

Sambanova input pricing starts from $0.40 per 1M tokens, with the most expensive offering at $0.4 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

Question 4

Does Sambanova support function calling?

Accepted Answer

Yes. 1 of 1 models on Sambanova support function calling (tool use). The Capabilities tab lists which specific models accept tool definitions.

Question 5

Does Sambanova support JSON mode and structured output?

Accepted Answer

Yes. 1 of 1 models on Sambanova support structured output (JSON mode / schema-constrained generation). The Capabilities tab shows which specific models accept response_format or json_schema parameters.

Question 6

Does Sambanova offer batch inference?

Accepted Answer

Yes. 1 of 1 models on Sambanova support batch inference for cheaper, asynchronous workloads.

Question 7

Whose models does Sambanova host?

Accepted Answer

Sambanova hosts models from Alibaba Cloud / Qwen Team and Meta. See the Models tab for the full catalog grouped by creator.

Question 8

How do I start using Sambanova?

Accepted Answer

Sign up at https://sambanova.ai/ to get an API key, then call Sambanova's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at Sambanova's base URL with your key. Use the Models and Pricing tabs above to pick the right model for your latency, cost, and context-window requirements.

Sambanova: API pricing, speed & models

Catalog

Sambanovapricing, performance & catalog

Most affordable

Fastest median throughput

Largest context

FAQ