At a glance

Togetherpricing, performance & catalog

The citable facts about Together's 2 models — sourced from provider APIs and refreshed continuously.

Lowest price
Qwen3.6 Plus at $0.500 per 1M input tokens
Largest context
Qwen3.7 Max at 1.0M tokens
Catalog
2 active models from 4 organizations

Fastest

No throughput data yet.

FAQ

Common questions about Together.

What is Together?

Together is an API provider that hosts large language models. Active models: 2; From (input): $0.50 / 1M tok; Max context: 1.0M.

How many models does Together offer?

Together currently serves 2 active models out of 16 historical offerings on LLM Stats.

What is Together's API pricing?

Together input pricing starts from $0.50 per 1M tokens, with the most expensive offering at $2.5 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

Is Together OpenAI compatible?

Most providers expose an OpenAI-compatible /v1/chat/completions endpoint so you can switch from OpenAI to Together by changing only the base URL and API key. Check https://together.ai/ for the exact endpoint format and any provider-specific parameters.

Does Together support multimodal models?

Yes. Together's catalog includes 1 vision-capable models. See the Models and Capabilities tabs for the full per-model breakdown.

Whose models does Together host?

Together hosts models from Alibaba Cloud / Qwen Team, DeepSeek, Google, and Meta. See the Models tab for the full catalog grouped by creator.

How do I start using Together?

Sign up at https://together.ai/ to get an API key, then call Together's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at Together's base URL with your key. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.