Question 1

What is FriendliAI?

Accepted Answer

FriendliAI is an API provider that hosts large language models. Active models: 3; From (input): $0.14 / 1M tok; Max context: 262K.

Question 2

How many models does FriendliAI offer?

Accepted Answer

FriendliAI currently serves 3 active models out of 4 historical offerings on LLM Stats.

Question 3

What is FriendliAI's API pricing?

Accepted Answer

FriendliAI input pricing starts from $0.14 per 1M tokens, with the most expensive offering at $1.4 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

Question 4

Is FriendliAI OpenAI compatible?

Accepted Answer

Most providers expose an OpenAI-compatible /v1/chat/completions endpoint so you can switch from OpenAI to FriendliAI by changing only the base URL and API key. Check https://friendli.ai/ for the exact endpoint format and any provider-specific parameters.

Question 5

Does FriendliAI support multimodal models?

Accepted Answer

Yes. FriendliAI's catalog includes 1 vision-capable models. See the Models and Capabilities tabs for the full per-model breakdown.

Question 6

Whose models does FriendliAI host?

Accepted Answer

FriendliAI hosts models from Google, Zhipu AI, and LG AI Research. See the Models tab for the full catalog grouped by creator.

Question 7

How do I start using FriendliAI?

Accepted Answer

Sign up at https://friendli.ai/ to get an API key, then call FriendliAI's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at FriendliAI's base URL with your key. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.

Model	Input /M	Output /M	Throughput	Context	Capabilities
Gemma 4 31B	$0.140	$0.400	—	262K	Vision
Gemma 4 31B	$0.140	$0.400	—	262K	—

Model	Input /M	Output /M	Throughput	Context	Capabilities
Gemma 4 31B	$0.140	$0.400	—	262K	Vision
Gemma 4 31B	$0.140	$0.400	—	262K	—

FriendliAI: API pricing, performance & models

Catalog

FriendliAIpricing, performance & catalog

Most affordable

Fastest

Largest context

FAQ