Question 1

How much does llama-4-maverick-17b-128e-instruct cost per 1M tokens?

Accepted Answer

Input is priced at $0.00 per 1M tokens, output at $0.00 per 1M tokens. Billing is per token, no rounding to batch sizes.

Question 2

How do I access llama-4-maverick-17b-128e-instruct via API?

Accepted Answer

Send requests to the UnoRouter /v1/chat/completions endpoint with model=llama-4-maverick-17b-128e-instruct. Any OpenAI-compatible client library works. Authentication uses a standard Bearer token.

Question 3

What is the context window of llama-4-maverick-17b-128e-instruct?

Accepted Answer

llama-4-maverick-17b-128e-instruct supports a context window of 131.1K tokens, shared between your prompt and the model's response.

Input price	$0.00 · 1M tokens
Output price	$0.00 · 1M tokens
Context window	131.1K tokens
Compatible endpoints	openai
Vendor	Meta

llama-4-maverick-17b-128e-instruct

Performance

Pricing

Call llama-4-maverick-17b-128e-instruct from your code

Frequently asked questions

How much does llama-4-maverick-17b-128e-instruct cost per 1M tokens?

How do I access llama-4-maverick-17b-128e-instruct via API?

What is the context window of llama-4-maverick-17b-128e-instruct?

Similar models

Try llama-4-maverick-17b-128e-instruct now