Question 1

How much does qwen3-embedding-8b cost per 1M tokens?

Accepted Answer

Input is priced at $0.16 per 1M tokens, output at $0.16 per 1M tokens. Billing is per token, no rounding to batch sizes.

Question 2

How do I access qwen3-embedding-8b via API?

Accepted Answer

Send requests to the UnoRouter /v1/chat/completions endpoint with model=qwen3-embedding-8b. Any OpenAI-compatible client library works. Authentication uses a standard Bearer token.

Question 3

What is the context window of qwen3-embedding-8b?

Accepted Answer

qwen3-embedding-8b supports a context window of 41K tokens, shared between your prompt and the model's response.

Input price	$0.16 · 1M tokens
Output price	$0.16 · 1M tokens
Context window	41K tokens
Compatible endpoints	openai
Vendor	Alibaba

qwen3-embedding-8b

Quick stats

Performance

Pricing

Call qwen3-embedding-8b from your code

Frequently asked questions

How much does qwen3-embedding-8b cost per 1M tokens?

How do I access qwen3-embedding-8b via API?

What is the context window of qwen3-embedding-8b?

Similar models

Try qwen3-embedding-8b now