AvailableOpenAIChat⚡Cache 50% off

gpt-oss-120b

Build next-gen apps with gpt-oss-120b

Pricing

TokenLab Price

Input $0.105 / Output $0.42

Per 1M Tokens

Discount: 30%

	Official PricePer 1M Tokens	TokenLab PricePer 1M Tokens	Discount
Input	$0.15	$0.105	30%
Output	$0.60	$0.42	30%

Prompt cache pricing

Cache Read

$0.075

$0.0525

30%

One-click test

Test gpt-oss-120b in Web Agent with a short request at /v1/chat/completions, then show request body, latency, and response.

API workbench

The default route for production. The code sample below uses this endpoint with the format you pick.

ChatOpenAI Compatible

POST/v1/chat/completions

curl https://api.tokenlab.sh/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxx" \
  -d '{
    "model": "gpt-oss-120b",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Use cases

Best for

Reasoning

Multi-step reasoning, analysis, and research workflows

Agents and tools

Drive reasoning, support triage, tool calls, and multi-step task flows.

Developer workflows

Generate, review, or debug code without rewiring your stack.

Knowledge assistants

Ship chat, search, and retrieval with predictable cost and behavior.

Side-by-side test

Compare real response quality, latency, and price for production defaults.

Prompt examples

Write a concise support reply and list the assumptions behind it.

Review this API design and call out the top three integration risks.

Turn a long changelog into release notes a non-engineer would read.

Cost Calculator

Monthly Input Tokens1M

Monthly Output Tokens0.5M

Estimated Monthly Cost$0.32

FAQ

How much does gpt-oss-120b cost?

On TokenLab, gpt-oss-120b costs $0.105 per 1M input tokens and $0.42 per 1M output tokens. Cache and per-request prices show in the pricing table when they apply.

What is gpt-oss-120b best for?

gpt-oss-120b is a strong fit for JSON Mode, Prompt Cache, Reasoning. You can call it through TokenLab with one API key.

How do I call the gpt-oss-120b API?

Get a TokenLab API key, then send your request to https://api.tokenlab.sh/v1/chat/completions. The API workbench above has a recommended endpoint and copy-ready code.

Which endpoint should gpt-oss-120b use?

Use https://api.tokenlab.sh/v1/chat/completions as the default for gpt-oss-120b. If a provider-native format is supported, the API workbench shows that endpoint too.

Can I test gpt-oss-120b before integrating it?

Yes. Try in Web Agent opens a ready test for gpt-oss-120b and keeps your prompt after sign-in, so you don’t lose context.

Related Models

Grok

xAI · 22 models

Doubao

ByteDance · 18 models

Qwen Omni

Alibaba Cloud · 13 models

Qwen

Alibaba Cloud · 10 models

Qwen 3 / QwQ

Alibaba Cloud · 10 models