AvailableOpenAIChat
gpt-oss-120b

Build next-gen apps with gpt-oss-120b

API Code Example

Pricing

TokenLab Price

$0.045

Per Token

Discount: 70%

One-click test

Sign in once and Web Agent keeps this model, prompt, and request preset for you.

Test gpt-oss-120b in Web Agent with a short request at /v1/chat/completions, then show request body, latency, and response.

API workbench

The default route for production. The code sample below uses this endpoint with the format you pick.

chatOpenAI Compatible
POST/v1/chat/completions
curl https://api.tokenlab.sh/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxx" \
  -d '{
    "model": "gpt-oss-120b",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Use cases

Best for

Vision

Reading images, parsing documents, and answering visual questions

01

Agents and tools

Drive reasoning, support triage, tool calls, and multi-step task flows.

02

Developer workflows

Generate, review, or debug code without rewiring your stack.

03

Knowledge assistants

Ship chat, search, and retrieval with predictable cost and behavior.

04

Side-by-side test

See real response quality, latency, and price before this becomes your production default.

Prompt examples

Write a concise support reply and list the assumptions behind it.

Review this API design and call out the top three integration risks.

Turn a long changelog into release notes a non-engineer would read.

Cost Calculator

1M
0.5M
Estimated Monthly Cost$0.14

FAQ

How much does gpt-oss-120b cost?

On TokenLab, gpt-oss-120b costs $0.0450 per 1M input tokens and $0.1800 per 1M output tokens. Cache and per-request prices show in the pricing table when they apply.

What is gpt-oss-120b best for?

gpt-oss-120b is a strong fit for Tool Use, Vision, JSON Mode. You can call it through TokenLab with one API key.

How do I call the gpt-oss-120b API?

Get a TokenLab API key, then send your request to https://api.tokenlab.sh/v1/chat/completions. The API workbench above has a recommended endpoint and copy-ready code.

Which endpoint should gpt-oss-120b use?

Use https://api.tokenlab.sh/v1/chat/completions as the default for gpt-oss-120b. If a provider-native format is supported, the API workbench shows that endpoint too.

Can I test gpt-oss-120b before integrating it?

Yes. Try in Web Agent opens a ready test for gpt-oss-120b and keeps your prompt after sign-in, so you don’t lose context.

Related Models