AvailableMetaChat

llama-4-scout

Build next-gen apps with llama-4-scout

Pricing

TokenLab Price

$0.28

Per Token

Discount: 30%

	Official Price	TokenLab Price	Discount
Input	$0.40	$0.28	30%
Output	$0.70	$0.49	30%

One-click test

Test llama-4-scout in Web Agent with a short request at /v1/chat/completions, then show request body, latency, and response.

API workbench

The default route for production. The code sample below uses this endpoint with the format you pick.

chatOpenAI Compatible

POST/v1/chat/completions

curl https://api.tokenlab.sh/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxx" \
  -d '{
    "model": "llama-4-scout",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Use cases

Best for

Vision

Reading images, parsing documents, and answering visual questions

Agents and tools

Drive reasoning, support triage, tool calls, and multi-step task flows.

Developer workflows

Generate, review, or debug code without rewiring your stack.

Knowledge assistants

Ship chat, search, and retrieval with predictable cost and behavior.

Side-by-side test

See real response quality, latency, and price before this becomes your production default.

Prompt examples

Write a concise support reply and list the assumptions behind it.

Review this API design and call out the top three integration risks.

Turn a long changelog into release notes a non-engineer would read.

Cost Calculator

Monthly Input Tokens1M

Monthly Output Tokens0.5M

Estimated Monthly Cost$0.53

FAQ

How much does llama-4-scout cost?

On TokenLab, llama-4-scout costs $0.2800 per 1M input tokens and $0.4900 per 1M output tokens. Cache and per-request prices show in the pricing table when they apply.

What is llama-4-scout best for?

llama-4-scout is a strong fit for Vision, Tool Use, JSON Mode. You can call it through TokenLab with one API key.

How do I call the llama-4-scout API?

Get a TokenLab API key, then send your request to https://api.tokenlab.sh/v1/chat/completions. The API workbench above has a recommended endpoint and copy-ready code.

Which endpoint should llama-4-scout use?

Use https://api.tokenlab.sh/v1/chat/completions as the default for llama-4-scout. If a provider-native format is supported, the API workbench shows that endpoint too.

Can I test llama-4-scout before integrating it?

Yes. Try in Web Agent opens a ready test for llama-4-scout and keeps your prompt after sign-in, so you don’t lose context.

Related Models

Alibaba Cloud · 6 models