Build next-gen apps with llama-4-scout
Pricing
TokenLab Price
$0.28
Discount: 30%
| Official Price | TokenLab Price | Discount | |
|---|---|---|---|
| Input | $0.40 | $0.28 | 30% |
| Output | $0.70 | $0.49 | 30% |
One-click test
Sign in once and Web Agent keeps this model, prompt, and request preset for you.
Test llama-4-scout in Web Agent with a short request at /v1/chat/completions, then show request body, latency, and response.
API workbench
The default route for production. The code sample below uses this endpoint with the format you pick.
curl https://api.tokenlab.sh/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer sk-xxx" \
-d '{
"model": "llama-4-scout",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'Use cases
Best forVision
Reading images, parsing documents, and answering visual questions
Agents and tools
Drive reasoning, support triage, tool calls, and multi-step task flows.
Developer workflows
Generate, review, or debug code without rewiring your stack.
Knowledge assistants
Ship chat, search, and retrieval with predictable cost and behavior.
Side-by-side test
See real response quality, latency, and price before this becomes your production default.
Prompt examples
Write a concise support reply and list the assumptions behind it.
Review this API design and call out the top three integration risks.
Turn a long changelog into release notes a non-engineer would read.
Cost Calculator
FAQ
How much does llama-4-scout cost?
On TokenLab, llama-4-scout costs $0.2800 per 1M input tokens and $0.4900 per 1M output tokens. Cache and per-request prices show in the pricing table when they apply.
What is llama-4-scout best for?
llama-4-scout is a strong fit for Vision, Tool Use, JSON Mode. You can call it through TokenLab with one API key.
How do I call the llama-4-scout API?
Get a TokenLab API key, then send your request to https://api.tokenlab.sh/v1/chat/completions. The API workbench above has a recommended endpoint and copy-ready code.
Which endpoint should llama-4-scout use?
Use https://api.tokenlab.sh/v1/chat/completions as the default for llama-4-scout. If a provider-native format is supported, the API workbench shows that endpoint too.
Can I test llama-4-scout before integrating it?
Yes. Try in Web Agent opens a ready test for llama-4-scout and keeps your prompt after sign-in, so you don’t lose context.