Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.tera.gw/llms.txt

Use this file to discover all available pages before exploring further.

All prices are in USD per million tokens. Updated 2026-05-11.
ModelInputOutputQuantContext
Qwen/Qwen3.5-4B$0.04$0.08bf16262,144
Qwen/Qwen2.5-7B-Instruct$0.05$0.10bf168,192
Qwen/Qwen3.5-27B$0.10$0.30fp88,192
hexgrad/Kokoro-82M$0.62Freefp324,096
openai/gpt-oss-20b$0.75$4.00mxfp4131,072
openai/gpt-oss-120b$1.50$8.00mxfp4131,072
Pricing is the same whether requests stream or not. Failed requests (5xx, 429) are not billed.

How billing works

  • Input tokens are counted from the rendered prompt after applying the model’s chat template.
  • Output tokens include generated text. For reasoning models, reasoning_content tokens count toward output.
  • TTS (Kokoro) bills on input characters, surfaced as prompt token cost.

Volume discounts

Reach out at hello@tera.gw for committed-use pricing.