Pricing

OfoxAI uses a pure pay-as-you-go model — you only pay for what you use. No subscriptions, no plans, no monthly fees, no minimum spend.

No subscriptions, no plans, no hidden fees. Top up and start using immediately. Balance never expires.

Billing Method

Billing Item	Description
Input Tokens	Tokens consumed by input (prompt)
Output Tokens	Tokens consumed by output (completion)
Cached Tokens	Cache-hit input tokens (discounted rate)

Type	Billing Method
Image generation	Per image
Embeddings	Per input token

Every API response includes a usage field for tracking consumption:


{
  "usage": {
    "prompt_tokens": 100,
    "completion_tokens": 50,
    "total_tokens": 150
  }
}

Choose the right model — Use lightweight models for simple tasks; you don’t need flagship models for every scenario
Enable Prompt Caching — Repeated system prompts can save 50-90% on input costs
Control max_tokens — Set reasonable output length limits to avoid unnecessary token consumption
Batch requests — Combine multiple short requests into one to reduce repeated prompt overhead

For real-time pricing of each model, check the Model Catalog .