Pricing
OfoxAI uses a pure pay-as-you-go model — you only pay for what you use. No subscriptions, no plans, no monthly fees, no minimum spend.
No subscriptions, no plans, no hidden fees. Top up and start using immediately. Balance never expires.
Billing Method
Text Models (Per Token)
| Billing Item | Description |
|---|---|
| Input Tokens | Tokens consumed by input (prompt) |
| Output Tokens | Tokens consumed by output (completion) |
| Cached Tokens | Cache-hit input tokens (discounted rate) |
Other Models
| Type | Billing Method |
|---|---|
| Image generation | Per image |
| Embeddings | Per input token |
How It Works
- Sign up — Register at the OfoxAI Console
- Top up — Add funds as needed; balance never expires
- Use — Charged by actual usage, settled in real time
Viewing Costs
Console
- Log in to the OfoxAI Console
- Go to the Billing page
- View real-time cost breakdown by model and by date
API Response
Every API response includes a usage field for tracking consumption:
{
"usage": {
"prompt_tokens": 100,
"completion_tokens": 50,
"total_tokens": 150
}
}Cost Optimization Tips
- Choose the right model — Use lightweight models for simple tasks; you don’t need flagship models for every scenario
- Enable Prompt Caching — Repeated system prompts can save 50-90% on input costs
- Control max_tokens — Set reasonable output length limits to avoid unnecessary token consumption
- Batch requests — Combine multiple short requests into one to reduce repeated prompt overhead
For real-time pricing of each model, check the Model Catalog .
Last updated on