Request-Based Pricing

Simple, transparent pricing

Transparent pricing with weighted fast requests for priority runs and a separate monthly token budget for total included compute.

MonthlyAnnual
Fast requests are your priority lane. Token budget is your monthly compute cap. Higher-cost models may use more fast-request credits. GPT-5.5 and Claude Opus 4.7 are active in the catalog, and paid plans continue in the slow queue after fast credits are exhausted.
Loading live pricing...

Available AI Models

Live model catalog with GPT-5.5 and Claude Opus 4.7 active, plus public API input and output pricing per 1M tokens.

ModelProviderAPI Price / 1MContext Window
No published models are available right now.

Need More Fast Credits?

Purchase extra fast-request credits anytime. Packs never expire.

No request packs are currently published.

Slow Queue β€” Never Blocked

When your fast-request credits run out, paid plans keep working through the slow queue.

Same Quality

Same models, same output quality. Only the wait time changes.

5-30s Delay

Requests are queued with a short delay depending on server load. Off-peak is nearly instant.

20 per Hour

Rate limited to 20 slow requests per hour. Need more? Buy a request pack for extra fast-request credits.

Compare Plans

See exactly what you get with each plan.

Plan comparison will appear here when live plans are available.

Team

Custom credits, team workspaces, admin dashboard, and shared billing for your development team.

  • Team workspaces & SSO
  • Admin dashboard
  • Unlimited devices
  • Priority support
Contact Sales

Enterprise

Dedicated infrastructure, SLA guarantees, on-premise deployment, and custom integrations.

  • Everything in Team
  • SLA guarantee
  • On-premise deployment
  • Dedicated support engineer
Contact Sales

Frequently Asked Questions

No FAQ entries are published right now.