LLM inference

Powerful models.
Crazy cheap pricing.

Access to the best OSS models for the cheapest prices on the market

Platform
Everything you need to ship
A clean API, straightforward billing, and the models you actually want to use.
</>

OpenAI-compatible

Drop-in replacement for the OpenAI SDK. Change your base URL and you're live.

$

Transparent pricing

Pay per token with no hidden costs, or subscribe for a flat daily request allowance.

•••

Many models

A curated selection of top open-weight and proprietary LLMs, including vision and reasoning.

#

API key management

Create named keys per project. Revoke individually without disrupting other integrations.

~

Usage dashboard

Monitor token usage by model, track credits, and manage your plan from one place.

Integration
Three lines to your first call
If you've used the OpenAI SDK before, you already know how to use CrofAI.
Python
from openai import OpenAI

client = OpenAI(
    base_url="https://crof.ai/v1",
    api_key="your-api-key",
)

response = client.chat.completions.create(
    model="glm-5",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)
Plans
Simple, predictable pricing
Start free with per-token billing. Move to a plan when you're ready.
Free
$0
per month
Pay per token
Hobby
$5
per month
500 requests / day
Pro
$10
per month
1,000 requests / day
Scale
$50
per month
7,500 requests / day

Ready to build?

Create a free account and make your first API call in minutes.

Get Started