Cerebras Provider

Cerebras runs AI models on wafer-scale custom chips, delivering extremely fast inference for open-source models. Their hardware is purpose-built for AI workloads.

Get an API Key

Go to cloud.cerebras.ai
Create an account or sign in
Navigate to API Keys
Create a new key and copy it

Cerebras offers a free tier with limited usage.

Configure VibeCody

Option 1: Environment variable (recommended)

export CEREBRAS_API_KEY="..."
vibecli --provider cerebras

Option 2: Config file (~/.vibecli/config.toml)

[cerebras]
enabled = true
api_key = "..."
model = "llama3.1-70b"

Model Selection

Model	Strengths	Best for
`llama3.1-70b`	Strong general coding	Daily coding tasks
`llama3.1-8b`	Ultra-fast	Quick completions, simple edits

Default: llama3.1-70b

Override from the CLI:

vibecli --provider cerebras --model llama3.1-8b

Best For

Ultra-fast inference – custom hardware delivers very low latency
Open-source models – access Llama models with blazing speed
Free tier – test fast inference without upfront costs

Verify Connection

vibecli --provider cerebras -c "Write a Python class for a binary search tree"

Troubleshooting

Rate limited

Error: 429 Too Many Requests

Free tier has usage limits
Wait and retry, or upgrade for higher limits

Model not available

Check cloud.cerebras.ai for current model availability

Provider: Cerebras