Cerebras Provider
Cerebras runs AI models on wafer-scale custom chips, delivering extremely fast inference for open-source models. Their hardware is purpose-built for AI workloads.
Get an API Key
- Go to cloud.cerebras.ai
- Create an account or sign in
- Navigate to API Keys
- Create a new key and copy it
Cerebras offers a free tier with limited usage.
Configure VibeCody
Option 1: Environment variable (recommended)
export CEREBRAS_API_KEY="..."
vibecli --provider cerebras
Option 2: Config file (~/.vibecli/config.toml)
[cerebras]
enabled = true
api_key = "..."
model = "llama3.1-70b"
Model Selection
| Model | Strengths | Best for |
|---|---|---|
llama3.1-70b |
Strong general coding | Daily coding tasks |
llama3.1-8b |
Ultra-fast | Quick completions, simple edits |
Default: llama3.1-70b
Override from the CLI:
vibecli --provider cerebras --model llama3.1-8b
Best For
- Ultra-fast inference – custom hardware delivers very low latency
- Open-source models – access Llama models with blazing speed
- Free tier – test fast inference without upfront costs
Verify Connection
vibecli --provider cerebras -c "Write a Python class for a binary search tree"
Troubleshooting
Rate limited
Error: 429 Too Many Requests
- Free tier has usage limits
- Wait and retry, or upgrade for higher limits
Model not available
- Check cloud.cerebras.ai for current model availability