Deploy VibeCody on macOS

Works on MacBook (Air/Pro) and Mac Mini/Studio. Apple Silicon gets Metal GPU acceleration for fast local inference.

Setup time: 2 minutes

Cost: Free

GPU: Metal (Apple Silicon)

Quick Start

curl -fsSL https://raw.githubusercontent.com/TuringWorks/vibecody/main/install.sh | sh
vibecli

Or use the guided setup:

cd vibecody/deploy/macos
./setup.sh --always-on

curl -fsSL https://raw.githubusercontent.com/TuringWorks/vibecody/main/install.sh | sh

brew install ollama
ollama serve &
ollama pull codellama:7b

Apple Silicon Macs run Ollama with Metal GPU acceleration — a MacBook Pro M3 can run 13B models at ~30 tokens/sec.

vibecli

Install VibeCody as a launchd service that starts at login:

./setup.sh --always-on
# Or manually:
vibecli --service install
vibecli --service start

Access at: http://localhost:7878

A Mac Mini with Apple Silicon is the best desktop “server” — silent, efficient, and Metal GPU runs 13B models at full speed.

# Headless setup with remote access
./setup.sh --always-on
vibecli --serve --tailscale  # Expose via Tailscale

Mac	RAM	Recommended Model	Performance
MacBook Air M1	8 GB	codellama:7b	~15 tok/s
MacBook Pro M3	18 GB	codellama:13b	~30 tok/s
Mac Mini M2 Pro	32 GB	codellama:34b	~20 tok/s
Mac Studio M2 Ultra	64 GB+	llama3:70b	~15 tok/s

./uninstall.sh
# Or: vibecli --service stop && rm ~/.local/bin/vibecli

Problem	Solution
“command not found”	Add `~/.local/bin` to PATH: `export PATH="$HOME/.local/bin:$PATH"`
Ollama slow	Ensure Ollama is using Metal — check with `ollama ps`
Service won’t start	Check logs: `cat /tmp/vibecody-stderr.log`
Intel Mac	Works fine but no GPU acceleration — use cloud providers for large models