Documentation

v1.0.0-beta Apple Silicon · macOS 12+ MIT License

Overview

your-local-agent puts a full AI coding assistant — model, inference engine, and agent loop — entirely on your Mac. One command to install. One command to start. Nothing leaves your machine. Ever.

Every AI assistant you've used lives somewhere else. Your prompts travel over the internet, get logged, filtered, and processed on hardware you'll never see. That's fine for most things.

But when you're debugging at 2am. Writing scripts you'd rather not explain. Asking the same question for the fifteenth time. Working offline. Asking about anything that would get you flagged on a cloud provider — you want it on your machine.

NOTE

Apple Silicon only. your-local-agent targets M1, M2, M3, and M4 chips. Intel Mac support is on the roadmap.

Requirements

Hard requirements

Requirement	Minimum	Notes
macOS	12 Monterey	Works on 13, 14, 15 too
Chip	Apple Silicon (M1+)	M2/M3/M4 recommended
RAM	8 GB	16 GB+ for better models
Free disk	8 GB	Up to 20 GB for the 32B model
Internet	Required once	Only for initial setup and updates

Software prerequisites

The setup script installs most things automatically, but these must exist first:

1. Xcode Command Line Tools — needed to build llama.cpp

xcode-select --install

If you see "command line tools are already installed", you're good.

2. Python 3 — used to parse models and state files

python3 --version

macOS ships with Python 3 since Monterey. If missing: brew install python3

3. curl — needed to fetch the setup script

curl --version

Comes pre-installed on macOS. If somehow missing: brew install curl

4. A supported shell — zsh or bash

echo $SHELL

Zsh is the default on macOS Catalina and later. Both are fully supported.

Install

One-liner (recommended)

Paste this in your terminal and press Enter:

curl -fsSL https://raw.githubusercontent.com/noelps-git/your-local-agent/main/setup.sh | bash

That's it. The script:

Detects your RAM, chip, and disk space
Installs Homebrew, llama.cpp, Node.js, and Aider (skips anything already installed)
Downloads the right AI model for your exact hardware
Configures everything and wires up your shell aliases
Runs a live test to confirm the agent is working

TIME

Estimated time: 10–30 minutes depending on your internet speed and which model is selected. The model download is the slow part.

From source

git clone https://github.com/noelps-git/your-local-agent.git
cd your-local-agent
bash setup.sh

First launch

After setup finishes, restart your terminal, then:

local-ai-start

Your local AI is now running. Open a project folder and start talking to it.

Commands

All commands are shell aliases installed into your ~/.zshrc (or ~/.bash_profile) by the setup script.

local-ai-start Start the server and launch Aider in your current directory

local-ai-stop Shut down the llama-server and exit Aider

local-ai-status Check if the server is running and on which port

local-ai-info Show which model is loaded, port, and RAM usage

local-ai-update Pull latest scripts, update llama.cpp and Aider, check for better models

local-ai-setup Re-run model detection and configuration (safe to re-run)

Models

Setup reads your RAM and picks automatically. No config files, no manual decisions.

RAM	Model	Size	Speed
8 GB	Qwen3 4B	3.2 GB	~35 tok/s
16 GB	Qwen3 8B	5.0 GB	~25 tok/s
24 GB	Qwen3 14B	9.0 GB	~20 tok/s
32 GB+	Qwen3 32B	19.5 GB	~12 tok/s

You can override the auto-selection if you know what you're doing — it'll warn you, then let you proceed.

Use cases

Coding

Reads your actual project files — not just what you paste in
Writes, explains, refactors, and fixes bugs
No token limit anxiety. No per-message cost.

Security research

No content filters on exploit analysis, shellcode, payload breakdowns
Nothing logged anywhere — your CTF notes stay yours
Perfect for TryHackMe, HackTheBox, and offline lab environments
Ask it to decode Burp output, analyse Nmap scans, write custom fuzzers

Learning

Ask the same question ten ways at zero cost
"Explain this Metasploit module. Now simpler. Now with an example."
No judgment. No throttling. No "I can't help with that."

How it works

The stack is three components, all running locally:

Your Mac
│
├── llama-server          ← runs the AI model using your M-series GPU
│     └── Qwen3 (4B–32B)  ← automatically chosen for your RAM
│
└── Aider                 ← reads your files, talks to the model,
      └── your terminal     answers in your project context

No Docker. No Python environment hell. No .env file you'll accidentally push to GitHub.

Files on disk

~/models/
  └── Qwen3-*.gguf           the model (3–20 GB depending on RAM)

~/.local-ai/
  ├── setup.log              full timestamped log of every setup step
  ├── server.log             llama-server runtime output
  ├── state.json             what's installed (used by local-ai-update)
  └── bin/
      └── llama-server       the inference binary

~/.aider/
  └── config.json            Aider pointed at localhost:8080

Troubleshooting

Setup failed — where do I look?

cat ~/.local-ai/setup.log

Every step is logged with a timestamp. Scroll to the last error.

`command not found: local-ai-start`

Your shell aliases weren't loaded yet. Run:

source ~/.zshrc   # or source ~/.bash_profile

Then try local-ai-start again.

`xcode-select: error: command line tools are already installed`

Not actually an error — this message means the tools are present. Setup will continue normally.

Homebrew install fails or hangs

If you're behind a corporate proxy or a VPN is active, Homebrew's CDN can be blocked. Try:

# Disable proxy temporarily
unset HTTP_PROXY HTTPS_PROXY ALL_PROXY

# Then re-run setup
bash setup.sh

Model download stops or times out

Downloads are resumable. Just re-run:

bash setup.sh

Setup detects what's already done and skips it — only the download resumes.

`llama-server: Bad CPU type in executable`

You're on an Intel Mac. This build targets Apple Silicon only. Intel support is on the roadmap.

`llama-server` won't start / port 8080 in use

Something else is already using port 8080. Find and kill it:

lsof -i :8080
kill -9 <PID>

Then run local-ai-start again.

Aider can't connect to the model

Check the server is actually running:

local-ai-status
curl http://localhost:8080/health

If the server is down, start it: local-ai-start

`permission denied` running setup.sh

chmod +x setup.sh
bash setup.sh

Out of disk space mid-download

df -h ~          # check how much free space you have
bash setup.sh    # resumes the download

Security

The setup script was written with the following in mind:

Command injection via user input — sanitised to [a-zA-Z0-9-] only
Unsafe rm — path-guarded to ~/models/* only, never outside
eval of Homebrew path — validated against two known trusted paths
Temp file race conditions — chmod 600 immediately on creation
Downloaded script permissions — 750 for scripts, 640 for config
set -euo pipefail — in every script, fails fast and loud
No hardcoded secrets — anywhere

WARN

curl | bash — inherent to one-liner installers (Homebrew does this too). Mitigated: HTTPS only, GitHub raw source. Clone and inspect before running if you prefer.

Updating

local-ai-update

Pulls the latest scripts, updates llama.cpp and Aider, and checks whether a better model is available for your RAM. If one is, it asks before downloading and removes the old one after to free up space.

Roadmap

Linux + NVIDIA CUDA support
Intel Mac support
Fish shell alias setup
Open WebUI browser interface (chat UI on localhost:3000)
Community benchmark table (model × chip × speed)

Contributing

Most useful PRs:

New models in models.json — add an entry with the right fields
Bug reports for specific macOS versions or RAM configs
Roadmap items — Linux, Intel, Fish shell

Test on your own machine before submitting.

View repository on GitHub →

Documentation

Overview

Requirements

Hard requirements

Software prerequisites

Install

One-liner (recommended)

From source

First launch

Commands

Models

Use cases

Coding

Security research

Learning

How it works

Files on disk

Troubleshooting

Setup failed — where do I look?

command not found: local-ai-start

xcode-select: error: command line tools are already installed

Homebrew install fails or hangs

Model download stops or times out

llama-server: Bad CPU type in executable

llama-server won't start / port 8080 in use

Aider can't connect to the model

permission denied running setup.sh

Out of disk space mid-download

Security

Updating

Roadmap

Contributing

`command not found: local-ai-start`

`xcode-select: error: command line tools are already installed`

`llama-server: Bad CPU type in executable`

`llama-server` won't start / port 8080 in use

`permission denied` running setup.sh