tutorial How to Run OpenAI Codex CLI on a VPS: Setup and Auth Guide TL;DR You can run OpenAI Codex CLI on a headless VPS, but the OAuth flow requires a browser — you solve this by SSH port-forwarding the callback URL back to your local machine. Once authenticated, use tmux to keep sessions alive across disconnects. If you want a single always-on environment
agent-oversight Configure Claude Code Approval Gates by Project Risk Level You want Claude Code to keep moving on throwaway work and stay careful on production. The same global setting can't do both. Here's the three-tier config that matches gate strictness to actual project risk.
mobile-coding-agents What Breaks When You Run Claude Code Over SSH — And Three Fixes SSH into your remote server, start Claude Code, and half the UI is gone — diff viewer, task panels, changed-files sidebar, all of it. Here's why the terminal UI collapses over SSH and how Leo, Claudette, and tmux each solve it differently.
agent-oversight Automated Quality Gates for Agent Code: Beyond Passing Tests Your agent's PR passed CI. Tests are green. But hardcoded secrets, hallucinated imports, and convention drift all survive standard checks. Here's the three-layer pipeline that catches what tests miss.
ai-dev-environments The venv for Claude Code: Reproducible Team Environments Your teammate cloned the repo, ran Claude Code, and the agent did something yours never would. The tool versions differ by two majors and the settings.json was never committed. Here's the fix.
agent-oversight Catch Agent Mistakes Before They Execute: Agent Verifier + Conduct Your agent is about to write an API key to disk. You have 10 seconds to catch it. Here's how to automate that check — before the damage is done.
mobile-coding-agents Cut Claude Code Token Usage 98% with Purpose-Built MCPs Claude Code reads entire files. On a large codebase or an 80K-token SEC filing, that's a budget-destroying habit. Two open-source MCPs cut token usage by 98% — here's the exact wiring.
agent-oversight Where to Gate Your AI Coding Agent: A 3-Checkpoint Framework Most developers run zero approval gates on their AI coding agents. The other extreme — gating every tool call — just rebuilds a slow human workflow. Here's the minimal 3-checkpoint architecture that covers real risk without the noise.
agent-oversight How to Build Human-in-the-Loop Approval Gates for AI Coding Agents Your agent just ran something you didn't ask for. Here's the three-pattern stack — PreToolUse hooks, ThumbGate blocklists, and mobile approval forwarding — that keeps agents fast without giving them a blank check.
tutorial Run Multiple Coding Agents in Parallel with Git Worktrees Parallel agents feel productive right up until one silently overwrites the other's work. Here's the isolation and oversight framework that stops it from happening.
agent-oversight How to Audit What Your AI Agent Actually Did After the Session Your AI agent finished the session. But did it stay on task? Here's the post-run audit to catch silent scope creep — before it compounds into something you can't easily reverse.
mobile-coding-agents How to Use Claude Code --teleport to Hand Off Sessions to Your Phone You close your laptop — your Claude Code session dies. Here's how to hand off long-running tasks to your phone using --teleport, Remote Control, and always-on cloud VMs.
tutorial How to Authenticate Claude Code and Codex on a Headless VPS How to SSH into a remote Linux server, install Claude Code, keep it running persistently with tmux, and optionally connect to the session from your phone without a terminal.
tutorial Keep Claude Code Running After SSH Disconnects (tmux Guide) Claude Code exits when you close your terminal because it receives SIGHUP. tmux is the fix — here's how to set it up, detach, and reconnect, with screen and nohup as alternatives.
tutorial How to Run Claude Code with tmux on a VPS A complete tmux workflow for Claude Code — named sessions per project, multiple parallel agents in separate windows or panes, detach and reattach, and session recovery.
tutorial How to Review Your Agent's Code Changes from Your Phone Your agent just finished a feature build. Grass lets you review every file it touched — diffs, syntax highlighting, line numbers — right from your phone.