A native macOS AI agent that controls your apps, writes code, automates workflows, and runs tasks from your iPhone via iMessage. All powered by the AI provider of your choice.
- Download Agent! and drag to Applications
- Open Agent! -- it sets up everything automatically
- Pick your AI -- Settings β choose a provider β enter API key
- Type a task and press Enter
π‘ No API key? Use Ollama with GLM-5 -- completely free, runs offline, no account needed. Requires 32GB+ RAM.
"Play my Workout playlist in Music" "Build the Xcode project and fix any errors" "Take a photo with Photo Booth" "Send an iMessage to Mom saying I'll be home at 6" "Open Safari and search for flights to Tokyo" "Refactor this class into smaller files" "What calendar events do I have today?"
Just type what you want. Agent! figures out how and makes it happen.
Full coding environment built in. Reads codebases, edits files with precision, runs shell commands, builds Xcode projects, manages git, and auto-enables coding mode to focus the AI on development tools. Replaces Claude Code, Cursor, and Cline -- no terminal, no IDE plugins, no monthly fee.
Control any Mac app through the Accessibility API. Click buttons, type into fields, navigate menus, scroll, drag -- all programmatically. Powered by AXorcist for reliable, fuzzy-matched element finding.
| Provider | Cost | Best For |
|---|---|---|
| GLM-5 / GLM-5.1 (Ollama) | Low Cost | Recommended starting point |
| Claude (Anthropic) | Paid | Complex tasks |
| ChatGPT (OpenAI) | Paid | General purpose |
| Apple Intelligence | Free | On-device, private |
| DeepSeek | Paid | Budget cloud AI |
| Local Ollama | Free | Full privacy, offline |
| LM Studio | Free | Easy local setup |
| Hugging Face | Varies | Open-source models |
| Z.ai | Paid | Fast, versatile |
| Ollama Cloud / vLLM | Varies | Self-hosted |
Click the microphone and speak. Agent! transcribes in real time and executes your request.
Text your Mac from your iPhone:
Agent! What song is playing?
Agent! Check my email
Agent! Next Song
Your Mac runs the task and texts back the result. Only approved contacts can send commands.
Drives Safari hands-free -- search Google, click links, fill forms, read pages, extract information.
For complex tasks, Agent! creates a step-by-step plan, works through each step, and checks them off in real time.
Work on multiple tasks simultaneously. Each tab has its own project folder and conversation history.
Take screenshots or paste images. Vision-capable AI models analyze what they see -- describe content, read text, spot UI issues.
Agent! includes built-in Safari web automation via JavaScript and AppleScript. Search Google, click links, fill forms, read page content, and execute JavaScript -- all hands-free.
To enable: Open Safari β Settings β Advanced β check "Show features for web developers". Then go to Developer menu β check "Allow JavaScript from Apple Events".
Full cross-browser automation via Microsoft Playwright MCP. Click, type, screenshot, and navigate any website in Chrome, Firefox, or WebKit -- all controlled by the AI.
Setup (one-time):
# 1. Install Node.js (if not already installed)
brew install node
# 2. Install Playwright MCP server globally
npm install -g @playwright/mcp@latest
# 3. Install browser binaries (pick one or all)
npx playwright install chromium # Chrome (~165MB)
npx playwright install firefox # Firefox (~97MB)
npx playwright install webkit # Safari/WebKit (~75MB)
npx playwright install # All browsersConfigure in Agent!:
Go to Settings β MCP Servers β Add Server, paste this JSON:
{
"mcpServers": {
"playwright": {
"command": "npx",
"args": ["@playwright/mcp"],
"transport": "stdio"
}
}
}Note: If
npxis not found, use the full path: runwhich npxin Terminal and replace"npx"with the result (e.g."/opt/homebrew/bin/npx").
Toggle ON and Playwright tools appear automatically. The AI can now control browsers directly.
- Your data stays on your Mac. Files, screen contents, and personal data are never uploaded.
- Cloud AI only sees your prompt text. Use local AI to stay 100% offline.
- You're in control. Agent! shows everything it does and logs every action.
- Built on Apple's security model. macOS permissions protect your system.
| Shortcut | Action |
|---|---|
Enter |
Run task |
β R |
Run current task |
β . |
Stop task |
Escape |
Cancel active task |
β D |
Toggle LLM output panel |
β T |
New tab |
β W |
Close tab |
β 1-9 |
Switch to tab |
β [ / β ] |
Previous / next tab |
β F |
Search activity log |
β L |
Clear conversation |
β H |
Task history |
β , |
Settings |
β V |
Paste image |
β / β |
Prompt history |
Do I need to know how to code? No. Just type what you want in plain English.
Is it safe? Yes. Standard macOS automation, full activity logging, you approve permissions.
How much does it cost? Agent! is free (MIT License). Cloud AI providers charge for API usage. Local models are free.
What Mac do I need? macOS 26+. Apple Silicon recommended. 32GB+ RAM for local models.
How is this different from Siri? Siri answers questions. Agent! performs actions -- controls apps, manages files, builds code, automates workflows.
- Technical Architecture -- Tools, scripting, developer details
- Comparisons -- vs Claude Code, Cursor, Cline, OpenClaw
- Security Model -- XPC architecture, privilege separation
- FAQ -- Common questions
Agent! supports MCP servers for extended capabilities. Configure in Settings β MCP Servers.
Connect Agent! directly to Xcode for project-aware operations:
{
"mcpServers" : {
"xcode" : {
"command" : "xcrun",
"args" : [
"mcpbridge"
],
"transport" : "stdio"
}
}
}Xcode MCP provides:
- Project-aware file operations (read/write/edit/delete)
- Build and test integration
- SwiftUI Preview rendering
- Code snippet execution
- Apple Developer Documentation search
- Real-time issue tracking
MIT - free and open source.