Agent Mint
Welcome to the definitive guide for Mint. This document covers everything from your first installation to advanced autonomous engineering workflows.
1. Installation
Option A: Global NPM (Recommended)
Install Mint globally to use the mint command anywhere in your terminal.
$npm install -g @pheem49/mint@latest
Option B: Manual Build (For Developers)
Perfect for contributing or testing the latest experimental branch.
$git clone https://github.com/Pheem49/Mint.git
$cd Mint
$npm install
2. Initial Setup (The Onboarding Wizard)
Configuring a powerful AI assistant can be complex, so Mint includes a built-in Onboarding Wizard to handle everything in one go. Run it by typing:
$mint onboard
Step 1: Core AI Activation (Gemini)
Mint is powered primarily by Google Gemini. You will be prompted for:
- API Key: Paste your key from Google AI Studio. This key is stored locally and never shared.
- Model Selection: Choose between
flash(fast & cheap) orpro(highly intelligent). We recommend gemini-3.1-flash-lite-preview for daily tasks.
Step 2: QuickStart Provider Selection
This is where you choose which "plugins" or "channels" Mint should connect to. The menu uses an interactive checkbox system:
- ↑ / ↓ : Navigate through the list.
- Space : Toggle (Select/Deselect) an item. Selected items show a ◉.
- a : Select all items.
- i : Invert your current selection.
- Enter : Confirm and move to configuration.
Step 3: Categorized Services
Connect Mint to Telegram, Discord, WhatsApp, Slack, or LINE. This allows you to talk to your desktop agent from your phone!
Enable support for Anthropic (Claude), OpenAI, or Hugging Face if you prefer their models for specific coding tasks.
Privacy first. Connect to Ollama or LM Studio running on your machine to use Mint without sending data to the cloud.
Enable Google Search or Brave Search to give Mint real-time internet browsing capabilities for technical research.
Step 4: Dynamic Detail Entry
Based on your selections in Step 2, the wizard will now ask for the specific details needed for each service (e.g., Discord Bot Tokens, Search Engine IDs, or Local API URLs). Simply follow the prompts until you see the ✅ Configuration saved successfully! message.
3. Primary CLI Commands
Use these global commands to interact with Mint from your terminal:
Starts the unified interactive agent UI. This is your main gateway to talking with Mint.
Executes a specific autonomous coding task in the current directory (Workspace Mode).
Queues a long-running background task for the headless agent to perform autonomously.
Primary command for managing Model Context Protocol (MCP) servers and tools.
Displays an exhaustive list of all available features, tools, and terminal commands.
4. The Agent Workflow
Mint operates on a Plan-Act-Observe loop. It understands that complex tasks require a multi-step sequence of operations.
Thinking & Tool Selection
When given a task, Mint plans its moves, selects the appropriate tool (like web search or file editing), executes it, and then observes the results to decide the next action.
5. Coding Agent & Workspace
The specialized engineering mode (mint code) is optimized for technical integrity and idiomatic code standards.
Autonomous Safety
Before any destructive change (shell commands or file edits), Mint will prompt for your y/n approval, ensuring you remain in control of your machine.
6. Slash Commands In-Depth
In the interactive TUI, typing / opens the command menu. These allow you to control the agent's state directly.
chat, code, reviewer, or custom agents.
add, list, remove, or use to switch project contexts.
7. Autonomous Toolset
- web_search: Real-time internet access for up-to-date answers.
- open_url: Launches any website or URL in your system's default browser.
- open_app: Opens local applications (e.g., VS Code, Spotify, Slack) by name.
- open_file / open_folder: Opens files or directories using your system's default apps (e.g., Nautilus, Finder, Explorer).
- read_file / write_file: High-speed I/O operations with line range support.
- apply_patch: Surgical, non-destructive code edits using exact matching.
- run_shell: Executes terminal commands (Requires your y/n approval).
- find_path: Advanced fuzzy search to locate files and folders by name.
- system_automation: Controls hardware like Volume, Brightness, and Power.
8. Desktop GUI Features
Screen Vision
Let Mint "see" your screen to analyze errors, translate UI elements, or provide context-aware help based on your active windows.
Proactive Engine
Background monitoring that offers suggestions before you even ask, identifying task optimizations in real-time.
9. Advanced MCP (Extensions)
The Model Context Protocol (MCP) is the universal standard for connecting AI agents to data sources and tools. Mint acts as a host that can orchestrate multiple MCP servers simultaneously.
Adding a New Server
Use the following template to connect any MCP-compatible server to Mint:
# Template
mint mcp add <name> <command> --args <args...> --env <KEY=VALUE>
Common Extension Examples
Copy and run these commands in your terminal to extend Mint's capabilities:
Allows Mint to perform deep web searches beyond its basic internal browser.
$mint mcp add google-search npx --args -y @modelcontextprotocol/server-google-search --env GOOGLE_API_KEY=YOUR_KEY GOOGLE_SEARCH_ENGINE_ID=YOUR_ID
Gives Mint the ability to query and analyze local SQLite databases.
$mint mcp add my-db npx --args -y @modelcontextprotocol/server-sqlite /path/to/your/database.db
Provides Mint with real-time weather data for any location.
$mint mcp add weather npx --args -y @modelcontextprotocol/server-everything
*The 'everything' server includes weather, time, and more.
Allows Mint to perform advanced Git operations like branching and merging across folders.
$mint mcp add git npx --args -y @modelcontextprotocol/server-git /path/to/repo
mint mcp list- View all connected servers and their tools.mint mcp remove <name>- Disconnect a specific server.mint mcp clear- Wipe all extensions and start fresh.