hub / github.com/superagent-ai/grok-cli

github.com/superagent-ai/grok-cli @main sqlite

1,243 symbols 3,149 edges 146 files 4 documented · 0%

README

grok-cli: an open-source coding agent for the Grok API

Disclaimer: This project is community-built, open-source, and not affiliated with, endorsed by, or sponsored by xAI Corp. "Grok" is a trademark of xAI Corp. This tool uses the publicly available Grok API.

An open-source terminal coding agent that connects to xAI’s Grok API — real-time X search, web search, the full Grok model lineup, sub-agents on by default, remote control via Telegram (pair once, drive the agent from your phone while the CLI runs), and a terminal UI built with Bun and OpenTUI.

https://github.com/user-attachments/assets/7ca4f6df-50ca-4e9c-91b2-d4abad5c66cb

Install

curl -fsSL https://raw.githubusercontent.com/superagent-ai/grok-cli/main/install.sh | bash

Alternative installs (requires Bun on PATH):

bun add -g grok-dev

Self-management (script-installed only):

grok update
grok uninstall
grok uninstall --dry-run
grok uninstall --keep-config

Prerequisites: a Grok API key from x.ai and a modern terminal emulator for the interactive OpenTUI experience. Headless --prompt mode does not depend on terminal UI support. If you want host desktop automation via the built-in computer sub-agent, also enable Accessibility permission for your terminal app on macOS.

Run it

Interactive (default) — launches the OpenTUI coding agent:

grok

Supported terminals

For the most reliable interactive OpenTUI experience, use a modern terminal emulator. We currently document and recommend:

WezTerm (cross-platform)
Alacritty (cross-platform)
Ghostty (macOS and Linux)
Kitty (macOS and Linux)

Other modern terminals may work, but these are the terminal apps we currently recommend and document for interactive use.

Pick a project directory:

grok -d /path/to/your/repo

Headless — one prompt, then exit (scripts, CI, automation):

grok --prompt "run the test suite and summarize failures"
grok -p "show me package.json" --directory /path/to/project
grok --prompt "refactor X" --max-tool-rounds 30
grok --prompt "summarize the repo state" --format json
grok --prompt "review the repo overnight" --batch-api
grok --verify

--batch-api uses xAI's Batch API for lower-cost unattended runs. It is a good fit for scripts, CI, schedules, and other non-interactive workflows where a delayed result is fine.

Continue a saved session:

grok --session latest
grok -s <session-id>

Works in interactive mode too—same flag.

Structured headless output:

grok --prompt "summarize the repo state" --format json

--format json emits a newline-delimited JSON event stream instead of the default human-readable text output. Events are semantic, step-level records such as step_start, text, tool_use, step_finish, and error.

Computer sub-agent

Grok ships a built-in **computer** sub-agent backed by [agent-desktop](https://github.com/lahfir/agent-desktop) for host desktop automation on macOS.

Ask for it in natural language, for example:

grok "Use the computer sub-agent to take a screenshot of my host desktop and tell me what is open."
grok "Use the computer sub-agent to launch Google Chrome, snapshot the UI, and tell me which refs correspond to the address bar and tabs."

Notes:

Screenshots are saved under **.grok/computer/** by default.
The primary workflow is snapshot -> refs -> action -> snapshot using agent-desktop accessibility snapshots and stable refs like @e1.
computer_screenshot is available for visual confirmation, but the preferred path is computer_snapshot plus ref-based actions such as computer_click, computer_type, and computer_scroll.
macOS requires System Settings → Privacy & Security → Accessibility access for the terminal app running grok.
agent-desktop currently targets macOS.
If Bun blocks the native binary download during install, run:

node ./node_modules/agent-desktop/scripts/postinstall.js

Scheduling

Schedules let Grok run a headless prompt on a recurring schedule or once. Ask for it in natural language, for example:

Create a schedule named daily-changelog-update that runs every weekday at 9am
and updates CHANGELOG.md from the latest merged commits.

Recurring schedules require the background daemon:

grok daemon --background

Use /schedule in the TUI to browse saved schedules. One-time schedules start immediately in the background; recurring schedules keep running as long as the daemon is active.

List Grok models and pricing hints:

grok models

Pass an opening message without another prompt:

grok fix the flaky test in src/foo.test.ts

Generate images or short videos from chat:

grok "Generate a retro-futuristic logo for my CLI called Grok Forge"
grok "Edit ./assets/hero.png into a watercolor poster"
grok "Animate ./assets/cover.jpg into a 6 second cinematic push-in"

Image and video generation are exposed as agent tools inside normal chat sessions. You keep using a text model for the session, and Grok saves generated media under .grok/generated-media/ by default unless you ask for a specific output path.

What you actually get

Thing	What it means
Built for the Grok API	Defaults tuned for the xAI API; models like `grok-4.3`, `grok-4.20-non-reasoning`, `grok-4.20-multi-agent-0309`, plus current flagship and multi-agent variants—run `grok models` for the full menu.
X + web search	`search_x` and `search_web` tools—live posts and docs without pretending the internet stopped in 2023.
Media generation	Built-in `generate_image` and `generate_video` tools for text-to-image, image editing, text-to-video, and image-to-video flows. Generated files are saved locally so you can reuse them after the xAI URLs expire.
Sub-agents (default behavior)	Foreground `task` delegation (e.g. explore, general, or computer) plus background `delegate` for read-only deep dives—parallelize like you mean it.
Verify	`/verify` or `--verify` — inspects your app, builds, tests, boots it, and runs browser smoke checks in a sandboxed environment. Screenshots and video included.
Computer use	Built-in `computer` sub-agent for host desktop automation via `agent-desktop`. It prefers semantic accessibility snapshots and stable refs, with screenshots saved under `.grok/computer/` when requested.
Custom sub-agents	Define named agents with `subAgents` in `~/.grok/user-settings.json` and manage them from the TUI with `/agents`.
Remote control	Pair Telegram from the TUI (`/remote-control` → Telegram): DM your bot, `/pair`, approve the code in-terminal. Keep the CLI running while you ping it from your phone.
No “mystery meat” UI	OpenTUI React terminal UI—fast, keyboard-driven, not whatever glitchy thing you’re thinking of.
Skills	Agent Skills under `.agents/skills/<name>/SKILL.md` (project) or `~/.agents/skills/` (user). Use `/skills` in the TUI to list what’s installed.
MCPs	Extend with Model Context Protocol servers—configure via `/mcps` in the TUI or `.grok/settings.json` (`mcpServers`).
Sessions	Conversations persist; `--session latest` picks up where you left off.
Headless	`--prompt` / `-p` for non-interactive runs—pipe it, script it, bench it.
Hackable	TypeScript, clear agent loop, bash-first tools—fork it, shamelessly.

Coming soon

Deeper autonomous agent testing — persistent sandbox sessions, richer browser workflows, and stronger "prove it works" evidence.

API key (pick one)

Environment (good for CI):

export GROK_API_KEY=your_key_here

**.env** in the project (see .env.example if present):

GROK_API_KEY=your_key_here

CLI once:

grok -k your_key_here

Saved in user settings — ~/.grok/user-settings.json:

{ "apiKey": "your_key_here" }

Optional **subAgents** — custom foreground sub-agents. Each entry needs **name**, **model**, and **instruction**:

{
  "subAgents": [
    {
      "name": "security-review",
      "model": "grok-4.3",
      "instruction": "Prioritize security implications and suggest concrete fixes."
    }
  ]
}

Names cannot be general, explore, vision, verify, or computer because those are reserved for the built-in sub-agents.

Optional: **GROK_BASE_URL** (default https://api.x.ai/v1), **GROK_MODEL**, **GROK_MAX_TOKENS**.

Telegram (remote control) — short version

Create a bot with @BotFather, copy the token.
Set **TELEGRAM_BOT_TOKEN** or add **telegram.botToken** in ~/.grok/user-settings.json (the TUI **/remote-control** flow can save it).
Start **grok**, open **/remote-control** → Telegram if needed, then in Telegram DM your bot: **/pair, enter the 6-character code** in the terminal when asked.
First user must be approved once; after that, it’s remembered. Keep the CLI process running while you use the bot (long polling lives in that process).

Voice & audio messages

Send a voice note or audio attachment in Telegram and Grok will transcribe it with the Grok Speech-to-Text API (POST https://api.x.ai/v1/stt) before passing the text to the agent. The endpoint accepts Telegram's OGG/Opus voice notes and common audio containers (MP3, WAV, M4A, FLAC, AAC) directly — no local model download, whisper-cli, or ffmpeg required.

Prerequisites

A valid GROK_API_KEY (the same key used for the agent). Transcription reuses the CLI's apiKey / baseURL resolution, so if the agent can reach xAI, transcription will too.

Configure in `~/.grok/user-settings.json`

{
  "telegram": {
    "botToken": "YOUR_BOT_TOKEN",
    "audioInput": {
      "enabled": true,
      "language": "en"
    }
  }
}

Setting	Default	Description
`enabled`	`true`	Set to `false` to ignore voice/audio messages entirely.
`language`	`en`	Language code forwarded to `/v1/stt`. Enables Inverse Text Normalization (numbers, currencies, units → written form).

Optional headless flow when you do not want the TUI open:

grok telegram-bridge

Treat the bot token like a password.

Hooks

Hooks execute shell commands at key agent lifecycle events — enforce policies, run linters, trigger tests, or log activity.

Configure in ~/.grok/user-settings.json:

{
  "hooks": {
    "PreToolUse": [
      {
        "matcher": "bash",
        "hooks": [
          {
            "type": "command",
            "command": "./scripts/lint-before-edit.sh",
            "timeout": 10
          }
        ]
      }
    ]
  }
}

Hook commands receive JSON on stdin (event details) and can return JSON on stdout. Exit code 0 = success, 2 = block the action, other = non-blocking error.

Supported events: `PreToo

Extension points exported contracts — how you extend this code

SQLiteDatabase (Interface)

(no doc) [4 implementers]

src/storage/db.ts

McpToolBundle (Interface)

(no doc) [2 implementers]

src/mcp/runtime.ts

SQLiteQuery (Interface)

(no doc) [1 implementers]

src/bun-sqlite.d.ts

VerifyAgentLike (Interface)

(no doc) [1 implementers]

src/verify/orchestrator.ts

AudioTranscriptionEngine (Interface)

(no doc) [1 implementers]

src/audio/stt/engine.ts

TelegramBridgeOptions (Interface)

(no doc)

src/telegram/bridge.ts

GrepParams (Interface)

(no doc)

src/tools/grep.ts

LspClientOptions (Interface)

(no doc)

src/lsp/client.ts

Core symbols most depended-on inside this repo

src/verify/orchestrator.ts

getSandboxMutationBlockReason

src/utils/settings.ts

Shape

Function 762

Interface 237

Method 206

Class 38

Languages

TypeScript100%

Modules by API surface

src/agent/agent.ts114 symbols

src/ui/app.tsx77 symbols

src/tools/schedule.ts56 symbols

src/utils/settings.ts48 symbols

src/tools/computer.ts39 symbols

src/utils/install-manager.ts36 symbols

src/tools/bash.ts33 symbols

src/grok/batch.ts32 symbols

src/verify/recipes.ts28 symbols

src/lsp/client.ts28 symbols

src/agent/delegations.ts28 symbols

src/types/index.ts26 symbols

Dependencies from manifests, versioned

@ai-sdk/mcp1.0.25 · 1×

@ai-sdk/provider-utils4.0.21 · 1×

@ai-sdk/xai3.0.67 · 1×

@biomejs/biome2.4.8 · 1×

@coinbase/agentkit0.10.4 · 1×

@modelcontextprotocol/sdk1.27.1 · 1×

@npmcli/arborist9.4.2 · 1×

@opentui/core0.1.88 · 1×

@opentui/react0.1.88 · 1×

@types/diff8.0.0 · 1×

@types/node22.19.15 · 1×

@types/npmcli__arborist6.3.3 · 1×

For agents

$ claude mcp add grok-cli \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact

github.com/superagent-ai/grok-cli @main sqlite

grok-cli: an open-source coding agent for the Grok API

Install

Run it

Supported terminals

Computer sub-agent

Scheduling

What you actually get

Coming soon

API key (pick one)

Telegram (remote control) — short version

Voice & audio messages

Prerequisites

Configure in ~/.grok/user-settings.json

Hooks

Extension points exported contracts — how you extend this code

Core symbols most depended-on inside this repo

Shape

Languages

Modules by API surface

Dependencies from manifests, versioned

For agents

Configure in `~/.grok/user-settings.json`