Pi Coding Agent 扩展开发完全指南

Pi Coding Agent 扩展开发完全指南#

Pi Coding Agent 的扩展系统是其”自修改性”理念的核心载体。通过 TypeScript 扩展，你可以注册自定义工具、拦截事件流、定制 UI、修改系统提示——几乎可以对 Pi 的每一个环节进行编程控制。本文将全面介绍扩展系统的方方面面。

扩展是什么#

扩展是 TypeScript 模块，通过导出一个默认工厂函数来接收 ExtensionAPI 对象。Pi 使用 jiti 进行运行时加载，因此 TypeScript 无需预编译。

1
import type { ExtensionAPI } from "@earendil-works/pi-coding-agent";
2

3
export default function (pi: ExtensionAPI) {
4
  // 你的扩展逻辑
5
}

工厂函数也可以是异步的——Pi 会等待 async 初始化完成后再继续启动：

1
export default async function (pi: ExtensionAPI) {
2
  const response = await fetch("http://localhost:1234/v1/models");
3
  const payload = await response.json();
4
  pi.registerProvider("local", { /* ... */ });
5
}

核心能力一览#

能力	API	说明
自定义工具	`pi.registerTool()`	LLM 可调用的工具
事件拦截	`pi.on()`	拦截工具调用、修改消息、控制流程
自定义命令	`pi.registerCommand()`	注册 `/mycommand` 命令
快捷键	`pi.registerShortcut()`	注册键盘快捷键
CLI 标志	`pi.registerFlag()`	注册自定义命令行参数
提供商注册	`pi.registerProvider()`	注册或覆盖模型提供商
UI 组件	`ctx.ui.*`	状态栏、Widget、对话框、自定义编辑器等
消息注入	`pi.sendMessage()` / `pi.sendUserMessage()`	向会话注入消息

快速开始#

创建 ~/.pi/agent/extensions/my-extension.ts：

1
import type { ExtensionAPI } from "@earendil-works/pi-coding-agent";
2
import { Type } from "typebox";
3

4
export default function (pi: ExtensionAPI) {
5
  // 1. 监听会话启动事件
6
  pi.on("session_start", async (_event, ctx) => {
7
    ctx.ui.notify("扩展已加载！", "info");
8
  });
9

10
  // 2. 拦截危险命令
11
  pi.on("tool_call", async (event, ctx) => {
12
    if (event.toolName === "bash" && event.input.command?.includes("rm -rf")) {
13
      const ok = await ctx.ui.confirm("危险操作", "允许执行 rm -rf？");
14
      if (!ok) return { block: true, reason: "用户拒绝" };
15
    }
16
  });
17

18
  // 3. 注册自定义工具
19
  pi.registerTool({
20
    name: "greet",
21
    label: "打招呼",
22
    description: "按名字向某人问好",
23
    parameters: Type.Object({
24
      name: Type.String({ description: "名字" }),
25
    }),
26
    async execute(toolCallId, params, signal, onUpdate, ctx) {
27
      return {
28
        content: [{ type: "text", text: `你好, ${params.name}!` }],
29
        details: {},
30
      };
31
    },
32
  });
33

34
  // 4. 注册命令
35
  pi.registerCommand("hello", {
36
    description: "打个招呼",
37
    handler: async (args, ctx) => {
38
      ctx.ui.notify(`你好 ${args || "世界"}!`, "info");
39
    },
40
  });
41
}

测试运行：

1
# 方式一：命令行直接加载
2
pi -e ./my-extension.ts
3

4
# 方式二：放到自动发现目录（推荐，支持热重载）
5
cp my-extension.ts ~/.pi/agent/extensions/

扩展文件组织#

方式	目录结构	适用场景
单文件	`extensions/my-ext.ts`	简单扩展
目录	`extensions/my-ext/index.ts`	多文件扩展
带依赖	`extensions/my-ext/package.json` + `src/index.ts`	需要 npm 包

扩展存放位置#

位置	作用域
`~/.pi/agent/extensions/*.ts`	全局
`~/.pi/agent/extensions/*/index.ts`	全局（子目录）
`.pi/extensions/*.ts`	项目级
`.pi/extensions/*/index.ts`	项目级（子目录）

也可以在 settings.json 中指定额外路径：

1
{
2
  "extensions": ["/path/to/local/extension.ts"]
3
}

可用导入#

包	用途
`@earendil-works/pi-coding-agent`	扩展类型、事件、工具函数
`typebox`	工具参数的 Schema 定义
`@earendil-works/pi-ai`	`StringEnum`（Google API 兼容枚举）
`@earendil-works/pi-tui`	TUI 组件
`node:fs`, `node:path` 等	Node.js 内置模块

事件系统#

事件系统是扩展的核心机制。通过 pi.on(event, handler) 订阅事件，可以对 Pi 的每一步操作进行拦截、修改或响应。

生命周期概览#

1
pi 启动
2
  ├─► session_start { reason: "startup" }
3
  └─► resources_discover { reason: "startup" }
4

5
用户发送 prompt ───────────────────────────────┐
6
  │                                             │
7
  ├─► input (可拦截/转换/完全处理)               │
8
  ├─► before_agent_start (可注入消息/修改系统提示) │
9
  ├─► agent_start                               │
10
  │                                              │
11
  │   ┌── turn (循环直到 LLM 不再调用工具) ──┐    │
12
  │   │                                      │    │
13
  │   │  turn_start                          │    │
14
  │   │  context (可修改发送给 LLM 的消息)    │    │
15
  │   │  before_provider_request             │    │
16
  │   │                                      │    │
17
  │   │  LLM 响应，可能调用工具：             │    │
18
  │   │    tool_call (可阻止/修改参数)        │    │
19
  │   │    tool_result (可修改结果)           │    │
20
  │   │                                      │    │
21
  │   │  turn_end                            │    │
22
  │   └──────────────────────────────────────┘    │
23
  │                                              │
24
  └─► agent_end                                 │
25
                                                 │
26
用户再次发送 prompt ◄───────────────────────────┘

会话事件#

session_start#

会话启动、加载或重载时触发：

1
pi.on("session_start", async (event, ctx) => {
2
  // event.reason: "startup" | "reload" | "new" | "resume" | "fork"
3
  ctx.ui.notify(`会话已加载 (${event.reason})`, "info");
4
});

session_shutdown#

会话关闭前触发，用于清理工作：

1
pi.on("session_shutdown", async (event, ctx) => {
2
  // event.reason: "quit" | "reload" | "new" | "resume" | "fork"
3
  connection?.close();
4
});

session_before_switch / session_before_fork#

会话切换或分叉前触发，可取消：

1
pi.on("session_before_switch", async (event, ctx) => {
2
  if (event.reason === "new") {
3
    const ok = await ctx.ui.confirm("确认？", "清除所有消息？");
4
    if (!ok) return { cancel: true };
5
  }
6
});

Agent 事件#

before_agent_start#

每次用户提交 prompt 后、agent 循环开始前触发。可注入消息和修改系统提示：

1
pi.on("before_agent_start", async (event, ctx) => {
2
  return {
3
    // 注入持久化消息（存储在会话中，发送给 LLM）
4
    message: {
5
      customType: "my-extension",
6
      content: "额外的上下文信息",
7
      display: true,
8
    },
9
    // 修改系统提示（跨扩展链式修改）
10
    systemPrompt: event.systemPrompt + "\n\n额外的指令...",
11
  };
12
});

agent_start / agent_end#

每次用户 prompt 对应一个 agent_start / agent_end 对：

1
pi.on("agent_end", async (event, ctx) => {
2
  // event.messages - 本次 prompt 产生的所有消息
3
});

工具事件#

tool_call#

工具执行前触发。可阻止执行、可修改参数：

1
import { isToolCallEventType } from "@earendil-works/pi-coding-agent";
2

3
pi.on("tool_call", async (event, ctx) => {
4
  // 类型安全的参数访问
5
  if (isToolCallEventType("bash", event)) {
6
    // event.input.command 是类型安全的
7
    if (event.input.command.includes("rm -rf")) {
8
      return { block: true, reason: "危险命令被阻止" };
9
    }
10
    // 修改参数（原地修改）
11
    event.input.command = `source ~/.profile\n${event.input.command}`;
12
  }
13
});

关键行为保证：

对 event.input 的修改会影响实际工具执行
后续 tool_call 处理器能看到前面处理器的修改
不会在修改后重新校验

tool_result#

工具执行完成后触发。可修改结果，支持链式中间件模式：

1
pi.on("tool_result", async (event, ctx) => {
2
  // 可以对结果做后处理
3
  return { content: [...], details: {...}, isError: false };
4
});

上下文事件#

context#

每次 LLM 调用前触发，可以非破坏性地修改发送给 LLM 的消息：

1
pi.on("context", async (event, ctx) => {
2
  const filtered = event.messages.filter(m => !shouldPrune(m));
3
  return { messages: filtered };
4
});

输入事件#

input#

用户输入到达时触发（在扩展命令检查之后、技能/模板展开之前）：

1
pi.on("input", async (event, ctx) => {
2
  // 转换输入
3
  if (event.text.startsWith("?quick "))
4
    return { action: "transform", text: `简短回答: ${event.text.slice(7)}` };
5

6
  // 完全处理（不经过 LLM）
7
  if (event.text === "ping") {
8
    ctx.ui.notify("pong", "info");
9
    return { action: "handled" };
10
  }
11

12
  return { action: "continue" }; // 默认：传递给后续处理
13
});

处理结果：

continue — 原样传递（默认）
transform — 修改文本/图片后继续
handled — 跳过 agent 处理（第一个返回此值的处理器获胜）

模型事件#

model_select#

模型切换时触发（/model、Ctrl+P、会话恢复）：

1
pi.on("model_select", async (event, ctx) => {
2
  // event.model, event.previousModel, event.source ("set" | "cycle" | "restore")
3
});

thinking_level_select#

思考级别变化时触发（仅通知，返回值被忽略）：

1
pi.on("thinking_level_select", async (event, ctx) => {
2
  ctx.ui.setStatus("thinking", `思考级别: ${event.level}`);
3
});

自定义工具#

自定义工具是扩展最强大的能力之一。通过 pi.registerTool() 注册的工具会出现在系统提示中，LLM 可以像调用内置工具一样调用它们。

完整工具定义#

1
import { Type } from "typebox";
2
import { StringEnum } from "@earendil-works/pi-ai";
3

4
pi.registerTool({
5
  name: "my_tool",
6
  label: "我的工具",
7
  description: "工具描述（LLM 可见）",
8
  promptSnippet: "一句话描述工具功能",              // 出现在系统提示的 Available tools 中
9
  promptGuidelines: [                               // 工具级指引
10
    "当用户需要 X 时使用 my_tool 而不是直接编辑文件"
11
  ],
12
  parameters: Type.Object({
13
    action: StringEnum(["list", "add"] as const),   // 必须用 StringEnum！
14
    text: Type.Optional(Type.String()),
15
  }),
16
  prepareArguments(args) {
17
    // 可选：在 schema 校验前转换参数（用于向后兼容）
18
    return args;
19
  },
20
  async execute(toolCallId, params, signal, onUpdate, ctx) {
21
    // 检查取消
22
    if (signal?.aborted) {
23
      return { content: [{ type: "text", text: "已取消" }] };
24
    }
25

26
    // 流式进度更新
27
    onUpdate?.({
28
      content: [{ type: "text", text: "处理中..." }],
29
      details: { progress: 50 },
30
    });
31

32
    // 执行操作
33
    const result = await pi.exec("some-cmd", [], { signal });
34

35
    return {
36
      content: [{ type: "text", text: "完成" }],  // 发送给 LLM
37
      details: { data: result },                    // 用于 UI 渲染和状态持久化
38
      terminate: true,                              // 可选：提示跳过后续 LLM 调用
39
    };
40
  },
41

42
  // 可选：自定义渲染
43
  renderCall(args, theme, context) { /* ... */ },
44
  renderResult(result, options, theme, context) { /* ... */ },
45
});

重要注意事项#

1. 使用 StringEnum 而非 Type.Union#

1
// ✅ 正确 — 兼容所有 provider（包括 Google）
2
action: StringEnum(["list", "add"] as const)
3

4
// ❌ 错误 — Google API 不支持
5
action: Type.Union([Type.Literal("list"), Type.Literal("add")])

2. 错误处理：抛出异常而非返回值#

1
// ✅ 正确：抛出异常标记为错误
2
async execute(toolCallId, params) {
3
  if (!isValid(params.input)) {
4
    throw new Error(`无效输入: ${params.input}`);
5
  }
6
  return { content: [{ type: "text", text: "OK" }], details: {} };
7
}

3. 输出截断（必须！）#

工具输出超过 50KB / 2000 行会导致上下文溢出：

1
import { truncateHead, DEFAULT_MAX_BYTES, DEFAULT_MAX_LINES } from "@earendil-works/pi-coding-agent";
2

3
const output = await runCommand();
4
const truncation = truncateHead(output, {
5
  maxLines: DEFAULT_MAX_LINES,  // 2000
6
  maxBytes: DEFAULT_MAX_BYTES,  // 50KB
7
});
8

9
let result = truncation.content;
10
if (truncation.truncated) {
11
  result += `\n\n[输出已截断，完整内容保存在: ${tempFile}]`;
12
}

4. 文件变更安全：withFileMutationQueue#

当工具修改文件时，使用 withFileMutationQueue() 避免并行工具的竞态条件：

1
import { withFileMutationQueue } from "@earendil-works/pi-coding-agent";
2

3
async execute(_toolCallId, params, _signal, _onUpdate, ctx) {
4
  const absolutePath = resolve(ctx.cwd, params.path);
5

6
  return withFileMutationQueue(absolutePath, async () => {
7
    const current = await readFile(absolutePath, "utf8");
8
    const next = current.replace(params.oldText, params.newText);
9
    await writeFile(absolutePath, next, "utf8");
10
    return {
11
      content: [{ type: "text", text: `已更新 ${params.path}` }],
12
      details: {},
13
    };
14
  });
15
}

覆盖内置工具#

注册同名工具即可覆盖内置的 read、bash、edit、write、grep、find、ls：

1
pi.registerTool({
2
  name: "read",  // 同名覆盖
3
  label: "Read",
4
  description: "带日志记录的文件读取",
5
  parameters: Type.Object({ path: Type.String() }),
6
  async execute(toolCallId, params, signal, onUpdate, ctx) {
7
    console.log(`[读取] ${params.path}`);
8
    // 你的自定义实现...
9
    return { content: [{ type: "text", text: "..." }], details: {} };
10
  },
11
});

渲染是按槽位继承的——如果覆盖时省略了 renderCall，内置的渲染器仍然生效。

远程执行#

内置工具支持可插拔的操作接口，可以委托给远程系统：

1
import { createReadTool, createBashTool } from "@earendil-works/pi-coding-agent";
2

3
const remoteRead = createReadTool(cwd, {
4
  operations: {
5
    readFile: (path) => sshExec(remote, `cat ${path}`),
6
    access: (path) => sshExec(remote, `test -r ${path}`).then(() => {}),
7
  }
8
});

Bash 工具还支持 spawn hook，可以在执行前调整命令、工作目录和环境变量：

1
const bashTool = createBashTool(cwd, {
2
  spawnHook: ({ command, cwd, env }) => ({
3
    command: `source ~/.profile\n${command}`,
4
    cwd: `/mnt/sandbox${cwd}`,
5
    env: { ...env, CI: "1" },
6
  }),
7
});

自定义渲染#

工具可以提供 renderCall 和 renderResult 来自定义在终端中的显示效果：

1
import { Text } from "@earendil-works/pi-tui";
2

3
pi.registerTool({
4
  name: "my_tool",
5
  // ...
6

7
  renderCall(args, theme, context) {
8
    const text = (context.lastComponent as Text | undefined) ?? new Text("", 0, 0);
9
    let content = theme.fg("toolTitle", theme.bold("my_tool "));
10
    content += theme.fg("muted", args.action);
11
    text.setText(content);
12
    return text;
13
  },
14

15
  renderResult(result, { expanded, isPartial }, theme, context) {
16
    if (isPartial) {
17
      return new Text(theme.fg("warning", "处理中..."), 0, 0);
18
    }
19
    let text = theme.fg("success", "✓ 完成");
20
    if (expanded && result.details?.items) {
21
      for (const item of result.details.items) {
22
        text += "\n  " + theme.fg("dim", item);
23
      }
24
    }
25
    return new Text(text, 0, 0);
26
  },
27
});

自定义 UI#

扩展可以通过 ctx.ui 提供的方法与用户交互，并自定义消息/工具的渲染方式。

对话框#

1
// 选择
2
const choice = await ctx.ui.select("选择一项:", ["A", "B", "C"]);
3

4
// 确认
5
const ok = await ctx.ui.confirm("删除？", "此操作不可撤销");
6

7
// 文本输入
8
const name = await ctx.ui.input("名字:", "默认值");
9

10
// 多行编辑
11
const text = await ctx.ui.editor("编辑内容:", "预填文本");
12

13
// 通知（非阻塞）
14
ctx.ui.notify("操作完成！", "info");  // "info" | "warning" | "error"

带倒计时的对话框#

1
// 5 秒后自动取消
2
const confirmed = await ctx.ui.confirm(
3
  "限时确认",
4
  "此对话框将在 5 秒后自动取消。确认吗？",
5
  { timeout: 5000 }
6
);
7

8
if (confirmed) {
9
  // 用户确认
10
} else {
11
  // 用户取消或超时
12
}

1
// 底部状态栏（持续显示直到清除）
2
ctx.ui.setStatus("my-ext", "处理中...");
3
ctx.ui.setStatus("my-ext", undefined);  // 清除
4

5
// 工作指示器（流式输出时显示）
6
ctx.ui.setWorkingIndicator({
7
  frames: [
8
    ctx.ui.theme.fg("dim", "·"),
9
    ctx.ui.theme.fg("muted", "•"),
10
    ctx.ui.theme.fg("accent", "●"),
11
    ctx.ui.theme.fg("muted", "•"),
12
  ],
13
  intervalMs: 120,
14
});
15

16
// 编辑器上方 Widget
17
ctx.ui.setWidget("my-widget", ["状态行 1", "状态行 2"]);
18

19
// 编辑器下方 Widget
20
ctx.ui.setWidget("my-widget", ["行 1", "行 2"], { placement: "belowEditor" });
21

22
// 清除 Widget
23
ctx.ui.setWidget("my-widget", undefined);

完全替换内置的底部状态栏：

1
ctx.ui.setFooter((tui, theme) => ({
2
  render(width) {
3
    return [theme.fg("dim", `自定义 Footer | 宽度: ${width}`)];
4
  },
5
  invalidate() {},
6
}));
7

8
// 恢复内置 Footer
9
ctx.ui.setFooter(undefined);

自定义编辑器#

可以用自定义实现替换主输入编辑器（例如实现 Vim 模式）：

1
import { CustomEditor, type ExtensionAPI } from "@earendil-works/pi-coding-agent";
2

3
class VimEditor extends CustomEditor {
4
  private mode: "normal" | "insert" = "insert";
5

6
  handleInput(data: string): void {
7
    if (this.mode === "insert" && data === "escape") {
8
      this.mode = "normal";
9
      return;
10
    }
11
    if (this.mode === "normal" && data === "i") {
12
      this.mode = "insert";
13
      return;
14
    }
15
    super.handleInput(data);  // 保留 app 快捷键
16
  }
17
}
18

19
export default function (pi: ExtensionAPI) {
20
  pi.on("session_start", (_event, ctx) => {
21
    ctx.ui.setEditorComponent((_tui, theme, keybindings) =>
22
      new VimEditor(theme, keybindings)
23
    );
24
  });
25
}

也可以包装已有的自定义编辑器：

1
const previous = ctx.ui.getEditorComponent();
2
ctx.ui.setEditorComponent((tui, theme, keybindings) =>
3
  new MyEditor(tui, theme, keybindings, {
4
    base: previous?.(tui, theme, keybindings)
5
  })
6
);

自定义组件#

对于复杂的 UI 交互，使用 ctx.ui.custom() 临时替换编辑器：

1
import { Text } from "@earendil-works/pi-tui";
2

3
const result = await ctx.ui.custom<boolean>((tui, theme, keybindings, done) => {
4
  const text = new Text("按 Enter 确认，Escape 取消", 1, 1);
5

6
  text.onKey = (key) => {
7
    if (key === "return") done(true);
8
    if (key === "escape") done(false);
9
    return true;
10
  };
11

12
  return text;
13
});

还支持 Overlay 覆盖层模式（浮在现有内容之上，不清屏）：

1
const result = await ctx.ui.custom<string | null>(
2
  (tui, theme, keybindings, done) => new MyOverlayComponent({ onClose: done }),
3
  { overlay: true }
4
);

自定义消息渲染#

注册自定义渲染器来控制特定 customType 消息的显示：

1
pi.registerMessageRenderer("my-extension", (message, options, theme) => {
2
  const { expanded } = options;
3
  let text = theme.fg("accent", `[${message.customType}] `);
4
  text += message.content;
5

6
  if (expanded && message.details) {
7
    text += "\n" + theme.fg("dim", JSON.stringify(message.details, null, 2));
8
  }
9

10
  return new Text(text, 0, 0);
11
});

自动补全#

可以在内置的斜杠命令和路径补全之上叠加自定义补全逻辑：

1
ctx.ui.addAutocompleteProvider((current) => ({
2
  async getSuggestions(lines, cursorLine, cursorCol, options) {
3
    const line = lines[cursorLine] ?? "";
4
    const beforeCursor = line.slice(0, cursorCol);
5

6
    // 匹配 #1234 格式的 GitHub issue
7
    const match = beforeCursor.match(/(?:^|[ \t])#([^\s#]*)$/);
8
    if (!match) {
9
      return current.getSuggestions(lines, cursorLine, cursorCol, options);
10
    }
11

12
    return {
13
      prefix: `#${match[1] ?? ""}`,
14
      items: [
15
        { value: "#2983", label: "#2983", description: "扩展 API 示例" },
16
        { value: "#2753", label: "#2753", description: "重载资源配置" },
17
      ],
18
    };
19
  },
20
  applyCompletion(lines, cursorLine, cursorCol, item, prefix) {
21
    return current.applyCompletion(lines, cursorLine, cursorCol, item, prefix);
22
  },
23
  shouldTriggerFileCompletion(lines, cursorLine, cursorCol) {
24
    return current.shouldTriggerFileCompletion?.(lines, cursorLine, cursorCol) ?? true;
25
  },
26
}));

主题颜色#

所有渲染函数都会收到 theme 对象：

1
// 前景色
2
theme.fg("toolTitle", text)   // 工具名称
3
theme.fg("accent", text)      // 高亮
4
theme.fg("success", text)     // 成功（绿色）
5
theme.fg("error", text)       // 错误（红色）
6
theme.fg("warning", text)     // 警告（黄色）
7
theme.fg("muted", text)       // 次要文本
8
theme.fg("dim", text)         // 三级文本
9

10
// 文本样式
11
theme.bold(text)
12
theme.italic(text)
13
theme.strikethrough(text)

代码语法高亮：

1
import { highlightCode, getLanguageFromPath } from "@earendil-works/pi-coding-agent";
2

3
const lang = getLanguageFromPath("/path/to/file.rs");  // "rust"
4
const highlighted = highlightCode(code, lang, theme);

ExtensionAPI 速查#

核心方法#

方法	用途
`pi.on(event, handler)`	订阅事件
`pi.registerTool(def)`	注册自定义工具
`pi.registerCommand(name, opts)`	注册 `/命令`
`pi.registerShortcut(key, opts)`	注册快捷键
`pi.registerFlag(name, opts)`	注册 CLI 标志
`pi.registerProvider(name, config)`	注册/覆盖模型提供商
`pi.unregisterProvider(name)`	移除提供商

消息与状态#

方法	用途
`pi.sendMessage(msg, opts)`	注入自定义消息到会话
`pi.sendUserMessage(content, opts)`	发送用户消息
`pi.appendEntry(type, data)`	持久化扩展状态（不发给 LLM）
`pi.setSessionName(name)`	设置会话名称

运行时控制#

方法	用途
`pi.exec(cmd, args, opts)`	执行 shell 命令
`pi.getActiveTools()`	获取当前活跃工具列表
`pi.getAllTools()`	获取所有可用工具
`pi.setActiveTools(names)`	设置活跃工具
`pi.setModel(model)`	切换模型
`pi.getThinkingLevel()`	获取思考级别
`pi.setThinkingLevel(level)`	设置思考级别
`pi.events`	扩展间事件总线

消息投递模式#

sendMessage 和 sendUserMessage 支持 deliverAs 选项：

模式	行为
`"steer"`	当前 turn 结束后立即投递（默认）
`"followUp"`	等 agent 完全结束后投递
`"nextTurn"`	排队等待下一次用户 prompt

ExtensionContext#

所有事件处理器都会收到 ctx: ExtensionContext，提供运行时上下文：

属性/方法	说明
`ctx.ui`	UI 交互方法
`ctx.hasUI`	是否有 UI（打印/JSON 模式下为 false）
`ctx.cwd`	当前工作目录
`ctx.sessionManager`	只读的会话状态访问
`ctx.modelRegistry` / `ctx.model`	模型信息
`ctx.signal`	当前 agent 的 AbortSignal
`ctx.isIdle()`	agent 是否空闲
`ctx.abort()`	中断当前操作
`ctx.getContextUsage()`	获取上下文使用情况
`ctx.compact()`	触发上下文压缩
`ctx.getSystemPrompt()`	获取当前系统提示
`ctx.shutdown()`	请求优雅退出

命令专有方法#

命令处理器额外拥有 ExtensionCommandContext：

1
// 等待 agent 空闲
2
await ctx.waitForIdle();
3

4
// 创建新会话
5
await ctx.newSession({
6
  parentSession,
7
  setup: async (sm) => { /* 初始化新会话 */ },
8
  withSession: async (ctx) => { /* 在新会话中工作 */ },
9
});
10

11
// 从特定节点分叉
12
await ctx.fork("entry-id-123", { position: "before" });
13

14
// 导航到树中其他节点
15
await ctx.navigateTree("entry-id-456", { summarize: true });
16

17
// 切换到其他会话
18
await ctx.switchSession("/path/to/session.jsonl");
19

20
// 重载运行时
21
await ctx.reload();

状态管理#

扩展的状态应存储在工具结果的 details 中，以支持会话分支：

1
export default function (pi: ExtensionAPI) {
2
  let items: string[] = [];
3

4
  // 从会话恢复状态
5
  pi.on("session_start", async (_event, ctx) => {
6
    items = [];
7
    for (const entry of ctx.sessionManager.getBranch()) {
8
      if (entry.type === "message" && entry.message.role === "toolResult") {
9
        if (entry.message.toolName === "my_tool") {
10
          items = entry.message.details?.items ?? [];
11
        }
12
      }
13
    }
14
  });
15

16
  pi.registerTool({
17
    name: "my_tool",
18
    async execute(toolCallId, params, signal, onUpdate, ctx) {
19
      items.push("新项目");
20
      return {
21
        content: [{ type: "text", text: "已添加" }],
22
        details: { items: [...items] },  // 持久化到会话
23
      };
24
    },
25
  });
26
}

也可以使用 pi.appendEntry() 存储不参与 LLM 上下文的状态：

1
// 存储
2
pi.appendEntry("my-state", { count: 42 });
3

4
// 恢复
5
pi.on("session_start", async (_event, ctx) => {
6
  for (const entry of ctx.sessionManager.getEntries()) {
7
    if (entry.type === "custom" && entry.customType === "my-state") {
8
      // 从 entry.data 重建状态
9
    }
10
  }
11
});

自定义提供商#

扩展可以动态注册或覆盖模型提供商：

1
pi.registerProvider("my-proxy", {
2
  name: "My Proxy",
3
  baseUrl: "https://proxy.example.com",
4
  apiKey: "$PROXY_API_KEY",  // 环境变量引用
5
  api: "anthropic-messages",
6
  models: [
7
    {
8
      id: "claude-sonnet-4-20250514",
9
      name: "Claude 4 Sonnet (proxy)",
10
      reasoning: false,
11
      input: ["text", "image"],
12
      cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
13
      contextWindow: 200000,
14
      maxTokens: 16384,
15
    }
16
  ]
17
});

还可以覆盖已有提供商的 baseUrl，或注册带 OAuth 支持的提供商。

Skill 技能系统#

Skill 是 Markdown 格式的按需能力包，遵循 Agent Skills 标准。与扩展不同，Skill 不需要编写代码——它是给 LLM 的指令文档。

Skill 存放位置#

位置	作用域
`~/.pi/agent/skills/`	全局
`~/.agents/skills/`	全局
`.pi/skills/`	项目级
`.agents/skills/`（cwd 及父目录）	项目级

Skill 结构#

1
my-skill/
2
├── SKILL.md              # 必须：前置元数据 + 指令
3
├── scripts/              # 辅助脚本（可选）
4
└── references/           # 详细文档（可选）

SKILL.md 格式#

1
---
2
name: my-skill
3
description: 这个技能做什么以及何时使用。要具体描述。
4
---
5

6
# 我的技能
7

8
## 环境准备
9

10
首次使用前运行：
11
\`\`\`bash
12
cd /path/to/skill && npm install
13
\`\`\`
14

15
## 使用方法
16

17
\`\`\`bash
18
./scripts/process.sh <input>
19
\`\`\`

前置元数据字段#

字段	必填	说明
`name`	✅	1-64字符，小写字母+数字+连字符
`description`	✅	最多1024字符，描述用途和触发条件
`license`	❌	许可证
`compatibility`	❌	环境要求
`disable-model-invocation`	❌	`true` 时需手动 `/skill:name` 加载

渐进披露#

Skill 的关键设计是渐进披露：

只有 description 常驻系统提示上下文
完整的 SKILL.md 内容按需加载（Agent 用 read 工具读取）
这减少了 token 开销

使用 Skill#

1
# 命令行触发
2
/skill:my-skill
3

4
# 带参数
5
/skill:my-skill 参数1 参数2
6

7
# Agent 也会根据描述自动判断何时加载

通过 settings.json 配置 Skill 路径#

1
{
2
  "skills": ["~/.claude/skills", "/path/to/custom-skill"]
3
}

思考级别#

Pi 支持 6 个思考级别，控制模型在回答前的”深度思考”程度：

级别	说明
`off`	关闭思考
`minimal`	最少思考
`low`	低度思考
`medium`	中度思考
`high`	高度思考
`xhigh`	超高度思考

配置方式#

交互模式：按 Shift+Tab 循环切换

命令行：

1
pi --thinking high "解决这个复杂问题"
2
pi --model sonnet:high "复杂任务"  # 模型+思考级别简写

配置文件：

1
{
2
  "defaultThinkingLevel": "high",
3
  "thinkingBudgets": {
4
    "minimal": 1024,
5
    "low": 4096,
6
    "medium": 10240,
7
    "high": 32768
8
  }
9
}

通过扩展：

1
pi.setThinkingLevel("high");

非推理模型（如 GPT-4o）不支持思考，始终为 off。

设置系统#

Pi 使用 JSON 配置文件，项目级覆盖全局：

位置	作用域
`~/.pi/agent/settings.json`	全局
`.pi/settings.json`	项目级（覆盖全局，嵌套对象合并）

关键配置项#

1
{
2
  "defaultProvider": "anthropic",
3
  "defaultModel": "claude-sonnet-4-20250514",
4
  "defaultThinkingLevel": "medium",
5
  "theme": "dark",
6
  "packages": ["pi-skills"],
7
  "extensions": ["/path/to/extension.ts"],
8
  "skills": ["~/.claude/skills"],
9
  "compaction": {
10
    "enabled": true,
11
    "reserveTokens": 16384,
12
    "keepRecentTokens": 20000
13
  },
14
  "enabledModels": ["claude-*", "gpt-4o"]
15
}

60+ 实战示例索引#

Pi 内置了丰富的示例扩展，覆盖几乎所有使用场景。

工具类#

示例	说明	关键 API
`hello.ts`	最简工具注册	`registerTool`
`question.ts`	带用户交互的工具	`registerTool`, `ui.select`
`questionnaire.ts`	多步向导工具	`registerTool`, `ui.custom`
`todo.ts`	有状态工具 + 持久化	`registerTool`, `appendEntry`, `renderResult`
`dynamic-tools.ts`	运行时动态注册工具	`registerTool`, `session_start`
`structured-output.ts`	终止型工具	`registerTool`, `terminate: true`
`truncated-tool.ts`	输出截断	`registerTool`, `truncateHead`
`tool-override.ts`	覆盖内置工具	`registerTool`（同名）
`ssh.ts`	SSH 远程执行	`registerFlag`, 工具操作
`subagent/`	子代理	`registerTool`, `exec`

命令与 UI 类#

示例	说明	关键 API
`pirate.ts`	修改系统提示	`registerCommand`, `before_agent_start`
`summarize.ts`	对话摘要	`registerCommand`, `ui.custom`
`handoff.ts`	跨 provider 交接	`registerCommand`, `ctx.newSession`
`qna.ts`	Q&A 交互	`registerCommand`, `ui.custom`, `setEditorText`
`custom-footer.ts`	自定义 Footer	`registerCommand`, `setFooter`
`custom-header.ts`	自定义头部	`session_start`, `setHeader`
`modal-editor.ts`	Vim 模态编辑器	`setEditorComponent`, `CustomEditor`
`widget-placement.ts`	Widget 放置	`setWidget`
`overlay-test.ts`	Overlay 组件	`ui.custom`, overlay options
`github-issue-autocomplete.ts`	GitHub Issue 补全	`addAutocompleteProvider`

事件与安全类#

示例	说明	关键 API
`permission-gate.ts`	阻止危险命令	`on("tool_call")`, `ui.confirm`
`protected-paths.ts`	保护文件路径	`on("tool_call")`
`confirm-destructive.ts`	确认破坏性操作	`on("session_before_*")`
`dirty-repo-guard.ts`	Git 脏仓库警告	`on("session_before_*")`, `exec`
`input-transform.ts`	输入转换	`on("input")`
`model-status.ts`	模型切换状态	`on("model_select")`, `setStatus`

Git 集成类#

示例	说明	关键 API
`git-checkpoint.ts`	每轮 git stash	`on("turn_start")`, `exec`
`auto-commit-on-exit.ts`	退出时自动提交	`on("session_shutdown")`, `exec`
`git-merge-and-resolve.ts`	合并并解决冲突	`on("agent_end")`, `sendUserMessage`

复杂扩展#

示例	说明	关键 API
`plan-mode/`	完整 Plan Mode 实现	全部事件类型、命令、快捷键、标志
`preset.ts`	可保存的预设	`setModel`, `setActiveTools`, `setThinkingLevel`
`snake.ts`	贪吃蛇游戏	`ui.custom`, 键盘处理
`space-invaders.ts`	太空侵略者	`ui.custom`
`doom-overlay/`	Doom 覆盖层	`ui.custom`, overlay
`sandbox/`	沙箱执行	工具操作
`custom-provider-anthropic/`	自定义 Anthropic 代理	`registerProvider`
`custom-provider-gitlab-duo/`	GitLab Duo 集成	`registerProvider`, OAuth

常见模式#

模式一：权限控制#

在工具执行前弹出确认对话框：

1
pi.on("tool_call", async (event, ctx) => {
2
  if (isToolCallEventType("bash", event)) {
3
    const cmd = event.input.command;
4
    if (cmd.includes("rm -rf") || cmd.includes("sudo")) {
5
      const ok = await ctx.ui.confirm("危险操作", `允许执行: ${cmd}?`);
6
      if (!ok) return { block: true, reason: "用户拒绝" };
7
    }
8
  }
9
});

模式二：系统提示增强#

每轮动态修改系统提示：

1
pi.on("before_agent_start", async (event, ctx) => {
2
  const branch = ctx.sessionManager.getBranch();
3
  const turnCount = branch.filter(e => e.type === "message" && e.message?.role === "user").length;
4

5
  return {
6
    systemPrompt: event.systemPrompt + `\n\n当前对话轮次: ${turnCount}`,
7
  };
8
});

模式三：工具动态切换#

根据条件启用或禁用工具：

1
pi.on("agent_start", async (_event, ctx) => {
2
  const usage = ctx.getContextUsage();
3
  if (usage && usage.tokens > 80_000) {
4
    // 上下文快满了，只保留必要工具
5
    pi.setActiveTools(["read", "bash", "edit"]);
6
  }
7
});

模式四：自动提交#

Agent 结束时自动 git commit：

1
pi.on("agent_end", async (event, ctx) => {
2
  const status = await pi.exec("git", ["status", "--porcelain"]);
3
  if (status.stdout.trim()) {
4
    await pi.exec("git", ["add", "-A"]);
5
    const lastMsg = event.messages.findLast(m => m.role === "assistant");
6
    const msg = lastMsg?.content?.[0]?.text?.slice(0, 72) || "auto: agent changes";
7
    await pi.exec("git", ["commit", "-m", msg]);
8
    ctx.ui.notify("已自动提交更改", "info");
9
  }
10
});

错误处理#

场景	处理方式
扩展错误	记录日志，agent 继续
`tool_call` 错误	阻止工具执行（fail-safe）
工具 `execute` 错误	必须用 `throw` 信号化；错误被捕获后以 `isError: true` 报告给 LLM

非交互模式行为#

模式	UI 方法	说明
交互模式	完整 TUI	正常操作
RPC (`--mode rpc`)	JSON 协议	客户端处理 UI
JSON (`--mode json`)	无操作	事件流输出到 stdout
打印 (`-p`)	无操作	扩展运行但无法弹窗

在非交互模式下，使用 ctx.hasUI 检查是否有 UI 可用。

总结#

Pi 的扩展系统遵循其”自修改性”核心理念——Pi 不内置的功能（Plan Mode、权限控制、子代理、MCP、Todo……），都可以通过扩展来实现。20+ 个生命周期钩子、完整的 UI 组件系统、按需加载的 Skill 机制，加上 TypeScript 无编译热重载的开发体验，让 Pi 成为一个真正”可编程”的编码 Agent。

正如 Pi 作者 Mario Zechner 所说：

“如果我不需要它，它就不会被构建。”

但这并不意味着你不能拥有它——只需要一个 TypeScript 文件，就能让 Pi 变成你想要的任何样子。