TypeScript 在 AI Agent 上的应用#

在 AI Agent 开发领域，TypeScript 正在成为应用层开发的首选语言。本文将深入探讨 TypeScript 在 AI Agent 应用中的优势、核心库使用、类型安全的工具调用机制，以及实际的架构设计。

1. 为什么 AI Agent 项目需要 TypeScript + Python#

现代 AI Agent 项目通常采用双语言架构，这种分工充分发挥了各自的优势。

Python 的角色：模型层#

Python 在 AI Agent 的模型层占据主导地位，这得益于其丰富的生态系统：

AI/ML 生态：PyTorch、TensorFlow、transformers、LangChain 等框架为模型训练和推理提供了坚实基础
数据处理能力：列表推导式、NumPy、Pandas 等工具使数据处理变得简洁高效
胶水语言特性：易于与 C/C++、Rust 等底层语言集成，能够调用高性能计算库
低学习曲线：对于 AI 研究人员来说，Python 的入门门槛较低
最完整的 API SDK：OpenAI、HuggingFace 等主流平台的 SDK 首先支持 Python

TypeScript 的角色：应用层#

TypeScript 在应用层发挥着不可替代的作用：

类型安全：静态类型检查和 IDE 自动补全能够在编译时发现错误，减少运行时异常
npm 生态：Vercel AI SDK、LangChain.js 等库提供了完善的工具链支持
前后端统一：共享类型定义而非业务逻辑耦合，既保证类型一致性又不增加复杂度
Node.js 运行时：成熟的异步 I/O 模型，非常适合并发工具调用场景
VS Code 生态：微软背书，Copilot 支持良好，重构和调试体验出色

1
┌─────────────────────────────────────────────────────────────┐
2
│                      AI Agent 架构                          │
3
├─────────────────────────────────────────────────────────────┤
4
│  展现层    │  前端界面、Web 服务        │  TypeScript        │
5
├─────────────────────────────────────────────────────────────┤
6
│  应用层    │  工具调用、状态管理、类型安全 │  TypeScript        │
7
├─────────────────────────────────────────────────────────────┤
8
│  控制层    │  Agent 逻辑、工具编排        │  Python/TypeScript │
9
├─────────────────────────────────────────────────────────────┤
10
│  模型层    │  LLM 调用、RAG、向量检索     │  Python            │
11
└─────────────────────────────────────────────────────────────┘

2. 核心 TypeScript 库#

2.1 LangChain.js / LangGraph.js#

LangChain.js 是 LangChain 的 TypeScript 版本，为构建 Agent 提供了丰富的抽象。

1
import { createReactAgent } from '@langchain/langgraph/prebuilt';
2
import { ChatAnthropic } from '@langchain/anthropic';
3

4
// 创建 ReAct Agent
5
const agent = createReactAgent({
6
  llm: new ChatAnthropic({ model: 'claude-3-5-sonnet' }),
7
  tools: [webSearchTool, calculatorTool],
8
});
9

10
// 流式输出
11
const stream = await agent.stream({
12
  messages: [{ role: 'user', content: '帮我查今天天气' }],
13
});
14

15
for await (const chunk of stream) {
16
  console.log(chunk);
17
}

LangGraph.js 则提供了更细粒度的状态管理能力，适合复杂的多步骤 Agent 流程。

2.2 Vercel AI SDK#

Vercel AI SDK 是专为现代 AI 应用设计的 TypeScript 优先框架，提供了优雅的工具调用抽象。

1
import { anthropic } from '@ai-sdk/anthropic';
2
import { generateText, tool } from 'ai';
3
import { z } from 'zod';
4

5
const { text, toolCalls } = await generateText({
6
  model: anthropic('claude-3-5-sonnet'),
7
  prompt: '帮我查北京天气',
8
  tools: {
9
    getWeather: tool({
10
      description: '获取天气信息',
11
      parameters: z.object({
12
        city: z.string(),
13
        unit: z.enum(['celsius', 'fahrenheit']).optional().default('celsius'),
14
      }),
15
      execute: async ({ city, unit }) => {
16
        // 实际调用天气 API
17
        return { temp: 22, weather: '晴', unit };
18
      },
19
    }),
20
  },
21
});
22

23
console.log(text);
24
console.log(toolCalls);

Vercel AI SDK 的核心优势在于其简洁的 API 设计和开箱即用的流式支持。

3. 类型安全的工具调用#

工具调用是 Agent 与外部世界交互的桥梁，类型安全在这个环节至关重要。

3.1 使用 Zod 定义工具 Schema#

Zod 是一个 TypeScript 优先的模式验证库，与 TypeScript 的类型推断完美配合。

1
import { z } from 'zod';
2

3
// 定义工具 schema - 类型自动推断
4
const tools = {
5
  search: {
6
    description: '搜索网页',
7
    parameters: z.object({
8
      query: z.string(),
9
      limit: z.number().optional().default(10),
10
      site: z.string().optional(),
11
    }),
12
  },
13
  calculator: {
14
    description: '计算数学表达式',
15
    parameters: z.object({
16
      expression: z.string().describe('要计算的数学表达式'),
17
    }),
18
  },
19
  weather: {
20
    description: '获取城市天气',
21
    parameters: z.object({
22
      city: z.string(),
23
      days: z.number().min(1).max(7).optional().default(1),
24
    }),
25
  },
26
} as const;
27

28
// 从 schema 推断类型
29
type ToolName = keyof typeof tools;
30
type ToolParams<T extends ToolName> = z.infer<typeof tools[T]['parameters']>;
31

32
// 执行工具 - IDE 自动补全和类型检查
33
async function executeTool(name: ToolName, params: ToolParams<typeof name>) {
34
  const tool = tools[name];
35

36
  // 验证参数
37
  const validated = tool.parameters.parse(params);
38

39
  // 执行对应的工具逻辑
40
  switch (name) {
41
    case 'search':
42
      return await performSearch(validated);
43
    case 'calculator':
44
      return await evaluateExpression(validated.expression);
45
    case 'weather':
46
      return await fetchWeather(validated.city, validated.days);
47
  }
48
}

3.2 LLM 工具调用的类型化处理#

当 LLM 返回工具调用请求时，我们需要将原始输出转换为类型安全的结构。

1
interface ToolCallRequest {
2
  name: ToolName;
3
  params: ToolParams<ToolName>;
4
}
5

6
interface ToolCallResult {
7
  request: ToolCallRequest;
8
  result: unknown;
9
  error?: string;
10
}
11

12
// 处理 LLM 返回的工具调用
13
async function handleToolCalls(
14
  llmOutput: unknown
15
): Promise<ToolCallResult[]> {
16
  const results: ToolCallResult[] = [];
17

18
  // 假设 llmOutput 包含 tool_calls 数组
19
  const toolCalls = extractToolCalls(llmOutput);
20

21
  for (const call of toolCalls) {
22
    try {
23
      // 类型守卫：确保工具名称有效
24
      if (!isValidToolName(call.name)) {
25
        throw new Error(`Unknown tool: ${call.name}`);
26
      }
27

28
      const result = await executeTool(call.name, call.params);
29
      results.push({ request: call, result });
30
    } catch (error) {
31
      results.push({
32
        request: call,
33
        result: null,
34
        error: error instanceof Error ? error.message : 'Unknown error',
35
      });
36
    }
37
  }
38

39
  return results;
40
}
41

42
// 类型守卫函数
43
function isValidToolName(name: string): name is ToolName {
44
  return name in tools;
45
}

这种类型安全的设计带来三大好处：编译期检查减少运行时错误、IDE 自动补全提升开发效率、重构时能够自动发现所有引用点。

4. 多 Agent 编排#

复杂的 AI Agent 系统通常需要多个专门化的 Agent 协同工作，这就涉及到多 Agent 编排问题。

4.1 Agent 定义与注册#

1
interface Tool {
2
  name: string;
3
  description: string;
4
  parameters: z.ZodSchema;
5
  execute: (params: unknown) => Promise<unknown>;
6
}
7

8
interface SubAgent {
9
  name: string;
10
  description: string;
11
  systemPrompt: string;
12
  tools: Tool[];
13
  model?: string;
14
}
15

16
class AgentRegistry {
17
  private agents: Map<string, SubAgent> = new Map();
18

19
  register(agent: SubAgent): void {
20
    if (this.agents.has(agent.name)) {
21
      throw new Error(`Agent ${agent.name} already registered`);
22
    }
23
    this.agents.set(agent.name, agent);
24
  }
25

26
  get(name: string): SubAgent | undefined {
27
    return this.agents.get(name);
28
  }
29

30
  list(): SubAgent[] {
31
    return Array.from(this.agents.values());
32
  }
33

34
  findBestMatch(userInput: string): SubAgent[] {
35
    // 基于描述相似度或 LLM 判断选择合适的 Agent
36
    return this.list().sort((a, b) =>
37
      calculateRelevance(userInput, b.description) -
38
      calculateRelevance(userInput, a.description)
39
    );
40
  }
41
}

4.2 编排器实现#

1
interface OrchestrationResult {
2
  agentName: string;
3
  response: string;
4
  toolCalls: ToolCallResult[];
5
  reasoning: string;
6
}
7

8
class AgentOrchestrator {
9
  private registry: AgentRegistry;
10
  private supervisor: LLMChain;
11

12
  constructor(registry: AgentRegistry, supervisor: LLMChain) {
13
    this.registry = registry;
14
    this.supervisor = supervisor;
15
  }
16

17
  async route(userInput: string): Promise<OrchestrationResult> {
18
    // 1. 使用 supervisor 决定使用哪个 Agent
19
    const decision = await this.supervisor.predictMessages([{
20
      role: 'user',
21
      content: `用户输入: ${userInput}\n可用 Agent: ${this.listAgents()}`,
22
    }]);
23

24
    // 2. 解析决策结果
25
    const targetAgent = this.parseDecision(decision);
26

27
    // 3. 调用目标 Agent
28
    const response = await this.callAgent(targetAgent, userInput);
29

30
    return {
31
      agentName: targetAgent.name,
32
      response: response.content,
33
      toolCalls: response.toolCalls,
34
      reasoning: decision.reasoning,
35
    };
36
  }
37

38
  private listAgents(): string {
39
    return this.registry.list()
40
      .map(a => `- ${a.name}: ${a.description}`)
41
      .join('\n');
42
  }
43

44
  private parseDecision(decision: unknown): SubAgent {
45
    // 解析 LLM 返回的决策，提取目标 Agent 名称
46
    const agentName = extractAgentName(decision);
47
    const agent = this.registry.get(agentName);
48

49
    if (!agent) {
50
      throw new Error(`Agent not found: ${agentName}`);
51
    }
52

53
    return agent;
54
  }
55

56
  private async callAgent(
57
    agent: SubAgent,
58
    userInput: string
59
  ): Promise<AgentResponse> {
60
    // 实现 Agent 调用逻辑
61
    const context = await this.buildContext(agent, userInput);
62
    return this.executeAgent(agent, context);
63
  }
64
}

4.3 并发 Agent 调用#

在某些场景下，我们可能需要同时调用多个 Agent 并综合它们的结果。

1
async function concurrentAgents(
2
  userInput: string,
3
  agentNames: string[]
4
): Promise<OrchestrationResult[]> {
5
  const agents = agentNames
6
    .map(name => registry.get(name))
7
    .filter((a): a is SubAgent => a !== undefined);
8

9
  // 并发执行所有 Agent
10
  const results = await Promise.all(
11
    agents.map(agent => orchestrator.callAgent(agent, userInput))
12
  );
13

14
  // 综合结果
15
  return results.map((response, index) => ({
16
    agentName: agents[index].name,
17
    response: response.content,
18
    toolCalls: response.toolCalls,
19
    reasoning: 'Concurrent execution',
20
  }));
21
}

5. 记忆与状态管理#

Agent 的记忆系统是实现连续对话和上下文理解的关键。

5.1 记忆类型定义#

1
type MemoryType = 'short_term' | 'long_term' | 'vector';
2

3
interface MemoryMetadata {
4
  timestamp: Date;
5
  importance: number;      // 0-1, 记忆重要性评分
6
  accessCount: number;     // 访问次数
7
  source: 'user' | 'agent' | 'tool';
8
  tags?: string[];
9
}
10

11
interface MemoryEntry {
12
  id: string;
13
  type: MemoryType;
14
  content: string;
15
  embedding?: number[];    // 用于向量检索
16
  metadata: MemoryMetadata;
17
}
18

19
interface ConversationContext {
20
  sessionId: string;
21
  messages: Message[];
22
  memories: MemoryEntry[];
23
  createdAt: Date;
24
  updatedAt: Date;
25
}

5.2 分层记忆实现#

1
class AgentMemory {
2
  private shortTerm: MemoryBuffer;
3
  private longTerm: VectorStore;
4
  private workingMemory: Map<string, unknown>;
5

6
  constructor(config: MemoryConfig) {
7
    this.shortTerm = new MemoryBuffer(config.shortTermLimit);
8
    this.longTerm = new VectorStore(config.vectorDimensions);
9
    this.workingMemory = new Map();
10
  }
11

12
  // 添加记忆
13
  async add(
14
    content: string,
15
    type: MemoryType,
16
    metadata: Partial<MemoryMetadata> = {}
17
  ): Promise<MemoryEntry> {
18
    const entry: MemoryEntry = {
19
      id: generateId(),
20
      type,
21
      content,
22
      embedding: type === 'vector' ? await this.embed(content) : undefined,
23
      metadata: {
24
        timestamp: new Date(),
25
        importance: metadata.importance ?? 0.5,
26
        accessCount: 0,
27
        source: metadata.source ?? 'agent',
28
        tags: metadata.tags,
29
      },
30
    };
31

32
    if (type === 'short_term') {
33
      this.shortTerm.add(entry);
34
    } else if (type === 'long_term' || type === 'vector') {
35
      await this.longTerm.add(entry);
36
    }
37

38
    return entry;
39
  }
40

41
  // 检索记忆
42
  async retrieve(
43
    query: string,
44
    limit: number = 5,
45
    type?: MemoryType
46
  ): Promise<MemoryEntry[]> {
47
    if (type === 'short_term') {
48
      return this.shortTerm.getRecent(limit);
49
    }
50

51
    if (type === 'vector' || !type) {
52
      const queryEmbedding = await this.embed(query);
53
      return this.longTerm.similaritySearch(queryEmbedding, limit);
54
    }
55

56
    return [];
57
  }
58

59
  // 遗忘低重要性记忆
60
  async prune(minImportance: number = 0.3): Promise<number> {
61
    const toRemove = await this.longTerm.query({
62
      filter: { importance: { $lt: minImportance } },
63
    });
64

65
    for (const entry of toRemove) {
66
      await this.longTerm.delete(entry.id);
67
    }
68

69
    return toRemove.length;
70
  }
71

72
  // 总结短期记忆为长期记忆
73
  async summarize(): Promise<void> {
74
    const recent = this.shortTerm.getAll();
75
    const summary = await this.summarizeContent(recent.map(e => e.content));
76

77
    await this.add(summary, 'long_term', {
78
      importance: 0.8,
79
      source: 'agent',
80
      tags: ['summary'],
81
    });
82

83
    this.shortTerm.clear();
84
  }
85
}

5.3 状态持久化#

1
interface PersistedState {
2
  version: number;
3
  sessions: Record<string, ConversationContext>;
4
  memories: MemoryEntry[];
5
  lastUpdated: string;
6
}
7

8
class PersistentAgentState {
9
  private storage: StateStorage;
10
  private currentSession: ConversationContext | null = null;
11

12
  async saveSession(session: ConversationContext): Promise<void> {
13
    const state = await this.loadState();
14
    state.sessions[session.sessionId] = session;
15
    state.lastUpdated = new Date().toISOString();
16
    await this.storage.save(state);
17
  }
18

19
  async loadSession(sessionId: string): Promise<ConversationContext | null> {
20
    const state = await this.loadState();
21
    return state.sessions[sessionId] ?? null;
22
  }
23

24
  async createSession(): Promise<ConversationContext> {
25
    const session: ConversationContext = {
26
      sessionId: generateId(),
27
      messages: [],
28
      memories: [],
29
      createdAt: new Date(),
30
      updatedAt: new Date(),
31
    };
32

33
    await this.saveSession(session);
34
    this.currentSession = session;
35
    return session;
36
  }
37
}

6. 项目架构结构#

一个典型的 TypeScript AI Agent 项目应该遵循清晰的模块化架构。

1
src/
2
├── agents/
3
│   ├── base.ts              # Agent 基类，定义通用接口
4
│   ├── react-agent.ts       # ReAct 模式实现
5
│   ├── planner-agent.ts     # 规划器 Agent
6
│   ├── executor-agent.ts    # 执行器 Agent
7
│   └── index.ts             # 统一导出
8
├── tools/
9
│   ├── registry.ts          # 工具注册表
10
│   ├── web-search.ts        # 网页搜索工具
11
│   ├── calculator.ts        # 计算器工具
12
│   ├── weather.ts           # 天气查询工具
13
│   └── types.ts             # 工具类型定义
14
├── memory/
15
│   ├── buffer.ts            # 短期记忆缓冲区
16
│   ├── vector-store.ts      # 向量存储实现
17
│   ├── persistent.ts        # 持久化存储
18
│   └── index.ts
19
├── orchestration/
20
│   ├── router.ts            # Agent 路由逻辑
21
│   ├── supervisor.ts        # 监督器
22
│   └── concurrent.ts        # 并发编排
23
├── types/
24
│   ├── messages.ts          # 消息类型定义
25
│   ├── tools.ts             # 工具类型定义
26
│   └── agent.ts             # Agent 相关类型
27
├── utils/
28
│   ├── logger.ts            # 日志工具
29
│   ├── embedding.ts          # 向量化工具
30
│   └── validation.ts        # 验证工具
31
├── config/
32
│   └── index.ts             # 配置管理
33
├── index.ts                 # 入口文件
34
└── app.ts                   # 应用启动

核心模块说明#

agents/base.ts 提供了所有 Agent 的抽象基类：

1
abstract class BaseAgent {
2
  protected name: string;
3
  protected description: string;
4
  protected tools: Tool[];
5
  protected memory: AgentMemory;
6

7
  abstract execute(input: string, context?: Context): Promise<AgentResponse>;
8

9
  protected async think(input: string): Promise<string> {
10
    // Agent 的思考逻辑
11
    throw new Error('Not implemented');
12
  }
13

14
  protected async act(input: string): Promise<Action> {
15
    // Agent 的行动逻辑
16
    throw new Error('Not implemented');
17
  }
18
}

tools/registry.ts 实现类型安全的工具注册：

1
class ToolRegistry {
2
  private tools: Map<string, Tool> = new Map();
3

4
  register<T extends z.ZodSchema>(
5
    name: string,
6
    description: string,
7
    schema: T,
8
    execute: (params: z.infer<T>) => Promise<unknown>
9
  ): void {
10
    this.tools.set(name, {
11
      name,
12
      description,
13
      parameters: schema,
14
      execute,
15
    });
16
  }
17

18
  get(name: string): Tool | undefined {
19
    return this.tools.get(name);
20
  }
21

22
  list(): Tool[] {
23
    return Array.from(this.tools.values());
24
  }
25

26
  getSchema(name: string): z.ZodSchema | undefined {
27
    return this.tools.get(name)?.parameters;
28
  }
29
}

7. Python 与 TypeScript 对比：Agent 控制层#

在 Agent 控制层，Python 和 TypeScript 各有优劣，选择取决于具体场景。

方面	Python	TypeScript
复杂状态管理	运行时错误	编译时错误
工具 Schema 变更	静默失败	编译器警告
多 Agent 路由	手动处理	类型保护
前端集成	额外 HTTP 层	直接调用
IDE 与重构	一般支持	VS Code 强大支持
并发工具调用	asyncio 繁琐	Promise.all 简洁
运行时性能	优秀	良好
启动速度	较慢	快速

Python 的适用场景#

模型训练和微调相关的实验性工作
需要深度定制 LangChain/LangGraph 内部逻辑
数据处理和特征工程为主的任务
团队成员以 Python 为主，没有前端背景

TypeScript 的适用场景#

需要前后端类型统一的场景
复杂的多 Agent 系统，需要编译时检查
生产环境，需要长期维护和重构
需要与 React/Vue 等前端框架集成
强调开发体验和代码可靠性

8. 前后端统一不等于耦合#

TypeScript 的”统一”常被误解为前后端耦合，这是一个需要澄清的误区。

正确的理解#

前后端统一指的是类型定义的共享，而非业务逻辑的耦合：

1
传统架构：
2
┌─────────────┐       HTTP        ┌─────────────┐
3
│  Frontend   │ ───────────────→ │   Backend   │
4
│  (TS/React)  │ ←─────────────── │  (Python)    │
5
└─────────────┘                   └─────────────┘
6
                                        ↓
7
                              类型定义需要重复维护
8

9
TypeScript 统一架构：
10
┌─────────────┐       HTTP        ┌─────────────┐
11
│  Frontend   │ ───────────────→ │   Backend   │
12
│  (TS/React)  │ ←─────────────── │  (TS/Node)   │
13
└─────────────┘                   └─────────────┘
14
      │                                 │
15
      └──── 共享类型 @shared/types ←───┘
16
              (仅 npm 包，无业务逻辑)

实施方式#

1
packages/
2
└── @shared/
3
    └── types/
4
        ├── agent.ts       # Agent 相关类型
5
        ├── tool.ts        # 工具类型定义
6
        ├── message.ts     # 消息类型定义
7
        └── index.ts       # 统一导出
8

9
# frontend 和 backend 分别独立部署
10
# 仅通过 HTTP/API 通信
11
# 共享类型包确保接口一致性

前后端仍然是独立的服务、独立部署，只是都使用 TypeScript 并通过 npm 包共享类型定义。

9. 总结与建议#

技术选型建议#

场景	推荐方案	理由
全 Python 团队	Python (LangGraph)	无学习成本，生态完整
有前端参与，需类型安全	TypeScript	前后端统一，编译期检查
快速原型，验证想法	Python	开发速度快，迭代敏捷
生产环境，需重构安全	TypeScript	编译时检查，重构信心
复杂多 Agent，强类型	TypeScript	类型保护，多 Agent 协调

最佳实践#

类型安全优先：使用 Zod 定义所有工具 Schema，充分利用 TypeScript 类型推断
模块化设计：按职责分离 Agent、工具、记忆、编排逻辑，便于测试和维护
记忆分层：短期记忆处理当前会话，长期记忆支持跨会话知识积累
错误边界：为每个工具调用和 Agent 执行添加错误处理，避免级联失败
可观测性：集成日志和追踪，便于调试 Agent 行为和问题定位

未来趋势#

随着 AI Agent 应用的成熟，TypeScript 在这个领域的优势将更加明显：

微软等大厂推动 TypeScript 在 AI 领域的应用
更多 AI 原生的 TypeScript 库涌现
前端与 Agent 的深度集成需求增长
类型安全的工具调用成为行业标准

TypeScript 不仅是前端开发的选择，更是构建可靠、可维护 AI Agent 应用的重要基础设施。