Context Catch#

Context Catch 是 Claude Code 中上下文管理的技术，用于在上下文空间快用完时，智能地保留重要信息。

问题背景#

Claude Code 上下文窗口限制

1
Token 上限（比如 200K tokens）
2

3
已使用：
4
├─────────────────────────────────────────────────────────────┤
5
│  系统提示 │ 历史消息 │ 代码上下文 │ 工具输出 │    剩余空间   │
6
└─────────────────────────────────────────────────────────────┘
7
                                       ↑
8
                                   快满了！

如果继续添加 → 超出窗口 → 丢失早期上下文

Context Catch 的机制#

1
┌─────────────────────────────────────────────────────────────┐
2
│                   Context Catch 工作流程                     │
3
├─────────────────────────────────────────────────────────────┤
4
│                                                             │
5
│  Step 1: 检测上下文使用率                                   │
6
│  ┌─────────────────────────────────────────────────────┐  │
7
│  │  当使用量 > 80% 时触发 catch 策略                   │  │
8
│  └─────────────────────────────────────────────────────┘  │
9
│                          ↓                                   │
10
│  Step 2: 识别"可压缩"的内容                                │
11
│  ┌─────────────────────────────────────────────────────┐  │
12
│  │  ❌ 工具执行结果（可重新执行）                      │  │
13
│  │  ❌ 重复的日志输出                                   │  │
14
│  │  ❌ 次要的中间过程                                   │  │
15
│  │  ✅ 关键决策点                                      │  │
16
│  │  ✅ 用户明确的需求                                  │  │
17
│  │  ✅ 代码修改的核心内容                              │  │
18
│  └─────────────────────────────────────────────────────┘  │
19
│                          ↓                                   │
20
│  Step 3: 执行压缩/摘要                                     │
21
│  ┌─────────────────────────────────────────────────────┐  │
22
│  │  "之前的搜索结果已保存，现在执行下一步..."           │  │
23
│  │  "已完成的代码变更：[文件A, 文件B]..."             │  │
24
│  └─────────────────────────────────────────────────────┘  │
25
│                          ↓                                   │
26
│  Step 4: 释放空间，继续工作                                 │
27
│                                                             │
28
└─────────────────────────────────────────────────────────────┘

实际效果#

压缩前（原始记录）

1
[11:30] 用户: 帮我创建一个用户注册功能
2
[11:31] Claude: 我来创建这个功能...
3
[11:31] 工具: 搜索现有用户相关代码
4
[11:31] 工具: 找到 5 个相关文件
5
[11:32] 工具: 读取 user.ts
6
[11:32] 工具: 读取 auth.ts
7
[11:32] 工具: 读取 database.ts
8
[11:33] Claude: 分析了现有代码结构
9
[11:33] Claude: 开始实现...
10
[11:34] 工具: 创建 user-service.ts
11
[11:35] 工具: 更新 user.ts 添加新字段
12
[11:36] Claude: 完成了用户注册功能

Context Catch 压缩后

1
[11:30-11:36] 用户注册功能已完成
2
- 创建: user-service.ts
3
- 修改: user.ts
4
- 关键需求: 用户名、邮箱、密码字段
5
- 状态: 可运行，待测试

Claude Code 中的记忆系统分层#

1
┌─────────────────────────────────────────────────────────────┐
2
│                   Claude Code 记忆系统                        │
3
├─────────────────────────────────────────────────────────────┤
4
│                                                             │
5
│  ┌─────────────────────────────────────────────────────┐  │
6
│  │           Working Memory (当前上下文)                │  │
7
│  │  • 当前会话的完整上下文                             │  │
8
│  │  • 所有工具调用和结果                              │  │
9
│  │  • 代码修改历史                                   │  │
10
│  └──────────────────┬──────────────────────────────────┘  │
11
│                     │  Context Catch                      │
12
│                     ↓  压缩/摘要                           │
13
│  ┌─────────────────────────────────────────────────────┐  │
14
│  │           Compressed Context                         │  │
15
│  │  • 关键决策点摘要                                  │  │
16
│  │  • 已完成/进行中的工作摘要                          │  │
17
│  │  • 重要文件状态                                    │  │
18
│  └──────────────────┬──────────────────────────────────┘  │
19
│                     │                                      │
20
│                     ↓                                      │
21
│  ┌─────────────────────────────────────────────────────┐  │
22
│  │           Persistent Memory                         │  │
23
│  │  • 项目知识 (README, 架构)                         │  │
24
│  │  • 用户偏好设置                                    │  │
25
│  │  • 长期项目上下文                                  │  │
26
│  └─────────────────────────────────────────────────────┘  │
27
│                                                             │
28
└─────────────────────────────────────────────────────────────┘

Context Catch 的策略#

策略	说明
工具输出压缩	工具执行结果压缩为”已执行”摘要
重复检测	连续相似的操作只保留一个
重要性评分	根据用户反馈、决策点等评分
层级保留	关键信息 > 中间过程 > 详细日志

使用示例#

对话很长后，Claude 自动压缩

用户: 继续刚才的工作
Claude: (Context Catch 检测到上下文已压缩)

我记起来了：

正在实现用户注册功能

已完成 user-service.ts 和 user.ts 修改

需要完成: 验证逻辑 + 测试

让我继续…

总结：Context Catch 就是 Claude Code 在上下文快满时，自动压缩/摘要早期内容，保留关键信息，确保重要上下文不丢失的机制。

详细讲解：Context Catch 与 Claude Code 记忆系统#

针对你在 AI-Interview 项目中的应用需求，以下是更完整的架构说明和代码实现。

整体架构：三层记忆系统#

1
┌─────────────────────────────────────────────────────────────────┐
2
│                 Claude Code 记忆系统三层架构                      │
3
├─────────────────────────────────────────────────────────────────┤
4
│                                                                 │
5
│  ┌─────────────────────────────────────────────────────────┐    │
6
│  │           L1: Working Context (活动层)                   │    │
7
│  │                                                         │    │
8
│  │  • 当前 session 的完整上下文                            │    │
9
│  │  • 实时消息、工具调用、代码修改                         │    │
10
│  │  • Token 消耗大户 (会触发 catch)                       │    │
11
│  │                                                         │    │
12
│  │  容量：~200K tokens                                     │    │
13
│  │  淘汰：超出时触发 Context Catch                         │    │
14
│  └─────────────────────────────────────────────────────────┘    │
15
│                            │                                     │
16
│                            ▼ Context Catch ▼                    │
17
│  ┌─────────────────────────────────────────────────────────┐    │
18
│  │           L2: Compressed Context (压缩层)                │    │
19
│  │                                                         │    │
20
│  │  • 压缩后的会话摘要                                    │    │
21
│  │  • 关键决策点 (用户确认的选择、设计决策)                │    │
22
│  │  • 工作进度快照 (完成了什么、待完成什么)                 │    │
23
│  │  • 文件变更摘要                                        │    │
24
│  │                                                         │    │
25
│  │  容量：~20K tokens (L1 的 10%)                         │    │
26
│  │  淘汰：Session 结束或长期不活跃                          │    │
27
│  └─────────────────────────────────────────────────────────┘    │
28
│                            │                                     │
29
│                            ▼                                     │
30
│  ┌─────────────────────────────────────────────────────────┐    │
31
│  │           L3: Persistent Memory (持久层)                 │    │
32
│  │                                                         │    │
33
│  │  • 项目知识 (项目 README、架构文档、代码规范)           │    │
34
│  │  • 用户偏好 (编程语言偏好、注释风格)                     │    │
35
│  │  • 跨 session 的长期状态                                │    │
36
│  │                                                         │    │
37
│  │  容量：无限制 (存储在磁盘)                               │    │
38
│  │  淘汰：用户主动删除或项目结束                            │    │
39
│  └─────────────────────────────────────────────────────────┘    │
40
│                                                                 │
41
└─────────────────────────────────────────────────────────────────┘

L1 → L2 的 Context Catch 压缩过程（代码示例）#

1
// 压缩前的原始数据 (假设 1500 tokens)
2
interface RawSession {
3
  messages: [
4
    { role: "user", content: "帮我实现用户登录功能" },
5
    { role: "assistant", content: "我来帮你实现..." },
6
    { role: "system", content: "正在搜索现有代码..." },
7
    { role: "tool", content: "找到 5 个相关文件: user.ts, auth.ts..." },
8
    { role: "tool", content: "读取 user.ts 完成 (200行)" },
9
    { role: "tool", content: "读取 auth.ts 完成 (150行)" },
10
    { role: "assistant", content: "分析完成，开始实现..." },
11
    { role: "tool", content: "修改 user.ts: +50行" },
12
    { role: "tool", content: "创建 auth-service.ts: +80行" },
13
    { role: "assistant", content: "登录功能已完成" }
14
  ];
15
}
16

17
// Context Catch 压缩后
18
interface CompressedSnapshot {
19
  sessionId: string;
20
  startTime: Date;
21
  endTime: Date;
22
  duration: string;  // "15分钟"
23

24
  goal: "实现用户登录功能";
25
  status: "已完成";
26

27
  decisions: [
28
    { point: "使用 JWT 进行身份验证", timestamp: "11:35" },
29
    { point: "密码使用 bcrypt 加密", timestamp: "11:36" }
30
  ];
31

32
  changes: {
33
    created: ["auth-service.ts", "jwt-util.ts"],
34
    modified: ["user.ts", "auth.ts"],
35
    deleted: []
36
  };
37

38
  todo: [
39
    "添加单元测试",
40
    "更新 API 文档"
41
  ];
42

43
  keySnippets: [
44
    "function login(): Promise<{token, user}> { ... }"
45
  ];
46

47
  compressedTokens: 300;
48
}

Context Catch 的具体策略（伪代码）#

1
// 1. 工具输出压缩
2
const toolOutputCompression = {
3
  before: "读取文件 /src/user.ts 完成\n共 200 行:\nline 1: import {...}\n...",
4
  after: "[已读取] user.ts (200行)",
5
  preserveIf: (output) => output.includes("ERROR") || output.includes("找到")
6
};
7

8
// 2. 重复消息折叠
9
const duplicateCollapse = {
10
  before: ["正在搜索...", "正在搜索...", "正在搜索..."],
11
  after: "[重复操作已折叠] 搜索 x3"
12
};
13

14
// 3. 中间过程摘要
15
const processSummarization = {
16
  before: ["分析代码结构...", "分析依赖关系...", "确定实现方案..."],
17
  after: "完成代码分析和方案设计"
18
};
19

20
// 4. 决策点保留
21
const decisionPreservation = {
22
  preserve: [
23
    "用户确认使用 TypeScript",
24
    "用户选择 REST API 风格",
25
    "用户要求添加日志中间件"
26
  ],
27
  priority: "HIGH"
28
};

AI-Interview 项目应用设计（基于 Context Catch）#

面试记忆系统架构#

1
┌─────────────────────────────────────────────────────────────────┐
2
│                 AI-Interview 记忆系统设计                          │
3
├─────────────────────────────────────────────────────────────────┤
4
│                                                                 │
5
│  ┌─────────────────────────────────────────────────────────┐    │
6
│  │           用户会话层 (User Session)                     │    │
7
│  │  • 面试进行中的实时上下文                              │    │
8
│  │  • 候选人的回答历史                                    │    │
9
│  │  • 评分和笔记                                         │    │
10
│  │  触发 Context Catch 条件:                              │    │
11
│  │    - Token 使用 > 80%                                 │    │
12
│  │    - 用户主动暂停/中断                                │    │
13
│  │    - 长时间无交互 (>30分钟)                           │    │
14
│  └─────────────────────────────────────────────────────────┘    │
15
│                            │                                     │
16
│                            ▼                                     │
17
│  ┌─────────────────────────────────────────────────────────┐    │
18
│  │           面试快照层 (Interview Snapshot)                │    │
19
│  │  当用户中断面试时保存:                                  │    │
20
│  │  {                                                       │    │
21
│  │    sessionId: "interview_2024_001",                    │    │
22
│  │    candidateName: "张三",                               │    │
23
│  │    position: "高级后端工程师",                          │    │
24
│  │    currentQuestion: 5,                                 │    │
25
│  │    totalQuestions: 10,                                 │    │
26
│  │    answeredQuestions: [1,2,3,4],                       │    │
27
│  │    scores: { q1: 4, q2: 3, q3: 5, q4: 2 },             │    │
28
│  │    keyInsights: [                                       │    │
29
│  │      "候选人对分布式系统理解深入",                      │    │
30
│  │      "算法能力偏弱",                                    │    │
31
│  │      "项目经验描述不够具体"                             │    │
32
│  │    ],                                                   │    │
33
│  │    partialAnswer: {                                    │    │
34
│  │      questionId: 5,                                    │    │
35
│  │      content: "关于性能优化，我首先会..."              │    │
36
│  │    },                                                   │    │
37
│  │    overallImpression: "80%推荐进入下一轮",              │    │
38
│  │    nextActions: "继续第5题的系统设计部分"               │    │
39
│  │  }                                                       │    │
40
│  └─────────────────────────────────────────────────────────┘    │
41
│                            │                                     │
42
│                            ▼                                     │
43
│  ┌─────────────────────────────────────────────────────────┐    │
44
│  │           候选人档案层 (Candidate Profile)               │    │
45
│  │  • 候选人基础信息 (简历、教育背景)                       │    │
46
│  │  • 历史面试记录 (如有)                                  │    │
47
│  │  • 技能评估雷达图                                       │    │
48
│  │  • 面试偏好设置                                        │    │
49
│  │  应用:                                                  │    │
50
│  │    - 下次面试快速预热                                  │    │
51
│  │    - 跟踪候选人成长                                    │    │
52
│  │    - 面试官间共享候选人信息                            │    │
53
│  └─────────────────────────────────────────────────────────┘    │
54
└─────────────────────────────────────────────────────────────────┘

核心 TypeScript 实现#

1
interface InterviewSnapshot {
2
  sessionId: string;
3
  candidateInfo: {
4
    name: string;
5
    position: string;
6
    interviewId: string;
7
  };
8
  progress: {
9
    currentPhase: 'introduction' | 'coding' | 'system-design' | 'behavioral' | 'qna';
10
    currentQuestionIndex: number;
11
    totalQuestions: number;
12
    answeredQuestions: number[];
13
  };
14
  evaluation: {
15
    questionScores: Record<string, number>;
16
    strengthAreas: string[];
17
    improvementAreas: string[];
18
    overallScore?: number;
19
  };
20
  keyInsights: {
21
    candidateStrengths: string[];
22
    candidateWeaknesses: string[];
23
    technicalDepth: string[];
24
    redFlags: string[];
25
  };
26
  currentContext: {
27
    lastQuestion: Question | null;
28
    partialAnswer?: string;
29
    ongoingDiscussion?: string;
30
  };
31
  createdAt: Date;
32
  updatedAt: Date;
33
  expiresAt?: Date;
34
}
35

36
class InterviewContextCatch {
37
  private threshold = 200000; // token 上限
38
  private maxMessages = 100;
39

40
  shouldCompress(context: InterviewContext): boolean {
41
    const tokenUsage = calculateTokens(context);
42
    return (
43
      tokenUsage > this.threshold * 0.8 ||
44
      context.messages.length > this.maxMessages ||
45
      this.hasLongSilence(context)
46
    );
47
  }
48

49
  compress(context: InterviewContext): InterviewSnapshot {
50
    return {
51
      sessionId: context.sessionId,
52
      candidateInfo: context.candidateInfo,
53
      progress: context.progress,
54
      evaluation: this.compressEvaluation(context.evaluation),
55
      keyInsights: this.extractKeyInsights(context),
56
      currentContext: {
57
        lastQuestion: context.currentQuestion,
58
        partialAnswer: context.currentAnswer?.slice(-500),
59
      },
60
      summary: this.generateSummary(context),
61
      updatedAt: new Date()
62
    };
63
  }
64

65
  extractKeyInsights(context: InterviewContext): InterviewSnapshot['keyInsights'] {
66
    return {
67
      candidateStrengths: this.deduplicate([
68
        ...context.evaluation.technicalStrengths,
69
        ...context.evaluation.communicationStrengths,
70
      ]),
71
      candidateWeaknesses: this.deduplicate([
72
        ...context.evaluation.technicalWeaknesses,
73
        ...context.evaluation.areasForImprovement,
74
      ]),
75
      technicalDepth: context.questions
76
        .filter(q => q.answer?.depth > 7)
77
        .map(q => q.topic),
78
      redFlags: context.evaluation.redFlags || [],
79
    };
80
  }
81

82
  async restore(snapshot: InterviewSnapshot): Promise<InterviewContext> {
83
    return {
84
      sessionId: snapshot.sessionId,
85
      candidateInfo: snapshot.candidateInfo,
86
      progress: snapshot.progress,
87
      evaluation: snapshot.evaluation,
88
      keyInsights: snapshot.keyInsights,
89
      currentQuestion: snapshot.currentContext.lastQuestion,
90
      partialAnswer: snapshot.currentContext.partialAnswer,
91
      recoveryMessage: this.generateRecoveryMessage(snapshot),
92
    };
93
  }
94

95
  generateRecoveryMessage(snapshot: InterviewSnapshot): string {
96
    return `
97
面试已暂停，现在恢复。
98

99
候选人: ${snapshot.candidateInfo.name}
100
应聘岗位: ${snapshot.candidateInfo.position}
101
当前进度: 第 ${snapshot.progress.currentQuestionIndex + 1} 题，共 ${snapshot.progress.totalQuestions} 题
102

103
已完成评估:
104
- 总体评分: ${snapshot.evaluation.overallScore || '待定'}/10
105
- 优势: ${snapshot.keyInsights.candidateStrengths.join('、')}
106
- 不足: ${snapshot.keyInsights.candidateWeaknesses.join('、')}
107

108
待完成任务:
109
${snapshot.progress.answeredQuestions.length}/1 部分回答待完成
110
下一步: ${snapshot.keyInsights}
111

112
请继续面试。
113
    `.trim();
114
  }
115
}

快照持久化存储#

1
class InterviewSnapshotStorage {
2
  private snapshotDir = "./snapshots";
3

4
  async save(snapshot: InterviewSnapshot): Promise<void> {
5
    const path = this.getSnapshotPath(snapshot.sessionId);
6
    await fs.writeFile(path, JSON.stringify({
7
      ...snapshot,
8
      createdAt: snapshot.createdAt.toISOString(),
9
      updatedAt: snapshot.updatedAt.toISOString(),
10
      expiresAt: snapshot.expiresAt?.toISOString(),
11
    }, null, 2));
12
    await this.saveToUserLocalStorage(snapshot);
13
  }
14

15
  async load(sessionId: string): Promise<InterviewSnapshot | null> {
16
    const path = this.getSnapshotPath(sessionId);
17
    if (await fs.exists(path)) {
18
      const data = await fs.readFile(path, 'utf-8');
19
      return this.deserialize(JSON.parse(data));
20
    }
21
    return null;
22
  }
23

24
  async list(candidateName?: string): Promise<InterviewSnapshot[]> {
25
    const files = await fs.readdir(this.snapshotDir);
26
    const snapshots = await Promise.all(
27
      files
28
        .filter(f => f.endsWith('.json'))
29
        .map(f => this.load(f.replace('.json', '')))
30
    );
31
    if (candidateName) {
32
      return snapshots.filter(s => s?.candidateInfo.name.includes(candidateName));
33
    }
34
    return snapshots.filter((s): s is InterviewSnapshot => s !== null);
35
  }
36

37
  async cleanup(): Promise<void> {
38
    const snapshots = await this.list();
39
    const now = new Date();
40
    for (const snapshot of snapshots) {
41
      if (snapshot.expiresAt && new Date(snapshot.expiresAt) < now) {
42
        await this.delete(snapshot.sessionId);
43
      }
44
    }
45
  }
46
}

用户中断 → 恢复的完整流程#

1
┌─────────────────────────────────────────────────────────────────┐
2
│                 AI-Interview Session 恢复流程                      │
3
├─────────────────────────────────────────────────────────────────┤
4
│  用户场景: 面试进行到一半，用户关闭浏览器                           │
5
│                                                                 │
6
│  Step 1: 检测中断                                               │
7
│  beforeunload 事件触发 → 自动保存快照 → 保存到 localStorage + 服务器 │
8
│                                                                 │
9
│  Step 2: 生成压缩快照                                           │
10
│  ContextCatch.compress(interviewContext) → 生成 InterviewSnapshot │
11
│  保留关键洞察、进度、评分，压缩中间过程                            │
12
│                                                                 │
13
│  Step 3: 持久化存储                                             │
14
│  保存到 IndexedDB (浏览器本地) + 服务器数据库，设置7天过期         │
15
│                                                                 │
16
│  ---------------------- 用户重新打开应用 ----------------------    │
17
│                                                                 │
18
│  Step 4: 检测未完成面试                                         │
19
│  应用启动 → 检查 localStorage → 发现未完成快照 → 询问用户是否恢复  │
20
│                                                                 │
21
│  Step 5: 恢复上下文                                             │
22
│  snapshot = load(snapshotId); context = restore(snapshot)       │
23
│  生成恢复消息，展示摘要给用户确认                                 │
24
│                                                                 │
25
│  Step 6: 继续面试                                               │
26
│  "上次面试进行到第5题，候选人正在回答系统设计问题，回答内容关于缓存策略..." │
27
│  用户确认 → 继续面试流程                                         │
28
└─────────────────────────────────────────────────────────────────┘

关键设计要点#

要点	说明
优先级	关键洞察 > 评分数据 > 进度 > 中间过程
压缩比	L1 → L2 通常压缩到 10-15%
保留决策	面试官的评价、候选人表现判断必须保留
过期机制	快照设置过期时间，定期清理
用户确认	恢复前展示摘要，用户确认继续

效果对比#

项目	无 Context Catch	有 Context Catch
长面试支持	❌ 受 token 限制	✅ 支持任意长度
中断恢复	❌ 全部丢失	✅ 保留关键上下文
多轮面试	❌ 每次重新开始	✅ 累积候选人档案
Token 消耗	线性增长	保持稳定