🤖 fix: enforce ask_user_question in Plan Mode (#1169)

ThomasK33 · web-flow · commit 107ce63a9b7c · 2025-12-15T11:16:39.000Z
This tightens Plan Mode guidance so agents:

- use `ask_user_question` for unresolved questions (instead of an “Open
Questions” section / inline questions)
- avoid duplicating plan content and plan file paths after calling
`propose_plan` (the Plan UI already renders both)

Validation:
- `make static-check`

---

&lt;details&gt;
&lt;summary&gt;📋 Implementation Plan&lt;/summary&gt;

# Reinforce `ask_user_question` usage when the agent has open questions
(Plan Mode)

## Goal

Ensure that in **Plan Mode**, if the model has unanswered questions that
materially affect the plan, it **uses the `ask_user_question` tool**
(instead of outputting an “Open questions” section or asking inline in
chat).

## Recommended approach (A): tighten the Plan Mode system instruction
(primary)

**Why here:** `getPlanModeInstruction()` is injected into the system
message for *every* Plan Mode turn, so it’s the most reliable place to
enforce behavior across models.

**Change**: update `src/common/utils/ui/modeUtils.ts` →
`getPlanModeInstruction()` to add an explicit “no open questions in the
plan; use tool” rule.

**Proposed wording to insert** (exact text; adjust line wraps as
needed):

```text
If you need clarification from the user before you can finalize the plan, you MUST use the ask_user_question tool.
- Do not ask questions in a normal chat message.
- Do not include an "Open Questions" section in the plan.
- Ask up to 4 questions at a time (each with 2–4 options; "Other" is always available for free-form input).
- After you receive answers, update the plan file and only then call propose_plan.
- After calling propose_plan, do not repeat/paste the plan contents in chat; the UI already renders the full plan.
- After calling propose_plan, do not say “the plan is ready at &lt;path&gt;” or otherwise mention the plan file location; it’s already shown in the Plan UI.
```

**Net LoC (product code)**: +8 to +15 (string expansion only).

## Reinforcement (B): strengthen the tool description (secondary)

**Why:** tool descriptions are a strong hint for many models, and this
also helps when users add `Tool: ask_user_question` overrides (they’ll
see a better base description).

**Change**: update `src/common/utils/tools/toolDefinitions.ts` →
`ask_user_question.description`.

**Proposed wording tweak** (replace “should be used…” with a requirement
+ anti-pattern callout):

```text
This tool is intended for plan mode and MUST be used when you need user clarification to complete the plan.
Do not output a list of open questions; ask them via this tool instead.
```

**Net LoC (product code)**: ~0 to +3 (string edit).

## Tests

Add small unit tests so this behavior doesn’t regress during refactors.

1. `src/common/utils/ui/modeUtils.test.ts`
- Assert that `getPlanModeInstruction("/tmp/plan.md", false)` contains
`ask_user_question` and `MUST`.
- Assert that it includes the “don’t repeat/paste plan contents” +
“don’t mention plan file location” guidance (to prevent UI clutter).

2. `src/common/utils/tools/toolDefinitions.test.ts`
- Import the tool definitions map and assert
`ask_user_question.description` contains the “MUST be used …” phrasing.

**Net LoC (product code)**: 0 (tests only).

## Optional docs follow-up (not required for behavior)

If we want the user-facing docs to match the new behavior:

- `docs/plan-mode.mdx`: change “may call `ask_user_question` …” →
“should/must call … when it needs clarification before finalizing a
plan.”

**Net LoC (product code)**: 0 (docs only).

## Validation checklist (Exec Mode)

- Run `make typecheck`.
- Run targeted unit tests:
  - `bun test src/common/utils/ui/modeUtils.test.ts`
  - `bun test src/common/utils/tools/toolDefinitions.test.ts`
- Manual sanity check in mux:
  - Enter Plan Mode and ask for a plan on an underspecified task.
- Confirm the agent calls `ask_user_question` instead of emitting an
“Open questions” section.

---

&lt;details&gt;
&lt;summary&gt;Alternative considered: backend enforcement / linting&lt;/summary&gt;

**Idea:** detect “Open Questions” patterns in the assistant output and
block `propose_plan` unless `ask_user_question` was called.

**Why not (for now):**
- Hard to do safely without false positives/negatives.
- The backend can’t reliably synthesize the multiple-choice options the
tool requires.

This can be revisited if prompt-only enforcement proves insufficient.

**Net LoC (product code)**: ~50–150.

&lt;/details&gt;

&lt;/details&gt;

---
_Generated with `mux` • Model: openai:gpt-5.2 • Thinking: xhigh_
&lt;!-- mux-attribution: model=openai:gpt-5.2 thinking=xhigh --&gt;
diff --git a/src/common/utils/tools/toolDefinitions.test.ts b/src/common/utils/tools/toolDefinitions.test.ts
@@ -0,0 +1,17 @@
+import { TOOL_DEFINITIONS } from "./toolDefinitions";
+
+describe("TOOL_DEFINITIONS", () => {
+  it("asks for clarification via ask_user_question (instead of emitting open questions)", () => {
+    expect(TOOL_DEFINITIONS.ask_user_question.description).toContain(
+      "MUST be used when you need user clarification"
+    );
+    expect(TOOL_DEFINITIONS.ask_user_question.description).toContain(
+      "Do not output a list of open questions"
+    );
+  });
+
+  it("discourages repeating plan contents or plan file location after propose_plan", () => {
+    expect(TOOL_DEFINITIONS.propose_plan.description).toContain("do not paste the plan contents");
+    expect(TOOL_DEFINITIONS.propose_plan.description).toContain("plan file path");
+  });
+});
diff --git a/src/common/utils/tools/toolDefinitions.ts b/src/common/utils/tools/toolDefinitions.ts
@@ -228,15 +228,17 @@ export const TOOL_DEFINITIONS = {
   ask_user_question: {
     description:
       "Ask 1–4 multiple-choice questions (with optional multi-select) and wait for the user's answers. " +
-      "This tool is intended for plan mode and should be used when proceeding requires clarification. " +
+      "This tool is intended for plan mode and MUST be used when you need user clarification to complete the plan. " +
+      "Do not output a list of open questions; ask them via this tool instead. " +
       "Each question must include 2–4 options; an 'Other' choice is provided automatically.",
     schema: AskUserQuestionToolArgsSchema,
   },
   propose_plan: {
     description:
       "Signal that your plan is complete and ready for user approval. " +
       "This tool reads the plan from the plan file you wrote. " +
-      "You must write your plan to the plan file before calling this tool.",
+      "You must write your plan to the plan file before calling this tool. " +
+      "After calling this tool, do not paste the plan contents or mention the plan file path; the UI already shows the full plan.",
     schema: z.object({}),
   },
   todo_write: {
diff --git a/src/common/utils/ui/modeUtils.test.ts b/src/common/utils/ui/modeUtils.test.ts
@@ -0,0 +1,14 @@
+import { getPlanModeInstruction } from "./modeUtils";
+
+describe("getPlanModeInstruction", () => {
+  it("includes instructions to use ask_user_question (and avoid post-propose_plan clutter)", () => {
+    const instruction = getPlanModeInstruction("/tmp/plan.md", false);
+
+    expect(instruction).toContain("MUST use the ask_user_question tool");
+    expect(instruction).toContain('Do not include an "Open Questions" section');
+
+    // UI already renders the plan + plan file location, so the agent should not repeat them in chat.
+    expect(instruction).toContain("do not repeat/paste the plan contents");
+    expect(instruction).toContain('do not say "the plan is ready at <path>"');
+  });
+});
diff --git a/src/common/utils/ui/modeUtils.ts b/src/common/utils/ui/modeUtils.ts
@@ -17,6 +17,14 @@ NOTE that this is the only file you are allowed to edit - other than this you ar
 
 Keep the plan crisp and focused on actionable recommendations. Put historical context, alternatives considered, or lengthy rationale into collapsible \`<details>/<summary>\` blocks so the core plan stays scannable.
 
+If you need clarification from the user before you can finalize the plan, you MUST use the ask_user_question tool.
+- Do not ask questions in a normal chat message.
+- Do not include an "Open Questions" section in the plan.
+- Ask up to 4 questions at a time (each with 2–4 options; "Other" is always available for free-form input).
+- After you receive answers, update the plan file and only then call propose_plan.
+- After calling propose_plan, do not repeat/paste the plan contents in chat; the UI already renders the full plan.
+- After calling propose_plan, do not say "the plan is ready at <path>" or otherwise mention the plan file location; it's already shown in the Plan UI.
+
 When you have finished writing your plan and are ready for user approval, call the propose_plan tool.
 Do not make other edits in plan mode. You may have tools like bash but only use them for read-only operations.