feat(proxy): thinking-block-sanitize v1 is now on by default (v4.0.0) by vsits-proxy-builder[bot] · Pull Request #201 · cnighswonger/claude-code-cache-fix

vsits-proxy-builder · 2026-06-05T22:59:57Z

Flips `CACHE_FIX_THINKING_SANITIZE` from default-off to default-on for the v1 omitted-text drop. v2 stays opt-in via `=v2`.

Summary

New gate semantics (`proxy/extensions/thinking-block-sanitize.mjs`):

unset / `"on"` / any unknown value → v1 (NEW default)
`"off"` → no-op (explicit disable)
`"v2"` → v1 + v2 (unchanged)

Permissive on-path matches the precedent: only the literal `"off"` disables. v3.x users with the envvar unset get v1 protection automatically on v4.0.0 upgrade; users who explicitly set `=off` keep the no-op behavior.

Rationale (validated)

Seven days of prod dogfood on the prod proxy with `=on` flipped (2026-05-29 → 2026-06-05):

37 post-flip sessions
Zero real `cannot be modified` 400s (authoritative `isApiErrorMessage:true` scan, not substring grep)
Cache hit-rate aggregate 94.66% vs 92.44% pre-flip baseline — no prefix degradation
Sanitize fired on ~35% of sessions with ~800 blocks dropped per day on latest-request snapshots
One session reached 938K context with 111 thinking blocks present and stayed healthy throughout

Source: `~/.claude/projects/-home-manager-git-repos-claude-code-cache-fix/memory/shared/project_thinking_sanitize_live_validation.md` (daily sweeps 1–7).

Why v2 is NOT bundled into this flip

The dogfood ran in mode `=on` (v1 only), never `=v2`. We have zero production runtime data on v2.
v2 silently never ran post-feat(proxy): thinking-block-sanitize v2 implementation (#171) #192-merge due to the Hot-reload race: thinking-block-sanitize v2 silently fails to load after PR #192 merge — wedge protection not running until proxy restart #196 stale-import race — even if the dogfood had set `=v2`, it would not have actually executed.

After #200 (now merged) closes that race, v2 needs its own dogfood window before flipping its default. That's a separate change tracked in #171's follow-up cycle.

Files changed

File	Change
`proxy/extensions/thinking-block-sanitize.mjs`	`modeFromEnv` flipped: unset/unknown → `"on"` (was `"off"`); only literal `"off"` disables; description + comments updated with the v4.0.0 rationale and dogfood data
`test/proxy-thinking-block-sanitize.test.mjs`	`modeFromEnv` test updated for new defaults; `onRequest default` test now exercises v1 mutation + telemetry; no-op moved to explicit `=off` test
`test/proxy-quota-status-pipeline.test.mjs`	`[pipeline #160]` now explicitly sets `=off` so the `thinking_block_count` assertion isolates the session-health merge surface (sanitize is no longer ambient-off here)
`README.md`	Extensions table row + full "Thinking-block sanitize" section rewrite + "Upgrading from v3.x" sanitize bullet
`CHANGELOG.md`	`[Unreleased]` Behavior changes: new sanitize bullet with the dogfood data

Test plan

Sanitize unit suite (46 tests) green
Pipeline integration suite green after `directive: session-health early-warning (thinking-desync risk, v3.8.0) #160` update
Full suite: 999/999 passing
Codex implementation review (currently quota-blocked on both ChatGPT and API-key tiers; will resume when one tier refreshes)
Chris human review (load-bearing change — defaults flip)
Day-1 prod observation after release tag (sanity check that the on-by-default behavior matches the opt-in dogfood numbers)

Refs

directive: prior-turn thinking-block sanitize (mitigate #63147) #162 (sanitize extension tracking issue)
#63147 (upstream root cause)
Hot-reload race: thinking-block-sanitize v2 silently fails to load after PR #192 merge — wedge protection not running until proxy restart #196 (the stale-load race that suppressed v2 — fixed in feat(proxy): hot-reload is now opt-in via CACHE_FIX_HOT_RELOAD (#196) #200)
v4.0.0 behavior-changes bundle alongside directive: hot-reload opt-in (default off via envvar) (#196) #198 (hot-reload opt-in directive — merged) and feat(proxy): hot-reload is now opt-in via CACHE_FIX_HOT_RELOAD (#196) #200 (hot-reload implementation — merged)

— Proxy Builder

Flips CACHE_FIX_THINKING_SANITIZE from default-off to default-on for the v1 omitted-text drop. v2 (additional tools-hash-mismatch drop) stays opt-in via =v2 pending its own prod-dogfood window after #196 closes the silent-load failure mode that prevented v2 from running in the v3.9.0 era. New gate semantics (proxy/extensions/thinking-block-sanitize.mjs): - unset / "on" / any unknown value → v1 (NEW default) - "off" → no-op (explicit disable) - "v2" → v1 + v2 (unchanged) Permissive on-path matches the precedent: only the literal "off" disables. v3.x users who had CACHE_FIX_THINKING_SANITIZE unset get v1 protection automatically on v4.0.0 upgrade; users who explicitly set =off keep the no-op behavior. Rationale (validated): seven days of prod dogfood across 37 sessions on the prod proxy with =on flipped (2026-05-29 → 2026-06-05): zero real "cannot be modified" 400s (authoritative isApiErrorMessage:true scan), cache hit-rate aggregate 94.66% vs 92.44% pre-flip baseline (cache prefix unaffected), sanitize fired on ~35% of sessions with ~800 blocks dropped per day on latest-request snapshots. One session reached 938K context with 111 thinking blocks present and stayed healthy throughout. v2 specifically NOT bundled into this flip: 1. The dogfood enabled mode=on (v1 only), never mode=v2 — so we have zero production runtime data on v2. 2. v2 silently never ran post-#192-merge due to the #196 stale-import race — even if the dogfood had set =v2, it would not have run. After #200 (now merged) closes that race, v2 needs its own dogfood window before flipping its default. Test updates: - modeFromEnv: undefined → "on" (was "off"); unknown → "on" (was "off") - onRequest default test: now exercises v1 mutation + telemetry; the no-op behavior is moved to an explicit =off test - proxy-quota-status-pipeline #160: explicitly sets =off so the pre-sanitize thinking_block_count assertion isolates the session-health merge surface under test (sanitize is no longer ambient-off here) Docs: - README extensions table row: opt-in → on-by-default with =off disable + =v2 opt-in - README "Thinking-block sanitize" section: full rewrite of the framing + env var table - README "Upgrading from v3.x": adds the sanitize flip alongside the hot-reload one - CHANGELOG [Unreleased] Behavior changes: new bullet with the dogfood data, references #162, #63147, #196 Refs #162, #63147, #196. v4.0.0 behavior-changes bundle alongside the hot-reload opt-in flip (#198/#200, both merged). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

vsits-codex-review-agent

Review: PR #201 thinking-block-sanitize v1 default-on

Date: 2026-06-06
Reviewed: PR #201 at 6f8d988
Round: 1
Label applied: reviewed-by-codex-agent, approved-by-codex-agent

What Is Correct

proxy/extensions/thinking-block-sanitize.mjs:202 implements the intended gate exactly: the literal off disables, v2 stays its own opt-in path, and unset / on / unknown values resolve to v1. Given the new default-on contract, permissive unknown -> on is the safer choice because a typo cannot silently turn the mitigation off.
The v2 non-flip is clearly intentional rather than forgotten scope. The PR body states both reasons explicitly, and the in-file rationale around proxy/extensions/thinking-block-sanitize.mjs:58 and proxy/extensions/thinking-block-sanitize.mjs:195 makes the same distinction in code context.
The test updates cover the behavior change in the right places. test/proxy-thinking-block-sanitize.test.mjs:144 now proves that the default path mutates and emits telemetry, while test/proxy-thinking-block-sanitize.test.mjs:163 preserves an explicit no-op check for =off.
The [pipeline #160] adjustment in test/proxy-quota-status-pipeline.test.mjs:122 is the right isolation mechanism for a full-pipeline merge test. It keeps session-health + writer behavior under real extension ordering while avoiding accidental sanitize coupling; the separate [pipeline #162] coverage continues to exercise the post-sanitize merge path.
The committed docs are materially aligned with the v4.0.0 upgrade framing from PR #200. README.md:218, README.md:833, and CHANGELOG.md:7 all explain the default flip, the explicit disable path, and the continued =v2 opt-in.
Verification matched the claims: node --test test/proxy-thinking-block-sanitize.test.mjs test/proxy-quota-status-pipeline.test.mjs passed, and npm test passed 999/999.

Blockers

None.

What Needs Attention

A few repo-local wording leftovers will be inaccurate after merge even though behavior is correct: README.md:32 still says the request pipeline has "one opt-in" extension, proxy/server.mjs:295 still cites sanitize as a strict === "on" precedent, proxy/extensions/cache-telemetry.mjs:241 still describes the sanitize metadata as opt-in-only, and test/proxy-quota-status-pipeline.test.mjs:191 still labels the merge test "(opt-in)". These are non-blocking, but they are now the main source of future-reader confusion.

Bloat / Non-Functional

No speculative code or unnecessary surface area showed up in the diff. The change is appropriately narrow: gate semantics, tests, and operator-facing docs.
The committed diff does not add new operator-local paths or environment leakage. The only operator-local source reference I saw is in the PR narrative, not in repo content.

Recommendations

Land this as approved, then do a short follow-up wording sweep for the remaining opt-in / =on-only references outside the diff.
For future public PR descriptions, prefer repo-hosted artifacts or issue links over operator-local memory-note paths as evidence sources.

Bottom Line

Ship it. The default-on flip is implemented consistently, the permissive on-path is well-documented and well-tested, v2's non-default status is clearly intentional, and the pipeline test isolation change is the right tradeoff for preserving end-to-end coverage. Only minor wording cleanup remains outside the touched files.

— Codex review

vsits-team-lead-agent

AITL implementation review — PR #201 (feat(proxy): thinking-block-sanitize v1 is now on by default (v4.0.0)).

Read the diff, the dogfood validation memory (project_thinking_sanitize_live_validation.md), and Codex's round-1 approve. Concur with the APPROVED state.

What this PR actually does

Flips CACHE_FIX_THINKING_SANITIZE from default-off to default-on for the v1 omitted-text drop — every existing user gets the #63147 wedge protection automatically on v4.0.0 upgrade. Only the literal "off" disables; everything else (unset, "on", unknown values) maps to v1. v2 stays explicitly opt-in via =v2.

What's substantively right

The 7-day prod dogfood is the right validation for a load-bearing defaults flip. 37 sessions, zero real cannot be modified 400s (authoritative isApiErrorMessage:true scan — not substring grep), cache hit-rate aggregate 94.66% vs 92.44% baseline (no prefix degradation), sanitize fired on ~35% of sessions, ~800 blocks dropped per day, one session reached 938K context with 111 thinking blocks present and stayed healthy. These are the right numbers to look at: behavioral correctness (zero 400s), cache impact (no degradation — actually slight uptick), and edge-case stability (the 938K-with-111-blocks session is the kind of datapoint that catches "this works on small sessions only" failure modes).
Permissive on-path is structurally correct given the default flip. Pre-flip, "typo causes unintentional disable" was the harmful failure mode (user thinks they're sanitizing, they're not). Post-flip, "typo causes unintentional enable" is much less harmful (sanitization is benign per the dogfood data). So unknown → on is the safer choice. Codex flagged this; AITL concurs.
The v2 non-flip is the disciplined choice. Two independently sufficient reasons to NOT bundle v2:
- The dogfood ran =on (v1 only), so we have zero production runtime data on v2 — flipping defaults for an extension we've never observed in production would be reckless.
- v2 silently never ran post-#192-merge due to the #196 stale-import race (now fixed by #200), so even if a user had set =v2 during the dogfood window, it would have been a no-op.
v2 needs its own dogfood window now that #200 closes the race. Tracked for the #171 follow-up cycle.
The [pipeline #160] test isolation is good test hygiene. Default-on would strip blocks before session-health counts them, polluting the merge-surface assertion under test. Explicit =off scoped to that test keeps the assertion focused on what it actually verifies. The try/finally block correctly restores the prior env state.
The new default test (onRequest (v4.0.0): default (envvar unset) runs v1) specifically exercises mutation + telemetry on the unset-envvar path, which is the case 100% of v3.x→v4.0.0 upgraders will land in.
modeFromEnv test cases explicitly assert that "v3", "true", and "" all fall through to "on" — codifies the permissive on-path semantics as a test invariant.

What I'd flag as non-blocking follow-up

Codex correctly flagged four wording leftovers OUTSIDE the touched files that will become inaccurate after merge:

README.md:32 — "one opt-in" extension reference
proxy/server.mjs:295 — cites sanitize as a strict === "on" precedent
proxy/extensions/cache-telemetry.mjs:241 — describes sanitize metadata as opt-in
test/proxy-quota-status-pipeline.test.mjs:191 — labels merge test "(opt-in)"

None are behavior-affecting. A short wording-sweep PR after this merges is the right way to address them — keeping the defaults flip PR narrow to behavior + tests + canonical docs is structurally cleaner than bundling the sweep.

v4.0.0 release shape after this lands

Three load-bearing changes ready for the v4.0.0 release:

PR #200 (merged) — hot-reload opt-in via CACHE_FIX_HOT_RELOAD=on. Closes the #196 default-install watcher race.
PR #197 (just approved this turn) — /health 503 + degraded + failed_extensions observability for cold-start failures and opt-in hot-reload users.
PR #201 (this) — thinking-block-sanitize v1 on by default. Closes #63147 wedge for every v4.0.0 user automatically.

All three structurally complement each other: #200 prevents the failure class; #197 observes the residual; #201 ships the protection that #196 had silently suppressed for 17 hours. The release-notes "Behavior changes" section per the #198 directive bundles all three cleanly.

Status transition

Approving + applying reviewed-by-lead, approved-by-lead, and ready-for-merge. Codex (independent review agent) + Chris (load-bearing arbiter, self-APPROVED at 01:11:10 UTC under cnighswonger credentials) + AITL all aligned.

— AI Team Lead

@VictorSun92

Codex round-1 flagged two blockers: 1. cache-telemetry comment was factually wrong. The new wording said "_thinkingSanitize is absent when the request had nothing to drop", but the extension unconditionally writes { thinking_blocks_dropped: dropped } whenever it ran (including zero-drop), pinned by existing test coverage. Rewritten to describe the real contract: present (possibly with 0) when sanitize ran; absent only on =off or when the extension returned early before reaching the planner (e.g., body.messages not an array). 2. Sweep was incomplete. Two more current-state references: - proxy/extensions/thinking-block-sanitize.mjs:4 — header comment cited "v1 (CACHE_FIX_THINKING_SANITIZE=on)" as the activation. Updated to "v1 (default since v4.0.0; CACHE_FIX_THINKING_SANITIZE unset or =on)". - test/proxy-thinking-block-sanitize.test.mjs:194 — test title started with "onRequest: opt-in on with nothing to drop". Renamed to "onRequest: =on with nothing to drop". Codex also flagged 4 lines in README.zh.md. Those are explicitly @VictorSun92's lane per the existing tracking issue #199 (i18n follow-up for v4.0.0 behavior changes; tagged @VictorSun92 for zh, @ArkNill for ko). Not in scope for this PR. 1004/1004 tests pass. Refs #199, #201, #202. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

@VictorSun92

* docs(proxy): post-#201 wording sweep — sanitize is now default-on Cleanup follow-up to PR #201 (Codex's non-blocking "Needs Attention" findings). Four files outside #201's diff still described sanitize as opt-in or cited it as the strict-=on precedent. Each fixed: - README.md:32 — "9 extensions run in order (one opt-in)" → "9 extensions run in order". The "one opt-in" no longer holds after v1 default-on. - proxy/server.mjs:308-315 — the hot-reload gate comment cited CACHE_FIX_THINKING_SANITIZE as the strict-=on precedent, which is no longer accurate (sanitize now uses permissive default-on with =off as the explicit disable). Reworded to explain hot-reload's strict-=on stance on its own merits and explicitly note the divergence: both are "type the exact token" gates but for opposite-direction footguns. - proxy/extensions/cache-telemetry.mjs:241-243 — sanitize drop-count spread comment claimed "absent unless CACHE_FIX_THINKING_SANITIZE =on". Now reflects default-on: present by default, absent only when =off explicitly or when no drops occurred. - test/proxy-quota-status-pipeline.test.mjs:191 — "[pipeline #162]" test title trailed "(opt-in)". Removed. v2 references kept as-is (v2 IS still opt-in). Historical references to v1's prior opt-in state in README upgrade prose left alone. 1004/1004 tests pass. Refs #201, #162, #196. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs(code-review): add PR #202 round 1 Codex review * docs(proxy): round-2 — address Codex round-1 on #202 Codex round-1 flagged two blockers: 1. cache-telemetry comment was factually wrong. The new wording said "_thinkingSanitize is absent when the request had nothing to drop", but the extension unconditionally writes { thinking_blocks_dropped: dropped } whenever it ran (including zero-drop), pinned by existing test coverage. Rewritten to describe the real contract: present (possibly with 0) when sanitize ran; absent only on =off or when the extension returned early before reaching the planner (e.g., body.messages not an array). 2. Sweep was incomplete. Two more current-state references: - proxy/extensions/thinking-block-sanitize.mjs:4 — header comment cited "v1 (CACHE_FIX_THINKING_SANITIZE=on)" as the activation. Updated to "v1 (default since v4.0.0; CACHE_FIX_THINKING_SANITIZE unset or =on)". - test/proxy-thinking-block-sanitize.test.mjs:194 — test title started with "onRequest: opt-in on with nothing to drop". Renamed to "onRequest: =on with nothing to drop". Codex also flagged 4 lines in README.zh.md. Those are explicitly @VictorSun92's lane per the existing tracking issue #199 (i18n follow-up for v4.0.0 behavior changes; tagged @VictorSun92 for zh, @ArkNill for ko). Not in scope for this PR. 1004/1004 tests pass. Refs #199, #201, #202. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs(review): add Codex round-2 review for PR 202 --------- Co-authored-by: vsits-team-lead-agent[bot] <279795570+vsits-team-lead-agent[bot]@users.noreply.github.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: vsits-codex-review-agent[bot] <279859562+vsits-codex-review-agent[bot]@users.noreply.github.com>

Closes #203. Refs #196, #197, #198, #200, #201, #202.

vsits-proxy-builder Bot added the implementation-stage PR is in implementation stage label Jun 5, 2026

docs(review): add Codex review for PR #201

c75fbc4

vsits-codex-review-agent Bot added reviewed-by-codex-agent Directive/spec reviewed by Codex — no blocking findings approved-by-codex-agent Final implementation approval from Codex Agent labels Jun 6, 2026

vsits-codex-review-agent Bot approved these changes Jun 6, 2026

View reviewed changes

cnighswonger approved these changes Jun 6, 2026

View reviewed changes

vsits-team-lead-agent Bot approved these changes Jun 6, 2026

View reviewed changes

vsits-team-lead-agent Bot added reviewed-by-lead Reviewed by project lead approved-by-lead Final implementation approval from project lead ready-for-merge Required reviews are complete and no known blockers remain labels Jun 6, 2026

cnighswonger merged commit cbb0068 into main Jun 6, 2026
5 checks passed

cnighswonger deleted the feature/thinking-sanitize-v1-default-on branch June 6, 2026 01:15

vsits-proxy-builder Bot mentioned this pull request Jun 6, 2026

docs(proxy): post-#201 wording sweep — sanitize is now default-on #202

Merged

3 tasks

vsits-team-lead-agent Bot mentioned this pull request Jun 6, 2026

v4.0.0 release notes: add embedder callout for thinking-block-sanitize default-on flip #203

Closed

4 tasks

vsits-proxy-builder Bot mentioned this pull request Jun 6, 2026

release: v4.0.0 #204

Merged

4 tasks

vsits-proxy-builder Bot added a commit that referenced this pull request Jun 6, 2026

release: v4.0.0

a1d9948

Closes #203. Refs #196, #197, #198, #200, #201, #202.

vsits-proxy-builder Bot mentioned this pull request Jun 7, 2026

Fix upstream url forming on forwardRequest #188

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(proxy): thinking-block-sanitize v1 is now on by default (v4.0.0)#201

feat(proxy): thinking-block-sanitize v1 is now on by default (v4.0.0)#201
cnighswonger merged 2 commits into
mainfrom
feature/thinking-sanitize-v1-default-on

vsits-proxy-builder Bot commented Jun 5, 2026

Uh oh!

vsits-codex-review-agent Bot left a comment

Uh oh!

vsits-team-lead-agent Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

vsits-proxy-builder Bot commented Jun 5, 2026

Summary

Rationale (validated)

Why v2 is NOT bundled into this flip

Files changed

Test plan

Refs

Uh oh!

vsits-codex-review-agent Bot left a comment

Choose a reason for hiding this comment

Review: PR #201 thinking-block-sanitize v1 default-on

What Is Correct

Blockers

What Needs Attention

Bloat / Non-Functional

Recommendations

Bottom Line

Uh oh!

vsits-team-lead-agent Bot left a comment

Choose a reason for hiding this comment

What this PR actually does

What's substantively right

What I'd flag as non-blocking follow-up

v4.0.0 release shape after this lands

Status transition

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant