Peter Steinberger
07527e22ce
refactor(auth-profiles): centralize active-window logic + strengthen regression coverage
2026-02-22 13:23:19 +01:00
Peter Steinberger
1152b25866
fix(gateway): guard trim crashes in subagent flow
2026-02-22 13:21:26 +01:00
Peter Steinberger
0d0f4c6992
refactor(exec): centralize safe-bin policy checks
2026-02-22 13:18:25 +01:00
Artale
51e9c54f09
fix(agents): skip bootstrap files with undefined path ( #22698 )
...
* fix(agents): skip bootstrap files with undefined path
buildBootstrapContextFiles() called file.path.replace() without checking
that path was defined. If a hook pushed a bootstrap file using 'filePath'
instead of 'path', the function threw TypeError and crashed every agent
session — not just the misconfigured hook.
Fix: add a null-guard before the path.replace() call. Files with undefined
path are skipped with a warning so one bad hook can't take down all agents.
Also adds a test covering the undefined-path case.
Fixes #22693
* fix: harden bootstrap path validation and report guards (#22698 ) (thanks @arosstale)
---------
Co-authored-by: Peter Steinberger <steipete@gmail.com>
2026-02-22 13:17:07 +01:00
Peter Steinberger
7c3c406a35
fix: keep auth-profile cooldown windows immutable in-window ( #23536 ) (thanks @arosstale)
2026-02-22 13:14:02 +01:00
artale
dc69610d51
fix(auth-profiles): never shorten cooldown deadline on retry
...
When the backoff saturates at 60 min and retries fire every 30 min
(e.g. cron jobs), each failed request was resetting cooldownUntil to
now+60m. Because now+60m < existing deadline, the window kept getting
renewed and the profile never recovered without manually clearing
usageStats in auth-profiles.json.
Fix: only write a new cooldownUntil (or disabledUntil for billing) when
the new deadline is strictly later than the existing one. This lets the
original window expire naturally while still allowing genuine backoff
extension when error counts climb further.
Fixes #23516
[AI-assisted]
2026-02-22 13:14:02 +01:00
Peter Steinberger
47c3f742b6
fix(exec): require explicit safe-bin profiles
2026-02-22 12:58:55 +01:00
Peter Steinberger
29cc7f431f
test: share runtime scan filters and cached test scans
2026-02-22 12:44:44 +01:00
Peter Steinberger
3a65e4b523
test: make snapshot env override assertion independent of host env
2026-02-22 12:40:30 +01:00
Peter Steinberger
a4607277a9
test: consolidate sessions_spawn and guardrail helpers
2026-02-22 12:34:55 +01:00
Peter Steinberger
c343132dbb
fix(agents): harden bash tool and reply directive handling
2026-02-22 11:29:31 +00:00
Peter Steinberger
401106b963
fix: harden flaky tests and cover native google thought signatures ( #23457 ) (thanks @echoVic)
2026-02-22 12:24:53 +01:00
echoVic
9176571ec1
fix(gemini): sanitize thoughtSignatures for native Google provider
...
Native Google Gemini provider was accumulating 2K-8K tokens of Base64
thoughtSignature blobs per turn, causing premature context overflow.
The sanitizer was only enabled for OpenRouter Gemini, not native Google.
Fixes #23392
2026-02-22 12:24:53 +01:00
Peter Steinberger
78c3c2a542
fix: stabilize flaky tests and sanitize directive-only chat tags
2026-02-22 12:19:33 +01:00
Peter Steinberger
97eb4af01e
test: harden models-config env isolation list
2026-02-22 10:34:23 +00:00
Peter Steinberger
744df0fbe7
test: reclassify models-config suites from e2e to unit lane
2026-02-22 10:34:23 +00:00
Peter Steinberger
740fd7ae35
test: reclassify skills suites from e2e to unit lane
2026-02-22 10:34:23 +00:00
Peter Steinberger
c56ab39da5
perf(test): reduce bash e2e wait windows
2026-02-22 10:28:43 +00:00
Peter Steinberger
abff3f0f61
test: reclassify sessions_spawn lifecycle suite as unit test
2026-02-22 10:28:43 +00:00
Peter Steinberger
0b7c7ee1aa
perf(test): speed up sessions_spawn lifecycle suite setup
2026-02-22 10:28:43 +00:00
Peter Steinberger
c962bcba37
test: reclassify sandbox merge and exec path suites as unit tests
2026-02-22 10:28:43 +00:00
Peter Steinberger
9ab7b85a66
perf(test): tighten background abort timing windows
2026-02-22 10:28:43 +00:00
Peter Steinberger
c995f9be07
test: reclassify mocked announce and sandbox suites as unit tests
2026-02-22 10:28:43 +00:00
Peter Steinberger
27f0d7ebcc
test: reclassify auth-profile-rotation suite as unit test
2026-02-22 10:28:43 +00:00
Peter Steinberger
c0b1c10a08
test: reclassify mocked runner/safe-bins suites as unit tests
2026-02-22 10:28:43 +00:00
Peter Steinberger
a9b26d83de
perf(test): narrow pi-embedded runner e2e import path
2026-02-22 10:28:42 +00:00
Peter Steinberger
2b0ca9447c
perf(test): trim bash e2e sleep and poll windows
2026-02-22 10:28:42 +00:00
Peter Steinberger
c348a13640
perf(test): lower subagent fast-mode wait floors
2026-02-22 10:28:42 +00:00
Peter Steinberger
54e0786ba6
perf(test): reduce subagent announce fast-mode polling waits
2026-02-22 10:28:42 +00:00
Peter Steinberger
a96139e18c
perf(test): mock compact module in auth rotation e2e
2026-02-22 10:28:42 +00:00
Peter Steinberger
eda941f395
perf(test): remove flaky transport timeout and dedupe safeBins checks
2026-02-22 10:28:42 +00:00
Peter Steinberger
d72b4ead18
perf(test): lower fast-mode nested output wait floor to 70ms
2026-02-22 10:28:42 +00:00
Peter Steinberger
7ccf62fb4c
test(agents): remove dead shell-timeout override in safeBins suite
2026-02-22 10:28:42 +00:00
Peter Steinberger
60773c124e
perf(test): lower fast-mode nested output wait floor to 80ms
2026-02-22 10:28:42 +00:00
Peter Steinberger
36375f121f
perf(test): trim nested subagent output wait floor in fast mode
2026-02-22 10:28:42 +00:00
Peter Steinberger
2900eb5456
perf(test): trim background abort settle waits and dedupe cmd fixture
2026-02-22 10:28:42 +00:00
Peter Steinberger
7d13227d41
test(agents): dedupe auth profile rotation fixture setup
2026-02-22 10:28:42 +00:00
Peter Steinberger
6b5c20055b
perf(test): speed subagent announce retry polling in fast mode
2026-02-22 10:28:42 +00:00
Peter Steinberger
1b327da6e3
fix: harden exec sandbox fallback semantics ( #23398 ) (thanks @bmendonca3)
2026-02-22 11:12:01 +01:00
Brian Mendonca
c76a47cce2
Exec: fail closed when sandbox host is unavailable
2026-02-22 11:12:01 +01:00
Peter Steinberger
35d5bd4e07
perf(test): shrink subagent announce fast-mode settle waits
2026-02-22 09:29:04 +00:00
Peter Steinberger
703f7213b6
test(agents): simplify subagent announce suite imports and call assertions
2026-02-22 09:29:04 +00:00
Peter Steinberger
6c2e999776
refactor(security): unify secure id paths and guard weak patterns
2026-02-22 10:16:19 +01:00
Peter Steinberger
c3e13175d2
perf(test): bypass queue debounce in fast mode and tighten announce defaults
2026-02-22 09:13:01 +00:00
Peter Steinberger
833d7574e7
test(agents): consolidate repeated announce deferral and fallback matrices
2026-02-22 09:05:56 +00:00
Peter Steinberger
4985fb7f05
test(agents): remove overflow compaction mock reset dependency
2026-02-22 09:02:24 +00:00
Peter Steinberger
d9a7b447f5
test(agents): use lightweight clear for active-run announce mock
2026-02-22 09:01:55 +00:00
Peter Steinberger
15657dd48d
test(agents): collapse repeated announce direct-send scenarios
2026-02-22 08:57:39 +00:00
Peter Steinberger
53a7afe238
test(agents): unify hook thread-target announce assertions
2026-02-22 08:55:11 +00:00
Peter Steinberger
d625f888a9
test(core): dedupe command gating and trim announce reset overhead
2026-02-22 08:54:11 +00:00