Operator Tuner Report

Window: last 24h (extended 72h for recurrence). Runs: 48. Total actions: 426. Avg actions/run: 7.88.

Action mix (recent window)

    63  recent_task_failure
    48  run_start
    48  disk_worktree_snapshot
    48  memory_snapshot
    48  done_audit_summary
    48  run_complete
    44  blocked_lane_classification_summary
    15  create_operator_ticket
    14  repo_dirty_observed
    10  operator_dev_loop_throttled
     8  operator_dev_loop_finished
     8  needs_input_runtime_failure_reset
     7  active_lane_snapshot
     4  promote_dead_zone_ticket
     3  todo_backlog_observed
     2  terminal_ticket_runtime_closed
     1  worktree_reaper_ran
     1  blocked_lane_completed_marked_done
     1  needs_input_reclassified_in_review
     1  repo_dirty_all_known_generated
     1  reopen_weak_done_ticket
     1  stale_runtime_observed
     1  stale_runtime_marked
     1  stale_runtime_summary

Signals worth tuning

Recurring operator-ticket markers (>=3 in 72h): {'[operator:worktree-manual-review]': 3, '[operator:active-lane-backlog]': 6, '[operator:dirty-repo:workspace]': 41, '[operator:dirty-repo:PKA]': 4, '[operator:dirty-repo:mission-control]': 10}
Weak-done reopens: 1
Stale-runtime marks (extended window): 55
Dev-loop throttle ratio: 0.56 (10 throttled / 8 finished)
Repo-dirty observations: 14 (generated-only noise: 1)

Reopened-ticket outcomes (current MC state)

MC-3129: status=done updated=2026-05-11T18:48:09.860011+02:00 failure=—

Proposed patches

Operator Tuner Review — 2026-05-12

1. `_should_retry_misrouted_needs_input` unconditionally returns `True`

Problem: The method's final return True (line ~610) makes it always return True regardless of whether failure evidence was found. Every needs_input ticket without an unanswered question gets reset to todo — 8 resets in 24h, likely feeding the active-lane congestion that produces 6 [operator:active-lane-backlog] markers.

Location: _should_retry_misrouted_needs_input, the last line return True.

Change: Replace return True with return False. Only tickets whose haystack matches the crash/runtime-failure regex should be retried.

Expected effect: needs_input_runtime_failure_reset count drops to only genuinely failed sessions; active-lane backlog signal should soften.

2. Dev-loop throttle window too short

Problem: Throttle is 2 hours (timedelta(hours=2) in _operator_dev_loop_throttled, ~line 465). With the operator running every 30 min, the loop can only fire once every 4 runs, producing a throttle ratio of 0.56 (10 throttled vs 8 finished). Most non-critical triggers are being suppressed.

Location: timedelta(hours=2) inside _operator_dev_loop_throttled.

Change: Increase to timedelta(hours=4).

Expected effect: Throttle ratio drops; dev loops finish before being re-throttled, reducing stale re-trigger cycles.

3. `\blog\b` in `STRONG_EVIDENCE_PATTERNS` is too broad

Problem: The pattern re.compile(r"\b(screenshot|artifact|report|document|dashboard|log)\b", re.I) matches "log" in any context (e.g., "checked the log," "saw nothing in the log"). This falsely classifies weak completions as strongly evidenced, suppressing valid reopens. Combined with only 1 reopen in 24h and 500 audited done tickets, the operator may be missing weak completions.

Location: STRONG_EVIDENCE_PATTERNS, the tuple entry containing "log".

Change: Remove \blog\b from the alternation, or narrow to \blog\s*(file|entry|line|output|stream)\b.

Expected effect: Slight increase in legitimate reopen_weak_done_ticket actions; fewer false-strong classifications.

4. `[operator:dirty-repo:workspace]` re-created cyclically (41 markers / 72h)

Problem: _ensure_operator_ticket only checks for open (non-terminal) tickets. A dev loop or manual action resolves the ticket → next operator run sees the still-dirty repo → creates a new ticket. This cycle accounts for 41 markers in 72h and is pure noise for a persistent condition the system already knows about.

Location: _ensure_operator_ticket → the SELECT query checking status NOT IN ('done', 'cancelled').

Change: Extend the query to also match tickets resolved in the last 4 hours (e.g., OR (status IN ('done','cancelled') AND updated_at >= datetime('now','-4 hours'))). If a recently-resolved match exists, record operator_ticket_recently_resolved and return without creating a new ticket.

Expected effect: create_operator_ticket count for dirty-repo:workspace drops from ~15/day to near-zero until the dirty file set actually changes; reduces MC ticket churn significantly.

Skipped (too thin to act on)

Stale-runtime marks (55): Volume is high but the 6-hour cutoff and _runtime_safe_to_stale guardrails look intentional. No evidence sessions are being killed prematurely.
TODO_BACKLOG_MINUTES_TRIGGER = 15: Only fired 3 times; the scheduler and pickup already handle this. Not noisy enough to change.
\bssh\b in the _should_retry_misrouted_needs_input regex: After fix #1 this matters more, but alone the signal is thin — skip unless #1 lands and false resets persist.

Operator Tuner Report — 2026-05-12