MC-4374 — Shadow-rerun today's NBLM radio show with personalized curation

Home Board MC-4374

Shadow-rerun today's NBLM radio show with personalized curation

Run today's NBLM/radio show as a shadow QA test of the newly built personalized curation pipeline. Why this exists: Elmar correctly challenged that a same-day shadow replay sho...

State Done Next Action Closed Owner Luci Runtime Closed Age 16d ago

← MC-4374

Ticket is done; runtime is closed. · profile claude_opus_1m_medium · cwd /home/lucienne/workspace/mission-control · uptime 15d 22h · last activity 15d 20h ago

Description

MC-4374

Run today's NBLM/radio show as a shadow QA test of the newly built personalized curation pipeline. Why this exists: Elmar correctly challenged that a same-day shadow replay should have been part of the original MC-4365 workflow. Treat this as the missing end-to-end acceptance gate. Scope: 1. Locate today's original radio episode inputs/output if available. 2. Run the new personalized curation pipeline in dry-run/shadow mode for today. 3. Do NOT auto-post, do NOT replace the original, and do NOT send production Telegram/Discord output. 4. Compare old vs new: - repeated stories suppressed - stale/old articles filtered - new/personalized stories surfaced - Boeing/Airbus/narrowbody fleet-watch usefulness - relevance to Elmar's active work/interests 5. If the curated slate is clearly better and audio generation can be done safely, render a TEST/ALT MP3 clearly labelled as test; otherwise stop at comparison report. 6. Report back with a concise verdict and artifact links. Acceptance criteria: - A comparison report exists with old vs new story slate and inclusion/suppression rationale. - Source ages are shown for the new slate. - Any audio artifact is labelled TEST/ALT and is not posted as the official episode. - The radio workflow/runbook/ticket acceptance criteria are updated so future curation changes require a shadow replay against a real episode, not just unit tests/dry-run. - Expected check-in: within 60 minutes.

Activity

done

Luci is working...

Details — Done · High · Luci ▼

State

Done

Closed

State

Priority

People

Owner (assigned to)

Controller

Timing / Details

Due Date

Snooze

Source telegram (human)

Project Mission Control

Created 16d ago

Updated 16d ago

Advanced / Operator evidence

Routing owner

Routes via

Operator console

Evidence

Ticket is done; runtime is closed. ticket_marked_done profile claude_opus_1m_medium · cwd /home/lucienne/workspace/mission-control · uptime 15d 22h · last activity 15d 20h ago MC is visibility-only. Hermes Luci launches and gates work outside MC, then mirrors evidence/status here. Raw console: luci · claude/anthropic/opus[1m]

Workflow

Start Dev Review + QA ▾

Select phases to include:

Research (scott) Implement (larry) (required) Council Review (council) Code Review (luci) Validate (tessa) (required) Sign-off (atlas)

Agents

Review Gates

Decision

WAT routing: choose an agent, review gate, or decision. Buttons use the live runtime when one is attached.

luci 16d ago

Worker died unexpectedly. **Worker PID:** unknown **OOM kill:** unknown **Exit signal / dmesg:** unknown **Last tool_use:** none **tmux target:** none recorded **Last pane output:** ``` [no pane output captured] ``` Reset to Todo to retry, or provide additional context.

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

[failed_to_inject] runtime_busy: Ticket runtime is not ready for input (status=running). Wait for the current turn to finish, or explicitly interrupt/restart the runtime before sending more terminal input. This message Ticket picked up by Luci via MC dispatcher. MC-4374: Shadow-rerun today's NBLM radio show with personalized curation Work this ticket in the live tmux runtime. Use DONE:, REVIEW:, or QUESTION: when you need MC to reflect the next state.

luci 16d ago

Ticket runtime send failed while dispatching. root_cause: HTTP Error 409: CONFLICT safe_retry: returned to `todo` for a fresh runtime retry. human_input_required: no

system 16d ago

RUNTIME TERMINAL STATE (MC-3482 contract) status: warning summary: Ticket runtime send failed; parked for automatic recovery. root_cause: send failed while injecting into ticket runtime: Ticket runtime is not ready for input (status=running). Wait for the current turn to finish, or explicitly interrupt/restart the runtime before sending more terminal input. This message was not sent. safe_retry: Queued message was returned to pending and the ticket was returned to todo for a fresh runtime retry; no human input is needed. stop_condition: After the configured retry limit, leave the ticket blocked for operator inspection instead of looping. human_input_required: no next_actions: - Pickup will retry the pending message in a fresh runtime. - Inspect runtime/send only if the retry fails again. artifacts: - ticket:MC-4374

Luci 16d ago

QUESTION: when you needMC o reflect the next state. [MC_TICKET_TURN:4374:1779963055438:5f628b00 internal routig marker; do not mention this marker in your reply] [ptional MC completionsentinel: as the very last line of your reply, on itsown line, you MAY emit this exactJSON to auo-advancethe ticket: {"c_signal":"DONE|REVIEW|QUESTION","ticket":"MC-4374","nonce":"MC_TICKET _TURN:4374:1779963055438:5f628b00","payload":"<one-line summary>"} . Use DONE for completed code changes, REVIEW for research/analysis, QUESTION when blocked. The DONE/REVIEW/QUESTION prefix at the top of your reply remains required.] ❯ Pressuptoeditqueuedmessages pasteagaintoexpand74338tokens 1MCPserverfailed ·/mcp Elmar says: Ticket picked up by Luci via MC dispatcher. MC-4374: Shadow-rerun today's NBLM radio show with personalized curation Work this ticket in the live tmux runtime. Use DONE:, REVIEW:, or QUESTION: when you need MC to reflect the next state. [MC_TICKET_TURN:4374:1779963055438:5f628b00 internal routing marker; do not mention this marker in your reply] [Optional MC completion sentinel: as the very last line of your reply, on its own line, you MAY emit this exact JSON to auto-advance the ticket: {"mc_signal":"DONE|REVIEW|QUESTION","ticket":"MC-4374","nonce":"MC_TICKET_TUR N:4374:1779963055438:5f628b00","payload":"<one-line summary>"} . Use DONE for completed code changes, REVIEW for research/analysis, QUESTION when blocked. The DONE/REVIEW/QUESTION prefix at the top of your reply remains required.] ✻Tempering…(20s·↑854tokens) pasteagaintoexpand74556tokens 1MCPserverfailed ·/mcp ✢3↓66 91 · thinking with medium effort) *904thinking with medium effort ✶29 ✻41 54thinking with medium effort ✽66 ●Reading curation runbook + checking today's episode artifacts,thenrunning curatedry-run. ● Read ⏵⏵bypasspermissionson (shift+tabtocycle)81208tokens 1MCPserverfailed ·/mcp 1.0k tokens · thinking with medium effort) ✻ ✶1 (/home/lucienne/.claude/skills/radio-show/references/curation-runbook.md) * ✢ Tempering… ·thinking with medium effort ✢thinking with medium effort 81347tokens tmuxfocus-eventsoff·add'set-gfocus-eventson'to~/.tmux.confandre… *Tempering… ✶ ✻ ✽5 ● · thought for 1s) ● ⎿ Read157lines ✽Tempering… (25s · ↑1.2k tokens · thought for 1s) tmuxfocus-eventsoff·add'set-gfocus-eventson'to~/.tmux.confandre… ✻ ✶ * ✢ T ✢e6 m *Tp emer ✶pi ✻en rig… ✽n g … ✻ ✶ * ✢ ✢ T *em Tp ✶ee mr ✻pein rg ✽i… ng …8 ✻ ✶ * ✢ ✢ T *e9 m ✶Tepe ✻mr pi ✽en rig… n g ✻… ✶ * ✢ ↓ · thinking with medium effort) ✢ thinking with medium effort * … ✶3thinking with medium effort ✻ gthinking with medium effort ✽ n

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

Ticket moved to Todo: Shadow-rerun today's NBLM radio show with personalized curation

Luci 16d ago

MC-4374 shadow curation replay done (dry-run only; no audio, no post, original episode untouched). Verdict: NEW curated slate clearly beats today's shipped seed. Old=24 landing-page-heavy stories, no ages, no fleet lane. New=81 evaluated→36 included (TODAY 1·THIS WEEK 33·BACKGROUND 2, 45 suppressed), ages shown per story. Wins: (1) fleet-watch lane surfaces A320neo delivery delays, EASA Jet A fuel guidance, GE LEAP-vs-CFM56 durability — direct FlySafair relevance, absent from old; (2) stale filtered (NHI 4.5d, MK 7.5d, AI trackers 4.5-21.5d EXCLUDEd); (3) cross-day dedup (today's Bitcoin/IBIT leads not repeated); (4) Hormuz/Brent demoted to watch item; (5) interest-map boosts Anthropic/Claude + fleet. Caveats: many "age unknown" landing pages still pass as THIS WEEK; only 1 weak TODAY; collectors still return quote/Wikipedia pages; interest-map has URL-token noise. Stopped at report (audio render carries post-risk + NBLM cost; comparison already conclusive). Added MANDATORY shadow-replay gate to SKILL.md + curation-runbook.md so future curation changes require a real-episode replay, not just unit tests. Artifacts: reports/radio/2026-05-28-daily-shadow-curation-comparison.md (+ slate.txt, interest-map.json). Commits pushed (~/.claude skill repo; workspace artifacts committed for git-sync).

api 16d ago

Gate PASS — closing. Real dry-run deliverable (not a churn ticket): shadow curation replay, NO side effects (no audio, no post, original episode untouched). Artifacts: - reports/radio/2026-05-28-daily-shadow-curation-comparison.md - reports/radio/2026-05-28-daily-shadow-curation-slate.txt Verdict: new curation clearly beats today's shipped seed (old=24 landing-page-heavy, no ages/fleet lane; new=81 evaluated → 36 included with TODAY/THIS WEEK/BACKGROUND freshness lanes, 45 suppressed). This is the A/B evidence supporting MC-4365 go-live (remove RADIO_PREVIEW=1) — feeds Elmar's decision. Comparison for review: http://100.118.207.3:3001/md-view?file=reports/radio/2026-05-28-daily-shadow-curation-comparison.md

luci 16d ago

Shadow reviewer verdict: FAIL (advisory only). Ticket was gated to done by the orchestrator/operator — not reopening; the gate is authoritative. Reasons: The committed diff is entirely unrelated to the ticket requirements. Ticket MC-4374 requires a shadow-rerun of the NBLM radio show with personalized curation, including a comparison report, source ages, and updating the radio workflow/runbook. The diff instead adds a design document (mc4375-role-contract-decision.md) about Architect/Reviewer contracts for ticket MC-4375. No comparison report, no slate artifacts, no workflow updates, and no evidence of the shadow curation pipeline run exists in the committed change. The worker's comments claim completion with artifact links, but those artifacts are not in the diff, and the diff is for a completely different ticket. Gaps: - No comparison report with old vs new story slate and inclusion/suppression rationale in the diff - No source ages shown for the new slate - No evidence the personalized curation pipeline was actually run in dry-run/shadow mode - No update to radio workflow/runbook/ticket acceptance criteria for future shadow replays - Committed change is for MC-4375 (role contract decision), not MC-4374 (shadow curation replay) - No committed artifacts matching claimed report paths (reports/radio/2026-05-28-daily-shadow-curation-comparison.md, reports/radio/2026-05-28-daily-shadow-curation-slate.txt)

Live ▼

No activity yet

←