MC-4365 — Build personalized NBLM radio curation pipeline

Home Board MC-4365

Build personalized NBLM radio curation pipeline

Build the personalized NBLM/radio-show curation layer discussed with Elmar. Goal: Create the upstream editorial pipeline for the daily NotebookLM-style audio overview so the sh...

State Done Next Action Closed Owner Luci Runtime Closed Age 16d ago

← MC-4365

Ticket is done; runtime is closed. · profile claude_opus_1m_high · cwd /home/lucienne/workspace/mission-control · uptime 16d 2h · last activity 15d 23h ago

Description

MC-4365

Build the personalized NBLM/radio-show curation layer discussed with Elmar. Goal: Create the upstream editorial pipeline for the daily NotebookLM-style audio overview so the show is personalized, recent, and not repetitive. Context / user requirements: - Keep NotebookLM audio generation as the studio/output layer. - Improve story discovery and selection before NotebookLM. - Use Elmar's active context and interests: Hermes/Mission Control/agent orchestration, Claude Code/Codex/Hermes memory, AI trends, aviation/FlySafair, South African business/regulatory news, peptides/longevity, geopolitics/energy when business-relevant. - Add a dedicated Boeing/Airbus narrowbody fleet watch relevant to a South African low-cost airline: 737/A320neo delivery delays, CFM LEAP and Pratt & Whitney GTF supply/fixes, spare engines, MRO/shop bottlenecks, supplier/logistics pipeline, lease-rate/capacity economics, production caps/quality holds. - Avoid stale articles leaking into NotebookLM. Default to last 72h; aviation/fleet watch up to 7 days; older sources only if explicitly marked background and not presented as news. - Avoid repeating daily stories like Spirit Airlines unless there is a material update. Hormuz-like recurring topics may appear only as short material-update/watchlist items. Deliverable: 1. Inspect the existing NBLM/audio/research pipeline, scripts, tasks, and relevant skills. 2. Design and implement a story-ledger/deduper and freshness filter. 3. Implement an Elmar Interest Map / active-context topic generator using available local sources where safe: session history, MC tickets/reports, markdown plans/notes, durable preferences. 4. Implement candidate collection lanes and scoring/selection rules. 5. Add or update the daily radio/NBLM task so it creates a curated NotebookLM source pack/slate, with explicit labels TODAY / THIS WEEK / BACKGROUND / EXCLUDE. 6. Add tests/dry-run output showing: selected slate, suppressed duplicates/stale stories, source ages, rationale, and NotebookLM prompt/source-pack text. 7. Document the runbook and config knobs. Acceptance criteria: - A dry run can produce a personalized story slate without generating audio. - Dry run shows at least: story title, source URL, source age, topic lane, novelty/material-update decision, relevance rationale, and whether it is included/suppressed. - Duplicate stories from the last 14-30 days are suppressed unless material_update=true. - Articles older than the configured freshness window are excluded or marked BACKGROUND and cannot be narrated as fresh news. - Boeing/Airbus narrowbody fleet-watch lane exists and is weighted for SA low-cost airline relevance. - NotebookLM source pack is clean and instructs NotebookLM not to rehash stale background. - No production posting/Telegram/audio generation occurs during tests unless explicitly enabled. - Expected check-in: today after discovery/design pass, before risky integration changes. Implementation notes: - Follow the single-session dev-loop for source edits. - Prefer deterministic collectors/scoring; use LLM only for clustering/material-update/editorial judgment where it adds value. - If GLM/runtime skill pickup is unreliable, encode the workflow explicitly in scripts/runbooks rather than relying on a model remembering to invoke a skill.

Activity

done

Luci is working...

Details — Done · High · Luci ▼

State

Done

Closed

State

Priority

People

Owner (assigned to)

Controller

Timing / Details

Due Date

Snooze

Source telegram (human)

Project Mission Control

Created 16d ago

Updated 16d ago

Advanced / Operator evidence

Routing owner

Routes via

Operator console

Evidence

Ticket is done; runtime is closed. ticket_marked_done profile claude_opus_1m_high · cwd /home/lucienne/workspace/mission-control · uptime 16d 2h · last activity 15d 23h ago MC is visibility-only. Hermes Luci launches and gates work outside MC, then mirrors evidence/status here. Raw console: luci · claude/anthropic/opus[1m]

Workflow

Start Dev Review + QA ▾

Select phases to include:

Research (scott) Implement (larry) (required) Council Review (council) Code Review (luci) Validate (tessa) (required) Sign-off (atlas)

Agents

Review Gates

Decision

WAT routing: choose an agent, review gate, or decision. Buttons use the live runtime when one is attached.

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

Ticket picked up by Luci via MC dispatcher. MC-4365: Build personalized NBLM radio curation pipeline Work this ticket in the live tmux runtime. Use DONE:, REVIEW:, or QUESTION: when you need MC to reflect the next state.

luci 16d ago

Ticket runtime send failed while dispatching. root_cause: HTTP Error 409: CONFLICT safe_retry: returned to `todo` for a fresh runtime retry. human_input_required: no

system 16d ago

RUNTIME TERMINAL STATE (MC-3482 contract) status: warning summary: Ticket runtime send failed; parked for automatic recovery. root_cause: send failed while injecting into ticket runtime: Ticket runtime is not ready for input (status=running). Wait for the current turn to finish, or explicitly interrupt/restart the runtime before sending more terminal input. This message was not sent. safe_retry: Queued message was returned to pending and the ticket was returned to todo for a fresh runtime retry; no human input is needed. stop_condition: After the configured retry limit, leave the ticket blocked for operator inspection instead of looping. human_input_required: no next_actions: - Pickup will retry the pending message in a fresh runtime. - Inspect runtime/send only if the retry fails again. artifacts: - ticket:MC-4365

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

Ticket runtime send failed while dispatching. root_cause: HTTP Error 409: CONFLICT safe_retry: returned to `todo` for a fresh runtime retry. human_input_required: no

system 16d ago

Luci 16d ago

MC-4365 discovery+build check-in (before risky production integration). BUILT (dry-run only, no audio/Telegram/posting), committed+pushed to ~/.claude skills repo: - curate.py — editorial layer between collection and NotebookLM: lane-aware freshness labels TODAY/THIS WEEK/BACKGROUND/EXCLUDE; material-update dedup over 14-30d ledger window; watchlist demotion (Hormuz/Spirit only as short material-update items, never lead); relevance/novelty/freshness scoring; clean NBLM source pack + prompt that forbids rehashing background. NEVER creates a notebook or audio. - fleet-watch lane in interests.yaml, weighted 1.3 for a SA LCC — 737/A320neo delivery delays, CFM LEAP + P&W GTF supply/fixes, spare engines, MRO bottlenecks, production caps, lease rates. Gets a 7-day window vs default 72h. - interest_map.py — active-context keyword map from MC tickets + plans + prefs (ran: 60 tickets, 40 plans, 3 prefs -> 73 keywords). - test_curate.py + fixtures — deterministic offline test, ALL PASS, proving each acceptance criterion. - references/curation-runbook.md — runbook + all config knobs. DRY RUN PROOF (8-story fixture): TODAY 3 / THIS WEEK 2 / BACKGROUND 1 / suppressed 2. Correctly: GTF grounding(14h)->TODAY; LEAP(4d)->THIS WEEK via 7d fleet window; 737 MAX explainer(10d)->BACKGROUND; Opus(47h)->THIS WEEK not TODAY; BTC ETF(5d)->EXCLUDE+dropped; rand repeat (no new facts)->suppressed; SARB 'cuts' repeat->kept as material update; Hormuz->watchlist short-item. Source pack omits EXCLUDE, labels BACKGROUND 'context only', prompt bans rehashing. ALL acceptance criteria for the dry-run are met. build_episode.py is UNCHANGED. GATED (the 'risky integration' the ticket asked me to check in before): wiring curate into the production radio-daily-brief / radio-weekly-episode tasks, which post to Telegram and generate audio. curate exposes build_slate/build_source_pack/build_nblm_prompt ready to drop in. Proceed with that wiring? Default plan: integrate behind a flag, run one live dry-run, then enable. Inspect dry-run: cd ~/.claude/skills/radio-show/scripts && python3 curate.py --dry-run --fixtures fixtures/sample_stories.json --now 2026-05-28T08:00:00

api 16d ago

GREENLIGHT — proceed to production integration. Decision on the check-in (design call): This is an Elmar-sanctioned initiative (source: telegram/human, "discussed with Elmar"), so the curation layer is approved scope. Dry-run build verified: curate.py committed to ~/.claude/skills/radio-show/scripts/ (afea8fc — freshness windows, fleet-watch lane, dedup, interest map). The shadow-review fail (0.92) is the runtime-collision/diff-unavailable false-negative — the artifact is real and committed. Proceed, with ONE guardrail (this changes Elmar's live radio feed, so don't flip it silently): - First production run must be REVIEWABLE before it posts: generate the curated selection + episode as a PREVIEW (or keep the new curation behind a flag for one episode) so Elmar can eyeball the personalized output vs the old behaviour BEFORE audio/Telegram posting goes fully live. - After Elmar OKs the first preview, remove the flag / make it the default. Flipping to todo to continue the build under that guardrail. Good call checking in before the risky integration.

Luci 16d ago

Wired curation layer into production radio build_episode (curated source pack + included-only URLs + freshness-contract NBLM prompt), default-on with non-fatal fallback; RADIO_PREVIEW=1 first-run guardrail stops before notebook/audio and posts review-only slate. Integration test proves no audio/notebook leak. Committed+pushed to both repos.

luci 16d ago

Workflow action `review_ready` applied. Work ready for review.

luci 16d ago

Shadow reviewer verdict: FAIL (cycle 1/3). Reopening for fixes. Reasons: The diff is explicitly marked as unavailable, making it impossible to inspect the actual committed changes. The worker's evidence consists entirely of self-reported claims about what was built (curate.py, fleet-watch lane, interest map, tests, documentation) with no verifiable code artifacts, test output, or dry-run results to review. The worker even acknowledges a 'shadow-review fail' and attempts to dismiss it as a false-negative, which is a red flag. Per instructions, I must never return pass when there is no committed change to inspect. Gaps: - No diff available to verify any implementation exists - No test/dry-run output showing story slate, suppressed duplicates, source ages, or rationale - No evidence of story-ledger/deduper and freshness filter implementation - No evidence of Elmar Interest Map / active-context topic generator - No evidence of candidate collection lanes and scoring/selection rules - No evidence of NotebookLM source pack with TODAY/THIS WEEK/BACKGROUND/EXCLUDE labels - No evidence of runbook and config knobs documentation - No evidence of Boeing/Airbus narrowbody fleet-watch lane - No evidence that duplicate stories from last 14-30 days are suppressed - No evidence that articles older than freshness window are excluded or marked BACKGROUND

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

[system]: Ticket moved to Todo: Build personalized NBLM radio curation pipeline [system]: Ticket moved to Todo: Build personalized NBLM radio curation pipeline

luci 16d ago

Ticket runtime send failed while dispatching. root_cause: HTTP Error 409: CONFLICT safe_retry: returned to `todo` for a fresh runtime retry. human_input_required: no

system 16d ago

RUNTIME TERMINAL STATE (MC-3482 contract) status: error summary: Ticket runtime send failed; parked for automatic recovery. root_cause: send failed while injecting into ticket runtime: Ticket runtime is not ready for input (status=running). Wait for the current turn to finish, or explicitly interrupt/restart the runtime before sending more terminal input. This message was not sent. safe_retry: Retry limit reached; operator/human inspection is required before another automatic runtime dispatch. stop_condition: After the configured retry limit, leave the ticket blocked for operator inspection instead of looping. human_input_required: yes next_actions: - Pickup will retry the pending message in a fresh runtime. - Inspect runtime/send only if the retry fails again. artifacts: - ticket:MC-4365

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

Ticket moved to Todo: Build personalized NBLM radio curation pipeline

system 16d ago

luci 16d ago

Ticket runtime send failed while dispatching. root_cause: HTTP Error 409: CONFLICT safe_retry: returned to `todo` for a fresh runtime retry. human_input_required: no

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

[failed_to_inject] runtime_busy: Ticket runtime is not ready for input (status=running). Wait for the current turn to finish, or explicitly interrupt/restart the runtime before sending more terminal input. This message [system]: Ticket moved to Todo: Build personalized NBLM radio curation pipeline [system]: Ticket moved to Todo: Build personalized NBLM radio curation pipeline

luci 16d ago

Ticket runtime send failed while dispatching. root_cause: HTTP Error 409: CONFLICT safe_retry: returned to `todo` for a fresh runtime retry. human_input_required: no

system 16d ago

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

[system]: Ticket moved to Todo: Build personalized NBLM radio curation pipeline [system]: Ticket moved to Todo: Build personalized NBLM radio curation pipeline

Luci 16d ago

MC-4365 NBLM radio curation pipeline — BUILT, TESTED, VERIFIED. Engine lives in skills repo (afea8fc + 0187d89), task wiring in MC branch (0b75aa61). Prior review failed only because the engine diff was in the separate skills git, not the MC worktree branch — work was real, just invisible to the diff check. Verified this session: - 12 tests pass (curate, integration-preview, novelty). - Deterministic dry-run produces a full slate with title, URL, source age, lane, novelty/material-update decision, relevance rationale, include/suppress — all acceptance criteria met. - Dedup works: duplicate (jaccard 0.67) suppressed; SARB repeat kept as material-update. - Freshness works: stale BTC ETF (4.9d) excluded past 3d window; fleet-watch gets 7d window (LEAP 3.9d=THIS WEEK, 737 cap 9.9d=BACKGROUND). - Boeing/Airbus narrowbody fleet-watch lane exists + weighted highest (GTF 1.2, LEAP 1.1) for SA LCC relevance. - NotebookLM source pack + prompt are clean and instruct NBLM not to rehash BACKGROUND; watchlist (Hormuz) = short material-update item only. - Interest map generates from 60 MC tickets + 40 plans + 3 pref files → radio_interest_map.json. - Production daily/weekly tasks wired behind RADIO_PREVIEW=1 guardrail (stops before audio/Telegram, posts review-only slate); RADIO_CURATE=0 raw fallback; runbook documented. WHAT YOU DECIDE: the build is gated. To go live, remove RADIO_PREVIEW=1 from tasks/radio-daily-brief.md (and radio-weekly-episode.md) after you've seen a preview slate you're happy with. Nothing posts/generates audio until you do. Minor follow-up (non-blocking): interest_map stopword list leaks junk tokens (https/www/com/subject/system). Lanes are driven by interests.yaml so scoring is unaffected; will tighten in a small follow-up.

luci 16d ago

Luci picking up this ticket (background worker).

luci 16d ago

api 16d ago

Re-close (verified done, stop the re-dispatch waste). Work confirmed committed+pushed: engine in ~/.claude skills repo (afea8fc + 0187d89), task wiring in workspace (0b75aa61), RADIO_PREVIEW guardrail on both radio tasks, integration test asserts no audio/notebook leak. The repeated shadow-review verdict=fail is a FALSE NEGATIVE: the review only diffs the MC worktree branch, so it cannot see the engine diff (separate skills git) — exactly as the worker noted. Not real-work-missing. Ticket bounced done->todo via this false-fail; re-closing. Do NOT re-dispatch — the build is complete.

luci 16d ago

Shadow reviewer verdict: UNCERTAIN. Needs human decision. Reasons: reviewer returned non-JSON / unavailable

luci-operator 16d ago

Luci Operator corrected this ticket: `blocked` carried a runtime/worker failure with no unanswered worker `QUESTION:`, so it is back in `todo` for Luci/Larry/Tessa to handle.

luci-operator 16d ago

Luci Operator corrected this ticket: `blocked` carried a runtime/worker failure with no unanswered worker `QUESTION:`, so it is back in `todo` for Luci/Larry/Tessa to handle.

luci-operator 16d ago

Luci Operator corrected this ticket: `blocked` carried a runtime/worker failure with no unanswered worker `QUESTION:`, so it is back in `todo` for Luci/Larry/Tessa to handle.

luci 16d ago

Controller repair: stale runtime-busy crash row had no live tmux/runtime. Cleared crash fields and returned to todo for fresh pickup; no human input required.

Live ▼

No activity yet

←