You're offline — showing cached data

Luci · Task enabled, schedule `10 * * * *` correct. Dig sche

runtime not set · chat:549:session · failed
×
Luci
Task enabled, schedule `10 * * * *` correct. Dig scheduler state + task_runs + last error.
Luci
Manual cmd works (EXIT=0, writes snapshot). Scheduler alive (many tasks ran 08:52). But crypto-paper-snapshot absent from recent runs. Check schema + scheduler.log for why skipped.
Luci
REVIEW: crypto-paper-snapshot NOT at fault — scheduler-wide stall caused the miss; already self-healed. **Findings** - Task healthy: `enabled: true`, schedule `10 * * * *` correct, `run_as: shell`. Manual run clean — EXIT=0, wrote snapshot (total=$10,512.68). No lock/prereq in the task itself. - Root cause = scheduler lock contention. task_runs gap query: **0 runs between 07:01:15 and 08:51:03** (~1h50). Then a catch-up burst 08:51–08:53. crypto-paper-snapshot's missed :10 fires (07:10, 08:10) were collateral, not specific. - Mechanism: scheduler = systemd timer firing `flock -n /tmp/scheduler.lock python3 scheduler.py tick` every minute. One tick **hung ~1h50 holding the lock**; every subsequent minute's tick hit `flock -n`, failed to acquire, exited → nothing scheduled for the whole window. When the stuck tick released ~08:51, ticks resumed and `is_due` catch-up (one missed fire/cycle) drained the backlog — crypto-paper-snapshot fired at **08:53:03 (completed)**. Gap already backfilled. - Current state: `luci-scheduler.service` activating since 08:53, running crypto-trader-brief (heavy child, normal). Healthy — did NOT restart (would kill live brief; not needed). **Verdict:** Transient, self-healed. No task config or code change. Watchdog fired correctly. **Systemic risk (recommend follow-up ticket):** a single stuck child task stalls ALL scheduling for as long as it hangs, because `flock -n` silently bails with no hang-guard. Suggest a watchdog that kills a tick holding `/tmp/scheduler.lock` past N min (e.g. flock `--timeout` + reap of overrun child PIDs). Decide if you want it built. {"mc_signal":"REVIEW","ticket":"MC-4433","nonce":"MC_TICKET_TURN:4437:1780123954250:0f4e5a3d","payload":"crypto-paper-snapshot miss = ~1h50 scheduler lock stall (hung tick held /tmp/scheduler.lock, flock -n blocked all ticks); task healthy, self-healed at 08:53. Recommend hang-guard follow-up."}
Default runtime
Help