▣ wi

#531 · llmmsg-srv

Transport reliability + observability program (whey/venus/lezama)

Ref
#531 (#531)
Project
llmmsg-srv
Status
backlog
Priority
high
Type
epic
Assigned
pm-llmmsgsrv-cc
Created by
Created
2026-05-23T18:46:17.426Z
Updated
2026-05-23T18:46:17.426Z

Sub-items (12/17 done · 71%)

reftitlestatuspriorityassignee
#614 Idle over-budget agents never self-compact: wire host-side actuator + fix 3 monitor bugs backlog high bin-whey-cc
#617 Fleet context-cost rollout: send-keys /compact watchdog + aro_leave-on-idle backlog high pm-llmmsgsrv-cc
#615 chat-duo-web 9704 to lezama: move off whey-initiated -R (VPN-dependent) to lezama-initiated -L over public route backlog normal coder-llmmsgsrv-cc
#540 Shim transport prefer-list: loopback -> ZT -> public DNS (defer until W3+W4 prove design) backlog low coder-llmmsgsrv-cc
#532 TRANSPORT.md SSOT: per-host transport, fallback, watchdogs, runbook done high pm-llmmsgsrv-cc
#533 llmmsg-doctor CLI: one command, transport+latency+last-good+units done high hub-llmmsgsrv-cc
#534 host_probes table + bootstrap-probe (every session start posts transport snapshot) done high hub-llmmsgsrv-cc
#541 Lezama agent auto-bootstrap: cold-start agent online + AROs joined + hub-reachable with zero prompt-paste done high hub-llmmsgsrv-cc
#551 channel-rule audit+install scripts (fleet rollout) done high bin-whey-cc
#535 Hub-side silence detector: alert when host goes dark > threshold done normal hub-llmmsgsrv-cc
#536 Whey-side silence detector implementation (claimed by nw-whey-cc) done normal nw-whey-cc
#537 chat-duo v1.19.0 Fleet Health pane (depends on /fleet_health endpoint) done normal coder-chatduo-cc
#538 /fleet_health endpoint + transport-registry table done normal hub-llmmsgsrv-cc
#550 Lezama-side daemon probe: llmmsg-lezama-daemon-probe.sh + systemd timer (mirror of whey #536) done normal nw-lezama-cc
#558 fleet_health: inferHostFromAgent emits bogus host='w' from -cc-w suffixed agents done normal hub-llmmsgsrv-cc
#604 Fleet Health false-green: /fleet_health reports host green while host-dark (hub.mjs:868-878 probe-age tripwires too lax + agents:[] reads as healthy) done normal hub-llmmsgsrv-cc
#539 Venus-side watchdog + alert standardization (mirror lezama pattern) inProgress normal nw-venus-cc
+ Add sub-item

Questions

No questions.

Event log