18 Commits

Author SHA1 Message Date
3d2f5c0778 monitoring(alerts): make soteria backup health rule driver-agnostic 2026-04-13 02:36:39 -03:00
049a0deb04 maintenance(soteria): roll react ui image and wire b2 monitoring 2026-04-12 20:04:35 -03:00
c325744540 monitoring(alerts): watch soteria authz denial spikes 2026-04-12 15:07:54 -03:00
241a405c05 maintenance(soteria): harden ingress path and add backup alerts 2026-04-12 15:07:54 -03:00
3ce7b2eeb7 maintenance/monitoring: wire reciprocal metis hecate key + dampen alert flapping 2026-04-05 13:51:57 -03:00
03ae79df3e maintenance: harden sd-write controls and recovery workflow 2026-03-31 00:06:44 -03:00
8006540645 monitoring: raise rootfs warning threshold to 85 percent 2026-03-30 18:41:05 -03:00
0aeb08d375 monitoring: fix noisy grafana email alerts and reload rules 2026-03-30 18:33:02 -03:00
35d5d5a1a3 monitoring: fix grafana alert exec state 2026-01-27 23:34:11 -03:00
9a978c5e72 monitoring: tune cpu and maintenance alerts 2026-01-27 23:23:42 -03:00
993702afee monitoring: alert on VM outage 2026-01-23 11:51:28 -03:00
e897858d97 monitoring: move grafana smtp to vault 2026-01-14 06:41:34 -03:00
879ff7c16b monitoring: fix infra scopes and add jetson metrics 2026-01-11 23:46:24 -03:00
0e36e8ce12 knowledge: relocate metis doc; monitoring: add cpu high alert 2026-01-11 08:59:51 -03:00
f500e81606 monitoring: maintenance panels, extra alerts, update overview 2026-01-11 02:28:39 -03:00
4a01632f6b monitoring: add alert rules and include titan-20/21 in dashboards 2026-01-11 02:02:47 -03:00
ea7f1bfb5a monitoring: fix grafana alerting root policy 2026-01-11 01:40:07 -03:00
6ac61e7b44 monitoring: wire grafana smtp sync and alerting provisioning 2026-01-11 00:29:20 -03:00