|
|
be92017f4d
|
maintenance: harden sd-write controls and recovery workflow
|
2026-03-31 00:06:44 -03:00 |
|
|
|
2d7b51d3b3
|
monitoring: raise rootfs warning threshold to 85 percent
|
2026-03-30 18:41:05 -03:00 |
|
|
|
57375a81ad
|
monitoring: fix noisy grafana email alerts and reload rules
|
2026-03-30 18:33:02 -03:00 |
|
|
|
b34f2abefd
|
monitoring: fix grafana alert exec state
|
2026-01-27 23:34:11 -03:00 |
|
|
|
c5a7eece35
|
monitoring: tune cpu and maintenance alerts
|
2026-01-27 23:23:42 -03:00 |
|
|
|
a988af3262
|
monitoring: alert on VM outage
|
2026-01-23 11:51:28 -03:00 |
|
|
|
b8e50bb0a6
|
monitoring: move grafana smtp to vault
|
2026-01-14 06:41:34 -03:00 |
|
|
|
fcc0a49369
|
monitoring: fix infra scopes and add jetson metrics
|
2026-01-11 23:46:24 -03:00 |
|
|
|
c13b161171
|
knowledge: relocate metis doc; monitoring: add cpu high alert
|
2026-01-11 08:59:51 -03:00 |
|
|
|
54358df569
|
monitoring: maintenance panels, extra alerts, update overview
|
2026-01-11 02:28:39 -03:00 |
|
|
|
734a537a28
|
monitoring: add alert rules and include titan-20/21 in dashboards
|
2026-01-11 02:02:47 -03:00 |
|
|
|
1ffcb28be5
|
monitoring: fix grafana alerting root policy
|
2026-01-11 01:40:07 -03:00 |
|
|
|
b53c7d4a1c
|
monitoring: wire grafana smtp sync and alerting provisioning
|
2026-01-11 00:29:20 -03:00 |
|