15 Commits

Author SHA1 Message Date
b0996e9a4f monitoring: refine jobs/overview panels 2026-01-21 14:31:11 -03:00
343d41ecc7 monitoring: add glue dashboard and tag cronjobs 2026-01-18 02:50:07 -03:00
13df82e07a monitoring: treat cert-manager as infrastructure 2026-01-12 00:26:46 -03:00
fb2c7b22d5 monitoring: regenerate dashboards with expanded infra namespaces 2026-01-11 23:55:43 -03:00
fcc0a49369 monitoring: fix infra scopes and add jetson metrics 2026-01-11 23:46:24 -03:00
33b89c7dc2 monitoring: remove titan-16 and add titan-20/21 to worker dashboards 2026-01-11 02:20:47 -03:00
734a537a28 monitoring: add alert rules and include titan-20/21 in dashboards 2026-01-11 02:02:47 -03:00
a14726350c monitoring: add titan-jh control plane node 2026-01-06 09:50:40 -03:00
5093f77c0a monitoring: per-panel namespace share filters 2026-01-01 14:44:33 -03:00
24376594ff atlas dashboards: use threshold colors for stats 2025-12-12 20:44:20 -03:00
bf6179f907 atlas internal dashboards: add SLO/burn and api health panels 2025-12-12 18:00:43 -03:00
a3dc9391ee monitoring: polish dashboards and folders 2025-12-02 14:41:39 -03:00
f06be37f44 monitoring: refine network metrics and control-plane allowance 2025-11-18 16:18:52 -03:00
8f5781d3cf monitoring: rebuild atlas dashboards 2025-11-17 16:27:38 -03:00
a41f25e66d monitoring: restructure grafana dashboards 2025-11-17 14:22:46 -03:00