60 Commits

Author SHA1 Message Date
9409c037c9 monitoring: restart grafana for alerting reload 2026-01-27 23:29:46 -03:00
ca7a08e791 monitoring: fix grafana smtp from address 2026-01-27 22:28:37 -03:00
029e4d4ca6 monitoring: send grafana alerts via postmark 2026-01-27 22:00:19 -03:00
a988af3262 monitoring: alert on VM outage 2026-01-23 11:51:28 -03:00
8b35ab0292 monitoring: refresh jobs dashboards 2026-01-21 13:37:36 -03:00
98b063f2dd grafana: allow email-based oauth user lookup 2026-01-21 11:45:11 -03:00
0eb526c907 monitoring: label cronjob metrics and move grafana to arm64 2026-01-18 12:20:45 -03:00
c70054a30e monitoring: add atlas testing dashboard folder 2026-01-18 12:07:45 -03:00
a5bec3e543 monitoring: avoid titan-22 for core pods 2026-01-18 11:43:28 -03:00
6e3faeb9fd monitoring: restore grafana persistence 2026-01-18 11:37:01 -03:00
0b15007e2c monitoring: disable grafana persistence to recover 2026-01-18 09:55:28 -03:00
1fb3d179ef monitoring: add testing dashboard and switch postmark apikey 2026-01-18 09:21:33 -03:00
bb1bf3c017 fix ingress tls routing 2026-01-16 01:40:50 -03:00
de6665c450 smtp: use mail.bstein.dev for app relays 2026-01-15 04:04:50 -03:00
e6210644c2 smtp: point services at mailu relay 2026-01-15 03:58:03 -03:00
0b21c8f40d vault: fix hyphenated key templates 2026-01-14 22:37:18 -03:00
c38f77302f vault: inject comms and grafana secrets 2026-01-14 22:29:27 -03:00
b1f9df4d83 vault: sync harbor pulls 2026-01-14 10:07:31 -03:00
8fa38268d9 chore: refresh knowledge catalog headers 2026-01-14 01:08:05 -03:00
bcc15c3e0a monitoring: allow grafana upgrade remediation 2026-01-13 21:18:42 -03:00
0b5dcde3a3 monitoring: align victoria-metrics PVC size 2026-01-13 21:15:10 -03:00
b53c7d4a1c monitoring: wire grafana smtp sync and alerting provisioning 2026-01-11 00:29:20 -03:00
0b78ec663d logging: remove loki and backfill to opensearch 2026-01-09 18:08:39 -03:00
1027fe5ce5 logging: add loki and fluent-bit 2026-01-08 22:31:45 -03:00
a14726350c monitoring: add titan-jh control plane node 2026-01-06 09:50:40 -03:00
9be25e16fe monitoring: add Postmark mail dashboard 2026-01-05 21:55:59 -03:00
32f1532508 monitoring: dual-provision overview orgs 2026-01-01 18:20:40 -03:00
353f2e9210 monitoring: recreate grafana rollouts 2026-01-01 18:00:07 -03:00
100a11e0de monitoring: split overview org 2026-01-01 17:54:01 -03:00
0db786c343 grafana,jitsi: enable pkce and tcp fallback 2025-12-24 18:15:25 -03:00
39a8e551eb grafana: allow public overview via oidc 2025-12-24 17:43:07 -03:00
1a161b4d3c monitoring: longer data history 2025-12-14 14:47:20 -03:00
580d1731f9 monitoring: drop duplicate titan-db scrape job 2025-12-12 21:48:03 -03:00
4def298b83 monitoring: scrape titan-db node_exporter 2025-12-12 21:38:10 -03:00
8d5e6c267c auth: wire oauth2-proxy and enable grafana oidc 2025-12-07 02:01:21 -03:00
a3dc9391ee monitoring: polish dashboards and folders 2025-12-02 14:41:39 -03:00
eed67b3db0 monitoring: regen dashboards with gpu details 2025-12-02 13:16:00 -03:00
a1e731e929 monitoring: fix hottest stats and titan-db scrape 2025-11-17 19:38:40 -03:00
8f5781d3cf monitoring: rebuild atlas dashboards 2025-11-17 16:27:38 -03:00
a41f25e66d monitoring: restructure grafana dashboards 2025-11-17 14:22:46 -03:00
0b1437b77c monitoring: refresh grafana dashboards 2025-11-15 21:03:11 -03:00
46b6b1f3b8 grafana: set datasource uid 2025-11-15 11:35:27 -03:00
683dc84289 grafana: use atlas metrics hostname 2025-11-15 11:18:40 -03:00
d0b6fbe763 victoria-metrics: revert storageclass change 2025-11-15 11:16:37 -03:00
3cfe639387 monitoring: fix domain 2025-11-14 19:13:40 -03:00
418329e173 monitoring: fix ingress and env formats 2025-11-14 08:51:09 -03:00
394fcf2ee4 grafana: use string host format 2025-11-14 08:37:46 -03:00
465103a57e grafana: fix dashboard provider list 2025-11-14 08:33:53 -03:00
c2cb901102 monitoring: fix grafana values 2025-11-14 08:29:59 -03:00
06337f2b9d monitoring: add grafana and alertmanager 2025-11-14 00:02:59 -03:00