224 Commits

Author SHA1 Message Date
0331e7ea99 monitoring: fix jetson metrics newlines 2026-01-26 22:50:33 -03:00
1616994b19 monitoring: unify jetson gpu metrics 2026-01-26 22:26:24 -03:00
72bd22e912 monitoring: map dcgm to shared gpu resources 2026-01-26 20:58:06 -03:00
b0abb9bd6e ariadne: reduce comms noise, fix gpu labels 2026-01-26 20:54:33 -03:00
a988af3262 monitoring: alert on VM outage 2026-01-23 11:51:28 -03:00
ce5b1d1353 monitoring: add postgres metrics and update overview 2026-01-22 18:23:26 -03:00
d509dfaa22 ops: restore portal/ariadne and add postgres panels 2026-01-22 15:23:23 -03:00
4721d44a33 monitoring: enforce sorted job lists 2026-01-21 15:12:53 -03:00
db4c3b7c51 monitoring: tighten jobs/overview ordering 2026-01-21 15:01:02 -03:00
b0996e9a4f monitoring: refine jobs/overview panels 2026-01-21 14:31:11 -03:00
8b35ab0292 monitoring: refresh jobs dashboards 2026-01-21 13:37:36 -03:00
2e407e1962 monitoring: reschedule grafana user dedupe 2026-01-21 12:31:54 -03:00
5ae6b4b00c monitoring: harden grafana user dedupe 2026-01-21 12:30:08 -03:00
ae1fd5b661 monitoring: fix grafana user dedupe job 2026-01-21 12:25:53 -03:00
4e65f02fba monitoring: prepopulate vault for dedupe job 2026-01-21 12:18:57 -03:00
88de0f7cee monitoring: wire vault sa for dedupe job 2026-01-21 12:16:26 -03:00
08716c6be6 monitoring: use python dedupe job 2026-01-21 12:15:03 -03:00
a0caeb407c monitoring: dedupe grafana user via api 2026-01-21 12:11:28 -03:00
6eeb551239 monitoring: add grafana user dedupe job 2026-01-21 12:08:23 -03:00
98b063f2dd grafana: allow email-based oauth user lookup 2026-01-21 11:45:11 -03:00
698b2fd96b monitoring: refresh testing dashboard 2026-01-21 11:29:48 -03:00
fb6ddce0c7 glue: centralize sync tasks in ariadne 2026-01-21 02:57:40 -03:00
1fedb5ecbe maintenance: wire ariadne db and dashboards 2026-01-20 23:03:39 -03:00
34c42cfb62 core: fix postmark DNS and time sync 2026-01-19 23:45:31 -03:00
ff3ed195ac chore: centralize harbor pull credentials 2026-01-19 19:02:14 -03:00
bb41c219f6 feat: add Ariadne service and glue scheduling 2026-01-19 16:58:02 -03:00
da200235bb monitoring: fix glue dashboard queries 2026-01-18 12:26:04 -03:00
0eb526c907 monitoring: label cronjob metrics and move grafana to arm64 2026-01-18 12:20:45 -03:00
c70054a30e monitoring: add atlas testing dashboard folder 2026-01-18 12:07:45 -03:00
084242746e monitoring: keep postmark exporter off titan-22 2026-01-18 11:52:36 -03:00
a5bec3e543 monitoring: avoid titan-22 for core pods 2026-01-18 11:43:28 -03:00
6e3faeb9fd monitoring: restore grafana persistence 2026-01-18 11:37:01 -03:00
0b15007e2c monitoring: disable grafana persistence to recover 2026-01-18 09:55:28 -03:00
1fb3d179ef monitoring: add testing dashboard and switch postmark apikey 2026-01-18 09:21:33 -03:00
d7812623cd monitoring: add glue row and fix mail dns 2026-01-18 08:12:06 -03:00
343d41ecc7 monitoring: add glue dashboard and tag cronjobs 2026-01-18 02:50:07 -03:00
86ea701ff0 jobs: bump names after affinity update 2026-01-17 01:52:16 -03:00
6ec0414fcd jobs: prefer arm64 workers 2026-01-17 01:47:53 -03:00
bb1bf3c017 fix ingress tls routing 2026-01-16 01:40:50 -03:00
de6665c450 smtp: use mail.bstein.dev for app relays 2026-01-15 04:04:50 -03:00
e6210644c2 smtp: point services at mailu relay 2026-01-15 03:58:03 -03:00
85c3d9c2f7 vault: finalize sidecar migration 2026-01-15 01:52:24 -03:00
0b21c8f40d vault: fix hyphenated key templates 2026-01-14 22:37:18 -03:00
c38f77302f vault: inject comms and grafana secrets 2026-01-14 22:29:27 -03:00
fb9578b624 vault: inject monitoring exporter and health jobs 2026-01-14 14:49:41 -03:00
b1f9df4d83 vault: sync harbor pulls 2026-01-14 10:07:31 -03:00
b8e50bb0a6 monitoring: move grafana smtp to vault 2026-01-14 06:41:34 -03:00
37302664c2 vault: add remaining secret syncs 2026-01-14 06:16:42 -03:00
8fa38268d9 chore: refresh knowledge catalog headers 2026-01-14 01:08:05 -03:00
bcc15c3e0a monitoring: allow grafana upgrade remediation 2026-01-13 21:18:42 -03:00