|
|
029e4d4ca6
|
monitoring: send grafana alerts via postmark
|
2026-01-27 22:00:19 -03:00 |
|
|
|
38c8d08ab4
|
monitoring: fix gpu idle label
|
2026-01-27 21:46:58 -03:00 |
|
|
|
ba16f5119b
|
monitoring: unify gpu namespace usage
|
2026-01-27 21:43:37 -03:00 |
|
|
|
51bf01a8fd
|
monitoring: keep idle label in gpu share
|
2026-01-27 18:44:58 -03:00 |
|
|
|
1b04e6cb00
|
monitoring: fix gpu idle share
|
2026-01-27 17:51:13 -03:00 |
|
|
|
5f32dff73b
|
monitoring: fix tegrastats regexes
|
2026-01-27 16:44:00 -03:00 |
|
|
|
dfb295e5f0
|
monitoring: expose jetson scrape line length
|
2026-01-27 16:38:09 -03:00 |
|
|
|
a7f3d49fea
|
monitoring: read tegrastats per scrape
|
2026-01-27 16:34:31 -03:00 |
|
|
|
246ed6617e
|
monitoring: read jetson stats on demand
|
2026-01-27 16:27:45 -03:00 |
|
|
|
1951291090
|
monitoring: refresh jetson stats on scrape
|
2026-01-27 16:23:23 -03:00 |
|
|
|
62a423f32c
|
monitoring: fix jetson gpu metrics
|
2026-01-27 16:19:54 -03:00 |
|
|
|
9ea338b121
|
monitoring: restart jetson exporter
|
2026-01-26 22:51:41 -03:00 |
|
|
|
0331e7ea99
|
monitoring: fix jetson metrics newlines
|
2026-01-26 22:50:33 -03:00 |
|
|
|
1616994b19
|
monitoring: unify jetson gpu metrics
|
2026-01-26 22:26:24 -03:00 |
|
|
|
72bd22e912
|
monitoring: map dcgm to shared gpu resources
|
2026-01-26 20:58:06 -03:00 |
|
|
|
b0abb9bd6e
|
ariadne: reduce comms noise, fix gpu labels
|
2026-01-26 20:54:33 -03:00 |
|
|
|
a988af3262
|
monitoring: alert on VM outage
|
2026-01-23 11:51:28 -03:00 |
|
|
|
ce5b1d1353
|
monitoring: add postgres metrics and update overview
|
2026-01-22 18:23:26 -03:00 |
|
|
|
d509dfaa22
|
ops: restore portal/ariadne and add postgres panels
|
2026-01-22 15:23:23 -03:00 |
|
|
|
4721d44a33
|
monitoring: enforce sorted job lists
|
2026-01-21 15:12:53 -03:00 |
|
|
|
db4c3b7c51
|
monitoring: tighten jobs/overview ordering
|
2026-01-21 15:01:02 -03:00 |
|
|
|
b0996e9a4f
|
monitoring: refine jobs/overview panels
|
2026-01-21 14:31:11 -03:00 |
|
|
|
8b35ab0292
|
monitoring: refresh jobs dashboards
|
2026-01-21 13:37:36 -03:00 |
|
|
|
2e407e1962
|
monitoring: reschedule grafana user dedupe
|
2026-01-21 12:31:54 -03:00 |
|
|
|
5ae6b4b00c
|
monitoring: harden grafana user dedupe
|
2026-01-21 12:30:08 -03:00 |
|
|
|
ae1fd5b661
|
monitoring: fix grafana user dedupe job
|
2026-01-21 12:25:53 -03:00 |
|
|
|
4e65f02fba
|
monitoring: prepopulate vault for dedupe job
|
2026-01-21 12:18:57 -03:00 |
|
|
|
88de0f7cee
|
monitoring: wire vault sa for dedupe job
|
2026-01-21 12:16:26 -03:00 |
|
|
|
08716c6be6
|
monitoring: use python dedupe job
|
2026-01-21 12:15:03 -03:00 |
|
|
|
a0caeb407c
|
monitoring: dedupe grafana user via api
|
2026-01-21 12:11:28 -03:00 |
|
|
|
6eeb551239
|
monitoring: add grafana user dedupe job
|
2026-01-21 12:08:23 -03:00 |
|
|
|
98b063f2dd
|
grafana: allow email-based oauth user lookup
|
2026-01-21 11:45:11 -03:00 |
|
|
|
698b2fd96b
|
monitoring: refresh testing dashboard
|
2026-01-21 11:29:48 -03:00 |
|
|
|
fb6ddce0c7
|
glue: centralize sync tasks in ariadne
|
2026-01-21 02:57:40 -03:00 |
|
|
|
1fedb5ecbe
|
maintenance: wire ariadne db and dashboards
|
2026-01-20 23:03:39 -03:00 |
|
|
|
34c42cfb62
|
core: fix postmark DNS and time sync
|
2026-01-19 23:45:31 -03:00 |
|
|
|
ff3ed195ac
|
chore: centralize harbor pull credentials
|
2026-01-19 19:02:14 -03:00 |
|
|
|
bb41c219f6
|
feat: add Ariadne service and glue scheduling
|
2026-01-19 16:58:02 -03:00 |
|
|
|
da200235bb
|
monitoring: fix glue dashboard queries
|
2026-01-18 12:26:04 -03:00 |
|
|
|
0eb526c907
|
monitoring: label cronjob metrics and move grafana to arm64
|
2026-01-18 12:20:45 -03:00 |
|
|
|
c70054a30e
|
monitoring: add atlas testing dashboard folder
|
2026-01-18 12:07:45 -03:00 |
|
|
|
084242746e
|
monitoring: keep postmark exporter off titan-22
|
2026-01-18 11:52:36 -03:00 |
|
|
|
a5bec3e543
|
monitoring: avoid titan-22 for core pods
|
2026-01-18 11:43:28 -03:00 |
|
|
|
6e3faeb9fd
|
monitoring: restore grafana persistence
|
2026-01-18 11:37:01 -03:00 |
|
|
|
0b15007e2c
|
monitoring: disable grafana persistence to recover
|
2026-01-18 09:55:28 -03:00 |
|
|
|
1fb3d179ef
|
monitoring: add testing dashboard and switch postmark apikey
|
2026-01-18 09:21:33 -03:00 |
|
|
|
d7812623cd
|
monitoring: add glue row and fix mail dns
|
2026-01-18 08:12:06 -03:00 |
|
|
|
343d41ecc7
|
monitoring: add glue dashboard and tag cronjobs
|
2026-01-18 02:50:07 -03:00 |
|
|
|
86ea701ff0
|
jobs: bump names after affinity update
|
2026-01-17 01:52:16 -03:00 |
|
|
|
6ec0414fcd
|
jobs: prefer arm64 workers
|
2026-01-17 01:47:53 -03:00 |
|