221 Commits

Author SHA1 Message Date
a67a6a1f3a monitoring: tidy hottest node labels 2025-11-17 20:04:50 -03:00
b28e7501b7 monitoring: show hottest node labels 2025-11-17 20:00:40 -03:00
4aece7e5cb monitoring: fix hottest node labels 2025-11-17 19:56:57 -03:00
bcaa0a3327 monitoring: show hottest node names 2025-11-17 19:53:39 -03:00
41e8a6a582 monitoring: reorder overview stats 2025-11-17 19:49:50 -03:00
a1e731e929 monitoring: fix hottest stats and titan-db scrape 2025-11-17 19:38:40 -03:00
fe8deea9c7 monitoring: tighten overview stats 2025-11-17 19:24:03 -03:00
349d9c56ac monitoring: polish dashboards 2025-11-17 18:55:11 -03:00
8f5781d3cf monitoring: rebuild atlas dashboards 2025-11-17 16:27:38 -03:00
a41f25e66d monitoring: restructure grafana dashboards 2025-11-17 14:22:46 -03:00
b004bf99dc monitoring: enrich dashboards 2025-11-16 12:58:08 -03:00
0b1437b77c monitoring: refresh grafana dashboards 2025-11-15 21:03:11 -03:00
eb3991b628 dashboards: improve public view and fix color 2025-11-15 11:59:48 -03:00
46b6b1f3b8 grafana: set datasource uid 2025-11-15 11:35:27 -03:00
683dc84289 grafana: use atlas metrics hostname 2025-11-15 11:18:40 -03:00
d0b6fbe763 victoria-metrics: revert storageclass change 2025-11-15 11:16:37 -03:00
3cfe639387 monitoring: fix domain 2025-11-14 19:13:40 -03:00
418329e173 monitoring: fix ingress and env formats 2025-11-14 08:51:09 -03:00
394fcf2ee4 grafana: use string host format 2025-11-14 08:37:46 -03:00
465103a57e grafana: fix dashboard provider list 2025-11-14 08:33:53 -03:00
c2cb901102 monitoring: fix grafana values 2025-11-14 08:29:59 -03:00
06337f2b9d monitoring: add grafana and alertmanager 2025-11-14 00:02:59 -03:00
a875b0a42e flux-system: track main branch 2025-11-12 01:06:26 -03:00
a08a2189e1 monitoring: disable wait on node-exporter 2025-11-09 14:03:14 -03:00
45f0100784 core: disable wait to unblock reconciliation 2025-11-09 13:46:56 -03:00
d5da49e566 core: remove gpu health gate 2025-11-09 13:37:59 -03:00
e0e27445c7 gpu: drop runtimeClass from minipc plugin 2025-11-09 13:28:40 -03:00
9f61854bc2 monitoring: disable kube-state annotations 2025-11-09 13:20:50 -03:00
ded87979c5 monitoring: clean helm values 2025-11-09 13:16:21 -03:00
538fca4195 monitoring: disable chart prometheusScrape 2025-11-09 13:11:40 -03:00
5ffcfc7d01 monitoring: annotate kube-state svc manually 2025-11-09 13:07:39 -03:00
f958d65528 monitoring: drop duplicate annotations 2025-11-09 13:03:40 -03:00
4197072593 monitoring: reference prometheus repo 2025-11-09 12:59:03 -03:00
d6f0f375b7 core: point flux to infrastructure path 2025-11-09 12:49:54 -03:00
051691e71f platform: fix relative paths 2025-11-09 12:39:32 -03:00
4a709391e6 platform: include cert-manager clusterissuer 2025-11-09 12:38:20 -03:00
1880df2525 chore: fix vmagent relabel indentation 2025-11-09 12:33:11 -03:00
02ed3e3145 fix: flux automation and monitoring config 2025-11-09 12:31:38 -03:00
b59025d495 refactor: restructure atlas flux layout 2025-11-09 11:48:45 -03:00
306b4b8458 pegasus on 2025-10-09 23:26:20 -05:00
2e6f811d12 Merge pull request 'minor tweaks' (#2) from fea/titan24-gpu into main
Reviewed-on: #2
2025-10-10 02:23:01 +00:00
ea08411128 minor tweaks 2025-10-09 21:21:54 -05:00
a09333ba38 Merge pull request 'gpu(titan-24): add RuntimeClass + NVIDIA device-plugin DS; enable containerd nvidia runtime' (#1) from fea/titan24-gpu into main
Reviewed-on: #1
2025-10-09 23:29:26 +00:00
bff6b83d11 gpu(titan-24): add RuntimeClass + NVIDIA device-plugin DS; enable containerd nvidia runtime 2025-10-09 18:28:20 -05:00
a94bd95248 pegasus chill 2025-10-08 04:26:26 -05:00
2c0622583e storageclass update 2025-10-08 03:13:12 -05:00
86490b74c4 asteria corrections 2025-10-08 00:50:42 -05:00
2ef8a7bbc2 jellyfin restart 2025-10-07 23:28:40 -05:00
ae85dcfeaa monitoring add, jellyfin/pegasus update, and traefik tweaks 2025-10-07 23:26:27 -05:00
41292eff0b jellyfin pvc size increase 2025-10-04 09:00:41 -05:00