Compare commits

..

190 Commits

Author SHA1 Message Date
319b515882 zot: restore main branch config 2025-12-11 17:26:15 -03:00
cb2b2ec1cd zot: revert to unauthenticated registry 2025-12-11 17:22:16 -03:00
20cd185c0b vault: drop traefik basicauth 2025-12-11 17:09:05 -03:00
2f368f6975 zot,vault: remove oauth2-proxy sso 2025-12-11 17:04:19 -03:00
6c62d42f7a longhorn/vault: gate via oauth2-proxy 2025-12-07 19:44:02 -03:00
a7e9f1f7d8 auth: remove error middleware to allow redirect 2025-12-07 13:19:45 -03:00
ceb692f7ee oauth2-proxy: drop groups scope to avoid invalid_scope 2025-12-07 13:09:29 -03:00
24fbaad040 auth: forward-auth via external auth host (svc traffic flaky) 2025-12-07 13:03:29 -03:00
04aa32a762 oauth2-proxy: schedule on worker rpis 2025-12-07 12:49:38 -03:00
25ee698021 oauth2-proxy: ensure error middleware on auth ingress 2025-12-07 12:03:14 -03:00
4a089876ba auth: use internal oauth2-proxy svc for forward-auth 2025-12-07 11:25:29 -03:00
20bb776625 auth: add 401 redirect middleware to oauth2-proxy 2025-12-07 11:14:25 -03:00
5e59f20bc3 auth: point forward-auth to external auth host 2025-12-07 11:09:09 -03:00
dbede55ad4 oauth2-proxy: temporarily drop group restriction 2025-12-07 10:42:13 -03:00
27e5c9391c auth: add namespace-local forward-auth middlewares 2025-12-07 10:25:44 -03:00
8d5e6c267c auth: wire oauth2-proxy and enable grafana oidc 2025-12-07 02:01:21 -03:00
a55502fe27 add oauth2-proxy for SSO forward-auth 2025-12-06 14:42:24 -03:00
598bdfc727 keycloak: restrict to worker rpis with titan-24 fallback 2025-12-06 01:44:23 -03:00
88c7a1c2aa keycloak: require rpi nodes with titan-24 fallback 2025-12-06 01:40:24 -03:00
f4da27271e keycloak: prefer rpi nodes, avoid titan-24 2025-12-06 01:36:33 -03:00
141c05b08f keycloak: honor xforwarded headers and hostname url 2025-12-06 01:23:07 -03:00
f0a8f6d35e keycloak: enable health/metrics management port 2025-12-06 00:51:47 -03:00
1b01052eda keycloak: set fsGroup for data volume 2025-12-06 00:49:17 -03:00
1d346edd28 keycloak: remove optimized flag for first start 2025-12-06 00:43:24 -03:00
b14a9dcb98 chore: drop AGENTS.md from repo 2025-12-06 00:43:17 -03:00
47caf08885 notes: capture GPU share change and flux branch 2025-12-03 12:28:45 -03:00
0db149605d monitoring: show GPU share over dashboard range 2025-12-02 20:28:35 -03:00
f64e60c5a2 flux: add keycloak kustomization 2025-12-02 18:10:20 -03:00
61c5db5c99 flux: track feature/sso 2025-12-02 18:00:49 -03:00
2db550afdd keycloak: add raw manifests backed by shared postgres 2025-12-02 17:58:19 -03:00
65d389193f Merge pull request 'feature/atlas-monitoring' (#3) from feature/atlas-monitoring into main
Reviewed-on: #3
2025-12-02 20:52:35 +00:00
e80505a773 notes: add postgres centralization guidance 2025-12-02 17:36:37 -03:00
762aa7bb0f notes: add sso plan sketch 2025-12-02 17:14:45 -03:00
839fb94836 notes: update monitoring and next steps 2025-12-02 17:01:32 -03:00
6eba26b359 monitoring: show top12 root disks 2025-12-02 15:21:02 -03:00
ace383bedd monitoring: expand worker/control/root rows 2025-12-02 15:15:21 -03:00
b93636ecb9 monitoring: shrink hottest node row height 2025-12-02 15:12:16 -03:00
5df94a7937 monitoring: fix gpu share query and root bar labels 2025-12-02 14:56:36 -03:00
a3dc9391ee monitoring: polish dashboards and folders 2025-12-02 14:41:39 -03:00
eed67b3db0 monitoring: regen dashboards with gpu details 2025-12-02 13:16:00 -03:00
f1d0970aa0 monitoring: mirror dcgm-exporter as multi-arch 2025-12-02 12:36:24 -03:00
e26ef44d1a monitoring: run dcgm-exporter with nvidia runtime 2025-12-02 12:25:30 -03:00
a18c3e6f67 monitoring: always pull dcgm-exporter tag 2025-12-02 12:19:16 -03:00
ee923df567 monitoring: add registry pull secret for dcgm-exporter 2025-12-02 12:07:11 -03:00
d87a1dbc47 monitoring: allow dcgm rollout with unavailable node 2025-12-02 11:59:55 -03:00
5b89b0533e monitoring: use mirrored dcgm-exporter tag 2025-12-02 11:54:53 -03:00
d99bb06eeb monitoring: reenable dcgm exporter 2025-11-20 13:11:13 -03:00
75f6a59316 traefik: use responding timeouts only 2025-11-18 20:01:16 -03:00
630f1f2a81 traefik: extend upload timeouts 2025-11-18 19:43:19 -03:00
e4f93e85d2 monitoring: control-plane stat and namespace share tweaks 2025-11-18 17:09:13 -03:00
f06be37f44 monitoring: refine network metrics and control-plane allowance 2025-11-18 16:18:52 -03:00
c7b7bc7a6d monitoring: adjust overview spacing and net panels 2025-11-18 15:55:24 -03:00
7b2a69cfe3 monitoring: disable dcgm exporter 2025-11-18 15:10:58 -03:00
909cb4ff26 flux: disable wait for monitoring 2025-11-18 15:04:18 -03:00
5a2575d54e flux: scope monitoring health checks 2025-11-18 14:33:24 -03:00
46410c9a9d monitoring: fix dcgm image 2025-11-18 14:19:23 -03:00
ff056551c7 monitoring: refresh overview dashboards 2025-11-18 14:08:33 -03:00
8e6c0a3cfe monitoring: rework gpu share + gauges 2025-11-18 12:11:47 -03:00
497164a1ad monitoring: clean namespace gpu share and layout 2025-11-18 11:42:24 -03:00
fab5552039 monitoring: resolve pie errors and network data 2025-11-18 11:30:33 -03:00
7009a4f9ff monitoring: fix namespace gpu share and network stats 2025-11-18 11:12:03 -03:00
d7e4bcd533 monitoring: add gpu node fallback 2025-11-18 10:47:24 -03:00
ec76563a86 monitoring: source gpu pie from limits and node nets 2025-11-18 01:01:10 -03:00
5144bbe1f2 monitoring: fix gpu pie data and network panels 2025-11-18 00:31:51 -03:00
ac62387e07 monitoring: stabilize namespace pies and labels 2025-11-18 00:19:45 -03:00
2ba642d49f monitoring: add gpu pie and tidy net panels 2025-11-18 00:11:39 -03:00
beb3243839 Revert GPU pie chart additions 2025-11-17 23:42:55 -03:00
aef3176c1c monitoring: fix hottest stats and gpu share 2025-11-17 23:40:22 -03:00
f4dd1de43f monitoring: reorder namespace pies and add gpu data 2025-11-17 23:18:53 -03:00
0708522b28 monitoring: add namespace gpu share 2025-11-17 23:12:16 -03:00
c53c518301 monitoring: express namespace share as cluster percent 2025-11-17 22:58:57 -03:00
442a89d327 monitoring: fix pie colors & thresholds 2025-11-17 22:39:50 -03:00
255e014e0a monitoring: color namespace pies 2025-11-17 22:36:50 -03:00
cc62f497e9 monitoring: fix namespace share percentages 2025-11-17 22:19:01 -03:00
37e51b361b monitoring: normalize namespace share 2025-11-17 22:06:06 -03:00
be6052c47c monitoring: unify namespace share panels 2025-11-17 21:57:40 -03:00
b59677615c monitoring: worker/control-plane splits 2025-11-17 21:48:12 -03:00
76d3dc6ae2 monitoring: restore top1 hottest stats 2025-11-17 21:20:19 -03:00
53427cc8fa monitoring: fix net/io legend labels 2025-11-17 20:19:20 -03:00
b8998a3c6a monitoring: attach nodes to net/io stats 2025-11-17 20:14:11 -03:00
a67a6a1f3a monitoring: tidy hottest node labels 2025-11-17 20:04:50 -03:00
b28e7501b7 monitoring: show hottest node labels 2025-11-17 20:00:40 -03:00
4aece7e5cb monitoring: fix hottest node labels 2025-11-17 19:56:57 -03:00
bcaa0a3327 monitoring: show hottest node names 2025-11-17 19:53:39 -03:00
41e8a6a582 monitoring: reorder overview stats 2025-11-17 19:49:50 -03:00
a1e731e929 monitoring: fix hottest stats and titan-db scrape 2025-11-17 19:38:40 -03:00
fe8deea9c7 monitoring: tighten overview stats 2025-11-17 19:24:03 -03:00
349d9c56ac monitoring: polish dashboards 2025-11-17 18:55:11 -03:00
8f5781d3cf monitoring: rebuild atlas dashboards 2025-11-17 16:27:38 -03:00
a41f25e66d monitoring: restructure grafana dashboards 2025-11-17 14:22:46 -03:00
b004bf99dc monitoring: enrich dashboards 2025-11-16 12:58:08 -03:00
0b1437b77c monitoring: refresh grafana dashboards 2025-11-15 21:03:11 -03:00
eb3991b628 dashboards: improve public view and fix color 2025-11-15 11:59:48 -03:00
46b6b1f3b8 grafana: set datasource uid 2025-11-15 11:35:27 -03:00
683dc84289 grafana: use atlas metrics hostname 2025-11-15 11:18:40 -03:00
d0b6fbe763 victoria-metrics: revert storageclass change 2025-11-15 11:16:37 -03:00
3cfe639387 monitoring: fix domain 2025-11-14 19:13:40 -03:00
418329e173 monitoring: fix ingress and env formats 2025-11-14 08:51:09 -03:00
394fcf2ee4 grafana: use string host format 2025-11-14 08:37:46 -03:00
465103a57e grafana: fix dashboard provider list 2025-11-14 08:33:53 -03:00
c2cb901102 monitoring: fix grafana values 2025-11-14 08:29:59 -03:00
06337f2b9d monitoring: add grafana and alertmanager 2025-11-14 00:02:59 -03:00
a875b0a42e flux-system: track main branch 2025-11-12 01:06:26 -03:00
a08a2189e1 monitoring: disable wait on node-exporter 2025-11-09 14:03:14 -03:00
45f0100784 core: disable wait to unblock reconciliation 2025-11-09 13:46:56 -03:00
d5da49e566 core: remove gpu health gate 2025-11-09 13:37:59 -03:00
e0e27445c7 gpu: drop runtimeClass from minipc plugin 2025-11-09 13:28:40 -03:00
9f61854bc2 monitoring: disable kube-state annotations 2025-11-09 13:20:50 -03:00
ded87979c5 monitoring: clean helm values 2025-11-09 13:16:21 -03:00
538fca4195 monitoring: disable chart prometheusScrape 2025-11-09 13:11:40 -03:00
5ffcfc7d01 monitoring: annotate kube-state svc manually 2025-11-09 13:07:39 -03:00
f958d65528 monitoring: drop duplicate annotations 2025-11-09 13:03:40 -03:00
4197072593 monitoring: reference prometheus repo 2025-11-09 12:59:03 -03:00
d6f0f375b7 core: point flux to infrastructure path 2025-11-09 12:49:54 -03:00
051691e71f platform: fix relative paths 2025-11-09 12:39:32 -03:00
4a709391e6 platform: include cert-manager clusterissuer 2025-11-09 12:38:20 -03:00
1880df2525 chore: fix vmagent relabel indentation 2025-11-09 12:33:11 -03:00
02ed3e3145 fix: flux automation and monitoring config 2025-11-09 12:31:38 -03:00
b59025d495 refactor: restructure atlas flux layout 2025-11-09 11:48:45 -03:00
306b4b8458 pegasus on 2025-10-09 23:26:20 -05:00
2e6f811d12 Merge pull request 'minor tweaks' (#2) from fea/titan24-gpu into main
Reviewed-on: #2
2025-10-10 02:23:01 +00:00
ea08411128 minor tweaks 2025-10-09 21:21:54 -05:00
a09333ba38 Merge pull request 'gpu(titan-24): add RuntimeClass + NVIDIA device-plugin DS; enable containerd nvidia runtime' (#1) from fea/titan24-gpu into main
Reviewed-on: #1
2025-10-09 23:29:26 +00:00
bff6b83d11 gpu(titan-24): add RuntimeClass + NVIDIA device-plugin DS; enable containerd nvidia runtime 2025-10-09 18:28:20 -05:00
a94bd95248 pegasus chill 2025-10-08 04:26:26 -05:00
2c0622583e storageclass update 2025-10-08 03:13:12 -05:00
86490b74c4 asteria corrections 2025-10-08 00:50:42 -05:00
2ef8a7bbc2 jellyfin restart 2025-10-07 23:28:40 -05:00
ae85dcfeaa monitoring add, jellyfin/pegasus update, and traefik tweaks 2025-10-07 23:26:27 -05:00
41292eff0b jellyfin pvc size increase 2025-10-04 09:00:41 -05:00
a69bd45455 fixed jellyfin pv issue 2025-10-04 08:50:56 -05:00
a3a5b1a9bd jellyfin and pegasus in same group 2025-09-18 10:12:08 -05:00
938f6b336c jellyfin and pegasus in same group 2025-09-18 09:55:00 -05:00
3c97a02fa7 jellyfin and pegasus in same group 2025-09-18 09:38:46 -05:00
980892a5b4 jellyfin and pegasus in same group 2025-09-18 08:52:58 -05:00
adf7d7eb31 pegasus 1.2.32 2025-09-18 02:33:37 -05:00
2fe8f7ea6a gavilon to gavilan 2025-09-17 19:12:03 -05:00
c00b760976 added gavilon to account for pegasus 2025-09-17 18:29:33 -05:00
d78fc77825 pegasus 1.2.31 2025-09-17 18:08:49 -05:00
a6ab2b44af pegasus 1.2.31 2025-09-17 09:38:49 -05:00
3a207c7d94 pegasus 1.2.30 2025-09-17 09:09:24 -05:00
d45cf950ec pegasus 1.2.29 2025-09-17 09:00:52 -05:00
193c820fc6 pegasus 1.2.28 2025-09-17 08:52:11 -05:00
c3524cec3d pegasus 1.2.27 2025-09-17 08:21:51 -05:00
f214e394d0 pegasus 1.2.26 2025-09-17 07:57:36 -05:00
07cffbeec0 pegasus 1.2.25 2025-09-17 07:46:48 -05:00
576221c47d pegasus 1.2.24 2025-09-17 07:24:10 -05:00
f63d39e5aa pegasus 1.2.22 2025-09-17 01:33:11 -05:00
48bce52660 pegasus 1.2.22 2025-09-17 01:02:33 -05:00
5b1a209d9a pegasus 1.2.21 2025-09-17 00:08:18 -05:00
5437b985e8 pegasus 1.2.20 2025-09-16 23:10:58 -05:00
f49e341445 pegasus 1.2.17 2025-09-16 22:45:15 -05:00
8c64a4b067 pegasus 1.2.17 2025-09-16 20:08:50 -05:00
7b5001c581 pegasus 1.2.17 2025-09-16 18:02:55 -05:00
fc0c5c1250 pegasus 1.2.16 2025-09-16 17:18:42 -05:00
39fc2aacde pegasus 1.2.15 2025-09-16 16:56:49 -05:00
33f0d67b34 pegasus 1.2.14 2025-09-16 09:53:26 -05:00
48a2a53023 pegasus 1.2.13 2025-09-16 09:12:41 -05:00
269b6cd7ad pegasus 1.2.12 2025-09-16 08:54:32 -05:00
b06b5d7612 pegasus 1.2.11 2025-09-16 08:29:47 -05:00
0f1994c384 pegasus 1.2.10 2025-09-16 07:19:54 -05:00
3df06948a9 pegasus 1.2.9 2025-09-16 05:33:36 -05:00
30ac7e5ac1 pegasus 1.2.8 2025-09-16 04:09:10 -05:00
0b8e4f012a pegasus 1.2.7 - json fix 2025-09-16 03:35:12 -05:00
2eecba7f55 pegasus 1.2.6 - json fix 2025-09-16 03:05:50 -05:00
bd5f1b3a67 mapping to list 2025-09-16 02:36:43 -05:00
9ff70673e3 pegasus updates 1.2.5 2025-09-16 01:55:36 -05:00
755c54f26b pegasus updates 1.2.4 2025-09-16 01:01:23 -05:00
f4588b4304 pegasus updates 2025-09-16 00:06:26 -05:00
e36f7059ea pegasus updates 2025-09-15 22:52:58 -05:00
6deefc514e pegasus updates 2025-09-15 22:40:00 -05:00
33ff3d20aa pegasus updates 2025-09-15 19:55:20 -05:00
65de7602c9 pegasus: pin image digest + command + probes + tls 2025-09-15 13:00:39 -05:00
9b77a89b0d pegasus flux'd 2025-09-15 12:32:52 -05:00
6a86590484 pegasus flux'd 2025-09-15 12:28:56 -05:00
8cc80f695f pegasus fix 2025-09-15 12:09:24 -05:00
50c25b1b92 pegasus on 2025-09-15 02:45:22 -05:00
a85fac9002 zot fix 2025-09-15 02:15:27 -05:00
5bfeffe31f zot fix 2025-09-15 01:03:32 -05:00
8459ea7058 zot middleware add 2025-09-09 11:27:42 -05:00
6efe79819f zot middleware add 2025-09-09 01:43:13 -05:00
33d07dcf5c zot simplification 2025-09-09 01:16:33 -05:00
7257762c45 zot simplification 2025-09-09 00:22:24 -05:00
bff64dba65 zot configmap update 2025-09-08 23:08:32 -05:00
f72dc43f76 zot version pin 2025-09-08 22:52:41 -05:00
47a73af27e zot troubleshooting 2025-09-08 22:25:41 -05:00
1ee60d9534 zot middleware fix 2025-09-08 21:58:50 -05:00
63d82af268 jitsi corrections 2025-09-07 14:31:53 -05:00
47cbc9b9f6 pegasus corrections 2025-09-07 13:34:06 -05:00
001e9c36fe jitsi setup 2025-09-07 13:20:49 -05:00

View File

@ -0,0 +1,11 @@
# services/jitsi/secret.yaml
apiVersion: v1
kind: Secret
metadata:
name: jitsi-internal-secrets
namespace: jitsi
type: Opaque
data:
JICOFO_COMPONENT_SECRET: bEg5Y09hZFJBem5PUFliQlp4RHkwRTRP
JICOFO_AUTH_PASSWORD: VVkyUmczaVRDWUZ0MzdQdmN3UDN1SFc5
JVB_AUTH_PASSWORD: d0M5aWJ4dWlPTnhFak9lRHJqSHdYa0g5