1994 Commits

Author SHA1 Message Date
a4631dee81 maintenance: migrate metis ssh key names to ananke 2026-04-07 04:36:42 -03:00
525a0f9e71 harbor/bootstrap: pin via dynamic host label managed by recovery script 2026-04-06 21:32:43 -03:00
d168f02c7f harbor/recovery: remove fixed titan-05 pin and auto-select ready arm64 node 2026-04-06 21:27:23 -03:00
5e387e8e4d maintenance/metis: remove legacy hecate ssh key vars 2026-04-06 19:43:16 -03:00
1ccb04a18a maintenance/metis: default missing ananke ssh keys to empty 2026-04-06 19:36:01 -03:00
25ea022c2e maintenance/metis: migrate ssh key vars to ananke 2026-04-06 19:28:44 -03:00
a5f405432b hecate: add bootstrap bundle manifests and helper build scripts 2026-04-06 05:01:17 -03:00
e269829dc6 hecate: add controlled drill checklist to runbook 2026-04-06 04:59:49 -03:00
d880fac673 hecate: harden titan-24 cleanup and ups telemetry 2026-04-06 04:55:54 -03:00
fc4093a910 logging: raise opensearch heap headroom 2026-04-06 02:04:07 -03:00
7619bba5d9 traefik: define cluster ingress class 2026-04-06 02:00:22 -03:00
816d0cca65 traefik: isolate custom rbac from k3s cleanup 2026-04-06 01:57:34 -03:00
801dde8242 maintenance: harden k3s traefik disable cleanup 2026-04-06 01:47:32 -03:00
1e891de7e8 recovery: load colocated kubeconfig on remote hosts 2026-04-06 01:05:18 -03:00
b7f6317fd2 recovery: make cluster power console self-contained 2026-04-06 01:02:30 -03:00
aa447e6996 harbor: restore internal arm64 image refs for recovery bootstrap 2026-04-06 00:50:29 -03:00
99bd68f61b recovery: unblock harbor cold start and add power console 2026-04-06 00:22:54 -03:00
a097c36718 core: decouple coredns image from harbor for bootstrap recovery 2026-04-05 18:33:21 -03:00
2a9485d9e0 maintenance: disable ariadne vault auth/oidc policy sync cron 2026-04-05 17:40:40 -03:00
2799b54b08 maintenance: pin metis to available image tag 2026-04-05 17:05:31 -03:00
3ce7b2eeb7 maintenance/monitoring: wire reciprocal metis hecate key + dampen alert flapping 2026-04-05 13:51:57 -03:00
8d1be9672c maintenance/metis: bump runner tags to 0.1.0-23 2026-04-05 11:41:02 -03:00
deb52c424b maintenance/vault: move Metis runtime secrets to Vault 2026-04-05 11:31:05 -03:00
0828f0cf9e maintenance: inject metis SSH keys directly from Vault 2026-04-05 10:31:20 -03:00
e84399d0b1 maintenance: source metis SSH keys from Vault 2026-04-05 10:25:29 -03:00
1c9716d855 maintenance: pass bastion key into metis env 2026-04-05 10:18:13 -03:00
0fc5ac3041 maintenance/metis: read optional ssh pubkeys from secret env 2026-04-05 10:07:09 -03:00
a1cd9fe7a7 flux: order gitea/keycloak after vault and postgres 2026-04-05 01:16:26 -03:00
168e390f20 nextcloud: pin workload to worker rpi5 nodes 2026-04-04 16:07:17 -03:00
96bc93670b monitoring(power): rename hecate UPS peers to Pyrphoros and Statera 2026-04-04 05:54:16 -03:00
82e1b87b8f monitoring(overview): refine ups-climate row and climate/fan stat display 2026-04-04 04:40:22 -03:00
1b682cc60f monitoring(grafana): restart to pick up latest overview layout 2026-04-04 04:35:26 -03:00
5059d2918d monitoring(overview): swap jobs and power rows; tighten climate/fan display 2026-04-04 04:34:18 -03:00
d5fc6c89c4 monitoring(grafana): bump restart revision for overview dashboard reload 2026-04-04 01:34:36 -03:00
55b96c0675 monitoring(overview): place six power/climate panels on one row and fix test/job data 2026-04-04 01:33:15 -03:00
cdc3c081f5 monitoring(overview): replace power/climate summary row with six-panel layout 2026-04-03 22:16:02 -03:00
7ef4c895ba monitoring(grafana): bump restart revision to reload provisioned dashboards 2026-04-03 20:54:12 -03:00
69a02a3352 monitoring(power): implement six-panel UPS and climate layout 2026-04-03 20:45:40 -03:00
4167f0f988 monitoring(power): add UPS status snapshot table and climate placeholders 2026-04-03 17:53:42 -03:00
fd71c6644b monitoring(power): wire generated power dashboard and split per-UPS panels 2026-04-03 17:49:09 -03:00
7ae4746d10 monitoring: scope hecate power queries to hecate-power job 2026-04-03 15:23:27 -03:00
bc9bf0310a monitoring: add power dashboard and reorder atlas overview rows 2026-04-03 14:55:16 -03:00
e418183f56 maintenance(metis): roll deployment after config update 2026-04-02 01:27:23 -03:00
8e711c4666 maintenance(metis): raise media size ceiling to 1TB 2026-04-02 01:26:38 -03:00
0c2f769875 maintenance(metis): re-enable titan-24 flash host list 2026-04-02 01:22:49 -03:00
1e3352b94a maintenance: roll metis runtime to 0.1.0-21 2026-04-01 13:16:04 -03:00
db357dca2c maintenance: roll metis runtime to 0.1.0-20 2026-04-01 13:00:59 -03:00
7d0c474f4d maintenance: enable jetson sources and roll metis to 0.1.0-19 2026-04-01 12:53:50 -03:00
5ec3b2b992 maintenance: roll metis runtime to 0.1.0-18 2026-04-01 12:47:27 -03:00
aacaee2052 maintenance: roll metis runtime to 0.1.0-17 2026-04-01 12:30:04 -03:00