Commit Graph

  • 60446ee830 testing(ci): centralize quality gate contract Brad Stein 2026-04-10 17:06:53 -03:00
  • 9f088577b1 ci: publish titan-iac tests and seed ananke/lesavka jobs Brad Stein 2026-04-10 16:38:55 -03:00
  • b723382ff4 dashboards: unify suite pass-rate metrics on platform counters Brad Stein 2026-04-10 15:35:20 -03:00
  • c38b6c5e27 ci: publish titan-iac tests and seed ananke/lesavka jobs Brad Stein 2026-04-10 16:38:55 -03:00
  • 9419c4b26b dashboards: unify suite pass-rate metrics on platform counters Brad Stein 2026-04-10 15:35:20 -03:00
  • 9485541d2c logging/ci: make data-prepper image publish opt-in Brad Stein 2026-04-10 06:53:14 -03:00
  • c7cd723ee8 chore(bstein-dev-home): automated image update flux-bot 2026-04-10 09:46:06 +00:00
  • 4e177873c2 chore(bstein-dev-home): automated image update flux-bot 2026-04-10 09:44:05 +00:00
  • 434b586970 logging/ci: default push_latest when scm build omits params Brad Stein 2026-04-10 06:42:25 -03:00
  • 34e0183b48 logging/ci: publish data-prepper smoke test metrics Brad Stein 2026-04-10 06:24:51 -03:00
  • 3bc1a7eb40 jellyfin: remove retired oidc pipeline artifact Brad Stein 2026-04-10 06:07:59 -03:00
  • 2ca43b9c8b chore(maintenance): automated image update flux-bot 2026-04-10 09:04:45 +00:00
  • 9ca75d3fb3 jenkins: trigger data-prepper rerun on main Brad Stein 2026-04-10 06:03:39 -03:00
  • fb510e89ee jenkins: retire unused ci-demo and jellyfin-oidc jobs Brad Stein 2026-04-10 05:59:52 -03:00
  • 301d084695 jenkins: make jellyfin pipeline shell flags POSIX-safe Brad Stein 2026-04-10 05:45:33 -03:00
  • 628e204fc5 jenkins: fix pipeline portability and jellyfin option compatibility Brad Stein 2026-04-10 05:41:00 -03:00
  • 914a48e4f5 jenkins: trigger pipeline runs after controller refresh Brad Stein 2026-04-10 05:37:51 -03:00
  • bebe91d39b jenkins: add scm polling trigger for jellyfin oidc pipeline Brad Stein 2026-04-10 05:31:41 -03:00
  • d65b8e7a32 logging: fix groovy-safe awk matcher in data-prepper metrics Brad Stein 2026-04-10 05:23:35 -03:00
  • 62aae6ffb2 jenkins: wire full quality-gate metrics across platform jobs Brad Stein 2026-04-10 05:19:25 -03:00
  • 32b6e55467 monitoring: use CI-only series for platform test success panels Brad Stein 2026-04-10 04:52:57 -03:00
  • 99eda351df monitoring/jenkins: add pegasus CI job and separate health probe suite Brad Stein 2026-04-10 03:26:51 -03:00
  • 5f4641553c monitoring: replace failure table with 24h suite pass snapshot Brad Stein 2026-04-09 20:16:44 -03:00
  • 530f440679 monitoring: add suite probe metrics and align fan labels Brad Stein 2026-04-09 20:10:52 -03:00
  • 5e3aadc640 monitoring: set overview platform test panel to 7d Brad Stein 2026-04-09 20:05:10 -03:00
  • 12b85f4597 monitoring: add platform quality push gateway for test metrics Brad Stein 2026-04-09 19:30:16 -03:00
  • ad1cbd6f85 monitoring: make test panel point-based and failure-by-suite Brad Stein 2026-04-09 19:27:48 -03:00
  • 5cf9a16d97 monitoring: align overview panels with jobs and point-based suite rates Brad Stein 2026-04-09 16:35:14 -03:00
  • f8c1243dfd monitoring: add generic suite metric slots for platform tests Brad Stein 2026-04-09 16:16:35 -03:00
  • 7b0e9acbb1 monitoring: make suite pass rate 30d rolling for sparse tests Brad Stein 2026-04-09 16:14:26 -03:00
  • 0273727cb4 monitoring: make platform test success one line per suite Brad Stein 2026-04-09 15:21:59 -03:00
  • 09fa3e716c monitoring/atlas: merge top rows and fix platform test pass-rate panel Brad Stein 2026-04-09 14:56:43 -03:00
  • 293cd83999 monitoring/atlas: resize test/ops rows and source overview tests from atlas-jobs Brad Stein 2026-04-09 13:39:55 -03:00
  • 764bfe189e monitoring/recovery: harden ananke checks and OIDC-gated service validation Brad Stein 2026-04-09 01:41:02 -03:00
  • e0b124ca4e monitoring: switch power telemetry to ananke metrics Brad Stein 2026-04-08 23:33:17 -03:00
  • cf6252c55a docs: restore titan-iac README scope and split ananke guidance Brad Stein 2026-04-08 19:22:56 -03:00
  • cfdd5a377d atlasbot: keep retrying MAS login during transient Synapse outages Brad Stein 2026-04-07 13:09:36 -03:00
  • fa160f5f9b ananke: harden recovery checks and finalize naming migration Brad Stein 2026-04-07 12:30:28 -03:00
  • cc316c472b ananke: harden recovery checks and finalize naming migration feature/atlasbot-ananke-recovery Brad Stein 2026-04-07 12:30:28 -03:00
  • 9a07aa9be9 keycloak: make metis ssh db key optional during migration Brad Stein 2026-04-07 04:40:56 -03:00
  • a4631dee81 maintenance: migrate metis ssh key names to ananke Brad Stein 2026-04-07 04:34:39 -03:00
  • 525a0f9e71 harbor/bootstrap: pin via dynamic host label managed by recovery script Brad Stein 2026-04-06 21:32:43 -03:00
  • d168f02c7f harbor/recovery: remove fixed titan-05 pin and auto-select ready arm64 node Brad Stein 2026-04-06 21:27:23 -03:00
  • 5e387e8e4d maintenance/metis: remove legacy hecate ssh key vars Brad Stein 2026-04-06 19:43:16 -03:00
  • 1ccb04a18a maintenance/metis: default missing ananke ssh keys to empty Brad Stein 2026-04-06 19:36:01 -03:00
  • 25ea022c2e maintenance/metis: migrate ssh key vars to ananke Brad Stein 2026-04-06 19:28:44 -03:00
  • a5f405432b hecate: add bootstrap bundle manifests and helper build scripts Brad Stein 2026-04-06 05:01:17 -03:00
  • e269829dc6 hecate: add controlled drill checklist to runbook Brad Stein 2026-04-06 04:59:37 -03:00
  • c1dc50cace hecate: add controlled drill checklist to runbook Brad Stein 2026-04-06 04:59:37 -03:00
  • d880fac673 hecate: harden titan-24 cleanup and ups telemetry Brad Stein 2026-04-06 04:47:05 -03:00
  • 65de56b2ac hecate: harden titan-24 cleanup and ups telemetry Brad Stein 2026-04-06 04:47:05 -03:00
  • 31f5709929 hecate: add cluster power recovery tooling Brad Stein 2026-04-06 04:21:04 -03:00
  • fc4093a910 logging: raise opensearch heap headroom Brad Stein 2026-04-06 02:04:07 -03:00
  • 7619bba5d9 traefik: define cluster ingress class Brad Stein 2026-04-06 02:00:22 -03:00
  • 816d0cca65 traefik: isolate custom rbac from k3s cleanup Brad Stein 2026-04-06 01:57:34 -03:00
  • 801dde8242 maintenance: harden k3s traefik disable cleanup Brad Stein 2026-04-06 01:47:14 -03:00
  • 1e891de7e8 recovery: load colocated kubeconfig on remote hosts Brad Stein 2026-04-06 01:05:18 -03:00
  • b7f6317fd2 recovery: make cluster power console self-contained Brad Stein 2026-04-06 01:02:30 -03:00
  • aa447e6996 harbor: restore internal arm64 image refs for recovery bootstrap Brad Stein 2026-04-06 00:50:29 -03:00
  • 99bd68f61b recovery: unblock harbor cold start and add power console Brad Stein 2026-04-06 00:22:54 -03:00
  • a097c36718 core: decouple coredns image from harbor for bootstrap recovery Brad Stein 2026-04-05 18:33:21 -03:00
  • 2a9485d9e0 maintenance: disable ariadne vault auth/oidc policy sync cron Brad Stein 2026-04-05 17:40:40 -03:00
  • 2799b54b08 maintenance: pin metis to available image tag Brad Stein 2026-04-05 17:05:31 -03:00
  • 3ce7b2eeb7 maintenance/monitoring: wire reciprocal metis hecate key + dampen alert flapping Brad Stein 2026-04-05 13:51:57 -03:00
  • 8d1be9672c maintenance/metis: bump runner tags to 0.1.0-23 Brad Stein 2026-04-05 11:41:02 -03:00
  • deb52c424b maintenance/vault: move Metis runtime secrets to Vault Brad Stein 2026-04-05 11:31:05 -03:00
  • 0828f0cf9e maintenance: inject metis SSH keys directly from Vault Brad Stein 2026-04-05 10:31:20 -03:00
  • e84399d0b1 maintenance: source metis SSH keys from Vault Brad Stein 2026-04-05 10:25:29 -03:00
  • 1c9716d855 maintenance: pass bastion key into metis env Brad Stein 2026-04-05 10:18:13 -03:00
  • 0fc5ac3041 maintenance/metis: read optional ssh pubkeys from secret env Brad Stein 2026-04-05 10:07:09 -03:00
  • a1cd9fe7a7 flux: order gitea/keycloak after vault and postgres Brad Stein 2026-04-05 01:16:26 -03:00
  • 168e390f20 nextcloud: pin workload to worker rpi5 nodes Brad Stein 2026-04-04 16:07:17 -03:00
  • adef1df856 monitoring(power): rename hecate UPS peers to Pyrphoros and Statera Brad Stein 2026-04-04 05:54:16 -03:00
  • 96bc93670b monitoring(power): rename hecate UPS peers to Pyrphoros and Statera Brad Stein 2026-04-04 05:54:16 -03:00
  • d8bcd24dac monitoring(overview): refine ups-climate row and climate/fan stat display Brad Stein 2026-04-04 04:40:22 -03:00
  • 82e1b87b8f monitoring(overview): refine ups-climate row and climate/fan stat display Brad Stein 2026-04-04 04:40:22 -03:00
  • 8b9c5ee5ed monitoring(grafana): restart to pick up latest overview layout Brad Stein 2026-04-04 04:35:26 -03:00
  • 1b682cc60f monitoring(grafana): restart to pick up latest overview layout Brad Stein 2026-04-04 04:35:26 -03:00
  • 12ce0f8e2a monitoring(overview): swap jobs and power rows; tighten climate/fan display Brad Stein 2026-04-04 04:34:18 -03:00
  • 5059d2918d monitoring(overview): swap jobs and power rows; tighten climate/fan display Brad Stein 2026-04-04 04:34:18 -03:00
  • 3c21b470ed monitoring(grafana): bump restart revision for overview dashboard reload Brad Stein 2026-04-04 01:34:36 -03:00
  • d5fc6c89c4 monitoring(grafana): bump restart revision for overview dashboard reload Brad Stein 2026-04-04 01:34:36 -03:00
  • ab903e5619 monitoring(overview): place six power/climate panels on one row and fix test/job data Brad Stein 2026-04-04 01:33:15 -03:00
  • 55b96c0675 monitoring(overview): place six power/climate panels on one row and fix test/job data Brad Stein 2026-04-04 01:33:15 -03:00
  • c5acc3dc13 monitoring(overview): replace power/climate summary row with six-panel layout Brad Stein 2026-04-03 22:16:02 -03:00
  • cdc3c081f5 monitoring(overview): replace power/climate summary row with six-panel layout Brad Stein 2026-04-03 22:16:02 -03:00
  • 07a4515a6f monitoring(grafana): bump restart revision to reload provisioned dashboards Brad Stein 2026-04-03 20:54:12 -03:00
  • 7ef4c895ba monitoring(grafana): bump restart revision to reload provisioned dashboards Brad Stein 2026-04-03 20:54:12 -03:00
  • 758654c9df monitoring(power): implement six-panel UPS and climate layout Brad Stein 2026-04-03 20:45:40 -03:00
  • 69a02a3352 monitoring(power): implement six-panel UPS and climate layout Brad Stein 2026-04-03 20:45:40 -03:00
  • e199d20a3e monitoring(power): add UPS status snapshot table and climate placeholders Brad Stein 2026-04-03 17:53:42 -03:00
  • 4167f0f988 monitoring(power): add UPS status snapshot table and climate placeholders Brad Stein 2026-04-03 17:53:42 -03:00
  • 0eb4cc6550 monitoring(power): wire generated power dashboard and split per-UPS panels Brad Stein 2026-04-03 17:49:09 -03:00
  • fd71c6644b monitoring(power): wire generated power dashboard and split per-UPS panels Brad Stein 2026-04-03 17:49:09 -03:00
  • c406cba89d monitoring: scope hecate power queries to hecate-power job Brad Stein 2026-04-03 15:23:27 -03:00
  • 7ae4746d10 monitoring: scope hecate power queries to hecate-power job Brad Stein 2026-04-03 15:23:27 -03:00
  • 40dce5ee49 monitoring: add power dashboard and reorder atlas overview rows Brad Stein 2026-04-03 14:55:16 -03:00
  • bc9bf0310a monitoring: add power dashboard and reorder atlas overview rows Brad Stein 2026-04-03 14:55:16 -03:00
  • 65da91ae42 maintenance(metis): roll deployment after config update Brad Stein 2026-04-02 01:27:23 -03:00
  • e418183f56 maintenance(metis): roll deployment after config update Brad Stein 2026-04-02 01:27:23 -03:00