1838 Commits

Author SHA1 Message Date
2509d8876a maintenance(ariadne): default jenkins cleanup to safe dry-run 2026-04-12 12:32:20 -03:00
8331411b93 jenkins: use emptyDir workspace for default agents 2026-04-12 05:54:26 -03:00
369e738cf3 jenkins: run default jnlp agent as root and fix default label 2026-04-12 05:47:06 -03:00
4da6b0da3a jenkins: run data-prepper from main and default publish repo to monitoring 2026-04-12 05:26:45 -03:00
6276bd037c maintenance/ariadne: add jenkins workspace cleanup schedule and RBAC 2026-04-12 04:47:19 -03:00
4f6ca60521 jenkins: add atlasbot and soteria pipeline jobs 2026-04-12 04:35:12 -03:00
3774b600ee scheduling: keep app workloads off control-plane 2026-04-12 04:27:43 -03:00
246424762e maintenance: remove pi-usb-scratch guard rollout 2026-04-12 01:04:23 -03:00
63e89b51f0 maintenance(pi-usb-scratch): disable rollout jitter for initial cutover 2026-04-11 12:00:46 -03:00
3cb1582adc maintenance(pi-usb-scratch): fix false mount conflict detection 2026-04-11 11:58:00 -03:00
3ea296b552 maintenance: enforce Astraios + tmpfs /tmp on worker Pis 2026-04-11 11:55:49 -03:00
5e39164fcd maintenance: add worker pi usb scratch rollout 2026-04-11 11:55:49 -03:00
52d4709dd9 jenkins: run titan-iac glue tests with jenkins SA and vm fqdn 2026-04-10 17:28:29 -03:00
15dfbb728c jenkins: add direct titan-iac pipeline job on main 2026-04-10 17:12:46 -03:00
370ece5b60 testing(ci): centralize quality gate contract 2026-04-10 17:11:02 -03:00
9f088577b1 ci: publish titan-iac tests and seed ananke/lesavka jobs 2026-04-10 16:40:05 -03:00
b723382ff4 dashboards: unify suite pass-rate metrics on platform counters 2026-04-10 16:39:55 -03:00
9485541d2c logging/ci: make data-prepper image publish opt-in 2026-04-10 06:53:14 -03:00
434b586970 logging/ci: default push_latest when scm build omits params 2026-04-10 06:42:25 -03:00
34e0183b48 logging/ci: publish data-prepper smoke test metrics 2026-04-10 06:24:51 -03:00
3bc1a7eb40 jellyfin: remove retired oidc pipeline artifact 2026-04-10 06:07:59 -03:00
fb510e89ee jenkins: retire unused ci-demo and jellyfin-oidc jobs 2026-04-10 05:59:52 -03:00
301d084695 jenkins: make jellyfin pipeline shell flags POSIX-safe 2026-04-10 05:45:33 -03:00
628e204fc5 jenkins: fix pipeline portability and jellyfin option compatibility 2026-04-10 05:41:00 -03:00
bebe91d39b jenkins: add scm polling trigger for jellyfin oidc pipeline 2026-04-10 05:31:41 -03:00
d65b8e7a32 logging: fix groovy-safe awk matcher in data-prepper metrics 2026-04-10 05:23:35 -03:00
62aae6ffb2 jenkins: wire full quality-gate metrics across platform jobs 2026-04-10 05:19:25 -03:00
32b6e55467 monitoring: use CI-only series for platform test success panels 2026-04-10 04:52:57 -03:00
99eda351df monitoring/jenkins: add pegasus CI job and separate health probe suite 2026-04-10 03:26:51 -03:00
5f4641553c monitoring: replace failure table with 24h suite pass snapshot 2026-04-09 20:16:44 -03:00
530f440679 monitoring: add suite probe metrics and align fan labels 2026-04-09 20:10:52 -03:00
5e3aadc640 monitoring: set overview platform test panel to 7d 2026-04-09 20:05:10 -03:00
12b85f4597 monitoring: add platform quality push gateway for test metrics 2026-04-09 19:30:16 -03:00
ad1cbd6f85 monitoring: make test panel point-based and failure-by-suite 2026-04-09 19:27:48 -03:00
5cf9a16d97 monitoring: align overview panels with jobs and point-based suite rates 2026-04-09 16:35:14 -03:00
f8c1243dfd monitoring: add generic suite metric slots for platform tests 2026-04-09 16:16:35 -03:00
7b0e9acbb1 monitoring: make suite pass rate 30d rolling for sparse tests 2026-04-09 16:14:26 -03:00
0273727cb4 monitoring: make platform test success one line per suite 2026-04-09 15:21:59 -03:00
09fa3e716c monitoring/atlas: merge top rows and fix platform test pass-rate panel 2026-04-09 14:56:43 -03:00
293cd83999 monitoring/atlas: resize test/ops rows and source overview tests from atlas-jobs 2026-04-09 13:39:55 -03:00
764bfe189e monitoring/recovery: harden ananke checks and OIDC-gated service validation 2026-04-09 01:44:26 -03:00
e0b124ca4e monitoring: switch power telemetry to ananke metrics 2026-04-08 23:33:17 -03:00
cfdd5a377d atlasbot: keep retrying MAS login during transient Synapse outages 2026-04-07 13:09:36 -03:00
9a07aa9be9 keycloak: make metis ssh db key optional during migration 2026-04-07 04:40:56 -03:00
a4631dee81 maintenance: migrate metis ssh key names to ananke 2026-04-07 04:36:42 -03:00
525a0f9e71 harbor/bootstrap: pin via dynamic host label managed by recovery script 2026-04-06 21:32:43 -03:00
d168f02c7f harbor/recovery: remove fixed titan-05 pin and auto-select ready arm64 node 2026-04-06 21:27:23 -03:00
5e387e8e4d maintenance/metis: remove legacy hecate ssh key vars 2026-04-06 19:43:16 -03:00
1ccb04a18a maintenance/metis: default missing ananke ssh keys to empty 2026-04-06 19:36:01 -03:00
25ea022c2e maintenance/metis: migrate ssh key vars to ananke 2026-04-06 19:28:44 -03:00