1007 Commits

Author SHA1 Message Date
98d405bc42 monitoring: regenerate dashboards with expanded infra namespaces 2026-01-11 23:55:43 -03:00
879ff7c16b monitoring: fix infra scopes and add jetson metrics 2026-01-11 23:46:24 -03:00
84cc7de437 mailu: use postmark token for relay auth 2026-01-11 19:01:31 -03:00
0e36e8ce12 knowledge: relocate metis doc; monitoring: add cpu high alert 2026-01-11 08:59:51 -03:00
f500e81606 monitoring: maintenance panels, extra alerts, update overview 2026-01-11 02:28:39 -03:00
25907da229 monitoring: remove titan-16 and add titan-20/21 to worker dashboards 2026-01-11 02:20:47 -03:00
4a01632f6b monitoring: add alert rules and include titan-20/21 in dashboards 2026-01-11 02:02:47 -03:00
ea7f1bfb5a monitoring: fix grafana alerting root policy 2026-01-11 01:40:07 -03:00
b89aa57a13 monitoring: allow smtp sync to get target secret 2026-01-11 00:32:41 -03:00
8f03fbcd5c monitoring: fix smtp sync image reference 2026-01-11 00:30:45 -03:00
6ac61e7b44 monitoring: wire grafana smtp sync and alerting provisioning 2026-01-11 00:29:20 -03:00
dc80d09018 maintenance: run image sweeper on all nodes 2026-01-10 23:57:26 -03:00
6d16d20240 maintenance: fix image sweeper script indentation 2026-01-10 20:26:46 -03:00
1e7c5567ad maintenance: sweep unused images on arm workers 2026-01-10 20:20:54 -03:00
d7c4bf19ff logging: tune rpi4 image gc and rpi5 prune 2026-01-10 06:57:07 -03:00
40ebe52ced logging: tune kubelet image GC on rpi5 2026-01-10 06:22:56 -03:00
f75e91dbf4 logging: extend fluent-bit helm timeout 2026-01-10 05:55:45 -03:00
cdcb6f5604 logging: add data-prepper pull secret 2026-01-10 05:52:16 -03:00
6f436022ca logging: force data-prepper repo override 2026-01-10 05:42:39 -03:00
a7ce64adba logging: use streaming repo for data-prepper 2026-01-10 05:28:03 -03:00
ed32416975 logging: use kaniko debug image 2026-01-10 05:22:27 -03:00
198fc0bb20 logging: drop timestamps option from data-prepper job 2026-01-10 05:15:19 -03:00
7a00f813f7 logging: add rpi5 log retention tuning 2026-01-10 05:06:34 -03:00
e25c8e3701 logging: add Jenkins build for data-prepper 2026-01-10 05:01:17 -03:00
17ab7762f1 logging: pin otel collector image 2026-01-10 00:16:41 -03:00
c887aaeecf logging: add trace analytics ingestion 2026-01-10 00:13:59 -03:00
flux-bot
76cc512859 chore(bstein-dev-home): automated image update 2026-01-10 03:05:43 +00:00
flux-bot
a4815195e8 chore(bstein-dev-home): automated image update 2026-01-10 03:03:44 +00:00
9c2f2631ce logging: seed OpenSearch observability 2026-01-09 23:58:12 -03:00
flux-bot
887dada6b6 chore(bstein-dev-home): automated image update 2026-01-10 02:05:39 +00:00
flux-bot
8de57506e8 chore(bstein-dev-home): automated image update 2026-01-10 02:04:39 +00:00
ea6d1e0baa logging: expand OpenSearch dashboards 2026-01-09 22:55:39 -03:00
cd1c5232cc logging: add OpenSearch dashboards generator 2026-01-09 22:20:36 -03:00
ec4e491fa5 logging: force dark theme in dashboards 2026-01-09 21:17:08 -03:00
1bfc48fce1 logging: throttle fluent-bit backfill 2026-01-09 18:18:58 -03:00
e37c1e6a41 logging: force opensearch replicas to 0 2026-01-09 18:17:02 -03:00
66d8b98b50 logging: manage opensearch pvc size 2026-01-09 18:11:32 -03:00
a8da8731d0 logging: remove loki and backfill to opensearch 2026-01-09 18:08:39 -03:00
dc9d396b37 logging: extend dashboards helm timeout 2026-01-09 09:07:40 -03:00
f404f22be9 logging: fix opensearch ism job yaml 2026-01-09 09:01:15 -03:00
5653e1fb0e logging: pin opensearch to rpi5 2026-01-09 09:00:25 -03:00
a581029a58 logging: pin opensearch ISM job to rpi 2026-01-09 08:58:48 -03:00
9242efd8c6 keycloak: fix logs oauth2 cookie secret 2026-01-09 08:57:13 -03:00
3dcf40449b logging: fix dashboards cpu limits 2026-01-09 08:55:39 -03:00
abc6e45d17 logging: add opensearch dashboards ui 2026-01-09 08:54:07 -03:00
a9410b0c20 logging: route oauth2-proxy via loki gateway 2026-01-09 08:07:46 -03:00
1e9e6c7f0b logging: keep loki canary on rpi5 workers 2026-01-09 07:26:12 -03:00
91e3b4e96b logging: pin loki canary to rpi5 nodes 2026-01-09 07:19:59 -03:00
86e3682781 logging: shrink loki caches for rpi nodes 2026-01-09 07:16:10 -03:00
f335a8fa68 logging: fix oauth2 scope and pin loki to rpi 2026-01-09 07:12:40 -03:00