, Loki, ELK, Graylog). Lead incident response activities, define SLOs/SLIs, and optimize alerting and monitoring pipelines..., and security hardening. Implement automation for provisioning and operations using Ansible, Bash/Python, and GitOps workflows...