and operate observability stack (Prometheus, Grafana, Loki, Tempo) and integrate with external tools (Datadog, Dynatrace, Grafana... Cloud). - Implement application observability and Real User Monitoring (RUM) to improve client experience. - Automate...
and DevOps. Job responsibilities Lead and mentor a team of SRE/DevOps engineers, fostering a culture of collaboration... on experience in SRE/DevOps roles, with a strong background in cloud infrastructure, automation, and CI/CD. Hands-on practical...
of experience in SW Industry. Hands-on experience with end-to-end system Observability implementation. Expertise in Monitoring...
Infra and Observability Engineer Role TypeFull Time The opportunity We are the only professional services organization... years of experience in Observability and Cloud Infrastructure across Azure and AWS (with GCP as an added advantage...
operational overhead. Collaborate across teams, engineers, promote observability best practices, and partner with DevOps, SRE... observability Lead Engineer to design, maintain, and optimize our observability platform. The ideal candidate will be an expert...
Managers and L2, L3 Skillset Steer, Guide and modernize the observability capability to scale up for the Future Hosting Model... Transition & Transform all ABB Managed Servers, Infrastructure and Network device Observability and build & run core...
, secure, and cost-effective observability solutions that support our global operations. As the Sr. consultant SRE.... Responsibilities Lead SRE and DevOps operations during APAC hours, ensuring alignment with project objectives, delivery timelines...
, and SRE teams to define observability standards and KPIs. Enable proactive incident detection and root cause analysis through...Responsibilities : Key Responsibilities: End-to-end observability solutions across applications and infrastructure...
, CloudWatch, Splunk, ELK) with AI models for intelligent alerting and diagnostics. Collaborate with SRE, DevOps, and platform... integrated with observability platforms. Utilize Agentic AI and GenAI technologies to automate and enhance decision-making...
-impact solutions. You will define SRE best practices, drive automation, observability, incident response, performance...Job Description Summary As a Principal Engineer in Site Reliability Engineering (SRE), you'll be a technical leader...
of modern SRE and DevOps practices. You will be responsible for ensuring high availability, reliability, scalability... Job: We are looking for a passionate and experienced Site Reliability Engineer (SRE) to join our Cloud Platform team. The ideal candidate will have hands...
Reliability Engineer (SRE) for the IT Observability and Automation Team, you will be responsible for ensuring the reliability... and industry tools (CNCF, DevOps, CI/CD, Secrets Management, Container Registries, Service Mesh, etc.). Experience deploying...
of businesses worldwide.: Qualifications EXP :- B3 band (8+ years) Must to have: SRE Ops, AWS Cloud Infra, DevOps, Linux..., Observability/Automation, CI/CD, Kubernetes/Docker Good to have: Tools extensive knowledge (like AppDynamics, Nagios, Splunk...
learning. What You May Need to be Successful 10+ years of experience in software engineering, DevOps, or SRE roles... organization. Contributions to open-source SRE or DevOps tools Why First Advantage is Your Next Big Career Move First...
learning. What You May Need to be Successful 10+ years of experience in software engineering, DevOps, or SRE roles... organization. Contributions to open-source SRE or DevOps tools Why First Advantage is Your Next Big Career Move First...
, automation, and operational excellence while managing high-performing DevOps/SRE Responsibilities: Leadership & Team.... Required Qualifications :· Experience & Leadership Proven leadership experience in SRE, DevOps, or Production Engineering roles, with 3...
Must have - 6 -10+ years of experience in SRE, DevOps, or Infrastructure roles. - Expertise in Kubernetes (EKS or self-managed... on AWS (EKS, EC2, S3, IAM, VPC, etc.). - Ensure observability through logging, monitoring, and alerting systems (e.g...
improving application availability, automating deployments, and ensuring data systems meet SLAs through observability... environments. Implement Infrastructure as Code (IaC) using Terraform or Ansible. Work with Data Engineering and DevOps...
is to ensure availability, data accuracy, and performance of analytics systems while driving automation and observability... observability tools (Monte Carlo, Bigeye, or custom monitoring). Participate in incident response and root-cause analysis...
engineering, DevOps, or SRE experience in enterprise environments CI/CD Expertise: Hands-on experience with Jenkins, GitLab... and infrastructure-as-code solutions using modern DevOps toolchains Developer Experience: Create self-service tools and frameworks...