pipelines, observability, release management, and reliability engineering (SRE). Innovation & Customer Focus: Understand... management. DevOps: building and operating CI/CD pipelines, infrastructure as code (IaC), observability (logs/metrics/traces...
—recognized globally for its innovation, scale, and engineering excellence. As a Lead Software Engineer, you’ll help architect... As a Lead Software Engineer (Java/Python) and drive large-scale digital transformation. You will architect and modernize...
observability to manage complex routing, failover, and multi-region connectivity. - AI/ML & GPU Infrastructure: Architect large... teams spanning platform engineering, SRE, networking/traffic, storage and databases, data infrastructure, and GPU/ML...
scalable and resilient infrastructure on AWS. Architect and maintain Windows/Linux based environments, ensuring seamless... AI adoption at the platform level. Implement observability, security, data privacy and cost-optimization strategies specifically...
. Collaborate with SRE and engineering teams to improve reliability, observability, and operational efficiency. Participate..., and end user support. This role is rooted in modern IT Operations but works closely with our SRE team to improve reliability...
enterprise-grade CI/CD pipelines end-to-end. Architect and manage highly available cloud infrastructure on AWS / Azure / GCP... monitoring, logging, and observability using Prometheus, Grafana, ELK, Splunk, Datadog, etc. Manage production issues, identify...
Reliability Engineering (SRE) best practices into our workflows. Key Responsibilities Cloud-Native Development Architect... in processes, tools, and technologies to maintain a competitive edge. Implement monitoring and observability solutions (e.g...
requirements for observability, alerting, and maintenance to ensure smooth operations. Additionally, you will deliver... in DevOps or SRE type roles Strongly proficient in utilizing cloud services like AWS, Azure, or Google Cloud Platform...
with SRE and application teams to keep releases smooth and stable. The mandate is reliable services, disciplined change..., and HQ facilities. Reliability & Release support: Partner with SRE to define SLIs/SLOs, harden CI/CD paths, and reduce MTTR...
, and observability. DevOps & CI/CD Establish CI/CD pipelines using GitHub Actions/GitLab CI/Azure DevOps. Implement progressive... & System Design Develop end-to-end product flows and robust system architectures. Architect secure, multi-cloud...
Job Category: Product Development Job Description: Architect Operational Processes: Design and implement scalable... and root cause analysis for critical issues. Capacity and Performance Management: Architect and implement systems to monitor...
· Architect and implement networking for hybrid K3s clusters. · Configure and optimize CNIs (Multus, Cilium, Calico) for multi... with CRDs and controllers. · Implement network policies, encryption, and observability (Cilium Hubble, Prometheus...
the charge in redefining CI/CD for modern cloud development. In this critical role, you will architect the data-driven... spanning pipeline orchestration, infrastructure-as-code, observability, and developer experience. Persona-Driven Product...
on continuous improvement and learning. What You Will Do: Design, architect, and build secure, scalable IAM services using..., including requirements analysis, architecture, development, testing, deployment, observability, and long-term reliability...
Splunk-based observability and security analytics solutions across enterprise environments. The ideal candidate... (AWS, Azure, GCP) and third-party tools (e.g., Datadog, ServiceNow). Collaborate with DevOps, SRE, and Security teams...
, building production systems that leverage LLMs and AI agents to accelerate developer workflows. You'll architect and implement... their own AI developer tools such as deployment pipelines, observability integration, security controls, and operational tooling...
availability, resilience, and observability for our mission-critical mobility infrastructure serving millions of transactions..., and improving the observability foundation that will enable Metropolis to scale to new markets while maintaining 99.9%+ uptime...
availability, resilience, and observability for our mission-critical mobility infrastructure serving millions of transactions..., and improving the observability foundation that will enable Metropolis to scale to new markets while maintaining 99.9%+ uptime...
availability, resilience, and observability for our mission-critical mobility infrastructure serving millions of transactions..., and improving the observability foundation that will enable Metropolis to scale to new markets while maintaining 99.9%+ uptime...
in incident response, deployment automation, observability, and capacity planning—leveraging modern DevOps/SRE methodologies... Engineering, DevOps, or SRE roles within healthcare, medical device, or life sciences industries. Expertise in containerization...