matter expert and an incident lead during the incident response process Initiate and contribute to continuous improvement... for multiple POS systems infrastructure and developer experiences. The team is at the helm of providing a stable, reliable...
or GitHub Actions. Operate production platforms with ownership for monitoring, alerting, incident response, capacity planning..., and post incident RCA. Enforce security and governance through Azure AD, RBAC, Key Vault, network segmentation, encryption...
, and build observability that powers real-time insight and rapid incident response. Our work spans L4/L7 load balancing, service... Traffic team is responsible for the reliability, performance, and security of network traffic across our edge and core...
dashboards, problem elimination and incident response. This is an operationally focused DevOps role requiring participation... release velocity whilst delivering at scale without compromising security or reliability. Reporting directly to the Data...
, Datadog) to enable real-time visibility and incident response Lead incident investigation and resolution for production... infrastructure issues and lead incident response across highly available, large-scale systems Benefits: Extended health...
developer will be playing a key role in the platform QA to ensure that development standards are adhered to and platform... platform applications (ex. Incident & MIM, Change, Problem, and custom applications). What will you do: Lead the...
developer will be playing a key role in the platform QA to ensure that development standards are adhered to and platform... platform applications (ex. Incident & MIM, Change, Problem, and custom applications). What will you do: Lead the...
to accelerate development, improve consistency, and increase developer velocity. Own operational excellence, including incident... response, root-cause analysis, and long-term reliability improvements. Advocate for scalability, performance, security...
specializing in Security Operations (SecOps), particularly in Security Incident Response (SIR) and Vulnerability Response (VR). The..., design, and delivery of ServiceNow Security Incident Response and Vulnerability Response solutions. Translate business...
· Lead incident response, root cause analysis, and continuous improvement for platform-related issues · Define and enforce..., observability, and incident response. Excellent interpersonal skills: collaboration, stakeholder management, mentoring...
-based platforms; lead incident response and disaster recovery readiness. Define and execute the roadmap for scalability... and addressing scalability issues with the platform as a whole. Recommend security enhancements for triage/risk assessment. Oversee...
-based platforms; lead incident response and disaster recovery readiness. Define and execute the roadmap for scalability... and addressing scalability issues with the platform as a whole. Recommend security enhancements for triage/risk assessment. Oversee...
and manage incident response, escalation processes, and documentation for AI-related operational issues. Drive continuous... Developer-Java Developer-UX/UI/Graphic Financial Analyst HR-Admin/Recruiter/Learning and Development IT Security-Analyst...
infrastructure and experimental platforms Operational Excellence & Incident Response Participate in on-call rotations and serve... platforms. Candidates should be comfortable with incident response responsibilities and working in a fast-paced production...
response workflows. Contributions to infrastructure projects, open-source systems, or developer tooling that improved... protocols, erasure coding, and data placement algorithms. Experience with production monitoring, observability, and incident...
recovery/incident response protocols, and business continuity planning. Team Leadership and Development Develop leaders... security service providers. Ensure alignment with applicable industry frameworks and regulatory standards (e.g., ITIL, ISO...
rotation and collaborate with SRE to ensure production reliability through proactive issue identification and rapid incident... response Implement observability in your code (metrics, logging, tracing) and work with monitoring tools to track application...
, deployment, monitoring, and incident response. Design and implement comprehensive monitoring, logging, and alerting solutions... community, united by the relentless pursuit to help keep people safer everywhere. Our critical communications, video security...
. Participate in incident response for platform-level issues as part of a globally distributed, sustainable on-call rotation... infrastructure. Strong troubleshooting and incident response skills across distributed systems and cloud-native environments...
. Participate in on-call rotations and incident response. Collaborate with engineering teams to identify and automate operational... role, you’ll play a key part in strengthening the foundation of our cloud and developer platforms, with a focus...