health reviews and process simplification. - Incident management and prevention: lead postmortems/RCAs, coordinate fixes.... - Automation: eliminate toil by automating operational workflows, recovery procedures, code delivery, and configuration management...
-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, and effective... will be a hands-on engineer who can create Terraform modules, write Lambda functions, and maintain observability using ELK...
JD :- 8 years of experience in Java/.NET based application support like Issues Resolution and Incident management...
environments 8 years’ hands on experience with Windows/Linux Servers. Proactive incident management with Azure/AWS/GCP-based... sustainable growth. Role Profile In this role, you'll be joining our Site Reliability Engineering Team within Cloud...
of ideas and perspectives at AHEAD. The Cloud/DevOps Engineer delivers platform capabilities and automation that speed up... development while improving reliability and security. You'll own GitHub Actions workflows, Helm/Kubernetes deployments, and cloud...
, reliability, and data integrity. Lead incident response and RCA (Root Cause Analysis) for any database-related issues and outages... helps companies attract, engage, and retain top talent. Role Summary We are looking for a Principal DBOps Engineer...
for our clients. By delivering the combined power of our distinctive investment management capabilities, we provide a wide range... continuity of Invesco. Your Role: The Senior Cloud Security Engineer, based in Hyderabad, India, is responsible for designing...
, or PowerShell Hands-on experience with incident management and troubleshooting in large-scale environments Solid understanding... as an Infrastructure SRE, supporting large-scale enterprise systems. As a Software Engineer III at JPMorgan Chase within the Global ESX...
Job Description Summary Staff Software Engineer - DevOps will be responsible for providing build and release strategy... software development, its effect on build management and releasing the builds across versions and environments...
Job Description Summary We are seeking a highly skilled and experienced Staff DevOps Engineer to join our Smart Factory... and RDS databases for performance and reliability. Enforce security best practices in cloud environments and advocate...
Job Description Summary Staff Software Engineer - DevOps, will be responsible for providing build and release strategy... software development, its effect on build management and releasing the builds across versions and environments...
Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission... best practices for monitoring, alerting, and incident management. Oversee the management of high-severity incidents, driving quick...
more on Cubic.com. Job Details: Principal Platform Engineer – FinOps We are seeking an exceptional Principal Platform Engineer.... Lead by example in incident response and post-mortem culture, turning failures into platform improvements...
performance monitoring (APM) and user monitoring is essential. Sound knowledge of ITSM process, SI/SLO/SLA management, incident... development in the business. About the Role In this opportunity as a lead solutions engineer, you will: Project...
and/or technical project management including: 4+ years of experience as a Site Reliability Engineer (SRE), building and managing...Job Description Role Title:Principal Engineer Position Summary: TheEngineering Manager Site Reliability Engineer...
, problem, and change management processes. Use enterprise ticketing systems (ServiceNow, Jira) for incident management...Job Description Production Support Engineer Mobile (Android) About Us: Marriott International Inc., headquartered...
to medium bugs Participate in on-call rotations and adhere to ITIL-based incident, problem, and change management processes... improvement of monitoring, alerting, and troubleshooting workflows. Adhere to ITIL-based incident, problem, and change management...
, and incidents. Incident & Change Management - Manage incident resolution and root cause analysis for critical operational issues..., Incident, Problem and Change management) Familiarity with various Agile methodologies esp. Scrum and Kanban Complete...
, DevOps, and SREs, to optimize system observability, and improve our incident response capabilities. Observability Engineer..., and event-driven systems. Proven track record of handling on-call rotations and incident management workflows in high...
adherence to change management policies for thorough documentation and compliance. Incident and Request Management...: Maintain alignment with organizational requirements for incident management, including SLA and SLT compliance...