Reliability Engineering, DevOps, or Infrastructure Engineering roles. Strong proficiency in Kubernetes, Docker, and container...Reliability & Availability: Ensure uptime, resiliency, and fault tolerance of AI model training and inference systems...
in system design consulting, platform management, and capacity planning. Improve reliability, quality, and time-to-market... sustainable systems and services through automation and uplifts. Balance feature development speed and reliability with well...
This is a developed professional level role for an SRE. Individuals are responsible for challenging reliability and toil reduction... Contributes to SRE knowledge documentation Functional Competencies/Technical Skills: Design for Reliability Can support...
This is a developed professional level role for an SRE. Individuals are responsible for challenging reliability and toil reduction... Contributes to SRE knowledge documentation Functional Competencies/Technical Skills: Design for Reliability Can support...
time, take increasing responsibility for leading incidents end-to-end. Improve operational reliability: Identify... recurring issues and reliability risks, and drive fixes through better alerting, automation, system changes, or process...
Owns reliability architecture and end-to-end service understanding (dependencies, failure modes, and customer journeys... readiness criteria. Drives cross-team reliability reviews and recommends design changes, runbooks, and safe rollout/rollback...
on performance, reliability, and scalability. Participate in on-call rotations, providing critical support as needed. Ensure timely... problems. Collaborate with product support teams to improve reliability, drive efficiency, and enhance customer experience...
. Reliability: Ensure the reliability, availability, and performance of applications and services. Develop and track new service.... Identify and drive improvements in reliability, performance, and efficiency through data and root cause analysis. Participate...
deployments. Reliability: Ensure the reliability, availability, and performance of applications and services. Develop and track.... Identify and drive improvements in reliability, performance, and efficiency through data and root cause analysis. Participate...
utilization, and data flow. Implement infrastructure-as-code, CI/CD pipelines, and reliability standards across thousands... of nodes. Diagnose performance bottlenecks and drive continuous improvements in reliability, latency, and throughput...
are multiple ways to solve most technical problems and are willing to debate the trade-offs. Have become a stronger engineer..., and better monitoring. In the future there will be new architectural or coding problems that we will need an experienced engineer...
Payrate: $61.00 - $62.00/hr. Summary: We are seeking a skilled and proactive Senior DevOps Engineer/SRE... and templates for deployment automation Required Experience: 8 years in DevOps/SRE roles Hands-on experience with Octopus...
network of the future using Cisco’s latest networking solutions. We leverage a DevOps approach to deploy, monitor, and manage... operations. Familiarity with DevOps principles and agile practices. Commitment to quality-driven development. Basic...
in Ember.js front-end applications. Execute OS-level patching and package updates in coordination with DevOps... of remediated services. Collaborate with security and DevOps teams to meet accelerated remediation targets. Maintain documentation...
DevOps Software Engineering role, you will design, develop, and test Java code in a Linux, Agile, DevOps environment... written code and make modifications as necessary. Ensure software performance, reliability, and scalability. Participate...
in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the... outcomes. Demonstrates and champions site reliability culture and practices and exerts technical influence Leads initiatives...
organization. A strong technical background as a hands-on software engineer or site reliability engineer prior to moving...About the Role: Site Reliability Engineering (SRE) at Tubi is not a traditional operations team. We are a software...
engineering and operations to create highly scalable and fault-tolerant systems. At Opentext, Site Reliability Engineer position... to design and implement DevOps toolchains in large, complex organizations. Experience running Kubernetes across multiple...
& Site Reliability (SRE), you will lead DevSecOps and SRE practices for CAS Engineering, including release management for 7... Cloud DevOps Engineer, CISSP/CCSP. Domain experience in Pharmacy Benefit Management, Health Insurance, Clinical Utilization...
of Cloud Operations, you will guide a team of Site Reliability Engineers, drive automation, and own the operational health... within strict government confines. You will: Lead, mentor, and grow a high-performing team of Site Reliability Engineers...