Reliability Lead the creation of reusable, enterprise-wide Azure DevOps CI/CD templates for ML assets, ensuring consistency... service deployments. Establish platform wide SRE practices for ML systems, including reliability, scalability, and cost...
. You will define and enforce DevSecOps and Site Reliability Engineering (SRE) standards and best practices, enabling secure, reliable.../SLOs, implementing robust monitoring and alerting, and partnering with product and engineering teams to balance reliability...
Reliability Engineering (SRE), privacy, and Machine Learning (ML) teams. Detection Algorithm Enablement: Architect the core data... a Distinguished Engineer to lead the architecture, scalability, reliability, and advanced detection algorithms of Palo Alto Networks...
across treasury and payments platforms (telemetry, tracing, logging, metrics, SRE practices, refactor/replatform plans). Lead... architecture reviews, set technical objectives, and ensure consistent engineering patterns across teams. And provide expert...
practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and advanced Observability tooling (e.g... boundaries. You will promote a culture of engineering excellence, and strike the right balance between lending expertise...
looking for a seasoned Senior Site Reliability Engineer (SRE) to ensure the reliable deployment, operation, and continuous improvement... budgets, and reliability targets for each factory site; drive root-cause analysis and post-mortems for incidents. Lead...
– Lead, mentor, and develop a team of Site Reliability Operations Engineers across all levels (1,2,3, & Senior). Foster... leadership role responsible for overseeing the Site Reliability Operations (SRO) team that provides 24/7 monitoring and support...
to an engineering-driven organization that emphasizes reliability, scalability, and automation. Position Description Site... Reliability Engineering (SRE) blends software engineering and systems administration to design, develop, and manage large-scale...
Familiarity with site reliability engineering (SRE) best practices Strong analytical, problem-solving, and organizational skills... your career. THE ROLE We are seeking a Technical Program Manager to lead the execution of AMD Data Center Graphics hardware...
also considered). Professional Experience: 3–4 years of experience in SRE, DevOps, or Platform Engineering roles. Technical Skill..., transformation, and routing using OpenTelemetry (OTel). DevX Advocacy: Partner with software engineering teams to understand...
. Ensure adherence to security, compliance, and regulatory standards. Apply Site Reliability Engineering (SRE) principles... will be deeply involved across the full SDLC and bring a site reliability mindset while ensuring secure, scalable, and highly...
AI-driven operations (AIOps), predictive analytics, DevOps, SRE, and platform engineering practices to boost uptime... (with ability to travel on-site for leadership, incident response, and strategic engagements) • Dallas, TX possible Position...
in technical leadership roles within production support, Site Reliability Engineering (SRE), or technical operations, specifically... for Instant Payments We are seeking a highly experienced and technically hands-on Global Support Manager to lead the production...
and operational processes. DevOps and Site Reliability Advance DevOps and SRE practices across engineering and platform teams... IT, Cloud Operations, DevOps, Site Reliability Engineering, and IT Security, and plays a critical role in enabling...
practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and advanced Observability tooling (e.g.... You will promote a culture of engineering excellence, and strike the right balance between lending expertise and providing an inclusive...
% requirement for the site. \n Responsibilities Staff Network Engineering - AWS and Hybrid Cloud AWS VPC Engineering Design... flexible work arrangement policy requires that a minimum of 60%, or 24 hours, of your total work week be on-site. Your specific...
, and Reliability Lead the strategic adoption of AI/ML methods as a superpower for scale and speed. Target automation...: We are seeking an exceptional Principal Security Architect to join eBay's Security Engineering leadership team at a critical...
with strong knowledge of site reliability engineering, observability tooling, and large-scale distributed systems. Position... Partner with SRE, Product, and Engineering teams to translate reliability goals into measurable ML objectives and maintain...
and accessible for all. We're searching for a Staff Client Platform Engineer to join our Enterprise Site Reliability (SRE) team... to lead the management of our enterprise Windows, Mac, and Ubuntu clients. The team is responsible for designing, implementing...
and accessible for all. We're searching for a Staff Client Platform Engineer to join our Enterprise Site Reliability (SRE) team... to lead the management of our enterprise Windows, Mac, and Ubuntu clients. The team is responsible for designing, implementing...