of SRE principles, observability, or automation tools Nice to Have Hands-on technical background (former software..., observability, automation tools) Drive platform adoption across engineering teams through customer-obsessed product thinking...
beyond maintaining servers. You'll architect, automate, and secure infrastructure, improve resilience, and embed best-in-class DevOps... and SRE practices into workflows. You'll collaborate across engineering, product, and security to translate requirements...
a Platform Reliability Engineering (PRE) or Site Reliability Engineering (SRE) function focused on proactive monitoring..., automation, and resilience. Implement enterprise monitoring, observability, and event management tools (e.g., Splunk, ServiceNow...
) to architect and manage the cloud infrastructure behind this blockchain-powered payment network. This role involves building highly... automation and testing workflows Set up observability systems for blockchain events, transactions, and system performance Use...
developer experience. Platform Development: Architect and implement scalable platform components and services... cloud-native infrastructure (Azure, AWS, or GCP). Observability & Reliability: Implement monitoring, logging, and alerting...
, observability, and storage services. We build tooling to perform automated operations in order to scale the FortiCNAPP...'s infrastructure, ensuring it meets the demands of our customers and supports rapid growth. Responsibilities: Architect...
closely with R&D development teams What You'll Bring to Us: 4+ years of experience in DevOps / CIE / SRE roles. A B.Sc... / Kubernetes Experience in cloud-oriented/automation software development Ability to architect, build and manage complex systems...
with Argo CD. Architect and evolve a self-serve MLOps platform (standards, templates, CLI/scaffolds) enabling repeatable.... Integrate telemetry and observability (logging, metrics, tracing) and establish SLOs for model services. Monitor model and data...
Reports to the OpenSearch Development Lead / Observability Architect. Works closely with the Axiom, OneConsole, and SRE... pipelines supporting T-Mobile’s observability and analytics ecosystem. This role is responsible for migrating critical network...
. Architect and optimize solutions leveraging AWS services such as S3, EC2, RDS, CloudFront, Lambda, Athena, EKS, Route53..., and infrastructure-as-code principles. Drive system reliability engineering (SRE) practices, ensuring high availability, scalability...
Architect to lead our DevOps strategy and architecture while managing a distributed team of DevOps engineers. This role combines... deep technical leadership with future-state thinking-you will architect enterprise-scale solutions, champion modern secure...
Certifications: ITIL, SRE Foundation, DevOps Leader, TOGAF, AWS/GCP/Azure Architect-level Tools & Technology Exposure Cloud...- and agent-facing platforms. The ideal candidate has deep expertise in DevOps, Site Reliability Engineering (SRE), and IT...
Certifications: ITIL, SRE Foundation, DevOps Leader, TOGAF, AWS/GCP/Azure Architect-level Tools & Technology Exposure Cloud...- and agent-facing platforms. The ideal candidate has deep expertise in DevOps, Site Reliability Engineering (SRE), and IT...
has designed and implemented large-scale, reusable, and secure IaC frameworks, not just consumed them. Observability Architect... and passionate people like you to help us! THE ROLE: At FloSports, SRE is the team that acts as a force multiplier...