requirements for observability, alerting, and maintenance to ensure smooth operations. Additionally, you will deliver... in DevOps or SRE type roles Strongly proficient in utilizing cloud services like AWS, Azure, or Google Cloud Platform...
planning experience Experience with agile software development Preferred certifications: SRE, Cloud Architect/Engineer (OCI...Job Category: Product Development Job Description: Role Summary As a Senior Site Reliability Engineer (SRE...
architect reliability frameworks, drive automation across incident response and observability, and collaborate with engineering... and product teams to embed SRE principles into every layer of the stack. This role offers the excitement of solving real-world...
. Knowledge of SRE, observability, and reliability engineering. Leadership & Core Competencies Strong communication... with NYL's security and regulatory requirements. Architect resilient, scalable, secure, cloud-native applications. Lead...
. Knowledge of SRE, observability, and reliability engineering. Leadership & Core Competencies Strong communication... with NYL’s security and regulatory requirements. Architect resilient, scalable, secure, cloud‑native applications. Lead...
certifications such as AWS Certified Solutions Architect Kubernetes and container orchestration experience Familiarity with SRE... of software delivery practicesand solutions that optimize for fast feedback, observability, and operational excellence Evaluate...
, you will be instrumental in defining and implementing advanced observability, resiliency, and recoverability solutions. You will engage deeply... strong hands-on technical leadership in designing, implementing, and continuously improving observability, resiliency...
, observability, reliability engineering, and technical debt management Familiarity with DevOps, SRE, security, and compliance... PM, Solution Architect, or Engineering Lead) Bachelor’s degree in Computer Science, Engineering, Data/Information...
on for more details. ROLE AND RESPONSIBILITIES: A Senior Site Reliability Engineer (SRE) is expected to own the operational stability... hybrid cloud infrastructure across Nutanix HCI, AWS, and GCP platforms Architect and implement multi-cloud solutions...
operations, automation, observability, and AIOps to ensure mission-critical distributed systems deliver exceptional uptime... excellence and reduce toil through automation. Key Responsibilities Cloud Platform, Network & Infrastructure Architect...
. Build and maintain a comprehensive observability stack, including distributed tracing, metrics, and logging (e.g...., Prometheus, Grafana, Jaeger, ELK Stack). Collaborate with cross-functional teams, including DevOps and SRE, to ensure the...
. Responsibilities Architect and build scalable backend services that handle millions of transactions reliably. Own the full... systems are tuned for high availability, fault tolerance, and observability. Collaborate with cross-functional teams (Product...
will be primarily on-site with residency commutable to one of our offices required. Responsibilities · As a Lead Engineer of the SRE... / Production Operations team for FedNow, you will operate the production environment for the program. · You will architect...
. Responsibilities Architect and build scalable backend services that handle millions of transactions reliably. Own the full... systems are tuned for high availability, fault tolerance, and observability. Collaborate with cross-functional teams (Product...
-native CI/CD, OTEL observability, cost monitoring). Modern IaC & App Delivery: Architect and enforce Terraform standards... like multi-region resiliency, observability standardization, cost anomaly monitoring, and testing in production. What you'll...
with strong knowledge of site reliability engineering, observability tooling, and large-scale distributed systems. Position... and autonomous action safety Architect vector database solutions (Milvus, pgvector, Qdrant) for semantic search and RAG to enable...
Network Engineer to architect, secure, and scale the hybrid network infrastructure behind our high-traffic digital properties... to the Director of Network Engineering, collaborating closely with cloud, security, DevOps, SRE, and application engineering...
will be primarily on-site with residency commutable to one of our offices required. Responsibilities · As a Lead Engineer of the SRE... / Production Operations team for FedNow, you will operate the production environment for the program. · You will architect...
. Architect and optimize solutions leveraging AWS services such as S3, EC2, RDS, CloudFront, Lambda, Athena, EKS, Route53..., and infrastructure-as-code principles. Drive system reliability engineering (SRE) practices, ensuring high availability, scalability...
. - Implement and evolve observability across logs, metrics, traces, dashboards, and alerting systems. - Apply SRE principles... to infrastructure, automation, and reliability. - Architect and evolve our AWS ecosystem - multi-region, multi-account, secure...