. As a Staff SRE, you will be a technical leader on a highly skilled and senior team. You will be a key driver of our architecture... leadership, and close collaboration with other Staff and Senior engineers to set the technical direction for the entire...
for you, why not say hello? About the Role This is a full-time staff engineering position based in the U.S. What you will be doing... Slack’s worldwide infrastructure and adopt new cloud technologies. Drive improvements in system observability, reliability...
, Salesforce and NetSuite Improve systems observability and invest in reliability & correctness of data Develop customer empathy... or internal tools in the past Flexible and adaptive in a fast-paced startup environment Passion for observability...
complex systems into manageable components Drive for observability to understand performance and be able to diagnose...
, and utilities to improve observability, cost monitoring, workload efficiency, user, or administration experience. Document...
end-to-end observability and operations through metrics, tracing, logging, dashboard development, monitoring...
automation, data quality and observability for the pipelines using technologies such as TestNG, Junit, PyTest, Terratest...
integrating observability tools (APM, centralized logging, metrics) and security controls (e.g., IAM, encryption, scanning...
, and observability) that balance scale, cost, and operational simplicity. Build core infrastructure. Implement and own platform features... (e.g., transformation frameworks, feature stores, real-time ingestion pipelines, lineage and observability) that power...
and enhance system observability through effective monitoring, logging, tracing, and alerting strategies. Stay abreast... with observability tooling, chaos testing, and incident management. Excellent influencing, problem-solving, and analytical skills...
and architectural conversations Recommending observability and alerting configurations The SRE team benefits from experience... automation, observability, and configuration management development and product experience The SRE team is seeking seasoned...
solutions, and patterns that improve the availability, reliability, efficiency, observability, and performance of products...
and caching. Is skilled in observability and reliability practices: defining SLOs, implementing metrics/alerts, debugging...
-on with observability (metrics, tracing, logs) and model evaluation frameworks. Bachelor's Degree in Computer Science or related technical...
Deep technical expertise A passion for automation and observability Fluency in distributed systems Creativity to design...
, reconciliation, and failure recovery at global scale. Drive Technical Excellence: Set the bar for reliability, observability...
). Strong proficiency in distributed system design, architecture, performance optimization, observability, reliability engineering...