Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Site Reliability / Observability Engineer, Location: USA

Page: 12

Staff Site Reliability Engineer - Eng

junior team members and serve as a champion for Site Reliability Engineering best practices. - Actively participate..., service delivery, reliability, and automation, including the definition and monitoring of service health indicators (latency...

Location: Lowell, MA
Posted Date: 07 Feb 2026

Site Reliability Engineer

, Vercel, Plaid, and hundreds of others. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE... teams. We embed reliability into everything we do-whether it's designing scalable systems, improving observability...

Company: WorkOS
Location: USA
Posted Date: 20 Jan 2026

Site Reliability Engineer - Hardware Infrastructure

At NVIDIA, Site Reliability Engineering provides a rare chance to define, develop, and support large-scale production... to guarantee flawless service operation with consistent reliability and uptime. As an SRE here, you will be part of a welcoming...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 17 Jan 2026

Senior Site Reliability Engineer

the availability, reliability, efficiency, observability, and performance of products while also driving consistency... issues impacting performance or functionality of Live Site service and escalates as necessary. Reviews and writes issues...

Company: Microsoft
Location: USA
Posted Date: 19 Dec 2025

MTS - Site Reliability Engineer

Reliability & Availability: Ensure uptime, resiliency, and fault tolerance of AI model training and inference systems.... Observability: Design and maintain monitoring, alerting, and logging systems to provide real-time visibility into model serving...

Company: Microsoft
Location: Redmond, WA
Posted Date: 18 Dec 2025

Site Reliability Engineer

Build and improve CI/CD pipelines in GitHub Actions to reduce manual steps and increase deployment reliability Use... dashboards, alerts, and reliability improvements using Prometheus and Grafana Partner with development teams to automate...

Company: CPS Solutions
Location: Eden Prairie, MN
Posted Date: 12 Mar 2026
Salary: $72800 - 130000 per year

Engineer Lead, Site Reliability

in system design consulting, platform management, and capacity planning. Improve reliability, quality, and time-to-market... sustainable systems and services through automation and uplifts. Balance feature development speed and reliability with well...

Company: FIS
Location: Jacksonville, FL
Posted Date: 11 Mar 2026

Senior Site Reliability Engineer I

This is a developed professional level role for an SRE. Individuals are responsible for challenging reliability and toil reduction... Contributes to SRE knowledge documentation Functional Competencies/Technical Skills: Design for Reliability Can support...

Company: LexisNexis
Location: California
Posted Date: 06 Mar 2026
Salary: $95300 - 158800 per year

Senior Site Reliability Engineer I

This is a developed professional level role for an SRE. Individuals are responsible for challenging reliability and toil reduction... Contributes to SRE knowledge documentation Functional Competencies/Technical Skills: Design for Reliability Can support...

Company: RELX
Location: California
Posted Date: 06 Mar 2026
Salary: $95300 - 158800 per year

Senior Site Reliability Engineer (Upmarket)

time, take increasing responsibility for leading incidents end-to-end. Improve operational reliability: Identify... recurring issues and reliability risks, and drive fixes through better alerting, automation, system changes, or process...

Company: Heidi
Location: San Francisco, CA
Posted Date: 28 Feb 2026

Site Reliability Engineer, Search Infrastructure - USDS

for observability and automation across complex, large-scale service mesh architectures. Responsibilities Engage in and improve the... and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R...

Company: TikTok
Location: San Jose, CA
Posted Date: 26 Feb 2026

Site Reliability Engineer, Search Infrastructure - USDS

for observability and automation across complex, large-scale service mesh architectures. Responsibilities Engage in and improve the... and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R...

Company: TikTok
Location: Seattle, WA
Posted Date: 26 Feb 2026

Site Reliability Engineer, Search Infrastructure - USDS

for observability and automation across complex, large-scale service mesh architectures. Responsibilities Engage in and improve the... and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R...

Company: TikTok
Location: San Jose, CA
Posted Date: 26 Feb 2026

Site Reliability Engineer, Search Infrastructure - USDS

for observability and automation across complex, large-scale service mesh architectures. Responsibilities: Engage in and improve the... and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R...

Company: TikTok
Location: Seattle, WA
Posted Date: 25 Feb 2026

Site Reliability Engineer, Search Infrastructure - USDS

for observability and automation across complex, large-scale service mesh architectures. Responsibilities: Engage in and improve the... and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R...

Company: TikTok
Location: Seattle, WA
Posted Date: 25 Feb 2026

Site Reliability Engineer, Recommendation Infrastructure - USDS

and be responsible for observability and automation across complex, large-scale service mesh architectures. Responsibilities Engage..., operation and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations...

Company: TikTok
Location: San Jose, CA
Posted Date: 22 Feb 2026

Site Reliability Engineer - CTJ - POLY

Owns reliability architecture and end-to-end service understanding (dependencies, failure modes, and customer journeys... readiness criteria. Drives cross-team reliability reviews and recommends design changes, runbooks, and safe rollout/rollback...

Company: Microsoft
Location: Virginia
Posted Date: 22 Feb 2026

Site Reliability Engineer, Recommendation Infrastructure - USDS

and be responsible for observability and automation across complex, large-scale service mesh architectures. In order to enhance... and refinement Deliver tools/software to improve the reliability and scalability of services, automate operations and improve R...

Company: TikTok
Location: Seattle, WA
Posted Date: 21 Feb 2026

Cloud Site Reliability Engineer

Description Responsibilities Implement observability tooling to monitor AWS EKS-based systems focusing... on performance, reliability, and scalability. Participate in on-call rotations, providing critical support as needed. Ensure timely...

Location: Yarmouth, ME
Posted Date: 14 Feb 2026
Salary: $93547 - 152629 per year

Senior Cloud Site Reliability Engineer

career opportunity that will light a fire within you. The Senior Cloud SRE works to improve the reliability... and occurrence of outages. A Typical Day Might Include the Following: Create a new dashboard to provide observability...

Company: NICE Systems
Location: Sandy, UT
Posted Date: 08 Feb 2026