Tasks Data Infrastructure & Operations Offer flexible and secure data ingestion, streaming, transformation, analytics and data lake storage paired with self-service compute & ML workspaces so that In-house data teams can spin-up service...
recovery strategies; validate restore procedures and ensure readiness for mission-critical environments. Apply SRE... deployment reliability, and prevent recurring incidents. Instrument and enhance observability using monitoring/APM stacks (e.g...
for your individual contributions, and a variety of benefit options for you to choose from. Job Title: Development Expert (Principal... Engineer (Development Expert) with deep Java/J2EE expertise to join the SuccessFactors Application Engineering team...
for your individual contributions, and a variety of benefit options for you to choose from. Job Title: Development Expert (Principal... Engineer (Development Expert) with deep Java/J2EE expertise to join the SuccessFactors Application Engineering team...
implementing observability using Prometheus, Grafana, AWS CloudWatch, and OpenTelemetry. AWS Security skills, including IAM... and AWS Config rules. Exposure to OpenTelemetry-based distributed tracing and SRE concepts such as SLIs, SLOs, and error...
Set up observability tools (Prometheus, Grafana, ELK, etc.) for metrics, logging, and alerting. Ensure high availability...) Country: India City: Bangalore Required Skills & Qualifications Experience: 5+ years in DevOps/SRE or related...
, SRE & DevOps methodologies, and infrastructure automation."– VP, Software Engineering. What You’ll Contribute Cloud AWS... and integrations with Security Scanning software to ensure compliance Observability & IDP; Implement and support Observability...
, and continuous improvement About You: 5+ years in SRE, DevOps, or Systems Engineering roles Expert-level Linux/UNIX...We are seeking a Senior Site Reliability Engineer (SRE) to support our infrastructure, production data, and mission...
We are looking for a Site Reliability Engineering (SRE) Technical Lead to help drive the adoption of modern SRE practices within the CIAM... organisation at LSEG. This role is deeply technical and hands‑on, designed for an expert engineer who can set the standard...
and observability requirements for all new Prisma Access features. Identify opportunities and create prototype tools to improve support... efficiency through built-in diagnostics, telemetry, and AI/ML applications. Develop and deliver expert-level training materials...
criteria. Collaborate closely with cross‑functional teams (development, SRE/DevOps, product) to clarify requirements... capacity/scalability testing. Familiarity with environment provisioning and observability tooling (ArgoCD, Terraform...
motivated and experienced Site Reliability Engineer (SRE) with 7 to 12 years of experience to manage, scale, and ensure the high..., and ensure data sync. Handle backups of databases, logs, and configurations. Monitoring & Observability: Implement and manage...
observability, monitoring, and operational best practices. Ensure effective incident response, disaster recovery, and business... recommend cost‑optimization strategies. Technical Leadership & Collaboration Act as a cloud subject‑matter expert...
across the full SDLC, including Agile, DevOps, testing automation, observability, and SRE practices. High-ownership, self... secure, reliable, and performant products for our guests and partners. We embrace the philosophies of Agile, DevOps, and SRE...
Riverbed. Empower the Experience Riverbed, the leader in AI observability, helps organizations optimize their user... of experience in data collection and AI and machine learning, Riverbed’s open and AI-powered observability platform and solutions...
Riverbed. Empower the Experience: Riverbed, the leader in AI observability, helps organizations optimize their user... of experience in data collection and AI and machine learning, Riverbed’s open and AI-powered observability platform and solutions...
Architecture Facilitate SRE/Engineering teams to Create, deploy, and manage secure, scalable infrastructure across AWS and GCP.... Observability & Reliability Maintain a robust monitoring stack using Prometheus, Grafana, and ELK/Sumologic/Coralogic. Implement...
that facilitate rapid, reliable, and automated deployment and rollback of microservices. Implement and manage observability tools..., ensuring optimal resource utilization and cost efficiency. Mentorship & Collaboration Act as a subject matter expert (SME...
. --- What you’ll do: (Job Responsibilities): Enable SRE support and monitoring for HPE Networking SASE products to ensure...-7 years of overall experience in DevOps or SRE. 3+ years of experience developing Cloud native applications...
Observability: Architect and manage our comprehensive monitoring stack (Prometheus, Grafana, Loki, Thanos), setting the standards... subject matter expert on the platform, guiding development teams on complex integrations and best practices. Tooling...