secure delivery pipelines, platform automation, and robust observability for mission-critical services Key Responsibilities... workload distribution. ⦁ Deploy and maintain observability pipelines using Prometheus, Grafana, Splunk, Datadog, ELK/EFK...
. · Apply best practices in backend performance, security, reliability, and observability. · Collaborate with other backend engineers...
platform development across the organization Operational Health: Define and implement advanced observability, monitoring...
microservices-based data platforms with containerization (Docker/Kubernetes) Establish comprehensive observability, monitoring... Protocol (MCP) Experience with feature stores, model serving infrastructure, and AI/ML observability Understanding of data...
Engineer - Observability Operations will: Monitoring – Oversee 24/7 health monitoring of the company’s IT Infrastructure... using tools such as AWS CloudWatch and New Relic. Drive observability maturity across the organization by identifying...
understanding of data quality, governance, and observability principles and practices Working knowledge of DevOps concepts...
scalability and growth Own system SLAs, metrics, and observability to ensure platform reliability Serve as the escalation point... including testing, CI/CD, observability, and performance optimization Demonstrated ability to lead through influence, alignment...
and customer experience. Build and improve observability across backend systems using metrics, structured logging, and distributed... balancing and application-level services. We use Retool for some internal support applications. Observability Tools: Datadog...
reliability, observability, and performance in a production environment Support and evolve AI workflows, including pipeline..., and observability tools Interest in or experience working with AI-enabled products (e.g., LLMs, automation, or applied ML...
, platform automation, and robust observability for mission critical services Key Responsibilities: ⦁ Architect, deploy... distribution. ⦁ Deploy and maintain observability pipelines using Prometheus, Grafana, Splunk, Datadog, ELK/EFK, or similar tools...
design, risk-based testing, and observability-led quality. Provide technical input on automation strategies, test frameworks... and observability. Lead complex test initiatives supporting large, cross-functional features and platform-level changes. Triage...
(Adobe, Comscore) and QoS (Conviva) Deep understanding of software development lifecycle, agile practices, observability...
operational excellence by implementing monitoring, metrics, and diagnostic tools that improve system observability and reduce time...
We are a global team of innovators and pioneers dedicated to shaping the future of observability. At New Relic...
controlled changes to production systems. Identify and implement automation, observability, and configuration updates throughout...
in deployments and services on cloud platforms Experience in observability and tooling Experience with Docker and Kubernetes...
monitoring and observability tools such as Dynatrace, AppDynamics, SCOM, Prometheus, or Grafana. Demonstrated experience...
, including error handling, retries, and observability Experience using Git and participating in a collaborative, agile... APIs or event schemas intended for external partners Familiarity with observability tools such as Application Insights...
and automated scaling solutions Establish monitoring, alerting, and observability frameworks across the data platform Lead code...