, Kubernetes) and service mesh architectures. Experience with monitoring, observability, and alerting tools (e.g., Prometheus...
refinement) and enforce agile discipline across teams. Partner with DevOps to improve CI/CD pipelines, observability, deployment...
pipelines Understanding software quality principles including reliability, observability, and production readiness. Ability...
and Agentic AI tools in Vertex AI Enhance observability through Cloud Monitoring and Logging and New Relic. Optimize performance... across GCP and AWS. Knowledge of SRE, observability, and reliability engineering. Leadership & Core Competencies...
projects. You should have experience in building application observability through Prometheus / OpenTelemetry metrics...
observability for data workloads (metrics, logs, traces) and basic performance tuning. Participate in resilience testing (failover... and working with YAML-based configurations. Exposure to data systems (Kafka, Hadoop) or observability tooling (Prometheus/Grafana...
and implement observability best practices (e.g., logging, metrics, alerting). Ensure compliance with enterprise security standards...
where youll work across the entire model lifecycletraining, evaluation, deployment, observability, and continuous optimization..., checkpointing workflows). Lead the implementation of observability for ML systems (monitor drift, performance, throughput...
governance, data controls, and observability. What You'll Do: Enterprise Cloud & AI Platform Design and maintain enterprise... for deeper observability and troubleshooting. Enforce guardrails, approvals, and policy-as-code for AI usage with Cloud Security...
observability, data quality, and cost governance across data, graph, and AI pipelines. Leadership and Team Building Build...-as-Code, CI/CD, containers, and observability tooling. Governance and Trust Experience implementing fine-grained access...
reliability, governance, lineage, and observability across the stack. Mentor other engineers and help set strong engineering... to have. Experience with Docker, ECS, EKS, or similar container platforms is a nice to have. Familiar with data observability tooling...
production environments in cloud providers Experience with monitoring, logging, and observability tools Nice...
platforms that keep a mission critical network running. You can update SolarWinds Observability and create dashboards, reports...
engineering, internal GenAI tooling) Telemetry, observability, and system intelligence Direct interaction with founders...
projects from inception to production, including architecture decisions, implementation, and observability Collaborate...
In-depth knowledge of the financial services industry and their IT systems Observability and production management tools (ex...
governance, data controls, and observability. What You’ll Do: Enterprise Cloud & AI Platform Design and maintain enterprise... for deeper observability and troubleshooting. Enforce guardrails, approvals, and policy-as-code for AI usage with Cloud Security...
where youll work across the entire model lifecycletraining, evaluation, deployment, observability, and continuous optimization..., checkpointing workflows). Lead the implementation of observability for ML systems (monitor drift, performance, throughput...
We are a global team of innovators and pioneers dedicated to shaping the future of observability. At New Relic...
We are a global team of innovators and pioneers dedicated to shaping the future of observability. At New Relic... Observability, DevOps, or related cloud software solutions. Educational Background: A related university degree or equivalent...