Infrastructure: Kubernetes (AWS EKS), Istio, Datadog, Terraform, Cloudflare, Helm Backend: Java / Spring Boot microservices (Gradle...
Infrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and Helm Our backend is Java / Spring Boot microservices...
and a high sense of ownership What We Use Infrastructure: Kubernetes (AWS EKS), Istio, Datadog, Terraform, Cloudflare, Helm...
with observability platforms to analyze and visualize performance data, such as: Prometheus, Grafana, Datadog (or similar tools...
debugging and root-cause analysis (Sentry/Datadog). Knack for simple and intuitive Data Visualizations for impactful...
with observability platforms to analyze and visualize performance data, such as: Prometheus, Grafana, Datadog (or similar tools...
/observability stacks (Prometheus, Grafana, Datadog, Open Telemetry, Jaeger) Proven experience with AWS cost optimization...
monitoring, alerting, and observability using tools like - Pager Duty, CloudWatch, Grafana, Datadog and OpenSearch Support...
/observability stacks (Prometheus, Grafana, Datadog, Open Telemetry, Jaeger) Proven experience with AWS cost optimization...
architectures. Exposure to observability tools (Datadog, Site 24x7, etc). Experience with CICD pipelines and DevOps automation...
, DataDog and ELK. Ability to work independently on complex technical tasks and deliver high-quality results within deadlines...
. Familiarity with CI/CD pipelines. Monitoring/analytics (Sentry, Datadog, PostHog). Experience with PWAs or Web3/wallet...
, DataDog) and turning telemetry into actionable insights. Strong fixing and operational skills across distributed systems...
, and optimizing systems for GPU resource management. Familiarity with modern observability tools (e.g., DataDog, Prometheus, Grafana...
expertise through mentorship and peer feedback. Experience using observability tools (e.g., Datadog, Splunk, or New Relic...
and practices (metrics, logs, traces) and their management at scale. Experience with major obs platforms eg Grafana, Datadog...
flows, Monte Carlo, DataDog). · Certifications (Preferred) · Databricks Certified Data Engineer Professional...
tools (Grafana, Datadog, OpenTelemetry, etc.) Experience running large-scale GPU clusters for ML/AI workloads Experience...
of your solutions Experience with observability and monitoring tools (Datadog, Prometheus, Grafana, or similar) is a plus Located...
., Prometheus, Grafana, ELK, Datadog). Proven troubleshooting chops - you're the person people call when "nothing makes sense...