architectures. Exposure to observability tools (Datadog, Site 24x7, etc). Experience with CICD pipelines and DevOps automation...
/observability stacks (Prometheus, Grafana, Datadog, Open Telemetry, Jaeger) Proven experience with AWS cost optimization...
, DataDog and ELK. Ability to work independently on complex technical tasks and deliver high-quality results within deadlines...
, DataDog) and turning telemetry into actionable insights. Strong fixing and operational skills across distributed systems...
. Familiarity with CI/CD pipelines. Monitoring/analytics (Sentry, Datadog, PostHog). Experience with PWAs or Web3/wallet...
., Prometheus, Grafana, ELK, Datadog). Proven troubleshooting chops - you're the person people call when "nothing makes sense...
tools (Grafana, Datadog, OpenTelemetry, etc.) Experience running large-scale GPU clusters for ML/AI workloads Experience...
expertise through mentorship and peer feedback. Experience using observability tools (e.g., Datadog, Splunk, or New Relic...
and practices (metrics, logs, traces) and their management at scale. Experience with major obs platforms eg Grafana, Datadog...
, and optimizing systems for GPU resource management. Familiarity with modern observability tools (e.g., DataDog, Prometheus, Grafana...
of your solutions Experience with observability and monitoring tools (Datadog, Prometheus, Grafana, or similar) is a plus Located...
, including Graylog and Datadog, to enable automated observability and troubleshooting within pipelines. Exceptional analytical...
Familiarity with observability tools and metrics (e.g., CloudWatch, Datadog) Experience using AI tools to assist with routine...
, implementing, and optimizing systems for GPU resource management. Familiarity with modern observability tools (e.g., DataDog...
and practices (e.g., Git, GitLab, Docker, Jenkins, Bash, Linux, CI/CD, Sentry, Datadog) Practical expertise in data analysis...
, Direct Connect, and Transit Gateway; logging with CloudWatch, AWS CloudTrail, Grafana, Prometheus, Dynatrace, Datadog, Splunk...
: AWS Version control: Git & GitHub AI Tooling: Copilot on GitHub Observability: Datadog When you join Metropolis...
, and observability using tools like - Pager Duty, CloudWatch, Grafana, Datadog and OpenSearch Support and guide tooling initiatives...