journeys, observability, and proactive operations through AIOps. You will be responsible for evolving core ITSM processes..., and operational excellence, with a strong orientation toward DevOps, platform engineering, and digital service management...
. We are looking for an experienced Senior Site Reliability Engineer to join our SRE Cloud Infrastructure & Operations team. Reporting to the Director... fundamentals Experience in observability, building dashboards, good understanding of SLI/SLO, error budget, familiar with grafana...
, Databricks, Synapse; AWS IoT Core/SiteWise, Kinesis, S3/Glue, Athena; Power BI, Grafana. · DevOps/SRE: Git, Azure DevOps/GitHub... (least privilege, certificate-based auth, network segmentation, jump hosts, secure protocols). o Establish observability (logging...
the philosophies of Agile, DevOps, and SRE to accelerate our development process and provide the most enjoyable, inclusive... and troubleshoot Vault environments using Datadog,Prometheus, Grafana, and ELK stack for observability and performance insights....
. Own features from concept to production, ensuring observability, efficiency, and operational excellence. Mentor engineers... in customer-facing, technical support work/SRE Working knowledge of kubernetes/docker/Tanzu/Vmware container solutions (CKA...
ecosystem. The ideal candidate will have 5+ years of hands-on experience with distributed systems, data platforms, and SRE..., and observability solutions for big data pipelines and infrastructure components Lead incident response and post-mortem analysis...
deployments (AKS/EKS/GKE) using Helm charts and manage service mesh and traffic routing with Istio for enhanced observability... tools (e.g., Azure CLI, AWS CLI) for operational tasks. Orchestrate end-to-end DevOps workflows using tools like ArgoCD...
reliable and observable digital experiences through the application of concepts from Site Reliability Engineering, DevOps... Improve the availability, reliability, and observability of Kotak's services and reduce the burden of human toil...
solutions that provide a clear and consistent view of system health. You will design dashboards, work with observability tools.... This role emphasizes observability, automation, and collaboration with cross-functional teams, rather than direct code...
design docs and participates in design reviews. Cloud and DevOps: Hands-on with Azure: AKS, APIM, Key Vault, Service Bus..., Akeyless. Works with Azure DevOps (Repos, Pipelines; Boards/PBIs for tracking). Builds and deploys containers using Docker...
how to interpret observability data and translate it into actionable insights Enjoy enabling others-you're more coach than controller..., and you lead through enablement, not enforcement Are familiar with service management principles and practices (e.g., ITIL, SRE...
and tools like Jenkins, Gitlabs, or Azure DevOps. Monitoring & Observability: Proficient in at least two monitoring tools..., we will work to shape the future through innovation and continuous learning. Position Details: Job Title: OPS SRE Lead...
Evaluation & Observability: Eval framework, Langfuse, ELK Stack DevOps: GitHub ecosystem, CI/CD Pipeline, VC, branching... solutions through SRE practices. Mentor and guide junior engineers in AI agents, GitHub DevOps, and SaaS reliability. Soft...
Enterprises 2023. About Role: The Opportunity: Site Reliability Engineer (SRE-2) Are you an SRE with a few years... of cloud platforms and container orchestration, and a burning desire to automate everything in sight? As an SRE-2 at MoEngage...
. Own features from concept to production, ensuring observability, efficiency, and operational excellence. Mentor engineers... in customer-facing, technical support work/SRE Working knowledge of kubernetes/docker/Tanzu/Vmware container solutions (CKA...
Job Category: KMBL Degree Level: Bachelor's Degree Job Description: Title : Observability Platforms and SRE Engg... and associated delivery platform. The Observability Platforms and SRE team is a group of experts developing, maintaining, scaling...
across AWS, Azure, GCP, or other platforms. Collaborate with DevOps and SRE teams to ensure high availability, scalability... and manage service discovery, traffic routing, and observability using Istio. Drive infrastructure-as-code practices...
/Eventhub, Redis, Mongo Atlas, IoTHub). DevOps and SRE Practices Establish and promote DevOps and Site Reliability... Engineering (SRE) practices throughout HON IA-PSS Group. Implement continuous integration and delivery (CI/CD) workflows...
) · Bachelor's in Computer Science or related field, or equivalent experience, with 4+ years in Cloud-SRE, DevOps, or Systems..., Observability & Chaos Testing. The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity...
. Mentorship: Coach and elevate SRE and DevOps teams, promoting best practices in reliability and incident/problem management... Your Experience 12+ years of experience in SRE/DevOps/Infrastructure roles, with a strong foundation in GCP cloud-based environments...