Login for job apply.
Int9 Solutions
Job description We are looking for a skilled Datadog Observability Engineer to enhance and maintain our cloud infrastructure's observability and performance monitoring capabilities. The ideal candidate will bring strong experience in DevOps practices, AWS infrastructure, and infrastructure as code using Terraform, with a specific focus on deploying and managing observability toolsprimarily Datadog. Requirements and Qualifications: 5+ years of hands-on experience in DevOps or Site Reliability Engineering Proven expertise in implementing observability strategies using Datadog Strong experience with AWS services (EC2, ECS, CloudWatch, Lambda, etc.) Proficiency in Terraform for infrastructure automation Solid knowledge of monitoring, logging, alerting, and performance tuning Experience integrating observability into CI/CD pipelines Familiarity with containerization (Docker, Kubernetes) is a plus Strong analytical, troubleshooting and excellent communication skills Roles and Responsibilities: Design, implement, and maintain observability solutions using Datadog Develop dashboards, alerts, and monitors to provide actionable insights into infrastructure and applications Collaborate with development and DevOps teams to define SLAs, SLOs, and error budgets Optimize AWS resource utilization and observability configurations Automate monitoring infrastructure setup using Terraform Conduct root cause analysis of performance and availability issues Ensure best practices in logging, metrics collection, and distributed tracing Provide support during incident response and postmortem processes Keep observability tooling up to date with platform and business needs. Location - Remote in India, Mumbai, Delhi / NCR, Bengaluru , Kolkata, Chennai, Hyderabad, Ahmedabad. Role: Site Reliability Engineer Industry Type: IT Services & Consulting Department: Engineering - Software & QA Employment Type: Full Time, Permanent Role Category: DevOps Education UG: Any Graduate PG: Any Postgraduate Doctorate: Doctorate Not Required Key Skills Skills highlighted with ‘‘ are preferred keyskills DevOpsTerraformDatadogAWSObservability Infrastructure as CodeDockerCI/CDSite Reliability EngineeringKubernetes