Description
Join a skilled team as a Senior Site Reliability Engineer, leveraging your expertise in Azure Kubernetes Service and observability tools like Dynatrace and Splunk. Deliver high-impact solutions to enhance system reliability and performance.
As a critical member in this role, you will design observability-as-code solutions using Terraform to create effective monitoring pipelines and dashboards. Your responsibilities will encompass driving real-time performance insights, troubleshooting complex production incidents, and automating operational tasks to build resilient systems. You will collaborate with cross-functional teams to ensure service excellence and reliability.
Key Responsibilities:
• Design observability-as-code solutions with Terraform
• Drive improvements using Dynatrace, ELK, and Splunk
• Instrument applications for comprehensive observability
• Troubleshoot complex incidents in production environments
• Lead incident response and blameless postmortems
As a critical member in this role, you will design observability-as-code solutions using Terraform to create effective monitoring pipelines and dashboards. Your responsibilities will encompass driving real-time performance insights, troubleshooting complex production incidents, and automating operational tasks to build resilient systems. You will collaborate with cross-functional teams to ensure service excellence and reliability.
Key Responsibilities:
• Design observability-as-code solutions with Terraform
• Drive improvements using Dynatrace, ELK, and Splunk
• Instrument applications for comprehensive observability
• Troubleshoot complex incidents in production environments
• Lead incident response and blameless postmortems
Ready to Seal the Deal?
Submit your application today and take the next step in your career with ITRiders.
Apply for this Job