Description
Excel as a Senior Site Reliability Engineer at Intact and ensure operational reliability across multi-cloud environments! This hybrid position emphasizes deep investigations and advanced observability solutions.
In this role, you’ll be part of the SRE & Resiliency team within the Intelligent Operations Department. You will lead investigations, implement auto-healing tools, and define SLI/SLO policies to enhance service resilience. Collaborate with various teams to coach on best practices and facilitate system reliability improvements.
Key Responsibilities:
• Conduct root cause analysis for high-severity incidents
• Implement end-to-end observability with metrics and logs
• Develop auto-healing solutions and progressive delivery practices
• Enforce SLIs/SLOs and publish reliability reports
• Provide training and support to incident response teams
Requirements:
• 8+ years experience in SRE or related fields
• Expertise in observability tools and reliability ...
In this role, you’ll be part of the SRE & Resiliency team within the Intelligent Operations Department. You will lead investigations, implement auto-healing tools, and define SLI/SLO policies to enhance service resilience. Collaborate with various teams to coach on best practices and facilitate system reliability improvements.
Key Responsibilities:
• Conduct root cause analysis for high-severity incidents
• Implement end-to-end observability with metrics and logs
• Develop auto-healing solutions and progressive delivery practices
• Enforce SLIs/SLOs and publish reliability reports
• Provide training and support to incident response teams
Requirements:
• 8+ years experience in SRE or related fields
• Expertise in observability tools and reliability ...
Ready to Seal the Deal?
Submit your application today and take the next step in your career with Intact.
Apply for this Job