Description
L1 SRE Operations Engineer
The L1 SRE is the first line of defense in monitoring, triaging, and executing standardized operational tasks for all enterprise applications running on standard patterns and platforms like Kubernetes, APIs, WAF, databases, API Proxy (Gloo, APIGEE), Kafka, and Cloud (AWS/Azure/GCP). They will follow runbooks, leverage automation, and escalates appropriately to minimize downtime.
Responsibilities
- Monitor system health, alerts, dashboards, and logs across cloud and on‑prem infrastructure.
- Isolate functional issues with application versus platform.
- Execute standardized runbooks for incident resolution, deployments, and routine tasks.
- Perform initial triage of incidents and escalate to L2/L2+ as needed to mitigate the issue.
- Document new issues, gaps in runbooks, and automation opportunities.
- Provide excellent communication to stakeholders during incidents.
- Support onboardin...
Ready to Seal the Deal?
Submit your application today and take the next step in your career with Hitachi.
Apply for this Job