Job Overview
This role will be responsible for leading the design, development, implementation and support of Site Reliability Engineering (SRE) solutions for applications supported by the Cloud organization.
Responsibilities
- Lead code and non-functional reviews of all production-bound SRE solutions
- Drive transformation by automating existing processes and conducting engineering mindset meetups
- Manage SRE application assets such as cloud instances and source code repositories and publish technical designs
- Publish and review implementation plans for SRE solutions bound to production, explore new capabilities and technologies, and document how-to guides
- Track, audit, monitor and implement on technical work streams, acting as a portfolio SME and documenting common components and infrastructure
- Act as the escalation point in the on-call rotation, supporting maintena...