Position Overview
Job Description
This role is primarily an operations incident response role for cloud issues in AWS, Azure and GPC and includes cloud infrastructure management. This role will troubleshoot and performance analysis from a cloud perspective with the goal of reducing time to resolution. This role is focused on responding as well as strategizing and designing a solution to prevent incidents from happening in the future in the Cloud environment. They will collaborate with the NOC, Network engineering teams, platform teams and application support teams in addition to working with the cloud provider. Our goal is to modernize and stabilize our infrastructure. As we get pulled into incidents and issues, we want to resolve the issues quickly then address solving this and preventing.
We are seeking a Cloud Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our cloud-based infrastructure. The ideal candidate combines software engineering expertise...