Position Overview
Principal Engineer (SRE/DevOps in leading and hands on capacity)
Location: Irving, TX/Charlotte, NC/Minneapolis, MN
Key Responsibilities
Lead production support efforts across a portfolio of 20+ applications, ensuring stability, performance, and rapid issue resolution
Design and build advanced monitoring, alerting, and observability dashboards using tools such as Splunk, Grafana, AppDynamics, and Prometheus
Proactively identify risks through gap analysis, anomaly detection, and predictive alerting, preventing production incidents before they occur
Troubleshoot complex production issues across distributed microservices environments, reducing MTTR through deep technical expertise
Drive adoption of modern SRE practices, including automation, AIOps, and intelligent monitoring solutions
Support applications running on OpenShift and cloud-native platforms, with a focus on reliability an...