Britain's Premier Job Portal
. Maintain open source-based application monitoring infrastructure. Enhance, optimize, and migrate to new solutions if required.
. Support application teams to migrate to latest OpenShift versions, perform deployment of stateful/stateless apps, and troubleshoot issues in Kubernetes/OpenShift platforms.
. Work with application developers to implement application instrumentation libraries and frameworks.
. Maintain metrics data store using TSDBs like Prometheus. Perform administration and tuning like cardinality optimization, resource optimization.
. Maintain distributing tracing infrastructure like Otel, Jaeger, Zipkin, etc. Perform administrative functions and tuning like sampling strategy. Troubleshoot distributed tracing in microservices.
. Perform production support activities of enterprise logging platforms like ELK stack, Grafana Loki, etc. Work on Index Lifecycle management in Elastic search.
. Implementing alerting inf...