Job Description:
Take part in managing multiple k8s clusters, including upgrades, scaling, monitoring, resolving production issues, etc.
Help developers and quants deploy new services, including writing Kustomize manifests and build pipelines.
Advise developers on best practices for observability, metrics, logging, and tracing.
Configure Istio for a microservices environment, including routing, mirroring, A/B deployments, and circuit breakers.
Set up alerts with Prometheus and help troubleshoot in a microservice environment.
Implement CI/CD using Jenkins, Argo CD, and GitOps.
Job Qualifications:
Expert level with Kubernetes and Docker, with at least 3 years of hands-on experience deploying and managing Kubernetes clusters.
Experience with tracing, e.g., Jaeger/OpenTelemetry.
Experience with monitoring/metrics, e.g., Prometheus, Thanos.
Experience with the Grafana LGTM stack (Tempo, Loki, Grafana).
Experience with Kustomize/Helm.
Experience with ArgoCD and Argo Workflows.
Coding skills in Python and Bash.
GitOps.
Excellent troubleshooting and analytical skills.
Self-starter able to execute independently, on a deadline, and under pressure; good at multitasking.
Excellent written and verbal communication skills.
Preferred Requirements:
Experience with EKS.
Experience with AWS.
Experience with service meshes (Istio).
Experience with Jenkins and Jenkins pipelines/Groovy.
Experience with the CNCF landscape.
Familiarity with SRE terminology, SLOs/SLIs, etc.
Company Occupation:
High Tech
Company Size:
Small (0 - 50)