Rethinkit City of Cape Town, Western Cape, South Africa Direct message the job poster from Rethinkit We are looking for a Senior DevOps Engineer to help us design, implement, and operate enterprise-grade logging, metrics, monitoring, and alerting across our production platforms. You will be a key contributor in managing a production RKE2 Kubernetes cluster , operated via Rancher , and improving our CI/CD pipelines using GitLab . The role supports high-performance, highly secure systems for one of the largest stock brokers in the United States . Note: Candidates must be able to work, when required, US Eastern Time hours (14:00 - 00:00 SAST) as this will be required for production deployments, troubleshooting and collaboration with engineering teams. This is a fully remote role , but candidates must reside in the greater Cape Town area for team lunches and get togethers. It is a hands‑on role working closely with backend, web, and mobile engineering teams as well as the platform team in the US. Responsibilities Observability & Reliability Design and implement enterprise-grade logging, metrics, monitoring, and alerting Build and maintain centralised log aggregation pipelines Define alerting strategies, SLOs, and operational dashboards Ensure system health, performance, and reliability in production Set up, configure, and manage production Kubernetes clusters (RKE2) Deploy and manage workloads using Helm Troubleshoot cluster, networking, and workload-level issues Apply security and hardening best practices for production systems CI/CD & Automation Build, migrate, and maintain GitLab CI/CD pipelines Migrate existing services into standardised pipelines Support and maintain pipelines for new services Improve developer experience through automation and reusable templates Security & Networking Implement and maintain security best practices Manage SSL certificates Secure Kubernetes workloads, networks, and CI/CD pipelines Work with complex, high-security networking environments Containers & Software Collaboration Set up and maintain Docker containers and registries Work closely with backend teams and assist where required C++ experience is a bonus Required Experience Senior-level experience in DevOps / Platform Engineering Experience with RKE2 and Rancher Strong experience with Helm Hands‑on experience with GitLab CI/CD Experience implementing: Metrics and monitoring Alerting strategies Strong networking and security fundamentals Experience supporting mission‑critical production systems Nice to Have Observability tools such as: Grafana Loki, ELK, or OpenSearch OpenTelemetry Experience with Vector for log aggregation AWS/GCP experience Infrastructure as Code (Terraform, Ansible) GitOps workflowsExperience in financial or regulated environments What We Offer Fully remote role, but candidates must reside in the greater Cape Town area for team get togethers. Work on large-scale, high-performance systems Opportunity to contribute to platforms used by one of the largest US stock brokers Collaborative engineering environment Competitive compensation Seniority level Mid-Senior level Employment type Full-time Job function Information Technology Referrals increase your chances of interviewing at Rethinkit by 2x #J-18808-Ljbffr