Job Description
Responsibilities Own reliability, availability, scalability, and security of production systems Design and operate highly available, fault‑tolerant, multi‑region cloud architectures Define and manage SLOs, SLIs, SLAs, and error budgets for critical services Lead high‑severity incidents and drive effective post‑incident reviews Improve MTTD and MTTR through automation, tooling, and runbooks Operate and evolve Kubernetes (EKS) platforms and multi‑tenant deployments Work with Infrastructure‑as‑Code (Terraform, Cloud Formation, Pulumi) at scale Build and improve CI/CD pipelines and deployment safeguards Design and maintain observability (metrics, logs, traces, alerting) Drive capacity planning, performance optimisation, and cloud cost efficiency Partner with Security & Compliance on SOC 2, ISO 27001, GDPR, and DORA controls Mentor SREs and influence reliability‑first engineering practices across teams Qualifications 6+ years in SRE, Dev Ops, or cloud infrastructure roles (2+ years in a sen...
Ready to Apply?
Submit your application today and join our talented team at Yellosa.
Submit ApplicationJob Details
- Location johannesburg, gauteng
- Job Type Full-time
- Category Other-General
- Posted Date June 26, 2026
- Application Deadline August 05, 2026