📌 Job Opportunity: Site Reliability Engineer (Infra Project)

Full Time
Posted 3 months ago

🚀 Senior Site Reliability Engineer (SRE)

We’re looking for a Senior SRE to own the reliability, resilience, and operational excellence of our clients’ SaaS platforms.

You’ll play a key role in shaping how we design, run, and scale production systems — working hands-on with cloud infrastructure, Kubernetes, and observability, while leading incident response and mentoring engineers.

What You’ll Do

  • Own production reliability, availability, scalability, and security

  • Design highly available, multi-region AWS architectures

  • Define and manage SLOs, SLIs, SLAs, and error budgets

  • Lead high-severity incidents and blameless post-mortems

  • Improve MTTD / MTTR through automation and runbooks

  • Operate and evolve Kubernetes (EKS) platforms

  • Build and scale IaC, CI/CD, and observability (OpenTelemetry)

  • Drive performance, capacity planning, and cloud cost efficiency

What We’re Looking For

  • 6+ years in SRE / DevOps / Cloud Infrastructure

  • Strong AWS & Kubernetes experience

  • Proven incident management leadership

  • Solid experience with IaC, CI/CD, automation

  • Coding skills in Python, Go, or Bash

  • Strong communication and collaboration skills

Why Join Us

  • Real ownership of production systems

  • Influence SRE culture and standards

  • Work on complex systems at scale

  • Collaborative, reliability-first engineering culture

📩 Interested? Apply or reach out to learn more.

Please send your CV on dalisha.curpennaick@succexa.mu or call on 5916 9939 for more details.

Apply For This Job

A valid phone number is required.