Site Reliability Engineering (SRE)

Ant Group
📍 estepona, andalucía, Spain 💼 Full-time 🕒 Posted June 02, 2026

Job Description

Overview

  • Ensuring Payment System Stability and High Availability: Lead technical initiatives to strengthen the reliability of our payment systems. This includes designing and implementing monitoring tools, logging frameworks, dashboards, diagnostic utilities, and disaster recovery plans. Conduct routine drills, develop contingency strategies, and participate in on-call rotations to ensure rapid response and resolution of production issues across regions.
  • Incident Handling and Emergency Response: Conduct routine drills, develop contingency strategies, and participate in on-call rotations to ensure rapid response and resolution of production issues.
  • Analyze and Optimize Production Issues: Investigate and analyze real-world production cases, such as performance bottlenecks or system inefficiencies, to derive actionable insights and establish technical best practices. Contribute to the evolution of a highly available and resilient payment architecture....

Ready to Apply?

Submit your application today and join our talented team at Ant Group.

Submit Application

Job Details

  • Location estepona, andalucía
  • Job Type Full-time
  • Category Informática y tecnología
  • Posted Date June 02, 2026
  • Application Deadline July 12, 2026