Site Reliability Engineer

Devsu
📍 Peru, Peru, Peru 💼 Full-time 🕒 Posted March 02, 2026

Job Description

We are seeking a Site Reliability Engineer (SRE) with deep expertise in monitoring, observability, and reliability engineering to support systems running across on-premises infrastructure and Google Cloud Platform (GCP).

This role is primarily responsible for designing, operating, and improving monitoring, alerting, and observability platforms, with a strong focus on Grafana and Kubernetes environments.

As a secondary responsibility, this role provides backup coverage for the Application Support team during periods of resource constraints or major incidents, offering L2/L3 technical support when required.

Responsibilities
Monitoring & Observability (Core Focus)

  • Own and operate the monitoring and observability stack across on-prem and GCP environments
  • Design, build, and maintain Grafana dashboards for infrastructure, Kubernetes, and applications
  • Define, tune, and maintain alerts to ensure high signal-to-noise ratio
  • Establ...

Ready to Apply?

Submit your application today and join our talented team at Devsu.

Submit Application

Job Details

  • Location Peru, Peru
  • Job Type Full-time
  • Category Other-General
  • Posted Date March 02, 2026
  • Application Deadline April 11, 2026