Lead - Site Reliability Engineer

FundsIndia
📍 chennai, tamil nadu, India 💼 Full-time 🕒 Posted June 30, 2026

Job Description

Role Overview


We are looking for a Lead Site Reliability Engineer with 6-7 years of experience to drive reliability, observability, and incident management practices. The ideal candidate will have strong expertise in Grafana stack , production monitoring, and handling critical incidents in high-availability systems.


Key Responsibilities

  • Act as the Incident Commander during production outages, ensuring timely resolution and stakeholder communication
  • Lead incident response, triage, RCA (Root Cause Analysis), and postmortems
  • Build and enhance observability systems using Grafana (Prometheus, Loki, Tempo)
  • Define and manage SLIs, SLOs, and SLAs for critical services.
  • Develop and maintain monitoring, alerting, and dashboards for proactive issue detection.
  • Collaborate with Dev, Infra, an...

Ready to Apply?

Submit your application today and join our talented team at FundsIndia.

Submit Application

Job Details

  • Location chennai, tamil nadu
  • Job Type Full-time
  • Category automation,engineering,grafana,kubernetes,lead,linux,python,red,solution,ux
  • Posted Date June 30, 2026
  • Application Deadline August 09, 2026