RAG / LLM Specialist

EXL
📍 gurugram, haryana, India 💼 Full-time 🕒 Posted June 30, 2026

Job Description

Role Overview:

Lead the design and optimization of advanced RAG pipelines and model fine tuning processes. Bridge the gap between prototype and enterprise-scale LLM deployment.


Key Responsibilities

Pipeline Ownership: Design and manage complex, multi-stage RAG pipelines ensuring low latency and high relevance. Model Optimization: Lead fine-tuning initiatives (PEFT/LoRA) for open-source models to improve domain-specific task performance. Advanced Evaluation: Develop automated evaluation frameworks (e.g., RAGAS) to continually measure LLM accuracy, context precision, and recall. Vector Strategy: Architect metadata filtering and hybrid search strategies within vector databases (e.g., Pinecone, Milvus). Team Mentorship: Guide junior analysts in prompt engineering, chunking strategies, and code quality.


Required Skills & Qualifications

Tech Stack: Python, PyTorch/TensorFlow, LangChain, LlamaIndex, advanced embedding models. G...

Ready to Apply?

Submit your application today and join our talented team at EXL.

Submit Application

Job Details

  • Location gurugram, haryana
  • Job Type Full-time
  • Category Other-General
  • Posted Date June 30, 2026
  • Application Deadline August 09, 2026