AI SW Stack Deployment Architect

Sandisk
📍 Bengaluru, Karnataka, India 💼 full-time 🕒 Posted June 08, 2026

Job Description

Job Description

Role Overview

We are looking for a Software Architect (12+ years experience) to lead the application/framework layer and deployment stack for the Next Generation Accelerator AI platform. This role owns how models run on Next Generation Accelerator—from vLLM / PyTorch / TensotFlow/XLA to production deployment—ensuring correctness, performance, and scalability.

Key Responsibilities

  • Architect integration of vLLM, PyTorch, and TensorFlow, JAX/XLA into Next Generation Accelerator stack
  • Define framework → compiler → runtime APIs and contracts
  • Own LLM execution behavior (batching, KV cache, streaming inference)
  • Design and implement end-to-end deployment workflows (packaging, versioning, reproducibility)
  • Drive performance optimization across mod...

Ready to Apply?

Submit your application today and join our talented team at Sandisk.

Submit Application

Job Details

  • Location Bengaluru, Karnataka
  • Job Type full-time
  • Category Computer Occupations
  • Posted Date June 08, 2026
  • Application Deadline July 18, 2026