Data Engineer - Python, AI

Citigroup
📍 Pune, India, India 💼 Full-time 🕒 Posted June 06, 2026

Job Description

Role Summary
We are looking for a mid-level Python Developer with combined experience in Data Engineering and AI/NLP engineering. The candidate will build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks, and will also work on large-scale data processing using PySpark, Pandas, and related data tools. The role includes developing APIs, integrating with platform services, and supporting CI/CD deployments using GitHub and LightSpeed Enterprise.

**Key Responsibilities**

+ Develop and optimize ETL/data processing jobs using PySpark, Pandas, PyArrow, and related libraries.
+ Build and maintain NLP pipelines using Flair, BERT, and LLM-based models.
+ Develop scalable ingestion and data transformation pipelines for AI and analytics use cases.
+ Build and maintain Flask-based APIs for model inference and service integrations.
+ Use regular expressions for text cleaning, parsing, and NLP preprocessing.
+ Integrate caching and fast lookups ...

Ready to Apply?

Submit your application today and join our talented team at Citigroup.

Submit Application

Job Details

  • Location Pune, India
  • Job Type Full-time
  • Category other-general
  • Posted Date June 06, 2026
  • Application Deadline June 11, 2026