Platform architect ( AI/ Gen AI )

india, Maharashtra, Mumbai

Full–time

Posted on: 2 days ago

Platform Architect — AI / Gen AI

Location: Mumbai (or flexible)
Experience: 8+ Years

About the Role

We are seeking an experienced Platform Architect (AI / Gen AI) to design and scale enterprise-grade AI/ML and GenAI platforms. This role sits at the intersection of data engineering, MLOps, distributed systems, and large-scale LLM deployment.

You will work closely with data scientists, product teams, and engineering leaders to architect robust, scalable, and cost-efficient systems that power next-generation AI applications, including semantic search, RAG pipelines, and intelligent assistants.

Key Responsibilities
  • Collaborate with data scientists, product managers, and cross-functional teams to enable seamless AI/ML model integration
  • Design, build, and maintain scalable, enterprise-grade data pipelines for AI/ML workloads
  • Architect and optimize LLM inference systems with a focus on GPU utilization, memory efficiency, and latency
  • Implement advanced techniques such as model quantization, distillation, and efficient serving strategies
  • Develop and optimize RAG-based systems, semantic search, and chatbot solutions
  • Perform prompt engineering and fine-tuning of LLMs for domain-specific use cases
  • Build and manage CI/CD pipelines using tools like GitHub Actions and Jenkins
  • Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or Pulumi
  • Deploy and operate containerized workloads using Kubernetes, Helm, and service mesh architectures
  • Design and execute A/B testing frameworks for ML models and analyze performance metrics
  • Maintain experiment tracking, model versioning, and documentation using tools like MLflow or DVC
  • Research and adopt emerging AI/ML and GenAI technologies to improve system capabilities
  • Contribute to domain-specific AI solutions (e.g., healthcare) with production-ready deployment strategies

  • Skills & CompetenciesTechnical Skills
  • Advanced proficiency in Python
  • Strong experience with Apache Spark for large-scale data processing
  • Solid experience with SQL and data querying
  • Proficiency in Git, MLflow, and experiment tracking/versioning

  • Software Engineering
  • Strong background in system design and scalable software architecture
  • Experience with Go or Rust (preferred)
  • Expertise in microservices architecture and concurrent processing
  • Familiarity with test-driven development (TDD)
  • Ability to create detailed Low-Level Designs (LLDs)

  • DevOps, MLOps & Infrastructure
  • Infrastructure as Code: Terraform, CloudFormation, Pulumi
  • CI/CD pipelines: GitHub Actions, Jenkins
  • Containerization and orchestration: Docker, Kubernetes, Helm, service mesh
  • Experience with ML model serving tools: TorchServe, TensorFlow Serving
  • Automated model retraining and deployment pipelines

  • LLM & AI Expertise
  • Hands-on experience with LLM frameworks: Hugging Face, PyTorch, TensorFlow
  • Experience with LLM serving stacks (e.g., vLLM, FastAPI)
  • Strong understanding of model optimization techniques (quantization, distillation)
  • Experience with vector databases and retrieval systems
  • Proven experience building:
  • Chatbots
  • Recommendation systems
  • Translation systems

  • Cloud & Security
  • Strong experience with cloud platforms: AWS, GCP, Azure
  • Understanding of cloud networking, security, and access control for ML systems
  • Experience implementing secure and compliant AI systems

  • What We’re Looking For
  • Architect-level thinking with hands-on execution ability
  • Strong ownership and ability to lead platform-level initiatives
  • Deep understanding of scaling AI systems in production
  • Passion for GenAI and staying ahead of emerging trends
  • Ability to balance performance, cost, and reliability in system design

Job Type: Full-time

Pay: ₹885,405.90 - ₹2,821,384.80 per year

Work Location: In person