india, Maharashtra, Mumbai

Full–time

Posted on: 2 days ago

Full–time

Posted on: 2 days ago

Platform Architect — AI / Gen AI

Location: Mumbai (or flexible)
Experience: 8+ Years

About the Role

We are seeking an experienced Platform Architect (AI / Gen AI) to design and scale enterprise-grade AI/ML and GenAI platforms. This role sits at the intersection of data engineering, MLOps, distributed systems, and large-scale LLM deployment.

You will work closely with data scientists, product teams, and engineering leaders to architect robust, scalable, and cost-efficient systems that power next-generation AI applications, including semantic search, RAG pipelines, and intelligent assistants.

Key Responsibilities

Collaborate with data scientists, product managers, and cross-functional teams to enable seamless AI/ML model integration
Design, build, and maintain scalable, enterprise-grade data pipelines for AI/ML workloads
Architect and optimize LLM inference systems with a focus on GPU utilization, memory efficiency, and latency
Implement advanced techniques such as model quantization, distillation, and efficient serving strategies
Develop and optimize RAG-based systems, semantic search, and chatbot solutions
Perform prompt engineering and fine-tuning of LLMs for domain-specific use cases
Build and manage CI/CD pipelines using tools like GitHub Actions and Jenkins
Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or Pulumi
Deploy and operate containerized workloads using Kubernetes, Helm, and service mesh architectures
Design and execute A/B testing frameworks for ML models and analyze performance metrics
Maintain experiment tracking, model versioning, and documentation using tools like MLflow or DVC
Research and adopt emerging AI/ML and GenAI technologies to improve system capabilities
Contribute to domain-specific AI solutions (e.g., healthcare) with production-ready deployment strategies

Advanced proficiency in Python
Strong experience with Apache Spark for large-scale data processing
Solid experience with SQL and data querying
Proficiency in Git, MLflow, and experiment tracking/versioning

Strong background in system design and scalable software architecture
Experience with Go or Rust (preferred)
Expertise in microservices architecture and concurrent processing
Familiarity with test-driven development (TDD)
Ability to create detailed Low-Level Designs (LLDs)

Infrastructure as Code: Terraform, CloudFormation, Pulumi
CI/CD pipelines: GitHub Actions, Jenkins
Containerization and orchestration: Docker, Kubernetes, Helm, service mesh
Experience with ML model serving tools: TorchServe, TensorFlow Serving
Automated model retraining and deployment pipelines

Hands-on experience with LLM frameworks: Hugging Face, PyTorch, TensorFlow
Experience with LLM serving stacks (e.g., vLLM, FastAPI)
Strong understanding of model optimization techniques (quantization, distillation)
Experience with vector databases and retrieval systems
Proven experience building:
Chatbots
Recommendation systems
Translation systems

Strong experience with cloud platforms: AWS, GCP, Azure
Understanding of cloud networking, security, and access control for ML systems
Experience implementing secure and compliant AI systems

Architect-level thinking with hands-on execution ability
Strong ownership and ability to lead platform-level initiatives
Deep understanding of scaling AI systems in production
Passion for GenAI and staying ahead of emerging trends
Ability to balance performance, cost, and reliability in system design

Job Type: Full-time

Pay: ₹885,405.90 - ₹2,821,384.80 per year

Work Location: In person

GET IT ON

Google Play

india, Maharashtra, Mumbai