Backend Developer (Python – LLM APIs)

india, Telangana, Hyderabad

Full–time

Posted on: 6 days ago

We are looking for aBackend Developer with strong Python experienceto buildhigh-performance APIs that integrate with Large Language Models (LLMs)on platforms such asGoogle Cloud Platform (GCP)andMicrosoft Azure.The role focuses on buildinglow-latency AI APIs, implementingprompt orchestration workflows, and optimizing requests fortoken usage, streaming responses, and inference latency.The ideal candidate should have experience buildingscalable APIs, working withcloud services, and understanding the performance considerations involved inLLM-based applications.ResponsibilitiesAPI DevelopmentDesign and develop high-performance REST APIs using Python.Build APIs that integrate with LLM services on GCP and Azure.Implement streaming responses for real-time AI applications.Optimize APIs for low latency and high throughput.LLM IntegrationBuild prompt orchestration workflows across multiple LLM providers.Optimize requests by managing token usage and context windows.Implement streaming and asynchronous API responses for LLM outputs.Security & IdentityImplement authentication and authorization using OAuth 2.0 and OpenID Connect (OIDC).Ensure APIs follow secure access patterns and proper authorization controls.Cloud & InfrastructureDeploy and manage applications using Docker containers.Work with cloud services on GCP and Azure.Collaborate with infrastructure teams on deployment and scaling.Data & StorageWork with NoSQL databases to store prompts, metadata, and responses.Design data structures optimized for AI workloads and API performance.QualificationsProgrammingStrong experience in PythonExperience building REST APIs using frameworks such as FastAPI or Flask AI & LLM IntegrationUnderstanding of:TokenizationLatency considerations in LLM APIsStreaming responsesPrompt orchestration conceptsSecurityExperience implementing OAuth 2.0Understanding of OpenID Connect (OIDC)ContainersExperience building and deploying applications using DockerDatabasesFamiliarity with NoSQL databases such as:MongoDBFirestoreDynamoDBCosmos DBNice to HaveExperience working with GCP or Azure AI servicesFamiliarity with LLM frameworks (LangChain, LlamaIndex, CrewAI)Experience with vector databasesUnderstanding of RAG architecturesKnowledge of observability tools (OpenTelemetry, Prometheus, Grafana)Experience Needed: 2 - 5 years