Job Title: Senior Development Lead – AI Engineer (LLMs, RAG & Agentic AI)
Location: Onsite or Hybrid. Atlanta, GA.
Visa: Any Except GC
Experience - 12+ Years
Relocation Works
** Must Have Healthcare Domain**
Role Summary:
We are seeking a Senior Development Lead – AI Engineer with 12+ years of overall software development experience and deep expertise in Python, Large Language Models (LLMs), LangChain, LangGraph, and Retrieval-Augmented Generation (RAG). This role is both hands-on and leadership-focused, responsible for architecting enterprise-scale AI platforms, leading AI engineering teams, and delivering secure, scalable, production-ready AI solutions.
Key Responsibilities:
- Lead architecture, design, and development of enterprise AI and LLM-powered solutions
- Own and design RAG architectures integrating structured and unstructured data sources
- Build and orchestrate agentic and multi-agent workflows using LangChain and LangGraph
- Drive Python-based backend development for AI services and APIs
- Evaluate, integrate, and optimize LLMs (OpenAI, Anthropic, AWS Bedrock, Azure OpenAI, Google Vertex AI)
- Establish best practices for prompt engineering, memory, tool usage, and agent orchestration
- Lead and mentor senior engineers, AI engineers, and developers
- Partner with product, architecture, data, and cloud teams to deliver business-aligned AI solutions
- Ensure solutions meet performance, scalability, security, and compliance standards
- Review architecture and code, enforce engineering standards, and ensure delivery excellence
- Guide CI/CD pipelines, cloud deployment, monitoring, and observability for AI workloads
Required Technical Skills
Core AI & Development
- 12+ years of overall software development experience
- Python (expert level) – building large-scale, production-grade systems
- Large Language Models (LLMs) – prompt design, context management, inference optimization
- LangChain – chains, agents, tools, retrievers, memory
- LangGraph – stateful workflows and multi-agent orchestration
- Retrieval-Augmented Generation (RAG) – end-to-end design and optimization
- AI, Data & Search
- Vector databases: FAISS, Pinecone, Weaviate, Chroma, OpenSearch
- Embeddings, semantic search, ranking, and chunking strategies
- Document ingestion pipelines and metadata enrichment
- Model evaluation, grounding techniques, and hallucination reduction
- Backend, Cloud & DevOps
- API and microservices frameworks: FastAPI, Flask
- Cloud platforms: AWS, Azure, GCP
- Containers & orchestration: Docker, Kubernetes
- CI/CD pipelines, Git-based workflows, DevOps best practices