Onsite : Lead ML Engineer II – RAG / Gen AI Platform Architect (Open AI, MongoDB Atlas, AWS

0 views

Skip to first unread message

Jobs Jobs

unread,

Jun 9, 2026, 5:43:12 PM (18 hours ago) Jun 9

to Sharath Sharath

Job Summary

We are seeking a highly experienced Lead ML Engineer / Lead RAG Engineer to architect and deliver a production-grade Retrieval-Augmented Generation (RAG) platform. The ideal candidate will have deep expertise in Python, OpenAI APIs, MongoDB Atlas Vector Search, AWS data platforms, and enterprise-scale AI solution delivery.

This role will lead the design and implementation of end-to-end RAG architecture, including ingestion, indexing, retrieval, grounding, citations, tool integrations, observability, and production readiness.

Required Experience

10+ years of hands-on software engineering and Python development
Proven experience leading production-grade RAG/LLM platforms
Strong expertise with OpenAI APIs (Embeddings, Chat Completions, Tool Calling)
Hands-on experience with MongoDB Atlas Vector Search
Experience integrating AWS Aurora MySQL and AWS DocumentDB
Strong API development experience using FastAPI
Experience with asynchronous processing, data pipelines, batch jobs, and event-driven architectures
Expertise in CI/CD, testing frameworks, Docker, monitoring, and security best practices
Experience mentoring engineers and leading architecture reviews

Key Responsibilities

Own end-to-end RAG architecture from ingestion through generation
Design scalable extraction pipelines from Aurora MySQL and DocumentDB
Build chunking, metadata, embedding, and indexing strategies
Implement vector search, metadata filtering, and multi-tenant retrieval
Integrate OpenAI models for grounded response generation and citations
Establish MCP-style tool integrations with auditability and governance
Drive production readiness, observability, security, and reliability
Lead technical design reviews, coding standards, and mentoring initiatives

Must-Have Skills

End-to-end RAG Architecture & LLM Orchestration – 10+ Years
Python Backend/Data Engineering – 8+ Years
OpenAI APIs (Embeddings, Chat, Tools) – 6+ Years
MongoDB Atlas Vector Search – 6+ Years
AWS Aurora MySQL & DocumentDB Integration – 6+ Years
FastAPI, Async Pipelines, CI/CD – 6+ Years

Nice to Have

Hybrid Retrieval (Keyword + Vector)
Reranking frameworks
RAGAS, TruLens, LLM Evaluation Frameworks
OCR, Document Parsing, Knowledge Graphs
AWS ECS/EKS/Lambda deployments
Healthcare domain experience

Preferred Technologies

Python, FastAPI
OpenAI APIs
MongoDB Atlas Vector Search
AWS Aurora MySQL
AWS DocumentDB
Docker, Kubernetes, ECS/EKS
Kafka, SQS, Celery, RQ
Pytest, CI/CD Pipelines
Monitoring & Observability Stack

Thanks,

Everest Global Solutions INC

Email :Jo...@everestglobalsolutionsinc.com

Reply all

Reply to author

Forward

0 new messages