Hi,
I hope this message finds you well.
I am Arjun from Vyze Inc., currently working on an urgent requirement with one of our esteemed clients. Based on your profile, I believe this opportunity will be best fit for you. If you are available and interested in this contract opportunity, kindly share your updated resume along with your availability for a discussion at your earliest convenience.
Due to the urgent nature of this requirement, we would appreciate a prompt response.
Job Description -
We are urgently looking to onboard a top-tier On-Premises LLM Inference & GPU Systems Engineer for an exciting project with one of our premium clients. We are specifically seeking high-caliber professionals with deep, hands-on experience in On-Premises LLM Inference & GPU Systems Engineering.
Please confirm the candidate's current location and their availability for an in-person interview upon submission.
Kindly review the detailed JD below before submitting profiles.
Key Requirements:
“Need Old LinkedIn with photo” before 2020
“Dl and Visa copy”
Genuine Visa
Client: NTT Data
Important Note: Please avoid submitting junior or unrelated profiles. We are looking for strong, hands-on professionals who can lead the technical direction of AI products.
Job Description:
We are seeking an AI Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self-hosting open-source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model training infrastructure or fine-tuning pipelines.
Key Responsibilities:
Required Qualifications:
Best Regards
Arjun Tomar
Recruiter |Vyze Inc.
Call :+1 571-456-3086 |E-Mail: ato...@vyzeinc.com
Hangout: atoma...@gmail.com
24718 Tribe Square #306, Dulles, VA 20166
![]()
We believe great business comes from honest Relationships