Urgently looking for AI/ML Engineer with CUDA & GPU Expertise

0 views
Skip to first unread message

Suman Bakshi

unread,
Aug 12, 2025, 1:40:24 PMAug 12
to Suman Guha Bakshi
Hi Vendors,
Please submit match candidates on below requirement .
AI/ML Engineer

Primary Responsibilities

  • Deploy and optimize AI models on both Systalyze and Baseten platforms
  • Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
  • Conduct comprehensive performance testing and optimization
  • GPU utilization analysis and CUDA optimization
  • Cost analysis and resource efficiency evaluation
  • Model inference latency and throughput benchmarking

Required Technical Skills

Core AI/ML Expertise:

  • Programming Languages: Python (advanced), C++ (intermediate for CUDA optimization)
  • ML Frameworks: PyTorch, TensorFlow, Hugging Face Transformers, LangChain
  • Model Types: LLMs (GPT, BERT, T5), Computer Vision models, Embedding models

CUDA & GPU Expertise:

  • CUDA Programming: CUDA C/C++
  • GPU Optimization: Memory management, kernel optimization, multi-GPU scaling
  • Performance Profiling: NVIDIA Nsight, nvprof, CUDA profiler
  • GPU Architectures: Understanding of Ampere, Hopper, Ada Lovelace architectures
  • Tensor Operations: TensorRT optimization, ONNX runtime
  • Memory Management: GPU memory optimization, batch processing strategies

Platform & Infrastructure:

  • Containerization: Docker, NVIDIA Container Toolkit, GPU-enabled containers
  • Orchestration: Kubernetes with GPU scheduling, NVIDIA GPU Operator
  • Cloud Platforms: AWS (EC2 P/G instances), Azure (NC/ND series), GCP (A2/N1 instances)
Model Serving: TorchServe, TensorFlow Serving, Triton Inference Server

Suman G. Bakshi | Recruitment Head

Global IT Con LLC. WILMINGTON, DE 19801

Email:  sum...@globalitcon.com

https://www.linkedin.com/in/sumanbakshi/


Reply all
Reply to author
Forward
0 new messages