Share resumes at
Deepak...@nvish.comTitle: Site Reliability Engineer (GenAI Platform Focus)
Location: Austin, TX
Duration: 12 Months + Possible Extension
Rate: $45/hr C2C (Max)
Work Model: Onsite/Hybrid as required
Role Overview
We are seeking a Site Reliability Engineer (SRE) with strong cloud and Kubernetes expertise, and exposure to GenAI workloads, to support scalable infrastructure and platform reliability initiatives.
This role focuses on maintaining and optimizing cloud-native systems running on AWS, with a strong emphasis on container orchestration, observability, automation, and infrastructure reliability for AI-driven applications.
Required Skills
Strong experience with AWS and AWS EKS
Hands-on Kubernetes experience
Experience with:
Docker
Helm
GitHub / GitOps
Jenkins
Ansible
Strong Linux/Unix and Bash scripting skills
Knowledge of networking fundamentals (TCP/IP, SSL, HTTPS, SFTP)
Monitoring tools: Prometheus, Grafana, Splunk
Python scripting experience
Experience supporting DevOps/SRE environments
Exposure to GenAI workloads (deployment, scaling, or infrastructure support)