Hi Vendors
Job Title: Rancher & Kubernetes SME
Location: Princeton - NJ - 08540
Mode : Contract (6+ Months) – Onsite
Qualifications:
• Design and implement Rancher-managed Kubernetes clusters (RKE, RKE2, K3s, EKS, AKS, GKE).
• Architect high availability (HA) Rancher setups.
• Define multi-cluster and multi-tenant strategies using Rancher projects, namespaces, and RBAC.
• Integrate Kubernetes with VMware, Bare Metal, and Cloud platforms.
• Establish standardized cluster blueprints and reference architectures.
• Act as final escalation (L3) for Kubernetes and Rancher incidents.
• Diagnose and resolve Control plane failures
o etcd performance and corruption issues
o Pod scheduling and node pressure issues
o CNI (Calico / Cilium) networking problems
o CSI storage failures (Ceph, Longhorn, EBS, Azure Disk, NFS)
• Perform root cause analysis (RCA) and provide preventive recommendations.
• Install, upgrade, and maintain Rancher Server.
• Manage cluster lifecycles using Rancher UI & APIs.
• Implement and manage Rancher RBAC, Authentication (AD / LDAP / Azure AD / SSO)
• Global & cluster-level policies
• Maintain Rancher backups, DR, and recovery procedures
• Enforce Kubernetes security best practices like Pod Security Standards (PSS)
• Network policies and Secrets management
• integrate Kubernetes with CI/CD tools e.g., GitHub Actions, GitLab CI, Jenkins, Argo CD
• Enable GitOps workflows for application and cluster configuration.
• Support Helm chart development and lifecycle management.
• Assist development teams with Deployment strategies, Resource optimization
• Troubleshooting application issues on Kubernetes
Experience:
• 6–10+ years in Linux / Infrastructure / Cloud
• 3–5+ years hands-on Kubernetes production experience
• Strong expertise in Rancher (RKE / RKE2 / K3s)
• Deep understanding of:
o Kubernetes control plane
o etcd
o Networking (CNI)
o Storage (CSI)