Job Opening || Site Reliability Engineer SRE – ML platform || Austin, TX and Sunnyvale, CA (Onsite)

2 views
Skip to first unread message

pradeep bhondwe

unread,
Feb 20, 2026, 9:34:19 AM (3 days ago) Feb 20
to pradeep...@ktekresourcing.com

Hello,


My name is Pradeep Bhondve, and I work as a Technical Recruiter for K-Tek Resourcing.
 
We are searching for Professionals below business requirements for one of our clients. Please read through the requirements and connect with us in case it suits your profile.

Please see the Job Description and if you feel Interested then send me your updated resume at Pradeep.bhondve@ktekresourcing.com or give me a call at  . 



Job Title: Site Reliability Engineer SRE – ML platform
Location: Austin, TX and Sunnyvale, CA (Onsite)
Duration: Long Term


  • Please share profiles only with 
  •  Strong hands-on ML Platform Experience
Job Opening:

We are seeking an experienced Site Reliability Engineer (SRE) – ML Platform / MLOps Engineer to support the reliability, scalability, and performance of our Machine Learning platforms. This role focuses on building and operating production-grade ML systems using Kubernetes, Python, cloud infrastructure, and modern MLOps practices.

The ideal candidate has strong experience in MLOps, cloud-native architecture, containerization, and CI/CD, along with a solid understanding of ML models and Large Language Models (LLMs). You will work closely with data scientists, ML engineers, and software teams to design, deploy, and maintain robust ML pipelines and services.


Key Responsibilities

  • Design, deploy, and maintain scalable ML platforms using Kubernetes, Docker, and cloud services (primarily AWS)

  • Build and operate end-to-end MLOps pipelines, including model training, validation, deployment, and monitoring

  • Ensure high availability, reliability, and performance of ML production systems

  • Develop automation tools and services using Python

  • Implement and manage CI/CD pipelines for ML and microservices workloads

  • Support ML workloads involving LLMs and traditional ML models

  • Collaborate with data scientists to productionize models and optimize workflows

  • Administer Linux systems and troubleshoot infrastructure issues

  • Design cloud-native microservices and APIs for ML applications

  • Manage and integrate data stores such as MongoDB and search platforms like Apache Solr

  • Implement monitoring, alerting, logging, and benchmarking for ML systems

  • Translate business requirements into technical solutions

  • Contribute to best practices around testing, security, and operational excellence


Required Experience

  • 6+ years of hands-on experience in MLOps / SRE / Platform Engineering

  • Strong proficiency in Python

  • Extensive experience with Kubernetes and containerized environments

  • Solid knowledge of AWS (or Azure/GCP) cloud platforms

  • Experience with MongoDB

  • Strong Linux administration skills

  • Experience with microservices architectures

  • Hands-on experience with CI/CD pipelines

  • Working knowledge of ML models and Large Language Models (LLMs)

  • Experience productionizing ML systems built with open-source tools


Technical Skills

  • Python

  • Kubernetes & Docker

  • AWS (or Azure/GCP)

  • MongoDB

  • Microservices Architecture

  • Apache Solr

  • MLOps frameworks (Kubeflow, MLflow, Airflow, DataRobot, Argo, etc.)

  • CI/CD pipelines

  • Linux system administration

  • REST APIs and cloud integrations


Preferred Qualifications

  • Experience with workflow orchestration tools such as Kubeflow, Airflow, or Argo

  • Experience building custom integrations between cloud-based systems using APIs

  • Strong understanding of software testing, benchmarking, and continuous integration

  • Exposure to ML methodology and best practices

  • Ability to design and implement cloud-based ML solutions

  • Excellent communication skills and ability to collaborate across teams



image.png

Pradeep Bhondve   

Talent Acquisition Specialist,

KTEK Resourcing LLC

O 832.632.9328 | 832.968.6386

E Pradeep.bhondve@ktekresourcing.com

Linkedin: https://www.linkedin.com/in/pradeep-bhondve-aba57b166/

W www.ktekresourcing.com

A 9494 Southwest Freeway, Suite #350, Houston, TX -77074


Reply all
Reply to author
Forward
0 new messages