Job Title: AI
Architect withTrueFoundry
Location: Santa
Clara, CA (Onsite)
Type:
Contract
Job
Summary
We are
seeking an experienced TrueFoundry Architect to design, implement, and govern
enterprise-scale AI/ML platforms. The ideal candidate will have deep expertise
in TrueFoundry, Kubernetes, MLOps, LLMOps, cloud-native architectures, and
Generative AI solutions. The architect will work closely with data scientists,
ML engineers, platform teams, and business stakeholders to build scalable and
secure AI platforms supporting Applied Materials' AI initiatives.
Key
Responsibilities
- Design and implement
enterprise AI/ML platforms using TrueFoundry.
- Architect scalable model
deployment and serving infrastructure on Kubernetes.
- Build and maintain MLOps
and LLMOps pipelines for model training, deployment, monitoring, and
governance.
- Integrate TrueFoundry
with cloud platforms such as Azure, AWS, or GCP.
- Design CI/CD pipelines
for AI applications and machine learning workflows.
- Implement model
observability, monitoring, logging, and performance optimization.
- Develop AI platform
security, governance, and compliance standards.
- Enable deployment of
GenAI and Agentic AI applications using LLM frameworks.
- Collaborate with Data
Science, DevOps, Security, and Infrastructure teams.
- Provide architecture
guidance and technical leadership to engineering teams.
- Conduct architecture
reviews, PoCs, and platform modernization initiatives.
Required
Skills
- 10+ years of IT
experience with 3+ years in AI/ML platform architecture.
- Strong experience with
TrueFoundry platform.
- Expertise in Kubernetes,
Docker, Helm, and container orchestration.
- Experience with MLOps
tools and frameworks.
- Strong Python
programming skills.
- Experience with Azure,
AWS, or GCP cloud services.
- Knowledge of CI/CD tools
such as GitHub Actions, Jenkins, or Azure DevOps.
- Experience deploying and
managing LLM-based applications.
- Understanding of Vector
Databases, RAG, LangChain, LangGraph, or similar frameworks.
- Strong understanding of
AI governance, security, and compliance.
Preferred
Skills
- Experience with Agentic
AI architectures.
- Experience with MLFlow,
Kubeflow, Ray, or KServe.
- Knowledge of NVIDIA GPU
infrastructure and model optimization.
- Semiconductor or
manufacturing domain experience.
- Experience supporting
large enterprise AI transformation programs.
Typical
Tech Stack
- TrueFoundry
- Kubernetes (AKS/EKS/GKE)
- Docker
- Python
- MLFlow
- LangChain / LangGraph
- OpenAI / Azure OpenAI
- Vector DBs (Pinecone,
Weaviate, Chroma)
- Terraform
- Azure DevOps / GitHub
Actions