PyTorch & Triton Engineer
Bellevue, WA (Onsite)
Contract
Note: Performance testing engineer
(Python + AI workloads) who can stress test systems, find problems early, and
improve system performance and stability.
Job Summary
Design and implement high intensity stress workloads using
PyTorch and Triton to identify performance bottlenecks and improve platform
stability and maturity
Job Description
Design and implement high intensity stress workloads
using PyTorch and Triton Exercise core MAIA execution paths including compute
memory DMA and collectives Enable early detection of performance cliffs
stability issues and system bottlenecks across simulator and real hardware
Improve platform maturity reduce latestage escapes and increase confidence for
broader internal and external adoption Develop PyTorch workloads stressing
modellevel execution such as large GEMMs attention patterns MoElike behavior
mixed precision and longrunning loops Author custom Triton kernels to stress
hardware execution units memory hierarchies and synchronization paths Build
parameterized stress harnesses scalable by problem size number of devices and
runtime duration Integrate workloads with existing profiling monitoring and
failure triage tooling Collaborate with platform firmware and SDK teams to
target known risk areas and emerging issues Document usage patterns and provide
reproducible scripts for lab and continuous integration CI usage
Roles and Responsibilities
Develop and maintain a library of reusable PyTorch
stress workloads Create Tritonbased micro and macrokernels designed
specifically for stress and saturation testing Build and support test harnesses
and scripts for singledevice and multidevice execution Ensure workload designs
align with platform risk areas and emerging hardwaresoftware issues Collaborate
crossfunctionally with platform firmware and SDK teams to refine stress tests
Provide comprehensive documentation describing workload intent configuration
options and expected stress characteristics Support profiling monitoring and
failure triage by integrating stress workloads with existing tools Deliver
reproducible and scalable testing solutions for lab and CI environments
Regards
Pavan
VDart Inc
Email: Pavank...@vdartinc.com
Website: https://vdart.com


Confidentiality Notice
The information contained in this message may be privileged and confidential and protected from disclosure. If the reader of this message is not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by replying to the message and deleting it from your computer.