Data
Scientist
New
Ulm, MN
12
Months Contract to Hire
GC
or US Citizens only.
The
Impact You'll Make in this Role As a contractor, you will support the
development and maintenance of deep learning models for medical document
analysis. You will focus on training, evaluating, and troubleshooting models,
processing large text datasets, and running experiments in our AWS environment.
You will work with the technical lead to ensure model quality and deliverables,
while operating independently in day-to-day execution.
You
Will Make an Impact By:
- Training,
fine-tuning, and evaluating deep learning models (transformers and related
architectures)
- Processing,
merging, and analyzing large-scale text datasets
- Troubleshooting
model behavior, parameters, and training pipelines
- Running
experiments and documenting results
- Deploying
models into our AWS-based environment using established tools and
workflows
Required
Qualifications
- Strong
Python skills, especially with PyTorch and Transformers
- Experience
training and debugging deep learning models for text
- Solid
grounding in statistics, EDA, and machine learning concepts
Exploratory Data Analysis (EDA) in
machine learning involves analyzing datasets to summarize their main
characteristics, often using visualizations and statistical techniques. It
helps identify patterns, relationships, and anomalies in the data,
- Ability
to work in AWS
and use GitHub-based workflows
- Strong
communication and ability to work independently
Preferred
Qualifications
- Experience
with LLMs, prompt-based methods, or agentic AI
- Familiarity
with PySpark or large-scale ETL for textual data
- Experience
with experiment tracking and MLOps tools
- Background
in healthcare or medical text (nice to have, not required)