Role: Python SRE Engineer
Location: Austin, TX (Hybrid)
$60/hr on C2C
Description:
Looking
for an experienced Site Reliability Engineer. In this role you will design,
build and deliver highly scalable, reliable, secure cloud infrastructure which
powers the applications and services used by Client's customers every day. You
will work closely with cross functional teams, business leaders and other
partners across Client to implement new solutions. If infrastructure as code,
automation and intelligent monitoring excites you then this is the job for you.
Minimum
Qualifications:
- 5+ years’ experience in designing and building resilient,
large-scale, low latency, cloud and on-prem Infrastructure including
Compute, Storage, and Network
- Deep expertise in building, deploying and managing Kubernetes
clusters using Spinnaker and Helm
- Experience in monitoring using Splunk or ELK stack, Grafana,
Prometheus, Alertmanager
- Experience in setting up and managing CI/CD pipeline using Jenkins
Preferred
Qualifications:
- Experience in Linux Shell Scripting, Python, Terraform
- Cloud architecture, building reliable, scalable, and secure
Infrastructure as Code
- Troubleshooting of application specific, network, system &
performance issues in production during on-call rotations
- Building automation tools to deliver infrastructure services
reliably and in a repeatable fashion
- Collaborating cross-functionally with distributed teams of software
engineers, quality engineers, or other site reliability engineers to
gather, analyze, and define non-functional/technical requirements and
drive its implementation
- Experience with Cassandra, MongoDB, Couchbase databases, AWS S3 or
similar storage technologies
- Experience deploying and supporting java applications
- Deep understanding of networking protocols: DNS, TCP, HTTP/HTTPS
- Excellent problem solving, critical thinking, and interpersonal
skills
- BS or MS in Computer Science, or equivalent experience.