Hi Folks,
My client is looking for Site Reliability Engineer for 12 Month Contract role based in Parsippany, NJ (Onsite)
Position: Site Reliability Engineer
Location: Parsippany, NJ (Local Only)
Duration:
12 Month Contract
POSITION OVERVIEW : Site Reliability Engineer
We are looking for a talented Site Reliability Engineer (SRE) with a strong background in Google Cloud Platform (GCP) and kubernetes.
The ideal candidate will be responsible for ensuring the reliability, performance, and scalability of our on-premise and cloud-based systems along with focus on reducing costs for Google Cloud
POSITION GENERAL DUTIES AND TASKS :
System Reliability:
• Ensure the reliability and uptime of critical services and infrastructure.
• Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
• Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
• Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
• Collaboration: Work closely with development and operations teams to improve system reliability and performance.
• Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
• Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.
Qualifications:
• Education: Bachelor’s degree in computer science, Engineering, or a related field.
• Experience: 4+ years of experience in site reliability engineering or a similar role.
Skills:
• Proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, BigQuery, Pub/Sub, etc.).
• Familiarity with Google BI and AI/ML tools (Looker, BigQuery ML, Vertex AI, etc.)
• Experience with automation tools (Terraform, Ansible, Puppet).
• Familiarity with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
• Strong scripting skills (Python, Bash, etc.).
• Knowledge of networking concepts and protocols. (Service mesh experience a plus)
• Experience with monitoring tools (Prometheus, Grafana, etc.).
Preferred Certifications:
• Google Cloud Professional DevOps Engineer
•
Google Cloud Professional Cloud Architect
Thanks
Sid