GCP Data Engineer with Python/Scala, DataProc, BigQuery Exp. Remote || Need Opt's only || W2 only |
Duration: 6 Months This is not for AWS/Azure Data Engineers |
Consultant LinkedIn profile should be created before 2018 || No Junk Profiles please. Need 3-5+ yrs of IT Exp. Profiles || Need Passport number, I94, Travel History documents during submission for EAD |
Additional Skills: |
Strong knowledge of data processing in Scala or Python. |
Experience with data modeling and query optimization. |
Deep understanding of BigQuery architecture, best practices, and performance optimization. |
Proficiency in LookML for building data models and metrics. |
Experience with DataProc for running Hadoop/ Spark jobs on GCP. |
Knowledge of configuring and optimizing DataProc clusters |
Responsibilities: |
Design and implement data pipelines in GCP DataProc Cluster |
Develop data models and schema to support business requirements |
Ensure data quality and reliability by developing automated checks in place |
Optimize performance and ensure adherence to SLAs |
Document and share learnings with the rest of the team. |
Required skills |
GCP/ Big query Experience – 1 year minimum |
Scala/ Python - 4+ years |
DataProc for running workloads – 6+ months |
Preferred skills |
Building data models in LookML |