Role: Big Data / Hadoop Lead
Location: Santa Clara, CA
Duration: 6 – 12 Months
· Min 10 years of exp
· Strong understanding of Hadoop production environment which includes HDFS, YARN, Hive, NiFi, Ranger, Atlas and Spark
· Good understanding of data warehousing concepts and relational star-schema database designs
· Develop and implement ETL frameworks using Python and Spark languages
· Good knowledge of SQL and relational database models.
· knowledge in Git repository branching and code versioning is plus
· Understanding of machine learning models is a plus