Strong Experience in Scala, Spark, hive
SQL, Hadoop and Kafka
Proficiency in Hive and SQL optimization.
Understanding
of distributed systems and big data architecture.
Knowledge of streaming frameworks (Spark
Streaming, Kafka Streams).
Good to have – Aerospike experience
Skills required:
Experience : 6-9 Years
Must have Primary skills
required Cloudera (Hadoop), Spark + Scala or Spark + Java and SQL
The resources should also have
good understanding of Hive, Aerospike.
The resources should have
strong analytical skills.
Should have worked on large
scale ETL and DW projects and pipelines.
Real time data streaming
experience and batch orchestration, data quality and reconciliation,
understanding of concepts like Data Governance is a must.
Strong communication skills and
ability to independently work and troubleshoot problems and come up with
solutions.