Senior Big Data Engineer
Quantity: 2
Term: 12+ months
Interview Format: 1st – Phone, 2nd – In-person
Location: Reston, VA
Rate: $65/hr
Client needs 2) Senior Big Data Engineer contractor to build distributed components, systems, and tools that power decisions at FEPOC. This individual will work as part of the Informatics Team to optimize our use of Big Data supporting the enhancement of our data solutions to incorporate a new product.
Background:
The Senior Big Data Engineer is an experienced technical software development professional who can design and develop complex solutions within our Cloudera clusters. This person must be able to support the integration of Big Data technologies into our mainstream data solutions from architecture through implementation.
Tasks:
Candidate will support the design, development and implementation of Big Data solutions for the Informatics Teams
· Assess requirements and determine design for Informatics solutions to be implemented in Big Data
· Design, build and launch extremely efficient & reliable data pipelines to move data (both large and small amounts) to our Data Hub
· Code development using HIVE, Spark Streaming, Kafka, and Flume
· Work closely with Ab Initio ETL developers to leverage that technology as appropriate within our Cloudera Big Data environment
Required Qualifications
· 7+ years of full-time, industry experience
· 3+ years of Solution Design and Development experience using various Hadoop components such as Hive, Sqoop, Oozie, SPARK
· Highly proficient with programming languages (Java/Scala/Python) and Shell scripting
· 2+ years of working knowledge of relational databases and query authoring (SQL)
· Rigor in high code quality, automated testing, and other engineering best practices; ability to write reusable code components
Lead Big Data Engineer with Hbase/SOLR Expertise
Term: 12+ months
Interview Format: 1st – Phone, 2nd – In-person
Location: Reston, VA
Rate: $75/hr
Tasks:
Candidate will support the design, development and implementation of Big Data solutions for the Informatics Teams
· Assess requirements and determine best technologies and design patterns to utilize for data-centric solutions
· Design, build and launch extremely efficient & reliable data pipelines for real time streaming, search and Indexing solutions
· Write and mentor code development using HIVE, Spark Streaming, Hbase and SOLR
· Work closely with Cloudera Administrators to optimize the usage of our clusters and plan for future expansion and usage
· Work in an agile environment , collaborate with Solution Architects, Scrum Masters, Developers and Testers
Required Qualifications
· 7+ years of full-time, industry experience
· 3+ years of Solution Design and Development experience using various Hadoop components such as Hive, Sqoop, Oozie, SPARK
· Must have implemented Hbase/SOLR for real time search/indexing at production scale
· Highly proficient with programming languages (Java/Scala/Python) and Shell scripting
· 2+ years of working knowledge of relational databases and query authoring (SQL)
· Rigor in high code quality, automated testing, and other engineering best practices, ability to write reusable code components
Preferred Qualifications:
· BS/MS in Mathematics, Engineering, or Computer Science
· Working knowledge of U.S. Healthcare Industry
· Experience working with large data environments – petabytes or hundreds/thousands of terabytes
· Experience with No SQL technologies such as HBase or Cassandra.
· Experience with real-time data processing using Flume/Kafka
· Past experience with data warehouse/data analytics solutions and technologies, especially ETL tools
Thanks & Regards
Veera
GarreIT Solutions LLC.,
Phone – 732-639-5080 Ext: 179
Email Id: ve...@garreit.com