Greetings from GarreIT Solutions Inc.,, & I hope you are doing great.
Senior Big Data Engineer
Term: 12+ months
Interview Format: In-person
Location: Reston, VA
Rate: $60-70/hr
The Informatics Team needs one (1) Senior Big Data Engineer contractor to build distributed components, systems, and tools that power decisions at FEPOC. This individual will work as part of the Informatics Team to optimize our use of Big Data supporting the enhancement of our data solutions to incorporate a new product.
Background:
The Senior Big Data Engineer is an experienced technical software development professional who can design and develop complex solutions within our Cloudera clusters. This person must be able to support the integration of Big Data technologies into our mainstream data solutions from architecture through implementation.
Tasks:
Candidate will support the design, development and implementation of Big Data solutions for the Informatics Teams
Assess requirements and determine design for Informatics solutions to be implemented in Big Data
Design, build and launch extremely efficient & reliable data pipelines to move data (both large and small amounts) to our Data Hub
Code development using HIVE, Spark Streaming, Kafka, and Flume
Work closely with Ab Initio ETL developers to leverage that technology as appropriate within our Cloudera Big Data environment
Required Qualifications
· 7+ years of full-time, industry experience
· 3+ years of Solution Design and Development experience using various Hadoop components such as Hive, Sqoop, Oozie, SPARK
· Highly proficient with programming languages (Java/Scala/Python) and Shell scripting
· 2+ years of working knowledge of relational databases and query authoring (SQL)
· Rigor in high code quality, automated testing, and other engineering best practices; ability to write reusable code components
Preferred Qualifications
BS/MS in Mathematics, Engineering, or Computer Science
Working knowledge of U.S. Healthcare Industry
Experience working with large data environments – petabytes or hundreds/thousands of terabytes
Experience with No SQL technologies such as HBase or Cassandra.
Experience with real-time data processing using Flume/Kafka
Past experience with data warehouse/data analytics solutions and technologies, especially ETL tools