|
Exact Job
Location/Work Address
|
Requirements are
scattered across 4 main locations: Richmond VA, McLean, VA, Wilmington,
DE, Chicago, IL
|
|
Required
Technologies
|
- Strong Programming
experience with object-oriented/object function scripting languages: Python, PySpark, Scala,
etc.
- Experience with big data
tools: Hadoop, Apache Spark, Kafka, etc.
- Experience with AWS
cloud services: S3, EC2, EMR, RDS, Redshift
- Experience with
stream-processing systems: Storm, Spark-Streaming, etc.
- Experience with
relational SQL, Snowflake and NoSQL databases, including Postgres and
Cassandra.
|
|
Job
Description: Detailed overview of
functional and technical role expectations
|
Candidate with 5+ years of experience in a Data Engineer role,
who has attained a Graduate degree in Computer Science, Statistics,
Informatics, Information Systems or another quantitative field. They should
also have working experience using the following software/tools:
- Strong Programming
experience with object-oriented/object function scripting languages:
Python, PySpark, Scala, etc.
- Experience with big data
tools: Hadoop, Apache Spark, Kafka, etc.
- Experience with AWS
cloud services: S3, EC2, EMR, RDS, Redshift
- Experience with
stream-processing systems: Storm, Spark-Streaming, etc.
- Experience with
relational SQL, Snowflake and NoSQL databases, including Postgres and
Cassandra.
Responsibilities for Data Engineer:
- Create and maintain
optimal data pipeline architecture, Assemble large, complex data sets
that meet functional / non-functional business requirements.
- Identify, design,
and implement internal process improvements: automating manual
processes, optimizing data delivery, re-designing infrastructure for
greater scalability etc.
- Build the
infrastructure required for optimal extraction, transformation, and
loading of data from a wide variety of data sources using SQL and AWS
‘Big data’ technologies.
- Build analytics tools
that utilize the data pipeline to provide actionable insights into
customer acquisition, operational efficiency and other key business
performance metrics.
- Work with
stakeholders including the Executive, Product, Data and Design teams to
assist with data-related technical issues and support their data
infrastructure needs.
- Create data tools for
analytics and data scientist team members that assist them in building
and optimizing our product into an innovative industry leader.
|