Role: AWS Data Engineer
Location: Bay Area, CA/Hybrid Role/Need Only Local
Job Type: Contract
Job Description:
We are seeking a highly skilled Data Engineer with expertise in modern data platforms, semantic data modeling, and cloud-native lakehouse architectures. In this role, you will be responsible for designing and implementing a scalable semantic data layer that enables self-service analytics, consistent business intelligence, and efficient data access across the organization. You will leverage Dremio, Apache Iceberg, and AWS data services to build high-performance, governed, and scalable data solutions.
Responsibilities:
• Design, develop, and maintain enterprise-scale semantic data layers using Dremio to support self-service analytics and business intelligence initiatives.
• Build and manage semantic models, virtual datasets, business views, and curated data products to ensure consistent reporting and analytics across teams.
• Implement and optimize Apache Iceberg-based lakehouse architectures for scalable and reliable data storage and management.
• Leverage Apache Arrow technologies to enable high-performance, columnar query processing and analytical workloads.
• Integrate Dremio with AWS cloud data platforms including Amazon S3, Amazon Redshift, and Amazon RDS.
• Design and implement data virtualization and query federation solutions across multiple data sources.
• Develop and maintain logical and physical data models aligned with business requirements and enterprise data standards.
• Implement data governance, metadata management, data cataloging, and access control frameworks to ensure data quality and compliance.
• Optimize query performance, data access patterns, and workload management across the data platform.
• Collaborate with business stakeholders, analytics teams, and data architects to translate reporting and analytics requirements into scalable technical solutions.
• Support data platform modernization initiatives and promote best practices for data engineering and analytics enablement.
Required Skills:
• 5+ years of experience in Data Engineering, Data Warehousing, or Analytics Engineering roles.
• Hands-on experience with Dremio, including semantic layer design, virtual datasets, reflections, and query optimization.
• Strong expertise in SQL, data modeling, and dimensional modeling techniques.
• Experience working with Apache Iceberg for modern lakehouse implementations and table management.
• Solid understanding of Apache Arrow and columnar data processing concepts.
• Experience integrating and managing data platforms on AWS, including S3, Redshift, and RDS.
• Strong knowledge of data virtualization, query federation, and distributed query processing architectures.
• Experience implementing data governance, metadata management, data lineage, and security controls.
• Familiarity with ETL/ELT processes, data pipelines, and large-scale analytical workloads.
• Strong analytical and problem-solving skills with the ability to optimize complex data environments.
• Excellent communication skills and ability to work closely with business and technical stakeholders.
Preferred Skills:
• Experience with cloud-native lakehouse architectures and modern data platform ecosystems.
• Familiarity with data catalog and governance tools such as Collibra, Alation, AWS Glue Data Catalog, or similar platforms.
• Experience working with large-scale enterprise analytics and business intelligence environments.
• Knowledge of data orchestration tools such as Apache Airflow or similar workflow platforms.
• Experience with AWS certifications such as AWS Certified Data Engineer or AWS Certified Solutions Architect.
• Familiarity with Python, Spark, or other big data processing technologies.
• Understanding of modern data mesh, data fabric, and self-service analytics principles.