We are seeking an experienced and highly skilled Data Solutions Architect with strong expertise in AWS Glue, ETL Pipeline Development, PySpark, and Python. The ideal candidate will lead the design, development, optimization, and maintenance of scalable data processing solutions within the AWS ecosystem.
The candidate should possess deep technical expertise in building enterprise-grade ETL pipelines, managing cloud-based data platforms, and implementing efficient big data processing solutions using PySpark and AWS Glue.
Areas for evaluation with weightage
Level | Skill Name | Concepts | Expectations | Weightage |
Expert | AWS Glue | ETL, AWS Glue, Data Catalog, Job Scheduling | Demonstrate ability to design, implement, and manage ETL processes using AWS Glue | 20 |
Expert | ETL Pipeline Design | ETL, Data Validation, Data Integrity | Ability to design and implement robust ETL pipelines | 20 |
Proficient | Python | Python, OOP, Libraries, Error Handling | Demonstrate ability to write clean, efficient, and scalable Python code | 20 |
Proficient | PySpark | PySpark, DataFrames, RDD, Spark SQL | Show proficiency in writing and optimizing PySpark scripts for data processing | 20 |
Competent | RDBMS (PostgreSQL) | PostgreSQL - Query and DML Snowflake - SQL, Query Optimization, Database Design | Demonstrate ability to write complex SQL queries and manage PostgreSQL databases | 15 |
Competent | Data Platforms Management | AWS Ecosystem, Data Infrastructure, Security | Manage and optimize data platforms within the AWS ecosystem | 5 |
Pradeep Bhondve Talent Acquisition Specialist, KTEK Resourcing LLC
E Pradeep.bhondve@ktekresourcing.com Linkedin: https://www.linkedin.com/in/pradeep-bhondve-aba57b166/ A 9494 Southwest Freeway, Suite #350, Houston, TX -77074 |