Hello NumFOCUS Mentors,
I hope you are doing well.
My name is Shaurya Singh, and I am an MCA student with a strong interest in data engineering and interactive data visualization. I am very interested in contributing to your organization for Google Summer of Code 2026.
I have built an end-to-end football analytics platform that combines:
• A data warehouse (MySQL, star schema design) integrating multiple football datasets
• An ETL pipeline for data cleaning, transformation, and loading
• An interactive analytics dashboard using Panel (HoloViz)
The system allows users to:
• Analyze player performance (goals, assists)
• Compare players interactively
• Apply filters (team, position, league)
• Explore trends and top contributors
🔗 GitHub Repository:
https://github.com/DA-Shaurya/Sports-Analytics/tree/main/football-analytics-data-warehouse---
## Proposal Overview
### Title:
Interactive Football Analytics Platform using Data Warehousing & HoloViz
### Problem:
Many open-source sports analytics tools either focus on backend data processing or frontend visualization, but lack an integrated system that combines both effectively.
### Proposed Solution:
I propose building a complete analytics platform:
Raw Data → ETL → Data Warehouse → SQL Analytics → Interactive Dashboard
Key features include:
• Scalable star schema warehouse
• Efficient analytical queries
• Interactive dashboard with filters and comparisons
• Trend and performance visualization
### Tech Stack:
Python, Panel (HoloViz), Pandas, MySQL, SQL, hvPlot
---
## Timeline (Brief)
Weeks 1–2: Data ingestion & schema design
Weeks 3–4: ETL pipeline & warehouse setup
Weeks 5–6: Dashboard development
Weeks 7–8: Filtering & comparison features
Weeks 9–10: Query system & optimization
Weeks 11–12: Testing, documentation, polishing
---
## Why Me
I have already developed a working prototype of this system, demonstrating my ability to design and implement real-world analytics workflows. I am comfortable working with data pipelines, SQL analytics, and interactive visualization tools.
I am fully available during the GSoC period and can dedicate 30–40 hours per week. I am committed to maintaining regular communication and contributing actively throughout the program.
---
I would really appreciate your feedback on this idea and how I can align it better with your organization’s goals.
Looking forward to your guidance.
Best regards,
Shaurya Singh
Email:
shauryasin...@gmail.com GitHub:
https://github.com/DA-Shaurya