GSoC 2026 Proposal – Football Analytics Platform (Panel + Data Warehouse)

14 views
Skip to first unread message

Shaurya Singh

unread,
Mar 30, 2026, 2:47:19 PM (5 days ago) Mar 30
to NumFOCUS GSOC
Hello NumFOCUS Mentors,

I hope you are doing well.

My name is Shaurya Singh, and I am an MCA student with a strong interest in data engineering and interactive data visualization. I am very interested in contributing to your organization for Google Summer of Code 2026.

I have built an end-to-end football analytics platform that combines:

• A data warehouse (MySQL, star schema design) integrating multiple football datasets  
• An ETL pipeline for data cleaning, transformation, and loading  
• An interactive analytics dashboard using Panel (HoloViz)  

The system allows users to:
• Analyze player performance (goals, assists)  
• Compare players interactively  
• Apply filters (team, position, league)  
• Explore trends and top contributors  

🔗 GitHub Repository: 
https://github.com/DA-Shaurya/Sports-Analytics/tree/main/football-analytics-data-warehouse

---

##  Proposal Overview

### Title:
Interactive Football Analytics Platform using Data Warehousing & HoloViz

### Problem:
Many open-source sports analytics tools either focus on backend data processing or frontend visualization, but lack an integrated system that combines both effectively.

### Proposed Solution:
I propose building a complete analytics platform:

Raw Data → ETL → Data Warehouse → SQL Analytics → Interactive Dashboard

Key features include:
• Scalable star schema warehouse  
• Efficient analytical queries  
• Interactive dashboard with filters and comparisons  
• Trend and performance visualization  

### Tech Stack:
Python, Panel (HoloViz), Pandas, MySQL, SQL, hvPlot

---

## Timeline (Brief)

Weeks 1–2: Data ingestion & schema design  
Weeks 3–4: ETL pipeline & warehouse setup  
Weeks 5–6: Dashboard development  
Weeks 7–8: Filtering & comparison features  
Weeks 9–10: Query system & optimization  
Weeks 11–12: Testing, documentation, polishing  

---

##  Why Me

I have already developed a working prototype of this system, demonstrating my ability to design and implement real-world analytics workflows. I am comfortable working with data pipelines, SQL analytics, and interactive visualization tools.

I am fully available during the GSoC period and can dedicate 30–40 hours per week. I am committed to maintaining regular communication and contributing actively throughout the program.

---

I would really appreciate your feedback on this idea and how I can align it better with your organization’s goals.

Looking forward to your guidance.

Best regards,  
Shaurya Singh  
Email: shauryasin...@gmail.com 
GitHub: https://github.com/DA-Shaurya
Reply all
Reply to author
Forward
0 new messages