GSoC Inquiry - Data Science and Analytical Backend

31 views
Skip to first unread message

Anant Jamuar

unread,
Jan 26, 2026, 6:58:38 PMJan 26
to cBioPortal for Cancer Genomics Discussion Group

Hi cBioPortal Team,

My name is Anant Jamuar and I am a college student planning to apply for GSoC 2026. I have a strong interest in Data Science and Machine Learning, and I’ve been analyzing how these disciplines integrate with the portal’s analytical engine.

I am particularly interested in the logic underlying statistical enrichment, survival analysis, and the ongoing transition to ClickHouse for high-performance data processing. I also noticed the early-stage infrastructure for AI integrations and MCP, and I am curious if the team is looking for contributions that bridge genomic visualisation with advanced predictive modelling or AI-driven querying.

As there are a few months before GSoC projects are finalised, I would like to spend this time contributing actively to the codebase. I am interested in building a prototype or working on an analytical module to better understand the portal's data architecture and demonstrate my technical fit for these types of projects.

Could you suggest which analytical modules or performance-oriented tasks are high priority for this year's cycle?

Thank you!

Anant Jamuar

de Bruijn, Ino

unread,
Jan 28, 2026, 6:44:19 PMJan 28
to Anant Jamuar, cBioPortal for Cancer Genomics Discussion Group
Hi Anant,

Thanks for reaching out! The best way to get started is to:

1. Browse open issues — look for good first issue or help wanted labels on GitHub
2. Pick something that interests you and open a PR — doesn't need to be big
3. Join our community on Slack (https://slack.cbioportal.org) if you have questions

Since you mentioned MCP and AI integrations — check out the newly released MCP Apps spec (https://blog.modelcontextprotocol.io/posts/2026-01-26-mcp-apps/). It allows tools to return interactive UI components (dashboards, visualizations) directly in AI clients. Building MCP Apps for cBioPortal's genomic visualizations could be an interesting GSoC project direction

We don't typically assign tasks pre-GSoC; applicants who do well are the ones who dive in and start contributing. Your PR history will speak louder than the proposal

Looking forward to seeing what you pick up!

Best,
Ino

From: cbiop...@googlegroups.com <cbiop...@googlegroups.com> on behalf of Anant Jamuar <jamua...@gmail.com>
Date: Monday, January 26, 2026 at 6:58 PM
To: cBioPortal for Cancer Genomics Discussion Group <cbiop...@googlegroups.com>
Subject: [EXTERNAL] [cbioportal] GSoC Inquiry - Data Science and Analytical Backend

Be Careful With This Message
This message came from outside MSK.
 
--
You received this message because you are subscribed to the Google Groups "cBioPortal for Cancer Genomics Discussion Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cbioportal+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/cbioportal/bbde4848-b616-4461-a3eb-6d2c858ec365n%40googlegroups.com.
=====================================================================

Please note that this e-mail and any files transmitted from
Memorial Sloan Kettering Cancer Center may be privileged, confidential,
and protected from disclosure under applicable law. If the reader of
this message is not the intended recipient, or an employee or agent
responsible for delivering this message to the intended recipient,
you are hereby notified that any reading, dissemination, distribution,
copying, or other use of this communication or any of its attachments
is strictly prohibited. If you have received this communication in
error, please notify the sender immediately by replying to this message
and deleting this message, any attachments, and all copies and backups
from your computer.

Disclaimer ID:MSKCC
Reply all
Reply to author
Forward
0 new messages