Hi,
I'm getting " Entrez gene id not known to the cBioPortal instance. This record will not be loaded. Might be new or deprecated Entrez gene id. " warning during data validation of my CNA data, which suggests my gene reference data may be outdated or incomplete.
I've tried matching against gene_info.txt from the data curation tools on cBioPortal GitHub, but still encountering recognition issues with what appear to be valid current gene IDs/symbols.
Question: Does this meant I need to update my cBioPortal instance with the latest gene reference data, or can I somehow add missing genes to the existing gene database?
Since ignoring these genes will have about 10% of my data being dropped, so I wish that I could keep these data.
Any guidance on the proper procedure to refresh or supplement the gene reference data would be appreciated.
I deployed cBioPortal v6.2.0 via Docker on 22nd May 2025, following the guide.
Thanks!
Regards,
Ji Hen