Greetings,
I'm part of a collaborative open science project called Project Cognoma (
https://github.com/cognoma/cognoma). We are building a website to help biologists do machine learning on TCGA data. Our current plan is to use the TCGA Pan-Cancer data from Xena, which is available at
https://genome-cancer.soe.ucsc.edu/proj/site/xena/datapages/?cohort=TCGA%20Pan-Cancer%20(PANCAN).
I didn't see a license specified for this data. Due to past experience (
https://doi.org/bfmk), I try to clarify all data licensing issues before proceeding with data. We would like to reproduce and publish derivatives of the data under an open license, preferably CC0. Is TCGA data on Xena free of copyright because it was produced by the United States Government? Or is the data subject to copyright and available under a specific license? Or is the data publicly available without a specified license, which defaults to all rights reserved?
Best,
Daniel