New Collections
The MATCH Screening Trial with Expert Annotations
The MATCH Screening Trial: Targeted Therapy Directed by Genetic Testing in Treating Patients with Advanced Refractory Solid Tumors, Lymphomas, or Multiple Myeloma (EAY131) is a NCI‑sponsored precision‑oncology study evaluating whether genetically targeted therapies can benefit patients with advanced, treatment‑refractory solid tumors, lymphomas, or multiple myeloma. It has 2,813 patients with a wide range of cancer types, including bladder, brain, breast, colorectal, lung, melanoma, ovarian, pancreatic, prostate, sarcomas, thyroid, lymphomas, multiple myeloma, and many others.
Expert‑reviewed imaging annotations were also created for patients enrolled in the MATCH Screening Trial (EAY131-Tumor-Annotations). The annotations were created as part of an NCI initiative to enhance TCIA collections with high‑quality labels that support cancer research and AI development.
This is one of the largest precision‑oncology screening efforts ever conducted. Making this imaging dataset publicly available with expert annotations supports reproducible research, cross‑cancer biomarker discovery, and development of generalizable imaging‑AI models.
|
The University of Texas Southwestern Glioma MRI Dataset
The University of Texas Southwestern Glioma MRI dataset with molecular marker characterization and segmentations (UTSW‑Glioma) dataset is a large, well‑curated brain tumor imaging dataset from The University of Texas Southwestern Medical Center. It is designed to support research on MRI‑based prediction of glioma molecular markers. It includes multi‑contrast pre‑operative MRI, clinical and demographic data, molecular profiles, and expert multi‑label tumor segmentations for 625 patients treated between 2006 and 2023. This dataset can enable development and benchmarking of deep learning models for non‑invasive prediction of glioma molecular markers. It supports research into MRI‑genomic relationships and provides a robust foundation for reproducible AI‑driven glioma studies.
The University of Texas Southwestern Glioma MRI dataset with molecular marker characterization and segmentations (UTSW‑Glioma) dataset
High Grade Serous Carcinoma Precursor Lesion
The High Grade Serous Carcinoma precursor lesion digital slide images (TP53-Precursor-Lesions) collection contains whole‑slide digital pathology images of p53 signatures and precursor lesions associated with high‑grade serous carcinoma (HGSC), the most lethal gynecologic cancer. It offers a rare, well‑characterized resource for studying the earliest detectable changes in the fallopian tube epithelium. There are 78 whole‑slide images from a retrospective cohort of 242 women over age 50 who underwent benign gynecologic surgery. It is one of the few publicly available resources focused specifically on pre‑malignant fallopian tube lesions, making it valuable for both research and education.
|
Hancock: Multimodal Head and Neck Cancer Dataset
The HANCOCK Collection is a multimodal dataset of 763 head and neck squamous cell carcinoma patients. It brings together digital pathology whole‑slide images, immunohistochemistry tissue microarrays, structured clinical data, de‑identified clinical text, and expert annotations to support integrated research in AI and precision oncology. By combining histology, clinical variables, and clinical narratives, the dataset enables studies in digital pathology, multimodal machine learning, prognostic modeling, and biomarker discovery, helping researchers explore how complementary data types improve disease characterization and outcome prediction while advancing reproducible cancer research.
|
TCIA is funded by the Cancer Imaging Program (CIP) and is managed by CIP and the Frederick National Laboratory for Cancer Research (FNLCR). CIP is part of the Division of Cancer Treatment and Diagnosis (DCTD) of the National Cancer Institute. For complete information about the Cancer Imaging Program, please see the Cancer Imaging Program website.
|