Skip to content

TCGA

Description

The Cancer Genome Atlas (TCGA), a landmark cancer genomics program, molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. This joint effort between NCI and the National Human Genome Research Institute began in 2006, bringing together researchers from diverse disciplines and multiple institutions.

TCGA has generated over 2.5 petabytes of genomic, epigenomic, transcriptomic, and proteomic data. The data, which has already led to improvements in our ability to diagnose, treat, and prevent cancer, will remain publicly available for anyone in the research community to use.

Click here to visualize the list of tumor types available in TCGA. For each cancer type, TCGA published an overview of the characterizations performed and an initial analysis of the data.

Data access

You can find the data from TCGA in the folder:

/workspace/datasets/intogen_datasets/genomes/tcga_20171006/filtered

You can find clinical data from samples from TCGA in the folder:

/workspace/datasets/intogen_datasets/genomes/tcga_20171006/metadata

Citing in Publications and Presentations

Click here to learn how to cite TCGA.

Reference

  • Paula Gomis
  • Monica Sanchez