CRDC, NCI Cloud Resources, and Data Nodes

The vision for the Cancer Research Data Commons (CRDC) is a virtual, expandable infrastructure that provides secure access to many different data types across scientific domains, allowing users to analyze, share, and store results, leveraging the storage and elastic compute, or ability to easily scale resources, of the cloud. The ability to combine diverse data types and perform cross-domain analysis of large data sets can lead to new discoveries in cancer prevention, treatment and diagnosis, and supports the goals of precision medicine and the Cancer Moonshot.
https://datascience.cancer.gov/data-commons

Figure 1: NCI Cancer Research Data Commons (CRDC website, https://datascience.cancer.gov/data-commons)

Figure 1: NCI Cancer Research Data Commons (CRDC website, https://datascience.cancer.gov/data-commons)

National Cancer Institute (NCI) Cancer Research Data Commons (CRDC) provides an ecosystem that enables access to many NCI funded programs including through interconnected infrastructures like the below (text from CRDC):


FireCloud Powered by Terra

“Broad Institute FireCloud is a NCI Cloud Resource project powered by Terra for biomedical researchers to access data, run analysis tools, and collaborate.”
FireCloud: https://firecloud.terra.bio
Resource Portal: https://portal.firecloud.org


Institute for Systems Biology ISB Cloud

“The ISB Cancer Genomics Cloud is democratizing access to NCI Cancer Data (TCGA, TARGET, CCLE) and coupling it with unprecedented computational power to allow researchers to explore and analyze this vast data-space.”
ISB-CGC: https://isb-cgc.appspot.com
Resource Portal: https://isb-cgc.appspot.com


Seven Bridges Cancer Genomics Cloud

“The Cancer Genomics Cloud (CGC), powered by Seven Bridges, is one of three systems funded by the National Cancer Institute to explore the paradigm of colocalizing massive public datasets, like The Cancer Genomics Atlas (TCGA), alongside secure and scalable computational resources to analyze them. The CGC makes more than two petabytes of multi-dimensional data available immediately to authorized researchers. You can add your own data to analyze alongside the public datasets using predefined analytical workflows or your own tools. Every execution is fully reproducible, and collaborating with your team is simple and secure.”
Seven Bridges CGC: http://www.cancergenomicscloud.org/
Resource Portal: https://cgc-accounts.sbgenomics.com/