posted on 2024-05-02, 07:44authored byErika Kim, Tanja Davidsen, Brandi N Davis-Dusenbery, Alexander Baumann, Angela Maggio, Zhaoyi Chen, Daoud Meerzaman, Esmeralda Casas-Silva, David Pot, Todd Pihl, John Otridge, Eve Shalley, Jill S. Barnholtz-Sloan, Anthony R. Kerlavage
CRDC Dataset Citations by Year
Funding
Center for Biomedical Informatics and Information Technology (CBIIT)
History
ARTICLE ABSTRACT
More than ever, scientific progress in cancer research hinges on our ability to combine datasets and extract meaningful interpretations to better understand diseases and ultimately inform the development of better treatments and diagnostic tools. To enable the successful sharing and use of big data, the NCI developed the Cancer Research Data Commons (CRDC), providing access to a large, comprehensive, and expanding collection of cancer data. The CRDC is a cloud-based data science infrastructure that eliminates the need for researchers to download and store large-scale datasets by allowing them to perform analysis where data reside. Over the past 10 years, the CRDC has made significant progress in providing access to data and tools along with training and outreach to support the cancer research community. In this review, we provide an overview of the history and the impact of the CRDC to date, lessons learned, and future plans to further promote data sharing, accessibility, interoperability, and reuse.See related articles by Brady et al., p. 1384, Wang et al., p. 1388, and Pot et al., p. 1396