ARTICLE ABSTRACTThe development of single-cell RNA sequencing (scRNA-seq) technologies has greatly contributed to deciphering the tumor microenvironment (TME). An enormous amount of independent scRNA-seq studies have been published representing a valuable resource that provides opportunities for meta-analysis studies. However, the massive amount of biological information, the marked heterogeneity and variability between studies, and the technical challenges in processing heterogeneous datasets create major bottlenecks for the full exploitation of scRNA-seq data. We have developed IMMUcan scDB (https://immucanscdb.vital-it.ch), a fully integrated scRNA-seq database exclusively dedicated to human cancer and accessible to nonspecialists. IMMUcan scDB encompasses 144 datasets on 56 different cancer types, annotated in 50 fields containing precise clinical, technological, and biological information. A data processing pipeline was developed and organized in four steps: (i) data collection; (ii) data processing (quality control and sample integration); (iii) supervised cell annotation with a cell ontology classifier of the TME; and (iv) interface to analyze TME in a cancer type–specific or global manner. This framework was used to explore datasets across tumor locations in a gene-centric (CXCL13) and cell-centric (B cells) manner as well as to conduct meta-analysis studies such as ranking immune cell types and genes correlated to malignant transformation. This integrated, freely accessible, and user-friendly resource represents an unprecedented level of detailed annotation, offering vast possibilities for downstream exploitation of human cancer scRNA-seq data for discovery and validation studies.
The IMMUcan scDB database is an accessible supportive tool to analyze and decipher tumor-associated single-cell RNA sequencing data, allowing researchers to maximally use this data to provide new insights into cancer biology.