Cluster and dataset comparisonInclusion Criteria for Data in the Single Cell Type ResourceThe single cell RNA-seq dataset was retrieved from published studies based on healthy human tissues. We performed meta-analysis of literature on scRNA-seq and searched single cell databases, including the Single Cell Expression Atlas (https://www.ebi.ac.uk/gxa/sc/home), the Human Cell Atlas (https://www.humancellatlas.org), the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/), the Tabula Sapiens (https://tabula-sapiens-portal.ds.czbiohub.org/), the Allen Brain Atlas (https://portal.brain-map.org/) and the European Genome-phenome Archive (https://www.ebi.ac.uk/ega/). To avoid technical bias and to ensure the single cell datasets can best represent the corresponding tissues, we applied the following criteria for data selection: (1) Single cell RNA sequencing was performed on single cell suspension from tissues without pre-enrichment of cell types; (2) Datasets included >3,000 cells and 20 million read counts; (3) Pseudo-bulk gene expression profiles were highly correlated with bulk RNA-seq profiles. In total, datasets from 30 tissue types and human blood were included. The samples, their references, and cluster details are listed here. Tabula SapiensThe Tabula sapiens project (Tabula Sapiens Consortium* et al. (2022)), includes nearly 500,000 cells from 24 different tissues and organs. The data is publicly available (https://tabula-sapiens.sf.czbiohub.org/) and included in the CZ CellxGene tool where you can explore separate cell types across tissues, here exemplified by epithelial cells across the different tissue samples. Currently, 12 tissue types in the aggregated HPA single cell type data, which is used for cell type classification, are imported from the Tabula Sapiens (blood, bone marrow, eye, lung, lymph node, prostate, salivary gland, thymus, tongue, urinary bladder and vascular). Additionally, another 23 datasets are currently included in the HPA Single Cell Type resource, for comparison and validation of cell cluster expression profiles. For the tissue datasets represented by Tabula Sapiens, the original clustering is added to the gene detail pages for comparison of cluster expression overview. Tabula sapiens clustering with HPA clustering of the same dataIn the case of the 12 tissue types (blood, bone marrow, eye, lung, lymph node, prostate, salivary gland, thymus, tongue, urinary bladder and vascular*) represented by Tabula Sapiens data in the HPA aggregated cell type expression profile, the addition of Tabula Sapiens' own clustering details enables comparison and verification of the HPA pipeline robustness. Tabula Sapiens data but with the HPA pipeline and clustering methods. Tabula Sapiens data with Tabula Sapiens clustering details. EyeRHO is a protein enriched in the rod and cone photoreceptor cells of the retina. LungDNAI2 is a protein enriched in ciliated cells, highly expressed in the cilia cluster of the lung sample. ProstateKLK3 is a protein enriched in prostatic glandular cells. Salivary glandLPO is a protein enriched in serous glandular cells of the salivary gland. ThymusTHEMIS is a protein enriched in T-cells. TongueKRT5 is a protein enriched in basal keratinocytes. VascularSELE is a protein enriched in endothelial cells, and specifically detected in the endothelial cell clusters. BloodGP9 is a platele enriched protein, detected in the erythroid cluster of both datasets. Bone marrowHBD is a erythroid enriched protein, detected in the erythroid cluster of both datasets. Lymph nodeMS4A1 is a B-cell specific protein. SpleenIGHA2 is a protein enriched in plasma cells. Tabula Sapiens comparison with non-Tabula SapiensFor the tissues represented by non-Tabula Sapiens data, the addition of Tabula Sapiens clustering data enables a dataset for comparison and result validation. Here, we show examples of expression overview in each of the tissues that are represented by a non-Tabula Sapiens dataset and compare the cell type expression profile with the Tabula Sapiens results. The comparison for these tissues is available for each protein-coding gene at the gene detail page. Adipose tissueIn the HPA cell type aggregated data, cell data representing the adipose tissue is based on data from Lazarescu O et al. (2025).
Heart muscleIn the HPA cell type aggregated data, cell data representing heart muscle is based on data from Koenig AL et al. (2022).
KidneyIn the HPA cell type aggregated data, cell data representing the kidney is based on data from Lake BB et al. (2023).
LiverIn the HPA cell type aggregated data, cell data representing the liver is based on data from MacParland SA et al. (2018).
PancreasIn the HPA cell type aggregated data, cell data representing the pancreas is based on data from Craig-Schapiro R et al. (2025).
Skeletal muscleIn the HPA cell type aggregated data, cell data representing skeletal muscle is based on data from Pass CG et al. (2023).
SkinIn the HPA cell type aggregated data, cell data representing the skin is based on data from Solé-Boldo L et al. (2020).
Small intestineIn the HPA cell type aggregated data, cell data representing the small intestine is based on data from Wang Y et al. (2020).
|