All transcripts of all genes have been analyzed regarding the location(s) of corresponding protein based on prediction methods for signal peptides and transmembrane regions.
Genes with at least one transcript predicted to encode a secreted protein, according to prediction methods or to UniProt location data, have been further annotated and classified with the aim to determine if the corresponding protein(s) are secreted or actually retained in intracellular locations or membrane-attached.
Remaining genes, with no transcript predicted to encode a secreted protein, will be assigned the prediction-based location(s).
The annotated location overrules the predicted location, so that a gene encoding a predicted secreted protein that has been annotated as intracellular will have intracellular as the final location.
The RNA data was used to cluster genes according to their expression across tissues. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.
The RNA specificity category is based on mRNA expression levels in the consensus dataset which is calculated from the RNA expression levels in samples from HPA and GTEX. The categories include: tissue enriched, group enriched, tissue enhanced, low tissue specificity and not detected.
Tissue enhanced (intestine, kidney, placenta)
The regional specificity category is based on mRNA expression levels in the analysed brain samples, grouped into 13 main brain regions and calculated for the three different species. All brain expression profiles are based on data from HPA. The specificity categories include: regionally enriched, group enriched, regionally enhanced, low regional specificity and not detected. The classification rules are the same used for the tissue specificity category.
Low region specificity
Low region specificity
Tau specificity scorei
Tau specificity score is a numerical indicator of the specificity of the gene expression across cells or tissues. The value ranges from 0 and 1, where 0 indicates identical expression across all cells/tissue types, while 1 indicates expression in a single cell/tissue type.
The regional distribution category is based on mRNA expression detected above cut off or not in the analysed brain samples, grouped into 13 main brain regions and calculated for the three different species. Brain expression for all species is based on data from HPA. The distribution categories include: detected in all, detected in many, detected in some, detected in single and not detected. The classification rules are the same used for the tissue distribution category.
Detected in all
Detected in all
A summary of the overall protein localization across the analyzed brain tissue. The summary is based on manual annotation.
Mouse brain profiles includes a more extensive summary of the protein location while summaries with only human protein profile are more limited and are based on imported data/images from the Tissue Atlas stainings that has undergone an additional brain-centric curation step.
Protein detected in microglia in all brain regions.
Normalized RNA expression levels (nTPM) shown for the 13 brain regions. Color coding is based on brain region and the bar shows the highest expression among the subregions included. To access sample data, click on region name or bar.
Read more about normalized expression levels in <a href="/about/assays+annotation#normalization_rna"_blank">Assays & Annotation.
Additional Prefrontal Cortex dataset
COMPARISON BRAIN RNA EXPRESSION
GTEX AND FANTOM HUMAN BRAIN
GTEx Human brain RNA-Seq dataseti
GTEx dataset RNA-seq tissue data generated by the Genotype-Tissue Expression (GTEx) project is reported as mean nTPM, corresponding to mean values of the different individual samples for respective subregion. Highest expression among the subregions represents the brain region. To access sample data, click on region name or bar.
FANTOM5 dataset Tissue data for RNA expression obtained through Cap Analysis of Gene Expression (CAGE) generated by the FANTOM5 project are reported as Scaled Tags Per Million. To access sample data, click on region name or bar.
HPA Pig dataset HPA RNA-seq tissue data is reported as mean nTPM (normalized expression) for each of the brain regions analyzed in pig. The detailed pages (reached when clicking a bar or regional name) show nTPM values at the individual sample level. To access sample data, click on region name or bar.
HPA Mouse dataset HPA RNA-seq tissue data is reported as mean nTPM (normalized expression) for each of the brain regions analyzed in mouse. The detailed pages (reached when clicking a bar or regional name) show nTPM values at the individual sample level. To access sample data, click on region name or bar.
The Human brain protein data is based on curated and manually selected Tissue Atlas data. The standard brain regions used in the Tissue Atlas are cerebral cortex, caudate nucleus, hippocampus and cerebellum, only selected cases include information on hypothalamus or retina. The score is based on knowledge-based annotation of the protein location in the main cell types. For genes where more than one antibody has been used, a collective score is set displaying the estimated true protein expression.
Each bar represents the highest expression score found in a particular tissue. Protein expression scores are based on a best estimate of the "true" protein expression from a knowledge-based annotation, described more in detail under Assays & annotation. For genes where more than one antibody has been used, a collective score is set displaying the estimated true protein expression.
EXPRESSION CLUSTERING & CORRELATIONi
The RNA data was used to cluster genes according to their expression across samples. The resulting clusters have been manually annotated to describe common features in terms of function and specificity. The annotation of the cluster is displayed together with a confidence score of the gene's assignment to the cluster. The confidence is calculated as the fraction of times the gene was assigned to this cluster in repeated calculations and is reported between 0 to 1, where 1 is the highest possible confidence. The clustering results are shown in a UMAP, where the cluster this gene was assigned to is highlighted as a colored area in which most of the cluster genes reside. A table shows the 15 most similar genes in terms of expression profile.
SCIN is part of cluster 20Macrophages & Microglia - Immune response with confidencei
Confidence is the fraction of times a gene was assigned to the cluster in repeated clustering, and therefore reflects how strongly associated it is to the cluster. A confidence of 1 indicates that the gene was assigned to this cluster in all repeated clusterings.