The germ cell-specific proteome

Germ cells give rise to the gametes in humans, which are cells that can undergo meiosis, as opposed to somatic cells that undergo mitosis. Gametes are diploid germ cells that undergo development into haploid eggs and sperm through oogenesis and spermatogenesis.

  • 4103 elevated genes
  • 1339 enriched genes
  • 585 group enriched genes
  • Main function: Gamete development

Transcriptome analysis shows that 67% (n=13603) of all human proteins (n=20162) are detected in germ cells and 4103 of these genes show an elevated expression in any germ cells compared to other cell type groups. In-depth analysis of the elevated genes in germ cells using scRNA-seq and antibody-based protein profiling allowed us to visualize the expression patterns of these proteins in the following types of germ cells: spermatogenic cell types of the testis (undifferentiated spermatogonia, differentiating spermatogonia, early primary spermatocytes, late primary spermatocytes, early spermatids and late spermatids), and oocytes in the ovaries.


The germ cell transcriptome

The scRNA-seq-based germ cell transcriptome can be analyzed with regard to specificity, illustrating the number of genes with elevated expression in each specific germ cell type compared to other cell types (Table 1). Genes with an elevated expression are divided into three subcategories:

  • Cell type enriched: At least four-fold higher mRNA level in a certain cell type compared to any other cell type.
  • Group enriched: At least four-fold higher average mRNA level in a group of 2-5 cell type groups (out of 53) or 2-15 cell types (out of 154) compared to any other.
  • Cell type enhanced: At least four-fold higher mRNA level in a certain cell type compared to the average level in all other cell types.


Table 1. Number of genes in the subdivided specificity categories of elevated expression in the analyzed germ cell types compared to other grouped cell types.

Cell type Tissue origin Cell type enrichedGroup enrichedCell type enhancedTotal elevated
Spermatogenic cell types testis 1283 546 1921 3750
Oocytes ovary 56 107 395 558
Any germ cells 1339 585 2179 4103


Spermatogenic cell types

Spermatogenic cells represent the developmental stages of sperm formation, progressing from undifferentiated spermatogonia to mature spermatids. This process involves successive differentiation and meiosis, as spermatogonia develop into primary spermatocytes and then spermatids, ensuring the production of haploid sperm capable of fertilization.

Table 2. Number of genes in the subdivided specificity categories of elevated expression in the analyzed spermatogenic cell types compared to all other cell types.

Cell type Tissue origin Cell type enrichedGroup enrichedCell type enhancedTotal elevated
Undifferentiated spermatogonia testis 4 52 343 399
Differentiating spermatogonia testis 1 68 454 523
Early primary spermatocytes testis 33 105 766 904
Late primary spermatocytes testis 28 424 1701 2153
Early spermatids testis 31 658 1823 2512
Late spermatids testis 244 576 2001 2821
Any Spermatogenic cell types 341 818 3672 4831
Expression profiles are compared across the 154 cell types, in addition to the comparison across the 53 grouped cell types. This results in cell specificity categories based on two datasets, one more detailed across all cell types and one more suitable for the general overview based on the grouped cell types with similar expression profile and functions. Cell types are grouped based on function and biology, to facilitate a better overview and to complement the detailed information based on all cell types.

Undifferentiated spermatogonia

Diploid primitive germ cells residing on the basement membrane of seminiferous tubules. They undergo asymmetric division to self-renew and produce differentiating (type B) spermatogonia, maintaining the spermatogonial stem cell pool. As shown in the table, 399 genes show elevated expression in undifferentiated spermatogonia compared to other cell types. The majority of genes with elevated expression in is shared with differentiating spermatogonia and early primary spermatocytes, such is the case for MAGEC2.


MAGEC2 - testis

MAGEC2 - testis

MAGEC2 - testis


Differentiating spermatogonia

Diploid germ cells derived from undifferentiated spermatogonia that have committed to the spermatogenic lineage. They proliferate and mature toward the primary spermatocyte stage, marking the transition to meiosis. As shown in the table, 523 genes show elevated expression in differentiating spermatogonia compared to other cell types. An example of proteins with elevated expression in differentiating spermatogonia is sarcoma antigen 1 (SAGE1), which is a cancer related gene that belongs to the Cancer Testis Antigen (CTA) family, suggested to constitute important targets for immunotherapy.



SAGE1 - testis

SAGE1 - testis

SAGE1 - testis


Early primary spermatocytes

Spermatocytes are derived from type B spermatogonia and can be subdivided into primary spermatocytes that enter the first meiosis, and secondary spermatocytes that enter the second meiosis to produce haploid spermatids. The longest phase of meiosis is prophase I which is subdivided into different stages including preleptotene and pachytene. Several of the testis-specific proteins localized to spermatocytes are involved in testicular differentiation, proliferation, and meiosis. As shown in the table, 904 genes show elevated expressio in early primary spermatocytes compared to other cell types. Pachytene spermatocytes are easily recognizable by being the only meiotic germ cells observed on testicular histology sections due to the long duration of meiotic prophase I. The structural synaptonemal complex protein (SYCP3) is involved in the recombination and segregation of meiotic chromosomes. The synaptogyrin 4 (SYNGR4) protein is a membranous protein that belongs to the synaptogyrin family and its function is unclear.



SYCP3 - testis

SYCP3 - testis

SYCP3 - testis


Late primary spermatocytes

Germ cells progressing through late prophase I toward the first meiotic division. They will generate haploid secondary spermatocytes. As shown in in the table, 2153 genes are elevated in late primary spermatocytes compared to other cell types. An example of a protein with elevated expression in late primary spermatocytes is A Disintegrin and A Metalloprotease domain 2 (ADAM2), also called fertilin beta, involved in the sperm-egg membrane binding.


ADAM2 - testis

ADAM2 - testis

ADAM2


Early spermatids

Spermatids result from the division of secondary spermatocytes and hence meiosis completion. These cells are divided into early spermatids (round spermatids) and late spermatids (elongated spermatids). Although spermatids are not divided, they undergo a complex process of metamorphosis (called spermiogenesis). Early spermatids are transcriptionally active. The late spermatid will eventually mature into spermatozoa which will be released in the seminiferous tubule lumen. The DNA will become densely packed and form an acrosome and a mid area with mitochondria. As shown in Table 2, 2512 genes are elevated in early spermatids compared to other cell types. One example of an early spermatid protein is the sperm acrosome associated 4 (SPACA4), which is a sperm surface membrane protein that may be involved in sperm-egg plasma membrane adhesion and fusion during fertilization.



SPACA4 - testis

SPACA4 - testis

SPACA4 - testis


Late spermatids

Different from the early spermatids, late spermatids are transcriptionally inert when the acrosome is fully developed because of the tightly packed chromatin. As shown in Table 2, 2821 genes are elevated in late spermatids compared to other cell types. An examples of a cell type enriched protein in late spermatids is spermatogenesis associated 3 (SPATA3)



SPATA3 - testis

SPATA3 - testis

SPATA3 - testis


Oocytes

The female germ cells, oocytes, are produced in the ovaries, in an anatomical structure called the ovarian follicle. Oogenesis is a discontinuous process that starts during fetal life with the development of the primary oocyte. A single layer of granulosa cells surrounds each primary oocyte, arrested in prophase I until puberty. The granulosa cells, in turn, are enclosed in a thin layer of extracellular matrix, called the zona pellucida. These structures are referred to as primordial follicles. At puberty, the primordial follicles eventually develop into primary, secondary, and tertiary vesicular follicles. Each month, typically only one developing primary follicle becomes dominant and achieves complete maturation to release the oocyte into the fallopian tube.

Cell type Tissue origin Cell type enrichedGroup enrichedCell type enhancedTotal elevated
Oocytes ovary 56 62 816 934

As shown in the table, 934 genes are elevated in Oocytes compared to all other cell types. Examples of cell type enriched proteins in oocytes include folliculogenesis specific bHLH transcription factor (FIGLA) and zona pellucida glycoprotein 3 (ZP3). FIGLA is a transcription factor that plays an important role in the regulation of oocyte-specific genes essential for normal ovarian follicular development, including ZP3, a component of the zona pellucida, a structure that is important for subsequent fertilization by a sperm cell.



FIGLA - ovary

FIGLA - ovary

FIGLA - ovary



ZP3 - ovary

ZP3 - ovary

ZP3 - ovary


Germ cell function

Germ cells are precursor cells to the gametes of the human male and female. These cells are the only cells in the body that undergo meiosis instead of mitosis as compared to other somatic cells. The germ cells will differentiate into egg and sperm cells through spermatogenesis (sperm cell development) and oogenesis (egg cell development). When germ cells differentiate into unfertilized eggs and sperm they will become haploid (half of the genetic code) gametes through meiosis. Haploid cells, when merged, will contribute to genetic differences in the offspring, creating individuals that are genetically different.

The histology of organs that contain germ cells, including interactive images, is described in the Protein Atlas Histology Dictionary.


Background

Here, the protein-coding genes expressed in germ cells are described and characterized, together with examples of immunohistochemically stained tissue sections that visualize corresponding protein expression patterns of genes with elevated expression in different germ cell types.

The transcript profiling was based on publicly available genome-wide expression data from scRNA-seq and snRNA-seq experiments (36 datasets) covering 34 tissues. All datasets (unfiltered read counts of cells) were clustered independenty using leiden clustering, resulting in a total of 1175 different cell type clusters. The clusters were then manually annotated based on a survey of known tissue and cell type-specific markers. The RNA-seq data from each cluster of cells was aggregated to mean normalized protein-coding counts per million (nCPM) for all protein-coding genes. A specificity and distribution classification was performed for both single cell types individually, as well as grouped into 53 main cell type groups. The specificity classification determined the number of elevated genes, while the distribution determined whether genes are detected in one, several or all cell types or cell type groups.

It should be noted that since the analysis was limited to datasets representing 34 tissue types, not all human cell types are represented. Furthermore, some cell types are present only in low amounts, or identified only in mixed cell clusters, which may affect the results and bias the cell type specificity.


Relevant links and publications

Uhlén M et al., Tissue-based map of the human proteome. Science (2015)
PubMed: 25613900 DOI: 10.1126/science.1260419

Fagerberg L et al., Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Mol Cell Proteomics. (2014)
PubMed: 24309898 DOI: 10.1074/mcp.M113.035600

Guo J et al., The adult human testis transcriptional cell atlas. Cell Res. (2018)
PubMed: 30315278 DOI: 10.1038/s41422-018-0099-2

Wagner M et al., Single-cell analysis of human ovarian cortex identifies distinct cell populations but no oogonial stem cells. Nat Commun. (2020)
PubMed: 32123174 DOI: 10.1038/s41467-020-14936-3