The gene information section lists the gene name (HUGO Gene Nomenclature Committee (HGNC) name if available), any approved gene synonyms, Ensembl gene description, and the Entrez gene summary from the National Center for Biotechnology Information.
The chromosomal and cytoband location of the gene according to Ensembl is reported together with the Ensembl gene identifier and Ensembl database version. The Entrez gene identifier for the gene is also given. If any of the protein products of the gene is linked to a UniProt KB/SWISS-PROT entry, links to the UniProt and the neXtProt databases for these proteins are displayed.
Gene name
CTSF (HGNC Symbol)
Synonyms
CATSF, CLN13
Description
cathepsin F (HGNC Symbol)
Entrez gene summary
Cathepsins are papain family cysteine proteinases that represent a major component of the lysosomal proteolytic system. Cathepsins generally contain a signal sequence, followed by a propeptide and then a catalytically active mature region. The very long (251 amino acid residues) proregion of the cathepsin F precursor contains a C-terminal domain similar to the pro-segment of cathepsin L-like enzymes, a 50-residue flexible linker peptide, and an N-terminal domain predicted to adopt a cystatin-like fold. The cathepsin F proregion is unique within the papain family cysteine proteases in that it contains this additional N-terminal segment predicted to share structural similarities with cysteine protease inhibitors of the cystatin superfamily. This cystatin-like domain contains some of the elements known to be important for inhibitory activity. CTSF encodes a predicted protein of 484 amino acids which contains a 19 residue signal peptide. Cathepsin F contains five potential N-glycosylation sites, and it may be targeted to the endosomal/lysosomal compartment via the mannose 6-phosphate receptor pathway. The cathepsin F gene is ubiquitously expressed, and it maps to chromosome 11q13, close to the gene encoding cathepsin W. [provided by RefSeq, Jul 2008]
The protein view displays protein features. The tabs at the top of the protein view section can be used to switch between the different splice variants encoded by this gene. The mouse over function displays additional data for the features in the protein view.
At the top of the protein view, the maximum percent sequence identity of the protein to all other proteins from other human genes is shown, using a sliding window of 10 aa residues (HsID 10) or 50 aa residues (HsID 50) (read more).
If a signal peptide is predicted by a majority of the signal peptide predictors SPOCTOPUS, SignalP 4.0 and Phobius (turquoise) and/or transmembrane regions (orange) are predicted by MDM, these are displayed.
Common (purple) and unique (grey) regions between alternative processed transcripts from the same gene are also displayed (read more), and at the bottom of the protein view is the protein scale.
The protein information section displays the alternative protein-coding transcripts (splice variants) encoded by this gene, according to the Ensembl database.
The ENSP identifier links to the Ensembl website for that protein, and the ENST identifier links to the Ensembl website for that transcript. The data in the UniProt column can be expanded to show links to all matching UniProt identifiers for this protein.
The protein classes to which this protein has been assigned are shown if expanding the data in the protein class column. Parent protein classes are in bold font and subclasses are listed under the parent class.
The Gene Ontology terms assigned to this protein are listed if expanding the Gene ontology column.
The length of the protein (amino acid residues) (according to Ensembl), molecular mass (kDalton), predicted signal peptide (according to a majority of the signal peptide predictors SPOCTOPUS, SignalP 4.0 and Phobius and predicted transmembrane region(s) (according to MDM) are also reported.