User:Gturns/sandbox

Source: Wikipedia, the free encyclopedia.

C2orf72

Chromosome 2 open reading frame 72 (c2orf72) is a protein that in humans is encoded by the gene c2orf72. The function of this protein is currently unknown.

Gene

This gene consists of 3,629 nucleotides, contains 3 exons and 2 introns, and the protein it encodes is 295 amino acids long.[1][2]

It has orthologs in a number of different organisms, including mice, dogs, fish, frogs, chickens, snakes, and many others. C2orf72 has no known isoforms.

Location

This gene is located in the human genome at 2q37.1.[2] Its transcript contains 3,629 nucleotides, and has 3 exons.

Expression

On average, the most abundant tissues for gene c2orf72 are the liver and brain, while the abundance in the placenta has the potential to be quite high. During the developmental phase, the gene is most prevalent in the intestine and stomach.[3] In the brain, it is found primarily in the cingulate cortex.[4]

Protein

Primary Sequences

The protein C2orf72 is 295 amino acids long. It has an experimental molecular weight of 38 kDa.[5] It is predicted to be located in the cytoplasm.[6] There is a high amount of alanine and arginine relative to other proteins, as they make up 22.4% and 11.5% of the amino acids in the protein respectively.[6]

Post-Translational Modifications

The protein contains both a globular domain and a section of high disorder. It contains no trans-membrane regions.[7]

Secondary Structure and Tertiary Structure

The predicted secondary structure of c2orf72 consists primarily of alpha helices, with significant coiling in the final 30 amino acids.

The secondary structure of protein c2orf72, as predicted by CFSSP[8].
The tertiary structure of protein c2orf72, as modeled by NCBI.[9]







Homology and Evolution

Paralogs

No paralogs exist for protein c2orf72. There is one paralogous domain found in hCG1777421.[10]

Orthologs

Genus and species Common name Taxonomic group Date of divergence from human lineage (MYA) (estimated) Accession number Sequence length (aa) Sequence identity to human protein Sequence similarity to human protein
Homo sapiens Human Primate 0 NP_001138466.1 295 100% 100%
Urocitellus parryii Arctic ground squirrel Rodent 90 XP_026252042.1 286 82% 85%
Microtus ochrogaster Prairie vole Rodent 90 XP_026642192.1 298 68% 75%
Canis lupus dingo Dingo Canids 96 XP_025319191.1 284 80% 83%
Eumeotopias jubatus Steller sea lion Seal 96 XP_027960872.1 281 79% 83%
Phyllostomus discolor Pale spear-nosed bat Chiroptera 96 XP_028364834.1 289 73% 75%
Delphinapterus leucas Beluga whale Cetacean 96 XP_022423643.1 383 58% 60%
Ornithorhynchus anatinus Platypus Monotreme 177 XP_028928459.1 387 26% 34%
Terrapene carolina triunguis Three-toed box turtle Reptile 312 XP_026505723.1 269 40% 50%
Gallus gallus Red junglefowl Bird 312 XP_004943655.1 244 38% 47%
Anas platyrhynchos Mallard Bird 312 XP_027320562.1 251 36% 48%
Pseudonaja textilis Eastern brown snake Reptile 312 XP_026577460.1 272 33% 43%
Pelodiscus sinensis Chinese soft-shelled turtle Reptile 312 XP_025038651.1 252 28% 40%
Python bivittatus Burmese python Reptile 312 XP_025023716.1 279 24% 37%
Xenopus tropicalis Western clawed frog Amphibian 351.8 XP_002937397.2 291 30% 41%
Nanorana parkeri High Himalaya frog Amphibian 351.8 XP_018432004.1 304 27% 39%
Scleropages formosus Asian arowana Fish 435 XP_029110182.1 311 26% 38%
Erpetoichthys calabaricus Reedfish Fish 435 XP_028650671.1 280 24% 38%
Kryptolebias marmoratus Mangrove rivulus Fish 435 XP_017268814.1 317 22% 37%

Distant Homologs

A tree depiction of the evolutionary distance between some homologs of c2orf72.

The most distant homolog discovered is the Mangrove Rivulus.




Interacting Proteins

Abbreviated Name Full Name Aliases Info & Function
CD81 Cluster of differentiation 81 CVID6, S5.7, TAPA1, TSPAN28 Cell surface glycoprotein

Complexes with integrins Appears to promote muscle cell fusion and support myotube maintenance May be involved in signal transduction

KRAS Kirsten rat sarcoma viral oncogene homolog C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A, K-RAS4B, KI-RAS, KRAS1, KRAS2, NS, NS3, RALD, RASK2, K-ras, KRAS proto-oncogene, GTPase, c-Ki-ras2 Acts as a molecular on/off switch

Upon activation, it recruits and activates proteins necessary for the propagation of growth factors, as well as other cell signaling receptors

NRAS Neuroblastoma RAS viral (v-ras) oncogene homolog ALPS4, CMNS, N-ras, NCMS, NRAS1, NS6, Neuroblastoma RAS viral oncogene homolog, NRAS proto-oncogene, GTPase Involved in regulating cell division
SPATA3 Spermatogenesis-associated protein 3 --- Associated with spermatogenesis
PSMD1 26S proteasome non-ATPase regulatory subunit 1 P112, Rpn2, S1, proteasome 26S subunit, non-ATPase 1 ---
GPR55 G-protein coupled receptor 55 --- ---


  1. ^ "uncharacterized protein C2orf72 [Homo sapiens]". NCBI Protein. Retrieved 07/02/2019. {{cite web}}: Check date values in: |access-date= (help)
  2. ^ a b "C2orf72 chromosome 2 open reading frame 72 [ Homo sapiens (human) ]". NCBI Gene. Retrieved 07/02/2019. {{cite web}}: Check date values in: |access-date= (help)
  3. ^ "C2orf72 chromosome 2 open reading frame 72 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-08-04.
  4. ^ "GDS596 / 213143_at". www.ncbi.nlm.nih.gov. Retrieved 2019-08-04.
  5. ^ "C2orf72 Antibody". www.thermofisher.com. Retrieved 2019-08-04.
  6. ^ a b "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2019-08-04.
  7. ^ "ELM - Search the ELM resource". elm.eu.org. Retrieved 2019-08-04.
  8. ^ "CFSSP: Chou & Fasman Secondary Structure Prediction Server". www.biogem.org. Retrieved 2019-08-04.
  9. ^ "CFSSP: Chou & Fasman Secondary Structure Prediction Server". www.biogem.org. Retrieved 2019-08-04.
  10. ^ "hCG1777421 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-07-30.