G2Cdb::Gene report

Gene id
Gene symbol
C22orf28 (HGNC)
Homo sapiens
chromosome 22 open reading frame 28
G00001097 (Mus musculus)

Databases (8)

Curated Gene
OTTHUMG00000030300 (Vega human gene)
ENSG00000100220 (Ensembl human gene)
51493 (Entrez Gene)
1263 (G2Cdb plasticity & disease)
C22orf28 (GeneCards)
Marker Symbol
HGNC:26935 (HGNC)
Protein Expression
1103 (human protein atlas)
Protein Sequence
Q9Y3I0 (UniProt)

Synonyms (1)

  • HSPC117

Literature (11)

Pubmed - other

  • Proteomic analysis of SUMO4 substrates in HEK293 cells under serum starvation-induced stress.

    Guo D, Han J, Adam BL, Colburn NH, Wang MH, Dong Z, Eizirik DL, She JX and Wang CY

    Center for Biotechnology and Genomic Medicine, Medical College of Georgia, 1120 15th Street, CA4098, Augusta, GA 30912, USA.

    The substrates of SUMO4, a novel member for the SUMO gene family, were characterized in HEK293 cells cultured under serum starvation by proteomic analysis. We identified 90 SUMO4 substrates including anti-stress proteins such as antioxidant enzymes and molecular chaperones or co-chaperones. The substrates also include proteins involved in the regulation of DNA repair and synthesis, RNA processing, protein degradation, and glucose metabolism. Several SUMO4-associated transcription factors were characterized by Western blot analyses. AP-1 was selected for in vitro conjugation assays to confirm SUMO4 sumoylation of these transcription factors. Further functional analyses of the transcription factors suggested that SUMO4 sumoylation represses AP-1 and AP-2alpha transcriptional activity, but enhances GR DNA binding capacity. These results demonstrate that SUMO4 sumoylation may play an important role in the regulation of intracellular stress.

    Biochemical and biophysical research communications 2005;337;4;1308-18

  • Towards a proteome-scale map of the human protein-protein interaction network.

    Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, Klitgord N, Simon C, Boxem M, Milstein S, Rosenberg J, Goldberg DS, Zhang LV, Wong SL, Franklin G, Li S, Albala JS, Lim J, Fraughton C, Llamosas E, Cevik S, Bex C, Lamesch P, Sikorski RS, Vandenhaute J, Zoghbi HY, Smolyar A, Bosak S, Sequerra R, Doucette-Stamm L, Cusick ME, Hill DE, Roth FP and Vidal M

    Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute, Harvard Medical School, 44 Binney Street, Boston, Massachusetts 02115, USA.

    Systematic mapping of protein-protein interactions, or 'interactome' mapping, was initiated in model organisms, starting with defined biological processes and then expanding to the scale of the proteome. Although far from complete, such maps have revealed global topological and dynamic features of interactome networks that relate to known biological properties, suggesting that a human interactome map will provide insight into development and disease mechanisms at a systems level. Here we describe an initial version of a proteome-scale map of human binary protein-protein interactions. Using a stringent, high-throughput yeast two-hybrid system, we tested pairwise interactions among the products of approximately 8,100 currently available Gateway-cloned open reading frames and detected approximately 2,800 interactions. This data set, called CCSB-HI1, has a verification rate of approximately 78% as revealed by an independent co-affinity purification assay, and correlates significantly with other biological attributes. The CCSB-HI1 data set increases by approximately 70% the set of available binary interactions within the tested space and reveals more than 300 new connections to over 100 disease-associated proteins. This work represents an important step towards a systematic and comprehensive human interactome project.

    Funded by: NCI NIH HHS: R33 CA132073; NHGRI NIH HHS: P50 HG004233, R01 HG001715, RC4 HG006066, U01 HG001715; NHLBI NIH HHS: U01 HL098166

    Nature 2005;437;7062;1173-8

  • The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).

    Gerhard DS, Wagner L, Feingold EA, Shenmen CM, Grouse LH, Schuler G, Klein SL, Old S, Rasooly R, Good P, Guyer M, Peck AM, Derge JG, Lipman D, Collins FS, Jang W, Sherry S, Feolo M, Misquitta L, Lee E, Rotmistrovsky K, Greenhut SF, Schaefer CF, Buetow K, Bonner TI, Haussler D, Kent J, Kiekhaus M, Furey T, Brent M, Prange C, Schreiber K, Shapiro N, Bhat NK, Hopkins RF, Hsie F, Driscoll T, Soares MB, Casavant TL, Scheetz TE, Brown-stein MJ, Usdin TB, Toshiyuki S, Carninci P, Piao Y, Dudekula DB, Ko MS, Kawakami K, Suzuki Y, Sugano S, Gruber CE, Smith MR, Simmons B, Moore T, Waterman R, Johnson SL, Ruan Y, Wei CL, Mathavan S, Gunaratne PH, Wu J, Garcia AM, Hulyk SW, Fuh E, Yuan Y, Sneed A, Kowis C, Hodgson A, Muzny DM, McPherson J, Gibbs RA, Fahey J, Helton E, Ketteman M, Madan A, Rodrigues S, Sanchez A, Whiting M, Madari A, Young AC, Wetherby KD, Granite SJ, Kwong PN, Brinkley CP, Pearson RL, Bouffard GG, Blakesly RW, Green ED, Dickson MC, Rodriguez AC, Grimwood J, Schmutz J, Myers RM, Butterfield YS, Griffith M, Griffith OL, Krzywinski MI, Liao N, Morin R, Morrin R, Palmquist D, Petrescu AS, Skalska U, Smailus DE, Stott JM, Schnerch A, Schein JE, Jones SJ, Holt RA, Baross A, Marra MA, Clifton S, Makowski KA, Bosak S, Malek J and MGC Project Team

    The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.

    Funded by: PHS HHS: N01-C0-12400

    Genome research 2004;14;10B;2121-7

  • RNA and RNA binding proteins participate in early stages of cell spreading through spreading initiation centers.

    de Hoog CL, Foster LJ and Mann M

    Center for Experimental BioInformatics (CEBI), Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Denmark.

    Focal adhesions are specialized attachment and signaling centers that form at sites of cell-matrix contacts. We employed a quantitative mass spectrometry-based method called SILAC to identify and quantify proteins interacting in an attachment-dependent manner with focal adhesion proteins. Subsequent confocal microscopy revealed a previously undescribed structure, which we have termed a spreading initiation center (SIC), existing only in early stages of cell spreading. SICs contain focal adhesion markers, appear to be surrounded by an actin sheath, and, surprisingly, contain numerous RNA binding proteins, ribosomal RNA, and perhaps other RNAs. Interfering with the function of FUS/TLS, hnRNP K, and hnRNP E1 results in increased spreading. Spreading initiation centers are ribonucleoprotein complexes distinct from focal adhesions and demonstrate a role for RNA and RNA binding proteins in the initiation of cell spreading.

    Cell 2004;117;5;649-62

  • A genome annotation-driven approach to cloning the human ORFeome.

    Collins JE, Wright CL, Edwards CA, Davis MP, Grinham JA, Cole CG, Goward ME, Aguado B, Mallya M, Mokrab Y, Huckle EJ, Beare DM and Dunham I

    The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.

    We have developed a systematic approach to generating cDNA clones containing full-length open reading frames (ORFs), exploiting knowledge of gene structure from genomic sequence. Each ORF was amplified by PCR from a pool of primary cDNAs, cloned and confirmed by sequencing. We obtained clones representing 70% of genes on human chromosome 22, whereas searching available cDNA clone collections found at best 48% from a single collection and 60% for all collections combined.

    Genome biology 2004;5;10;R84

  • Reevaluating human gene annotation: a second-generation analysis of chromosome 22.

    Collins JE, Goward ME, Cole CG, Smink LJ, Huckle EJ, Knowles S, Bye JM, Beare DM and Dunham I

    The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

    We report a second-generation gene annotation of human chromosome 22. Using expressed sequence databases, comparative sequence analysis, and experimental verification, we have extended genes, fused previously fragmented structures, and identified new genes. The total length in exons of annotation was increased by 74% over our previously published annotation and includes 546 protein-coding genes and 234 pseudogenes. Thirty-two potential protein-coding annotations are partial copies of other genes, and may represent duplications on an evolutionary path to change or loss of function. We also identified 31 non-protein-coding transcripts, including 16 possible antisense RNAs. By extrapolation, we estimate the human genome contains 29,000-36,000 protein-coding genes, 21,300 pseudogenes, and 1500 antisense RNAs. We suggest that our revised annotation criteria provide a paradi 1f40 gm for future annotation of the human genome.

    Genome research 2003;13;1;27-36

  • Cloning and functional analysis of cDNAs with open reading frames for 300 previously undefined genes expressed in CD34+ hematopoietic stem/progenitor cells.

    Zhang QH, Ye M, Wu XY, Ren SX, Zhao M, Zhao CJ, Fu G, Shen Y, Fan HY, Lu G, Zhong M, Xu XR, Han ZG, Zhang JW, Tao J, Huang QH, Zhou J, Hu GX, Gu J, Chen SJ and Chen Z

    Shanghai Institute of Hematology (SIH), Rui Jin Hospital affiliated with Shanghai Second Medical University, Shanghai 200025, China.

    Three hundred cDNAs containing putatively entire open reading frames (ORFs) for previously undefined genes were obtained from CD34+ hematopoietic stem/progenitor cells (HSPCs), based on EST cataloging, clone sequencing, in silico cloning, and rapid amplification of cDNA ends (RACE). The cDNA sizes ranged from 360 to 3496 bp and their ORFs coded for peptides of 58-752 amino acids. Public database search indicated that 225 cDNAs exhibited sequence similarities to genes identified across a variety of species. Homology analysis led to the recognition of 50 basic structural motifs/domains among these cDNAs. Genomic exon-intron organization could be established in 243 genes by integration of cDNA data with genome sequence information. Interestingly, a new gene named as HSPC070 on 3p was found to share a sequence of 105bp in 3' UTR with RAF gene in reversed transcription orientation. Chromosomal localizations were obtained using electronic mapping for 192 genes and with radiation hybrid (RH) for 38 genes. Macroarray technique was applied to screen the gene expression patterns in five hematopoietic cell lines (NB4, HL60, U937, K562, and Jurkat) and a number of genes with differential expression were found. The resource work has provided a wide range of information useful not only for expression genomics and annotation of genomic DNA sequence, but also for further research on the function of genes involved in hematopoietic development and differentiation.

    Genome research 2000;10;10;1546-60

  • Gene expression profiling in the human hypothalamus-pituitary-adrenal axis and full-length cDNA cloning.

    Hu RM, Han ZG, Song HD, Peng YD, Huang QH, Ren SX, Gu YJ, Huang CH, Li YB, Jiang CL, Fu G, Zhang QH, Gu BW, Dai M, Mao YF, Gao GF, Rong R, Ye M, Zhou J, Xu SH, Gu J, Shi JX, Jin WR, Zhang CK, Wu TM, Huang GY, Chen Z, Chen MD and Chen JL

    Rui-Jin Hospital, Shanghai Institute of Endocrinology, Shanghai Second Medical University, China.

    The primary neuroendocrine interface, hypothalamus and pituitary, together with adrenals, constitute the major axis responsible for the maintenance of homeostasis and the response to the perturbations in the environment. The gene expression profiling in the human hypothalamus-pituitary-adrenal axis was catalogued by generating a large amount of expressed sequence tags (ESTs), followed by bioinformatics analysis (http://www.chgc.sh.cn/ database). Totally, 25,973 sequences of good quality were obtained from 31,130 clones (83.4%) from cDNA libraries of the hypothalamus, pituitary, and adrenal glands. After eliminating 5,347 sequences corresponding to repetitive elements and mtDNA, 20,626 ESTs could be assembled into 9, 175 clusters (3,979, 3,074, and 4,116 clusters in hypothalamus, pituitary, and adrenal glands, respectively) when overlapping ESTs were integrated. Of these clusters, 2,777 (30.3%) corresponded to known genes, 4,165 (44.8%) to dbESTs, and 2,233 (24.3%) to novel ESTs. The gene expression profiles reflected well the functional characteristics of the three levels in the hypothalamus-pituitary-adrenal axis, because most of the 20 genes with highest expression showed statistical difference in terms of tissue distribution, including a group of tissue-specific functional markers. Meanwhile, some findings were made with regard to the physiology of the axis, and 200 full-length cDNAs of novel genes were cloned and sequenced. All of these data may contribute to the understanding of the neuroendocrine regulation of human life.

    Proceedings of the National Academy of Sciences of the United States of America 2000;97;17;9543-8

  • The DNA sequence of human chromosome 22.

    Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins JE, Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough R, Almeida JP, Babbage A, Bagguley C, Bailey J, Barlow K, Bates KN, Beasley O, Bird CP, Blakey S, Bridgeman AM, Buck D, Burgess J, Burrill WD, O'Brien KP et al.

    Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK. id1@sanger.ac.uk

    Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome.

    Nature 1999;402;6761;489-95

  • [Pregnancies after conization of the cervix uteri].

    Garcia-Huidobro M

    Revista chilena de obstetricia y ginecologia 1971;36;3;175-7

Gene lists (5)

Gene List Source Species Name Description Gene count
L00000009 G2C Homo sapiens Human PSD Human orthologues of mouse PSD adapted from Collins et al (2006) 1080
L00000016 G2C Homo sapiens Human PSP Human orthologues of mouse PSP adapted from Collins et al (2006) 1121
L00000061 G2C Homo sapiens BAYES-COLLINS-MOUSE-PSD-CONSENSUS Mouse cortex PSD consensus (ortho) 984
L00000069 G2C Homo sapiens BAYES-COLLINS-HUMAN-PSD-FULL Human cortex biopsy PSD full list 1461
L00000071 G2C Homo sapiens BAYES-COLLINS-MOUSE-PSD-FULL Mouse cortex PSD full list (ortho) 1556
© G2C 2014. The Genes to Cognition Programme received funding from The Wellcome Trust and the EU FP7 Framework Programmes:
EUROSPIN (FP7-HEALTH-241498), SynSys (FP7-HEALTH-242167) and GENCODYS (FP7-HEALTH-241995).

Cookies Policy | Terms and Conditions. This site is hosted by Edinburgh University and the Genes to Cognition Programme.