G2Cdb::Gene report

Gene id
Gene symbol
Homo sapiens
engulfment and cell motility 2
G00000247 (Mus musculus)

Databases (8)

Curated Gene
OTTHUMG00000033070 (Vega human gene)
ENSG00000062598 (Ensembl human gene)
63916 (Entrez Gene)
569 (G2Cdb plasticity & disease)
ELMO2 (GeneCards)
606421 (OMIM)
Marker Symbol
HGNC:17233 (HGNC)
Protein Sequence
Q96JJ3 (UniProt)

Synonyms (5)

  • CED-12
  • CED12
  • ELMO-2
  • FLJ11656
  • KIAA1834

Literature (16)

Pubmed - other

  • Heterogeneity in gene loci associated with type 2 diabetes on human chromosome 20q13.1.

    Bento JL, Palmer ND, Zhong M, Roh B, Lewis JP, Wing MR, Pandya H, Freedman BI, Langefeld CD, Rich SS, Bowden DW and Mychaleckyj JC

    Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, NC 27157, USA.

    Human chromosome 20q12-q13.1 has been linked to type 2 diabetes mellitus (T2DM) in multiple studies. We screened a 5.795-Mb region for diabetes-related susceptibility genes in a Caucasian cohort of 310 controls and 300 cases with T2DM and end-stage renal disease (ESRD), testing 390 SNPs for association with T2DM-ESRD. The most significant SNPs were found in the perigenic regions: HNF4A (hepatocyte nuclear factor 4alpha), SLC12A5 (potassium-chloride cotransporter member 5), CDH22 (cadherin-like 22), ELMO2 (engulfment and cell motility 2), SLC13A3 (sodium-dependent dicarboxylate transporter member 3), and PREX1 (phosphatidylinositol 3,4,5-triphosphate-dependent RAC exchanger 1). Haplotype analysis found six haplotype blocks globally associated with disease (p<0.05). We replicated the PREX1 SNP association in an independent case-control T2DM population and inferred replication of CDH22, ELMO2, SLC13A3, SLC12A5, and PREX1 using in silico perigenic analysis of two T2DM Genome-Wide Association Study data sets. We found substantial heterogeneity between study results.

    Funded by: NIDDK NIH HHS: R01 DK056289, R01 DK056289-07, R01 DK056289-08, R01 DK56289

    Genomics 2008;92;4;226-34

  • The LIFEdb database in 2006.

    Mehrle A, Rosenfelder H, Schupp I, del Val C, Arlt D, Hahne F, Bechtel S, Simpson J, Hofmann O, Hide W, Glatting KH, Huber W, Pepperkok R, Poustka A and Wiemann S

    Division Molecular Genome Analysis, German Cancer Research Center, Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany. a.mehrle@dkfz.de

    LIFEdb (http://www.LIFEdb.de) integrates data from large-scale functional genomics assays and manual cDNA annotation with bioinformatics gene expression and protein analysis. New features of LIFEdb include (i) an updated user interface with enhanced query capabilities, (ii) a configurable output table and the option to download search results in XML, (iii) the integration of data from cell-based screening assays addressing the influence of protein-overexpression on cell proliferation and (iv) the display of the relative expression ('Electronic Northern') of the genes under investigation using curated gene expression ontology information. LIFEdb enables researchers to systematically select and characterize genes and proteins of interest, and presents data and information via its user-friendly web-based interface.

    Nucleic acids research 2006;34;Database issue;D415-8

  • Time-resolved mass spectrometry of tyrosine phosphorylation sites in the epidermal growth factor receptor signaling network reveals dynamic modules.

    Zhang Y, Wolf-Yadlin A, Ross PL, Pappin DJ, Rush J, Lauffenburger DA and White FM

    Biological Engineering Division, Massachusetts Institute of Technnology, Cambridge, Massachusetts 02139, USA.

    Ligand binding to cell surface receptors initiates a cascade of signaling events regulated by dynamic phosphorylation events on a multitude of pathway proteins. Quantitative features, including intensity, timing, and duration of phosphorylation of particular residues, may play a role in determining cellular response, but experimental data required for analysis of these features have not previously been available. To understand the dynamic operation of signaling cascades, we have developed a method enabling the simultaneous quantification of tyrosine phosphorylation of specific residues on dozens of key proteins in a time-resolved manner, downstream of epidermal growth factor receptor (EGFR) activation. Tryptic peptides from four different EGFR stimulation time points were labeled with four isoforms of the iTRAQ reagent to enable downstream quantification. After mixing of the labeled samples, tyrosine-phosphorylated peptides were immunoprecipitated with an anti-phosphotyrosine antibody and further enriched by IMAC before LC/MS/MS analysis. Database searching and manual confirmation of peptide phosphorylation site assignments led to the identification of 78 tyrosine phosphorylation sites on 58 proteins from a single analysis. Replicate analyses of a separate biological sample provided both validation of this first data set and identification of 26 additional tyrosine phosphorylation sites and 18 additional proteins. iTRAQ fragment ion ratios provided time course phosphorylation profiles for each site. The data set of quantitative temporal phosphorylation profiles was further characterized by self-organizing maps, which resulted in identification of several cohorts of tyrosine residues exhibiting self-similar temporal phosphorylation profiles, operationally defining dynamic modules in the EGFR signaling network consistent with particular cellular processes. The presence of novel proteins and associated tyrosine phosphorylation sites within these modules indicates additional components of this network and potentially localizes the topological action of these proteins. Additional analysis and modeling of the data generated in this study are likely to yield more sophisticated models of receptor tyrosine kinase-initiated signal transduction, trafficking, and regulation.

    Funded by: NCI NIH HHS: CA96504; NIDDK NIH HHS: DK070172, DK42816; NIGMS NIH HHS: GM68762

    Molecular & cellular proteomics : MCP 2005;4;9;1240-50

  • Immunoaffinity profiling of tyrosine phosphorylation in cancer cells.

    Rush J, Moritz A, Lee KA, Guo A, Goss VL, Spek EJ, Zhang H, Zha XM, Polakiewicz RD and Comb MJ

    Cell Signaling Technology Inc., 166B Cummings Center, Beverly, Massachusetts 01915, USA.

    Tyrosine kinases play a prominent role in human cancer, yet the oncogenic signaling pathways driving cell proliferation and survival have been difficult to identify, in part because of the complexity of the pathways and in part because of low cellular levels of tyrosine phosphorylation. In general, global phosphoproteomic approaches reveal small numbers of peptides containing phosphotyrosine. We have developed a strategy that emphasizes the phosphotyrosine component of the phosphoproteome and identifies large numbers of tyrosine phosphorylation sites. Peptides containing phosphotyrosine are isolated directly from protease-digested cellular protein extracts with a phosphotyrosine-specific antibody and are identified by tandem mass spectrometry. Applying this approach to several cell systems, including cancer cell lines, shows it can be used to identify activated protein kinases and their phosphorylated substrates without prior knowledge of the signaling networks that are activated, a first step in profiling normal and oncogenic signaling networks.

    Funded by: NCI NIH HHS: 1R43CA101106

    Nature biotechnology 2005;23;1;94-101

  • Comparative gene finding in chicken indicates that we are closing in on the set of multi-exonic widely expressed human genes.

    Castelo R, Reymond A, Wyss C, Câmara F, Parra G, Antonarakis SE, Guigó R and Eyras E

    Research Unit on Biomedical Informatics, Institut Municipal d'Investigació Mèdica/Universitat Pompeu Fabra, Centre de Regulació Genòmica E08003 Barcelona, Spain. rcastelo@imim.es

    The recent availability of the chicken genome sequence poses the question of whether there are human protein-coding genes conserved in chicken that are currently not included in the human gene catalog. Here, we show, using comparative gene finding followed by experimental verification of exon pairs by RT-PCR, that the addition to the multi-exonic subset of this catalog could be as little as 0.2%, suggesting that we may be closing in on the human gene set. Our protocol, however, has two shortcomings: (i) the bioinformatic screening of the predicted genes, applied to filter out false positives, cannot handle intronless genes; and (ii) the experimental verification could fail to identify expression at a specific developmental time. This highlights the importance of developing methods that could provide a reliable estimate of the number of these two types of genes.

    Nucleic acids research 2005;33;6;1935-9

  • The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).

    Gerhard DS, Wagner L, Feingold EA, Shenmen CM, Grouse LH, Schuler G, Klein SL, Old S, Rasooly R, Good P, Guyer M, Peck AM, Derge JG, Lipman D, Collins FS, Jang W, Sherry S, Feolo M, Misquitta L, Lee E, Rotmistrovsky K, Greenhut SF, Schaefer CF, Buetow K, Bonner TI, Haussler D, Kent J, Kiekhaus M, Furey T, Brent M, Prange C, Schreiber K, Shapiro N, Bhat NK, Hopkins RF, Hsie F, Driscoll T, Soares MB, Casavant TL, Scheetz TE, Brown-stein MJ, Usdin TB, Toshiyuki S, Carninci P, Piao Y, Dudekula DB, Ko MS, Kawakami K, Suzuki Y, Sugano S, Gruber CE, Smith MR, Simmons B, Moore T, Waterman R, Johnson SL, Ruan Y, Wei CL, Mathavan S, Gunaratne PH, Wu J, Garcia AM, Hulyk SW, Fuh E, Yuan Y, Sneed A, Kowis C, Hodgson A, Muzny DM, McPherson J, Gibbs RA, Fahey J, Helton E, Ketteman M, Madan A, Rodrigues S, Sanchez A, Whiting M, Madari A, Young AC, Wetherby KD, Granite SJ, Kwong PN, Brinkley CP, Pearson RL, Bouffard GG, Blakesly RW, Green ED, Dickson MC, Rodriguez AC, Grimwood J, Schmutz J, Myers RM, Butterfield YS, Griffith M, Griffith OL, Krzywinski MI, Liao N, Morin R, Morrin R, Palmquist D, Petrescu AS, Skalska U, Smailus DE, Stott JM, Schnerch A, Schein JE, Jones SJ, Holt RA, Baross A, Marra MA, Clifton S, Makowski KA, Bosak S, Malek J and MGC Project Team

    The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.

    Funded by: PHS HHS: N01-C0-12400

    Genome research 2004;14;10B;2121-7

  • From ORFeome to biology: a functional genomics pipeline.

    Wiemann S, Arlt D, Huber W, Wellenreuther R, Schleeger S, Mehrle A, Bechtel S, Sauermann M, Korf U, Pepperkok R, Sültmann H and Poustka A

    Molecular Genome Analysis, German Cancer Research Center, 69120 Heidelberg, Germany. s.wiemann@dkfz.de

    As several model genomes have been sequenced, the elucidation of protein function is the next challenge toward the understanding of biological processes in health and disease. We have generated a human ORFeome resource and established a functional genomics and proteomics analysis pipeline to address the major topics in the post-genome-sequencing era: the identification of human genes and splice forms, and the determination of protein localization, activity, and interaction. Combined with the understanding of when and where gene products are expressed in normal and diseased conditions, we create information that is essential for understanding the interplay of genes and proteins in the complex biological network. We have implemented bioinformatics tools and databases that are suitable to store, analyze, and integrate the different types of data from high-throughput experiments and to include further annotation that is based on external information. All information is presented in a Web database (http://www.dkfz.de/LIFEdb). It is exploited for the identification of disease-relevant genes and proteins for diagnosis and therapy.

    Genome research 2004;14;10B;2136-44

  • Complete sequencing and characterization of 21,243 full-length human cDNAs.

    Ota T, Suzuki Y, Nishikawa T, Otsuki T, Sugiyama T, Irie R, Wakamatsu A, Hayashi K, Sato H, Nagai K, Kimura K, Makita H, Sekine M, Obayashi M, Nishi T, Shibahara T, Tanaka T, Ishii S, Yamamoto J, Saito K, Kawai Y, Isono Y, Nakamura Y, Nagahari K, Murakami K, Yasuda T, Iwayanagi T, Wagatsuma M, Shiratori A, Sudo H, Hosoiri T, Kaku Y, Kodaira H, Kondo H, Sugawara M, Takahashi M, Kanda K, Yokoi T, Furuya T, Kikkawa E, Omura Y, Abe K, Kamihara K, Katsuta N, Sato K, Tanikawa M, Yamazaki M, Ninomiya K, Ishibashi T, Yamashita H, Murakawa K, Fujimori K, Tanai H, Kimata M, Watanabe M, Hiraoka S, Chiba Y, Ishida S, Ono Y, Takiguchi S, Watanabe S, Yosida M, Hotuta T, Kusano J, Kanehori K, Takahashi-Fujii A, Hara H, Tanase TO, Nomura Y, Togiya S, Komai F, Hara R, Takeuchi K, Arita M, Imose N, Musashino K, Yuuki H, Oshima A, Sasaki N, Aotsuka S, Yoshikawa Y, Matsunawa H, Ichihara T, Shiohata N, Sano S, Moriya S, Momiyama H, Satoh N, Takami S, Terashima Y, Suzuki O, Nakagawa S, Senoh A, Mizoguchi H, Goto Y, Shimizu F, Wakebe H, Hishigaki H, Watanabe T, Sugiyama A, Takemoto M, Kawakami B, Yamazaki M, Watanabe K, Kumagai A, Itakura S, Fukuzumi Y, Fujimori Y, Komiyama M, Tashiro H, Tanigami A, Fujiwara T, Ono T, Yamada K, Fujii Y, Ozaki K, Hirao M, Ohmori Y, Kawabata A, Hikiji T, Kobatake N, Inagaki H, Ikema Y, Okamoto S, Okitani R, Kawakami T, Noguchi S, Itoh T, Shigeta K, Senba T, Matsumura K, Nakajima Y, Mizuno T, Morinaga M, Sasaki M, Togashi T, Oyama M, Hata H, Watanabe M, Komatsu T, Mizushima-Sugano J, Satoh T, Shirai Y, Takahashi Y, Nakagawa K, Okumura K, Nagase T, Nomura N, Kikuchi H, Masuho Y, Yamashita R, Nakai K, Yada T, Nakamura Y, Ohara O, Isogai T and Sugano S

    Helix Research Institute, 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan.

    As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at approximately 58% compared with a peak at approximately 42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at approximately 42%, relatively low compared with that of protein-coding cDNAs.

    Nature genetics 2004;36;1;40-5

  • RhoG activates Rac1 by direct interaction with the Dock180-binding protein Elmo.

    Katoh H and Negishi M

    Laboratory of Molecular Neurobiology, Graduate School of Biostudies, Kyoto University, Sakyo-ku, Kyoto 606-8502, Japan. hirokato@pharm.kyoto-u.ac.jp

    The small GTPase Rac has a central role in regulating the actin cytoskeleton during cell migration and axon guidance. Elmo has been identified as an upstream regulator of Rac1 that binds to and functionally cooperates with Dock180 (refs 2-4). Dock180 does not contain a conventional catalytic domain for guanine nucleotide exchange on Rac, but possesses a domain that directly binds to and specifically activates Rac1 (refs 5, 6). The small GTPase RhoG mediates several cellular morphological processes, such as neurite outgrowth in neuronal cells, through a signalling cascade that activates Rac1 (refs 7-12); however, the downstream target of RhoG and the mechanism by which RhoG regulates Rac1 activity remain unclear. Here we show that RhoG interacts directly with Elmo in a GTP-dependent manner and forms a ternary complex with Dock180 to induce activation of Rac1. The RhoG-Elmo-Dock180 pathway is required for activation of Rac1 and cell spreading mediated by integrin, as well as for neurite outgrowth induced by nerve growth factor. We conclude that RhoG activates Rac1 through Elmo and Dock180 to control cell morphology.

    Nature 2003;424;6947;461-4

  • The DNA sequence and comparative analysis of human chromosome 20.

    Deloukas P, Matthews LH, Ashurst J, Burton J, Gilbert JG, Jones M, Stavrides G, Almeida JP, Babbage AK, Bagguley CL, Bailey J, Barlow KF, Bates KN, Beard LM, Beare DM, Beasley OP, Bird CP, Blakey SE, Bridgeman AM, Brown AJ, Buck D, Burrill W, Butler AP, Carder C, Carter NP, Chapman JC, Clamp M, Clark G, Clark LN, Clark SY, Clee CM, Clegg S, Cobley VE, Collier RE, Connor R, Corby NR, Coulson A, Coville GJ, Deadman R, Dhami P, Dunn M, Ellington AG, Frankland JA, Fraser A, French L, Garner P, Grafham DV, Griffiths C, Griffiths MN, Gwilliam R, Hall RE, Hammond S, Harley JL, Heath PD, Ho S, Holden JL, Howden PJ, Huckle E, Hunt AR, Hunt SE, Jekosch K, Johnson CM, Johnson D, Kay MP, Kimberley AM, King A, Knights A, Laird GK, Lawlor S, Lehvaslaiho MH, Leversha M, Lloyd C, Lloyd DM, Lovell JD, Marsh VL, Martin SL, McConnachie LJ, McLay K, McMurray AA, Milne S, Mistry D, Moore MJ, Mullikin JC, Nickerson T, Oliver K, Parker A, Patel R, Pearce TA, Peck AI, Phillimore BJ, Prathalingam SR, Plumb RW, Ramsay H, Rice CM, Ross MT, Scott CE, Sehra HK, Shownkeen R, Sims S, Skuce CD, Smith ML, Soderlund C, Steward CA, Sulston JE, Swann M, Sycamore N, Taylor R, Tee L, Thomas DW, Thorpe A, Tracey A, Tromans AC, Vaudin M, Wall M, Wallis JM, Whitehead SL, Whittaker P, Willey DL, Williams L, Williams SA, Wilming L, Wray PW, Hubbard T, Durbin RM, Bentley DR, Beck S and Rogers J

    The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK. panos@sanger.ac.uk

    The finished sequence of human chromosome 20 comprises 59,187,298 base pairs (bp) and represents 99.4% of the euchromatic DNA. A single contig of 26 megabases (Mb) spans the entire short arm, and five contigs separated by gaps totalling 320 kb span the long arm of this metacentric chromosome. An additional 234,339 bp of sequence has been determined within the pericentromeric region of the long arm. We annotated 727 genes and 168 pseudogenes in the sequence. About 64% of these genes have a 5' and a 3' untranslated region and a complete open reading frame. Comparative analysis of the sequence of chromosome 20 to whole-genome shotgun-sequence data of two other vertebrates, the mouse Mus musculus and the puffer fish Tetraodon nigroviridis, provides an independent measure of the efficiency of gene annotation, and indicates that this analysis may account for more than 95% of all coding exons and almost all genes.

    Nature 2001;414;6866;865-71

  • CED-12/ELMO, a novel member of the CrkII/Dock180/Rac pathway, is required for phagocytosis and cell migration.

    Gumienny TL, Brugnera E, Tosello-Trampont AC, Kinchen JM, Haney LB, Nishiwaki K, Walk SF, Nemergut ME, Macara IG, Francis R, Schedl T, Qin Y, Van Aelst L, Hengartner MO and Ravichandran KS

    Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11743, USA.

    The C. elegans genes ced-2, ced-5, and ced-10, and their mammalian homologs crkII, dock180, and rac1, mediate cytoskeletal rearrangements during phagocytosis of apoptotic cells and cell motility. Here, we describe an additional member of this signaling pathway, ced-12, and its mammalian homologs, elmo1 and elmo2. In C. elegans, CED-12 is required for engulfment of dying cells and for cell migrations. In mammalian cells, ELMO1 functionally cooperates with CrkII and Dock180 to promote phagocytosis and cell shape changes. CED-12/ELMO-1 binds directly to CED-5/Dock180; this evolutionarily conserved complex stimulates a Rac-GEF, leading to Rac1 activation and cytoskeletal rearrangements. These studies identify CED-12/ELMO as an upstream regulator of Rac1 that affects engulfment and cell migration from C. elegans to mammals.

    Funded by: NIGMS NIH HHS: GM52540, GM63310, R01 GM063310

    Cell 2001;107;1;27-41

  • The C. elegans PH domain protein CED-12 regulates cytoskeletal reorganization via a Rho/Rac GTPase signaling pathway.

    Zhou Z, Caron E, Hartwieg E, Hall A and Horvitz HR

    Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge 02139, USA.

    The C. elegans gene ced-12 functions in the engulfment of apoptotic cells and in cell migration, acting in a signaling pathway with ced-2 Crkll, ced-5 DOCK180, and ced-10 Rac GTPase and acting upstream of ced-10 Rac. ced-12 encodes a protein with a pleckstrin homology (PH) domain and an SH3 binding motif, both of which are important for ced-12 function. CED-12 acts in engulfing cells for cell corpse engulfment and interacts physically with CED-5, which contains an SH3 domain. CED-12 has Drosophila and human counterparts. Expression of CED-12 and its counterparts in murine Swiss 3T3 fibroblasts induced Rho GTPase-dependent formation of actin filament bundles. We propose that through interactions with membranes and with a CED-2/CED-5 protein complex, CED-12 regulates Rho/Rac GTPase signaling and leads to cytoskeletal reorganization by an evolutionarily conserved mechanism.

    Developmental cell 2001;1;4;477-89

  • Prediction of the coding sequences of unidentified human genes. XX. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro.

    Nagase T, Nakayama M, Nakajima D, Kikuno R and Ohara O

    Kazusa DNA Research Institute, Kisarazu, Chiba, Japan. nagase@kazusa.or.jp

    To accumulate information on the coding sequences of unidentified genes, we have carried out a sequencing project of human cDNA clones which encode large proteins. We herein present the entire sequences of 100 cDNA clones of unidentified human genes, named KIAA1776 and KIAA1780-KIAA1878, from size-fractionated cDNA libraries derived from human fetal brain, adult whole brain, hippocampus and amygdala. Most of the cDNA clones to be entirely sequenced were selected as cDNAs which were shown to have coding potentiality by in vitro transcription/translation experiments, and some clones were chosen by using computer-assisted analysis of terminal sequences of cDNAs. Three of these clones (fibrillin2/KIAA1776, MEGF10/KIAA1780 and MEGF11/KIAA1781) were isolated as genes encoding proteins with multiple EGF-like domains by motif-trap screening. The average sizes of the inserts and corresponding open reading frames of eDNA clones analyzed here reached 4.7 kb and 2.4 kb (785 amino acid residues), respectively. From the results of homology and motif searches against the public databases, the functional categories of the predicted gene products of 54 genes were determined; 93% of these predicted gene products (50 gene products) were classified as proteins related to cell signaling/communication, nucleic acid management, or cell structure/motility. To collect additional information on these genes, their expression profiles were also studied in 10 human tissues, 8 brain regions, spinal cord, fetal brain and fetal liver by reverse transcription-coupled polymerase chain reaction, products of which were quantified by enzyme-linked immunosorbent assay.

    DNA research : an international journal for rapid publication of reports on genes and genomes 2001;8;2;85-95

  • Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs.

    Wiemann S, Weil B, Wellenreuther R, Gassenhuber J, Glassl S, Ansorge W, Böcher M, Blöcker H, Bauersachs S, Blum H, Lauber J, Düsterhöft A, Beyer A, Köhrer K, Strack N, Mewes HW, Ottenwälder B, Obermaier B, Tampe J, Heubner D, Wambutt R, Korn B, Klein M and Poustka A

    Molecular Genome Analysis, German Cancer Research Center, 69120 Heidelberg, Germany. s.wiemann@dkfz.de

    With the complete human genomic sequence being unraveled, the focus will shift to gene identification and to the functional analysis of gene products. The generation of a set of cDNAs, both sequences and physical clones, which contains the complete and noninterrupted protein coding regions of all human genes will provide the indispensable tools for the systematic and comprehensive analysis of protein function to eventually understand the molecular basis of man. Here we report the sequencing and analysis of 500 novel human cDNAs containing the complete protein coding frame. Assignment to functional categories was possible for 52% (259) of the encoded proteins, the remaining fraction having no similarities with known proteins. By aligning the cDNA sequences with the sequences of the finished chromosomes 21 and 22 we identified a number of genes that either had been completely missed in the analysis of the genomic sequences or had been wrongly predicted. Three of these genes appear to be present in several copies. We conclude that full-length cDNA sequencing continues to be crucial also for the accurate identification of genes. The set of 500 novel cDNAs, and another 1000 full-coding cDNAs of known transcripts we have identified, adds up to cDNA representations covering 2%--5 % of all human genes. We thus substantially contribute to the generation of a gene catalog, consisting of both full-coding cDNA sequences and clones, which should be made freely available and will become an invaluable tool for detailed functional studies.

    Genome research 2001;11;3;422-35

  • DNA cloning using in vitro site-specific recombination.

    Hartley JL, Temple GF and Brasch MA

    Life Technologies, Inc., Rockville, Maryland 20850, USA. jhartley@lifetech.com

    As a result of numerous genome sequencing projects, large numbers of candidate open reading frames are being identified, many of which have no known function. Analysis of these genes typically involves the transfer of DNA segments into a variety of vector backgrounds for protein expression and functional analysis. We describe a method called recombinational cloning that uses in vitro site-specific recombination to accomplish the directional cloning of PCR products and the subsequent automatic subcloning of the DNA segment into new vector backbones at high efficiency. Numerous DNA segments can be transferred in parallel into many different vector backgrounds, providing an approach to high-throughput, in-depth functional analysis of genes and rapid optimization of protein expression. The resulting subclones maintain orientation and reading frame register, allowing amino- and carboxy-terminal translation fusions to be generated. In this paper, we outline the concepts of this approach and provide several examples that highlight some of its potential.

    Genome research 2000;10;11;1788-95

Gene lists (6)

Gene List Source Species Name Description Gene count
L00000009 G2C Homo sapiens Human PSD Human orthologues of mouse PSD adapted from Collins et al (2006) 1080
L00000016 G2C Homo sapiens Human PSP Human orthologues of mouse PSP adapted from Collins et al (2006) 1121
L00000059 G2C Homo sapiens BAYES-COLLINS-HUMAN-PSD-CONSENSUS Human cortex PSD consensus 748
L00000061 G2C Homo sapiens BAYES-COLLINS-MOUSE-PSD-CONSENSUS Mouse cortex PSD consensus (ortho) 984
L00000069 G2C Homo sapiens BAYES-COLLINS-HUMAN-PSD-FULL Human cortex biopsy PSD full list 1461
L00000071 G2C Homo sapiens BAYES-COLLINS-MOUSE-PSD-FULL Mouse cortex PSD full list (ortho) 1556
© G2C 2014. The Genes to Cognition Programme received funding from The Wellcome Trust and the EU FP7 Framework Programmes:
EUROSPIN (FP7-HEALTH-241498), SynSys (FP7-HEALTH-242167) and GENCODYS (FP7-HEALTH-241995).

Cookies Policy | Terms and Conditions. This site is hosted by Edinburgh University and the Genes to Cognition Programme.