A cDNA encoding a beta-1,4-galactosyltransferase named beta-1,4-GalT II was cloned from a cDNA library of the human breast tumor cell line, MRK-nu-1. Initially, a 860-bp PCR fragment was obtained from MRK-nu-1 mRNA by 3'-rapid amplification of cDNA ends by using two nested degenerate oligonucleotide primers based on a highly conserved amino acid sequence found in the catalytic domain of mammalian beta-1,4-galactosyltransferases and Lymnaea stagnalis beta-1,4-N-acetylglucosaminyltransferase (beta-1,4-GlcNAcT), both of which utilize the same sugar acceptor. This subsequently was used as a probe to isolate a 4.7-kb cDNA that contained an ORF of 1,164 bp predicting a polypeptide of 388 aa. Its deduced amino acid sequence shows an identity of 37% with that of the previously characterized human beta-1,4-galactosyltransferase (referred to as beta-1,4-GalT I) and of 28% with that of L. stagnalis beta-1,4-GlcNAcT. Study of the properties of the beta-1,4-GalT II fused to protein A expressed as a soluble form in COS-7 cells revealed that it is a genuine beta-1,4-GalT but has no lactose synthetase activity in the presence of alpha-lactalbumin. Northern blot analysis of 24 human tissues showed that they all express the beta-1,4-GalT II transcript, although the levels varied. These results indicate that human cells contain another beta-1,4-GalT.

From a systematic search of the UniGene and dbEST databanks, using human beta 4-galactosyltransferase (beta 4GalT-I), which is recognized to function in lactose biosynthesis, as the query sequence, we have identified five additional gene family members denoted as beta 4GalT-II, -III, -IV, -V, and -VI. Complementary DNA clones containing the complete coding regions for each of the five human homologs were obtained or generated by a PCR-based strategy (RACE) and sequenced. Relative to beta 4GalT-I, the percent sequence identity at the amino acid level between the individual family members, ranges from 33% (beta 4GalT-VI) to 55% (beta 4GalT-II). The highest sequence identity between any of the homologs is between beta 4GalT-V and beta 4GalT-VI (68%). beta 4GalT-II is the ortholog of the chicken beta 4GalT-II gene, which has been demonstrated to encode an alpha-lactalbumin responsive beta 4-galactosyltransferase (Shaper et al., J. Biol. Chem., 272, 31389-31399, 1997). As established by Northern analysis, beta 4GalT-II and -IV show the most restricted pattern of tissue expression. High steady state levels of beta 4GalT-II mRNA are seen only in fetal brain and adult heart, muscle, and pancreas; relatively high levels of beta 4GalT-VI mRNA are seen only in adult brain. When the corresponding mouse EST clone for each of the beta 4GalT family members was used as the hybridization probe for Northern analysis of murine mammary tissue, transcription of only the beta 4GalT-I gene could be detected in the lactating mammary gland. These observations support the conclusion that among the six known beta 4GalT family members in the mammalian genome, that have been generated through multiple gene duplication events of an ancestral gene(s), only the beta 4GalT-I ancestral lineage was recruited for lactose biosynthesis during the evolution of mammals.

Keywords

Enzymes that catalyze the transfer of glycosyl (sugar) residues to an acceptor, both during degradation (cosubstrates= water or inorganic phosphate) and during biosynthesis of polysaccharides, glycoproteins and glycolipids. In biosynthetic glycosyl transfers, the common activated monomeric sugar intermediate is a nucleoside diphosphate sugar.

Protein which is part of a reference proteome. Reference proteomes are a subset of proteomes that have been selected either manually or algorithmically according to a number of criteria to provide a broad coverage of the tree of life and a representative cross-section of the taxonomic diversity found within UniProtKB, as well as the proteomes of well-studied model organisms and other species of interest for biomedical research.