Ketoreductase polypeptides for the production of (r)-3-hydroxythiolane

Title: Ketoreductase polypeptides for the production of (r)-3-hydroxythiolane.Abstract: The present disclosure provides engineered ketoreductase enzymes having improved properties as compared to a naturally occurring wild-type ketoreductase enzyme. Also provided are polynucleotides encoding the engineered ketoreductase enzymes, host cells capable of expressing the engineered ketoreductase enzymes, and methods of using the engineered ketoreductase enzymes to synthesize chiral compounds. ...

The Patent Description & Claims data below is from USPTO Patent Application 20120276599, Ketoreductase polypeptides for the production of (r)-3-hydroxythiolane.

1. CROSS REFERENCES TO RELATED APPLICATIONS

This application claims benefit under 35 U.S.C. §120 of application Ser. No. 13/110,789, filed May 18, 2011, which claims benefit under 35 U.S.C. §120 of application Ser. No. 12/197,286, filed Aug. 24, 2008, which issued as U.S. Pat. No. 7,977,078 B2 on Jul. 12, 2011, and under 35 U.S.C. §119(e) of application Ser. No. 60/957,974, filed Aug. 24, 2007, the contents of each of which are incorporated herein by reference.

2.

TECHNICAL FIELD

The present disclosure relates to engineered polypeptides and uses of the polypeptides.

3. REFERENCE TO SEQUENCE LISTING, TABLE OR COMPUTER PROGRAM

The Sequence Listing concurrently submitted herewith under 37 C.F.R. §1.821 via EFS-Web in a computer readable form (CRF) as file name CX2-047USD1_ST25.txt is herein incorporated by reference. The electronic copy of the Sequence Listing was created on May 17, 2011, with a file size of 281 kilobytes. This Sequence Listing is identical except for minor formatting corrections to file 376247-016.txt created on Aug. 24, 2008, with a file size of 272 kilobytes, which was incorporated by reference in the parent U.S. application Ser. No. 12/197,286.

4. BACKGROUND

Enzymes belonging to the ketoreductase (KRED) or carbonyl reductase class (EC1.1.1.184) are useful for the synthesis of optically active alcohols from the corresponding prostereoisomeric ketone substrate. KREDs typically convert a ketone or aldehyde substrate to the corresponding alcohol product, but may also catalyze the reverse reaction, oxidation of an alcohol substrate to the corresponding ketone/aldehyde product. The reduction of ketones and aldehydes, and the oxidation of alcohols by enzymes such as KRED requires a co-factor, most commonly reduced nicotinamide adenine dinucleotide (NADH) or reduced nicotinamide adenine dinucleotide phosphate (NADPH), and nicotinamide adenine dinucleotide (NAD) or nicotinamide adenine dinucleotide phosphate (NADP) for the oxidation reaction. NADH and NADPH serve as electron donors, while NAD and NADP serve as electron acceptors. It is frequently observed that ketoreductases and alcohol dehydrogenases accept either the phosphorylated or the non-phosphorylated co-factor (in its oxidized and reduced state).

In order to circumvent many chemical synthetic procedures for the production of key compounds, ketoreductases are being increasingly employed for the enzymatic conversion of different keto substrates to chiral alcohol products. These applications can employ whole cells expressing the ketoreductase for biocatalytic ketone and aldehyde reductions, or purified enzymes in those instances where presence of multiple ketoreductases in whole cells would adversely affect the stereopurity and yield of the desired product. For in vitro applications, a co-factor (NADH or NADPH) regenerating enzyme such as glucose dehydrogenase (GDH), formate dehydrogenase etc., is used in conjunction with the ketoreductase. Examples using ketoreductases to generate useful chemical compounds include asymmetric reduction of 4-chloroacetoacetate esters (Zhou, J. Am. Chem. Soc., 1983, 105:5925-5926; Santaniello, J. Chem. Res. (S) 1984:132-133; U.S. Pat. No. 5,559,030; U.S. Pat. No. 5,700,670 and U.S. Pat. No. 5,891,685), reduction of dioxocarboxylic acids (e.g., U.S. Pat. No. 6,399,339), reduction of tert-butyl (S) chloro-5-hydroxy-3-oxohexanoate (e.g., U.S. Pat. No. 6,645,746 and WO 01/40450), reduction pyrrolotriazine-based compounds (e.g., US application No. 2006/0286646); reduction of substituted acetophenones (e.g., U.S. Pat. No. 6,800,477); and reduction of ketothiolanes (WO 2005/054491).

It is desirable to identify other ketoreductase enzymes that can be used to carryout conversion of various keto and aldehyde substrates to its corresponding chiral alcohol products.

5.

SUMMARY

The present disclosure provides ketoreductase polypeptides having the ability to reduce 3-ketothiolane (hereafter referred to as “the substrate”) to (R)-3-hydroxythiolane (hereafter referred to as “the product”), the polynucleotides encoding such polypeptides, and methods for using the polypeptides. Generally, the engineered ketoreductase polypeptides of the disclosure have an improved property as compared to the naturally-occurring wild-type ketoreductase enzymes obtained from Lactobacillus kefir (“L. kefir”; SEQ ID NO:4), Lactobacillus brevis (“L. brevis”; SEQ ID NO:2), and Lactobacillus minor (“L. minor”; SEQ ID NO:142) in converting the defined keto substrate to the corresponding chiral alcohol product. Improvements in enzyme activity can include increases in stereoselectivity, enzymatic activity, thermostability, solvent stability, reduced product inhibition, or combinations thereof.

In some embodiments, the ketoreductase polypeptides of the disclosure comprise an amino acid sequence in which the amino acid residue corresponding to position 145 of a reference sequence of SEQ ID NO:2, 4 or 142 is not an acidic residue, i.e., glutamic acid or aspartic acid. As such, the residue corresponding to position 145 is a non-acidic residue. In some embodiments, the ketoreductase polypeptides of the disclosure have an amino acid sequence in which the residue corresponding to position 145 is a polar residue. In some embodiments, this residue corresponding to X145 is a serine.

In some embodiments, the ketoreductase polypeptides of the disclosure have an amino acid sequence that is at least about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence based on SEQ ID NO:2 or SEQ ID NO:4 or SEQ ID NO:142 (or a region or domain thereof, such as residues 90-211) having at the residue corresponding to X145a serine, with the proviso that the ketoreductase polypeptide amino acid sequence has at the residue corresponding to X145a polar residue. In some embodiments, the residue corresponding to residue X145 is a serine. In some embodiments, the ketoreductase polypeptide is based on the sequence formulas of SEQ ID NO:143, 144, or 145, or a domain thereof, such as residues 90-211, in which the residue corresponding to X145 is a polar residue, particularly serine. In some embodiments, the ketoreductase polypeptides can additionally have one or more amino acid residue differences in the amino acid sequence, or the defined domain or region, as compared to the reference sequence, such as the reference sequences of SEQ ID NO:4, 2, or 142. In some embodiments, the amino acid sequence differences in the domain can comprise non-conservative, conservative, as well as a combination of non-conservative and conservative amino acid substitutions. Various amino acid residue positions where such changes can be made are described herein.

In some embodiments, the ketoreductase polypeptides described herein are capable of stereoselectively reducing the substrate to the product with at a higher stereomeric excess than the wildtype enzyme from L. kefir (i.e., SEQ ID NO:4). In some embodiments, the ketoreductase polypeptides described herein are capable of stereoselectively reducing the substrate ketothiolane to the product (R)-3-hydroxythiolane with at least about 70% stereomeric excess.

In another aspect, the present disclosure provides host cells comprising the polynucleotides and/or expression vectors for purposes of manipulation and expression of the ketoreductase polypeptides. The host cells may be L. kefir, L. brevis, or L. minor, or they may be a different organism. The host cells can be used for the expression and isolation of the engineered ketoreductase enzymes described herein, or, alternatively, they can be used directly for the conversion of the 3-ketothiolane substrate to the chiral (R)-3-hydroxythiolane product.

Whether carrying out the method with whole cells, cell extracts or purified ketoreductase enzymes, a single ketoreductase enzyme may be used or, alternatively, mixtures of two or more ketoreductase enzymes may be used.

As noted above, the ketoreductase enzymes described herein are capable of catalyzing the reduction reaction of the keto group in the compound of structural formula (I), 3-ketothiolane,

As such, in some embodiments, the present disclosure provides a method for reducing 3-ketothiolane (“the substrate”) to (R)-3-hydroxythiolane (“the product”), which method comprises contacting or incubating the substrate with a ketoreductase polypeptide of the disclosure under reaction conditions suitable for reducing or converting the substrate to the product. In some embodiments of this method, the substrate is reduced to the product with a stereomeric excess at least greater than about 65% or at least greater than wild-type.

In some embodiments, the methods relate to use of the ketoreductase polypeptides in the synthesis of other compounds, such as drug compounds. In some embodiments, the methods relate to use of the ketoreductase polypeptides in the synthesis of antibiotic sulopenem (CP-70,429), having the following structural formula (III):

Thus, in some embodiments, in a method for the synthesis of the antibiotic of structural formula (III) (i.e., CP-70,429), a step in the method can comprise contacting the substrate of formula (I) with any of the ketoreductases described herein, under reaction conditions suitable for converting or reducing the substrate of structural formula (I) to the product of structural formula (II). The compound of structural formula (II) in stereomeric excess of at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% or more can be used in the preparation of the compound of formula (III).

6.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the role of ketoreductases (KRED) in the conversion of the substrate compound of formula (I), 3-ketothiolane, to the corresponding chiral alcohol product of formula (II), (R)-3-hydroxythiolane. In this reaction, the substrate is reduced biocatalytically to the corresponding (R)-alcohol. This reduction uses a KRED of the disclosure and a co-factor such as NADPH. A glucose dehydrogenase (GDH) can be used to covert/recycle NADP to NADPH. Glucose is converted to gluconic acid, which in turn is converted to its sodium salt (sodium gluconate) with the addition of sodium hydroxide. Example 9 provides a non-limiting method for conducting the reaction.

7.

DETAILED DESCRIPTION

7.1 Definitions

As used herein, the following terms are intended to have the following meanings.

“Ketoreductase” and “KRED” are used interchangeably herein to refer to a polypeptide having an enzymatic capability of reducing a carbonyl group to its corresponding alcohol. More specifically, the ketoreductase polypeptides described herein are capable of stereoselectively reducing the compound of formula (I), supra to the corresponding product of formula (II), supra. The polypeptide typically utilizes a cofactor reduced nicotinamide adenine dinucleotide (NADH) or reduced nicotinamide adenine dinucleotide phosphate (NADPH) as the reducing agent. Ketoreductases as used herein include naturally occurring (wild type) ketoreductases as well as non-naturally occurring engineered polypeptides generated by human manipulation.

“Coding sequence” refers to that portion of a nucleic acid (e.g., a gene) that encodes an amino acid sequence of a protein.

“Naturally-occurring” or “wild-type” refers to the form found in nature. For example, a naturally occurring or wild-type polypeptide or polynucleotide sequence is a sequence present in an organism that can be isolated from a source in nature and which has not been intentionally modified by human manipulation.

“Recombinant” when used with reference to, e.g., a cell, nucleic acid, or polypeptide, refers to a material, or a material corresponding to the natural or native form of the material, that has been modified in a manner that would not otherwise exist in nature, or is identical thereto but produced or derived from synthetic materials and/or by manipulation using recombinant techniques. Non-limiting examples include, among others, recombinant cells expressing genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise expressed at a different level.

“Percentage of sequence identity” and “percentage homology” are used interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and are determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage may be calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Alternatively, the percentage may be calculated by determining the number of positions at which either the identical nucleic acid base or amino acid residue occurs in both sequences or a nucleic acid base or amino acid residue is aligned with a gap to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Those of skill in the art appreciate that there are many established algorithms available to align two sequences. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman, 1981, Adv. Appl. Math. 2:482, by the homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the GCG Wisconsin Software Package), or by visual inspection (see generally, Current Protocols in Molecular Biology, F. M. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1995 Supplement) (Ausubel)). Examples of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al., 1990, J. Mol. Biol. 215: 403-410 and Altschul et al., 1977, Nucleic Acids Res. 3389-3402, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information website. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as, the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=−4, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, 1989, Proc Natl Acad Sci USA 89:10915). Exemplary determination of sequence alignment and % sequence identity can employ the BESTFIT or GAP programs in the GCG Wisconsin Software package (Accelrys, Madison Wis.), using default parameters provided.

“Reference sequence” refers to a defined sequence used as a basis for a sequence comparison. A reference sequence may be a subset of a larger sequence, for example, a segment of a full-length gene or polypeptide sequence. Generally, a reference sequence is at least 20 nucleotide or amino acid residues in length, at least 25 residues in length, at least 50 residues in length, or the full length of the nucleic acid or polypeptide. Since two polynucleotides or polypeptides may each (1) comprise a sequence (i.e., a portion of the complete sequence) that is similar between the two sequences, and (2) may further comprise a sequence that is divergent between the two sequences, sequence comparisons between two (or more) polynucleotides or polypeptide are typically performed by comparing sequences of the two polynucleotides over a “comparison window” to identify and compare local regions of sequence similarity.

In some embodiments, a “reference sequence” can be based on a primary amino acid sequence, where the reference sequence is a sequence that can have one or more changes in the primary sequence. For instance, a reference sequence “based on SEQ ID NO:4 having at the residue corresponding to X145a serine” refers to a reference sequence in which the corresponding residue at X145 in SEQ ID NO:4 has been changed to a serine.

“Comparison window” refers to a conceptual segment of at least about 20 contiguous nucleotide positions or amino acids residues wherein a sequence may be compared to a reference sequence of at least 20 contiguous nucleotides or amino acids and wherein the portion of the sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The comparison window can be longer than 20 contiguous residues, and includes, optionally 30, 40, 50, 100, or longer windows.

“Substantial identity” refers to a polynucleotide or polypeptide sequence that has at least 80 percent sequence identity, at least 85 percent identity and 89 to 95 percent sequence identity, more usually at least 99 percent sequence identity as compared to a reference sequence over a comparison window of at least 20 residue positions, frequently over a window of at least 30-50 residues, wherein the percentage of sequence identity is calculated by comparing the reference sequence to a sequence that includes deletions or additions which total 20 percent or less of the reference sequence over the window of comparison. In specific embodiments applied to polypeptides, the term “substantial identity” means that two polypeptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap weights, share at least 80 percent sequence identity, preferably at least 89 percent sequence identity, at least 95 percent sequence identity or more (e.g., 99 percent sequence identity). Preferably, amino acid residue positions which are not identical differ by conservative amino acid substitutions.

“Corresponding to”, “reference to” or “relative to” when used in the context of the numbering of a given amino acid or polynucleotide sequence refers to the numbering of the residues of a specified reference sequence when the given amino acid or polynucleotide sequence is compared to the reference sequence. In other words, the residue number or residue position of a given polymer is designated with respect to the reference sequence rather than by the actual numerical position of the residue within the given amino acid or polynucleotide sequence. For example, a given amino acid sequence, such as that of an engineered ketoreductase, can be aligned to a reference sequence by introducing gaps to optimize residue matches between the two sequences. In these cases, although the gaps are present, the numbering of the residue in the given amino acid or polynucleotide sequence is made with respect to the reference sequence to which it has been aligned.

“Stereoselectivity” refers to the preferential formation in a chemical or enzymatic reaction of one stereoisomer over another. Stereoselectivity can be partial, where the formation of one stereoisomer is favored over the other, or it may be complete where only one stereoisomer is formed. When the stereoisomers are enantiomers, the stereoselectivity is referred to as enantioselectivity, the fraction (typically reported as a percentage) of one enantiomer in the sum of both. It is commonly alternatively reported in the art (typically as a percentage) as the enantiomeric excess (e.e.) calculated therefrom according to the formula [major enantiomer−minor enantiomer]/[major enantiomer+minor enantiomer]. This can also be referred to as stereomeric excess (s.e.). Where the stereoisomers are diastereoisomers, the stereoselectivity is referred to as diastereoselectivity, the fraction (typically reported as a percentage) of one diastereomer in a mixture of two diastereomers.

“Highly stereoselective” refers to a ketoreductase polypeptide that is capable of converting or reducing the substrate to the corresponding (R)-product with at least about 85% stereomeric excess.

“Improved enzyme property” refers to a ketoreductase polypeptide that exhibits an improvement in any enzyme property as compared to a reference ketoreductase. For the engineered ketoreductase polypeptides described herein, the comparison is generally made to the wild-type ketoreductase enzyme, although in some embodiments, the reference ketoreductase can be another improved engineered ketoreductase. Enzyme properties for which improvement is desirable include, but are not limited to, enzymatic activity (which can be expressed in terms of percent conversion of the substrate), thermal stability, pH activity profile, cofactor requirements, refractoriness to inhibitors (e.g., product inhibition), stereospecificity, and stereoselectivity (including enantioselectivity).

“Increased enzymatic activity” refers to an improved property of the engineered ketoreductase polypeptides, which can be represented by an increase in specific activity (e.g., product produced/time/weight protein) or an increase in percent conversion of the substrate to the product (e.g., percent conversion of starting amount of substrate to product in a specified time period using a specified amount of KRED) as compared to the reference ketoreductase enzyme. Exemplary methods to determine enzyme activity are provided in the Examples. Any property relating to enzyme activity may be affected, including the classical enzyme properties of Km, Vmax or kcat, changes of which can lead to increased enzymatic activity. Improvements in enzyme activity can be from about 1.5 times the enzymatic activity of the corresponding wild-type ketoreductase enzyme, to as much as 2 times. 5 times, 10 times, 20 times, 25 times, 50 times, 75 times, 100 times, or more enzymatic activity than the naturally occurring ketoreductase or another engineered ketoreductase from which the ketoreductase polypeptides were derived. In specific embodiments, the engineered ketoreductase enzyme exhibits improved enzymatic activity in the range of 1.5 to 50 times or 1.5 to 100 times greater than that of the parent ketoreductase enzyme. It is understood by the skilled artisan that the activity of any enzyme is diffusion limited such that the catalytic turnover rate cannot exceed the diffusion rate of the substrate, including any required cofactors. The theoretical maximum of the diffusion limit, or kcat/Km, is generally about 108 to 109 (M−1 s−1). Hence, any improvements in the enzyme activity of the ketoreductase will have an upper limit related to the diffusion rate of the substrates acted on by the ketoreductase enzyme. Ketoreductase activity can be measured by any one of standard assays used for measuring ketoreductase, such as a decrease in absorbance or fluorescence of NADPH (see Example 6) due to its oxidation with the concomitant reduction of a ketone or aldehyde to an alcohol, or by product produced in a coupled assay. Comparisons of enzyme activities are made using a defined preparation of enzyme, a defined assay under a set condition, and one or more defined substrates, as further described in detail herein. Generally, when lysates are compared, the numbers of cells and the amount of protein assayed are determined as well as use of identical expression systems and identical host cells to minimize variations in amount of enzyme produced by the host cells and present in the lysates.

“Conversion” refers to the enzymatic reduction of the substrate to the corresponding product. “Percent conversion” refers to the percent of the substrate that is reduced to the product within a period of time under specified conditions. Thus, the “enzymatic activity” or “activity” of a ketoreductase polypeptide can be expressed as “percent conversion” of the substrate to the product.

“Thermostable” refers to a ketoreductase polypeptide that maintains similar activity (more than 60% to 80% for example) after exposure to elevated temperatures (e.g., 40-80° C.) for a period of time (e.g., 0.5-24 hrs) compared to the untreated enzyme.

“Solvent stable” refers to a ketoreductase polypeptide that maintains similar activity (more than e.g., 60% to 80%) after exposure to varying concentrations (e.g., 5-99%) of solvent (e.g., isopropylalcohol, tetrahydrofuran, 2-methyltetrahydrofuran, acetone, toluene, butylacetate, methyl tert-butylether, etc.) for a period of time (e.g., 0.5-24 hrs) compared to the untreated enzyme.

“pH stable” refers to a ketoreductase polypeptide that maintains similar activity (more than e.g., 60% to 80%) after exposure to high or low pH (e.g., 4.5-6 or 8 to 12) for a period of time (e.g., 0.5-24 hrs) compared to the untreated enzyme.

“Thermo- and solvent stable” refers to a ketoreductase polypeptide that are both thermostable and solvent stable.

“Derived from” as used herein in the context of engineered ketoreductase enzymes, identifies the originating ketoreductase enzyme, and/or the gene encoding such ketoreductase enzyme, upon which the engineering was based. For example, the engineered ketoreductase enzyme of SEQ ID NO: 134 was obtained by artificially evolving, over multiple generations the gene encoding the L. kefir ketoreductase enzyme of SEQ ID NO:4. Thus, this engineered ketoreductase enzyme is “derived from” the wild-type ketoreductase of SEQ ID NO.: 4.

“Acidic Amino Acid or Residue” refers to a hydrophilic amino acid or residue having a side chain exhibiting a pK value of less than about 6 when the amino acid is included in a peptide or polypeptide. Acidic amino acids typically have negatively charged side chains at physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino acids include L-Glu (E) and L-Asp (D).

“Basic Amino Acid or Residue” refers to a hydrophilic amino acid or residue having a side chain exhibiting a pK value of greater than about 6 when the amino acid is included in a peptide or polypeptide. Basic amino acids typically have positively charged side chains at physiological pH due to association with hydronium ion. Genetically encoded basic amino acids include L-Arg (R) and L-Lys (K).

“Polar Amino Acid or Residue” refers to a hydrophilic amino acid or residue having a side chain that is uncharged at physiological pH, but which has at least one bond in which the pair of electrons shared in common by two atoms is held more closely by one of the atoms. Genetically encoded polar amino acids include L-Asn (N), L-Gln (O), L-Ser (S) and L-Thr (T).

“Aromatic Amino Acid or Residue” refers to a hydrophilic or hydrophobic amino acid or residue having a side chain that includes at least one aromatic or heteroaromatic ring. Genetically encoded aromatic amino acids include L-Phe (F), L-Tyr (Y) and L-Trp (W). Although owing to the pKa of its heteroaromatic nitrogen atom L-His (H) it is sometimes classified as a basic residue, or as an aromatic residue as its side chain includes a heteroaromatic ring, herein histidine is classified as a hydrophilic residue or as a “constrained residue” (see below).

“Constrained amino acid or residue” refers to an amino acid or residue that has a constrained geometry. Herein, constrained residues include L-pro (P) and L-his (H). Histidine has a constrained geometry because it has a relatively small imidazole ring. Proline has a constrained geometry because it also has a five membered ring.

“Non-polar Amino Acid or Residue” refers to a hydrophobic amino acid or residue having a side chain that is uncharged at physiological pH and which has bonds in which the pair of electrons shared in common by two atoms is generally held equally by each of the two atoms (i.e., the side chain is not polar). Genetically encoded non-polar amino acids include L-Gly (G), L-Leu (L), L-Val (V), L-Ile (I), L-Met (M) and L-Ala (A).

“Cysteine”. The amino acid L-Cys (C) is unusual in that it can form disulfide bridges with other L-Cys (C) amino acids or other sulfanyl- or sulfhydryl-containing amino acids. The “cysteine-like residues” include cysteine and other amino acids that contain sulfhydryl moieties that are available for formation of disulfide bridges. The ability of L-Cys (C) (and other amino acids with —SH containing side chains) to exist in a peptide in either the reduced free —SH or oxidized disulfide-bridged form affects whether L-Cys (C) contributes net hydrophobic or hydrophilic character to a peptide. While L-Cys (C) exhibits a hydrophobicity of 0.29 according to the normalized consensus scale of Eisenberg (Eisenberg et al., 1984, supra), it is to be understood that for purposes of the present disclosure L-Cys (C) is categorized into its own unique group.

“Small Amino Acid or Residue” refers to an amino acid or residue having a side chain that is composed of a total three or fewer carbon and/or heteroatoms (excluding the α-carbon and hydrogens). The small amino acids or residues may be further categorized as aliphatic, non-polar, polar or acidic small amino acids or residues, in accordance with the above definitions. Genetically-encoded small amino acids include L-Ala (A), L-Val (V), L-Cys (C), L-Asn (N), L-Ser (S), L-Thr (T) and L-Asp (D).

“Conservative” amino acid substitutions or mutations refer to the interchangeability of residues having similar side chains, and thus typically involves substitution of the amino acid in the polypeptide with amino acids within the same or similar defined class of amino acids. However, as used herein, in some embodiments, conservative mutations do not include substitutions from a hydrophilic to hydrophilic, hydrophobic to hydrophobic, hydroxyl-containing to hydroxyl-containing, or small to small residue, if the conservative mutation can instead be a substitution from an aliphatic to an aliphatic, non-polar to non-polar, polar to polar, acidic to acidic, basic to basic, aromatic to aromatic, or constrained to constrained residue. Further, as used herein, A, V, L, or I can be conservatively mutated to either another aliphatic residue or to another non-polar residue. The table below shows exemplary conservative substitutions.

How KEYWORD MONITOR works... a FREEservice from FreshPatents1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored. 3. Each week you receive an email with patent applications related to your keywords. Start now! - Receive info on patent apps like Ketoreductase polypeptides for the production of (r)-3-hydroxythiolane or other areas of interest.###

Data source: patent applications published in the public domain by the United States Patent and Trademark Office (USPTO). Information published here is for research/educational purposes only. FreshPatents is not affiliated with the USPTO, assignee companies, inventors, law firms or other assignees. Patent applications, documents and images may contain trademarks of the respective companies/authors. FreshPatents is not responsible for the accuracy, validity or otherwise contents of these public document patent application filings. When possible a complete PDF is provided, however, in some cases the presented document/images is an abstract or sampling of the full patent application for display purposes. FreshPatents.com Terms/Support -g2-0.3021