WO2001012791A1 - Dna shuffling of dioxygenase genes for production of industrial chemicals - Google Patents
Dna shuffling of dioxygenase genes for production of industrial chemicals Download PDFInfo
- Publication number
- WO2001012791A1 WO2001012791A1 PCT/US2000/022038 US0022038W WO0112791A1 WO 2001012791 A1 WO2001012791 A1 WO 2001012791A1 US 0022038 W US0022038 W US 0022038W WO 0112791 A1 WO0112791 A1 WO 0112791A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- polypeptide
- dioxygenase
- improved
- recombinant
- substituted
- Prior art date
Links
- OPVAJFQBSDUNQA-UHFFFAOYSA-N Cc(cc1)c(C)cc1C(O)=O Chemical compound Cc(cc1)c(C)cc1C(O)=O OPVAJFQBSDUNQA-UHFFFAOYSA-N 0.000 description 1
- LPNBBFKOUUSUDB-UHFFFAOYSA-N Cc(cc1)ccc1C(O)=O Chemical compound Cc(cc1)ccc1C(O)=O LPNBBFKOUUSUDB-UHFFFAOYSA-N 0.000 description 1
- GPSDUZXPYCFOSQ-UHFFFAOYSA-N Cc1cc(C(O)=O)ccc1 Chemical compound Cc1cc(C(O)=O)ccc1 GPSDUZXPYCFOSQ-UHFFFAOYSA-N 0.000 description 1
- ZWLPBLYKEWSWPD-UHFFFAOYSA-N Cc1ccccc1C(O)=O Chemical compound Cc1ccccc1C(O)=O ZWLPBLYKEWSWPD-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0069—Oxidoreductases (1.) acting on single donors with incorporation of molecular oxygen, i.e. oxygenases (1.13)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
- C12N15/1027—Mutagenizing nucleic acids by DNA shuffling, e.g. RSR, STEP, RPR
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
Definitions
- This invention pertains to the shuffling of nucleic acids to achieve or enhance industrial production of chemicals by dioxygenase genes.
- Oxygen-containing organic chemicals such as organic acids, hydroxy carboxylic acids, alcohols, hydroxyaryls (e.g., hydroxyaryl carboxylic acids, alkylphenols, etc.) and glycols are important classes of industrial chemicals.
- these products are generated by successive introduction of various chemical functional groups by oxidation, (trans)alkylation, reduction, desaturation and other reactions of inexpensive raw materials such as saturated and unsaturated hydrocarbons (alkanes, alkenes, etc) and simple aromatic compounds (benzene, ethyl benzene, cumene, naphthalene, styrene, toluene, xylenes, etc).
- DOs Dioxygenases
- ADOs arene dioxygenases
- DOs typically catalyze limited oxidation of these basic chemical building blocks. While potentially interesting from an industrial standpoint, these enzymes typically do not exhibit sufficient turnover numbers and/or desired specificity or regioselectivity to make them usable as industrial catalysts.
- the present invention provides a general (broad utility) method for providing dioxygenase enzymes with higher activity and desired selectivity and specificity of the catalyzed reactions, thereby solving the problems outlined above, as well as providing a variety of other features which will be apparent upon review.
- nucleic acid shuffling is used to generate new or improved dioxygenase ("DO") genes.
- DO dioxygenase
- These dioxygenase genes are used to provide dioxygenase enzymes, especially for industrial processes.
- These new or improved genes have surprisingly superior properties as compared to naturally occurring dioxygenase genes.
- a plurality of parental forms are used to obtain dioxygenase genes.
- the selected nucleic acid is derived either from one or more parental nucleic acid(s) which encodes a dioxygenase enzyme, or a fragment thereof, or from a parental nucleic acid which does not encode dioxygenase, but which is a candidate for nucleic acid shuffling to develop dioxygenase activity.
- the plurality of forms of the selected nucleic acid differ from each other in at least one (and typically two or more) nucleotides, and, upon recombination, provide a library of recombinant dioxygenase nucleic acids.
- the library can be an in vitro set of molecules, or present in cells, phage or the like.
- the library is screened to identify at least one recombinant dioxygenase nucleic acid that exhibits distinct or improved dioxygenase activity compared to the parental nucleic acid or nucleic acids.
- the starting DNA segments are first recombined by any of the formats described herein to generate a diverse library of recombinant DNA segments.
- a library can vary widely in size from having fewer than 10 to more than 10 5 , 10 7 , or 10 9 members.
- the starting segments and the recombinant libraries generated include full-length coding sequences and any essential regulatory sequences, such as a promoter and polyadenylation sequence, required for expression. However, if this is not the case, the recombinant DNA segments in the library can be inserted into a common vector providing the missing sequences before performing screening/selection.
- the library is typically generated by nucleic acid shuffling.
- One preferred method comprises initiating a polynucleotide amplification process on overlapping segments of a population of variant polynucleotides, e.g., allelic or species variants, at least one of which variant polynucleotides typically encodes a dioxygenase polypeptide.
- This amplification is typically carried out under conditions whereby one segment serves as a template for extension of another segment, to generate a population of recombinant polynucleotides.
- the recombinant polynucleotides are typically selected or screened for a desired property, e.g., improved dioxygenase activity.
- the overlapping segments are optionally produced by cleavage of the population of variant polynucleotides, e.g., by DNasel digestion. Alternatively, the overlapping segments are produced by chemical synthesis or by amplification of the population of polynucleotides.
- shuffling optionally comprises recombining at least first and second forms of a nucleic acid that encodes a dioxygenase polypeptide, or fragment thereof, wherein the first and second forms differ from each other in two or more nucleotides.
- the library of recombinant polynucleotides is then typically expressed to obtain a library of recombinant polypeptides.
- the method further comprises recombining at least one recombinant polynucleotide that encodes a member of the library of recombinant polynucleotides that encodes a member of the library of recombinant dioxygenase polypeptides, which is the same or different from the first and second forms, to produce a further library of recombinant polynucleotides.
- the further library of recombinant polynucleotides is expressed to obtain a further library of recombinant dioxygenase polypeptides.
- shuffling comprises hybridizing at least two sets of nucleic acids, wherein a first set of nucleic acids comprises single-stranded nucleic acid templates and a second set of nucleic acids comprises at least one set of nucleic acid fragments.
- the method further comprises elongating, ligating, or both, sequence gaps between the hybridized nucleic acid fragments, to generate one or more substantially full- length chimeric nucleic acid sequences corresponding to the single-stranded nucleic acid templates.
- the method optionally comprises denaturing the one or more substantially full-length chimeric nucleic acid sequences and the single-stranded nucleic acid templates; separating the substantially full-length chimeric nucleic acid sequences from the single-stranded nucleic acid templates; and fragmenting the separated substantially full-length chimeric nucleic acid sequences by nuclease digestion or physical fragmentation to provide chimeric nucleic acid fragments.
- sequence recombination format employed is an in vivo format
- the library of recombinant DNA segments generated already exists in a cell which is usually the cell type in which expression of the enzyme with altered substrate specificity is desired.
- sequence recombination is performed in vitro
- the recombinant library is preferably introduced into the desired cell type before screening/selection.
- the members of the recombinant library can be linked to an episome or virus before introduction or can be introduced directly.
- the library is amplified in a first host, and is then recovered from that host and introduced to a second host more amenable to expression, selection, or screening, or any other desirable parameter.
- the manner in which the library is introduced into the cell type depends on the DNA-uptake characteristics of the cell type (e.g., having viral receptors, being capable of conjugation, or being naturally competent). If the cell type is not susceptible to natural and chemical-induced competence, but is susceptible to electroporation, one preferably employs electroporation. If the cell type is not susceptible to electroporation as well, one can employ biolistics.
- the biolistic PDS-1000 Gene Gun uses helium pressure to accelerate DNA-coated gold or tungsten microcarriers toward target cells. The process is applicable to a wide range of tissues, including plants, bacteria, fungi, algae, intact animal tissues, tissue culture cells, and animal embryos.
- a candidate shuffled DNA can be tested for encoded dioxygenase activity in essentially any synthetic process.
- Common processes that can be screened include, for example, dihydroxylation of an aromatic ring, olefinic or polyenic alkene ⁇ -bond dihydroxylation, oxidation of methyl or methlylene groups attached to an aromatic ring, oxidation of methyl or methylene groups attached to a ⁇ -bond which is not a part of an aromatic system, sulfur heteroatom monooxygenation, desaturation of alkane groups attached to aromatic ring or non-aromatic ⁇ -bonds, oxidative elimination of halide (F, CI, Br, I), nitrite, ammonia from halogen, nitro or amino substituted ⁇ -bonds (aromatic and/or olefinic), N- dealkylation and dearylation of alkylamino- and arylamino- substituted aromatic compounds, O-dealkylation of alkoxy-
- a variety of screening methods can be used to screen a library, depending on the dioxygenase activity for which the library is selected.
- the library to be screened can be present in a population of cells.
- the library is selected by growing the cells in or on a medium comprising the chemical or compound to be oxidized or reduced and selecting for a detected physical difference between the oxidized or reduced form of the chemical or compound and the non-oxidized or reduced form of the chemical or compound, either in the cell, or the extracellular medium. Iterative selection for dioxygenase nucleic acids is also a feature of the invention.
- a selected nucleic acid identified as encoding dioxygenase activity can be shuffled, either with the parental nucleic acids, or with other nucleic acids (e.g., mutated forms of the selected nucleic acid) to produce a second shuffled library.
- the second shuffled library is then selected for one or more form of dioxygenase activity, which can be the same or different than the dioxygenase activity previously selected. This process can be iteratively repeated as many times as desired, until a nucleic acid with optimized properties is obtained.
- any dioxygenase nucleic acid identified by any of the methods herein can be cloned and, optionally, expressed.
- the invention also provides methods of increasing dioxygenase activity by whole genome shuffling.
- a plurality of genomic nucleic acids are shuffled in a cell (in whole cell shuffling, entire genomes are shuffled, rather than specific sequences).
- the resulting shuffled nucleic acids are selected for one or more dioxygenase traits.
- the genomic nucleic acids can be from a species or strain different from the cell in which dioxygenase activity is desired.
- the shuffling reaction can be performed in cells using genomic DNA from the same or different species, or strains. Strains or enzymes exhibiting enhanced or modified DO activity can be identified.
- the distinct or improved dioxygenase activity encoded by a nucleic acid identified after shuffling can encode for one or more of altered properties selected from a variety of properties of practical interest.
- DOs are of particular relevance to the practical use of DOs in making industrial chemicals.
- Preferred catalytic properties of DOs that can be altered in a variety of combinations using nucleic acid shuffling are as follows:
- Rate of reaction which can be catalyzed by enzyme. Changing the rate of oxidative reactions (turnover numbers, V max or other related parameters indicative of reaction rate) for a given substrate, or for a set of substrates, is often useful as wild-type dioxygenases accept substrates of varying structure, but the reaction is often so slow as to be impractical for preparative use. 2. Specificity of oxidation.
- oxidation of a compound of interest occurs in a mixture of structurally related compounds, some of which may also serve as DO substrates (e.g. isomers of xylene and other alkylbenzenes).
- DO substrates e.g. isomers of xylene and other alkylbenzenes.
- Evolving DOs by nucleic acid shuffling for greater specificity towards a particular compound of interest provides a means for using these enzymes in, for example, the reactive separation of compound mixtures, thus allowing for downstream separation of un desired substrates and their alternative uses.
- DOs form multiple or alternative products from a substrate when alternative substrate binding is possible and two or more sites for introducing oxygen are present.
- high or otherwise altered selectivity of DO reaction is often preferred.
- Other properties roughly falling into the category of regioselectivity of reactions catalyzed by DOs include, for example, absolute configuration of chiral products and their enantiomeric purity (e.g., chiral hydroxylation), where chirality of products is possible.
- DOs can catalyze different reactions including, but not limited to: a) monooxygenation of sulfur atoms in various thioethers; b) O- and N-dealkylation of appropriately substituted arenes; c) oxidative dehalogenations and denitrations of halogenated and nitrated arenes; d) monoxygenation (e.g., monohydoxylation) of "benzylic” (i.e., attached to a benzene or other aromatic ring) carbon atoms, whether methylene or methyl groups; e) monooxygenation (e.g., monohydoxylation) of allylic (i.e., attached to a non-aromatic ⁇ -bond) carbon atoms, whether methylene and methyl groups; f) desaturation reactions of alkyl or cycloalkyl carbon fragments attached to an aromatic ring (e.g. formation of styrene
- ADO arene dioxygenases
- properties of interest are those which may not be directly associated with the catalytic mechanism; however these physical and other general properties can make a profound impact on biocatalyst performance in a practical setting.
- These properties include, for example: an increase in the range of dioxygenase substrates which the distinct or improved polypeptide operates on, an increased expression level of a polypeptide encoded by the nucleic acid, a decrease in susceptibility of a polypeptide encoded by the nucleic acid to protease cleavage, a decrease in susceptibility of a polypeptide encoded by the nucleic acid to high or low pH levels, a decrease in susceptibility of the protein encoded by the nucleic acid to high or low temperatures, an optimization of nucleic acid codon usage for effective expression of ADO polypeptides in a particular host cell, a reduction in the sensitivity of the ADO polypeptides and/or an organism expressing the polypeptide to inactivation by organic solvents, a decrease in ADO inactivation or inhibition
- ADO reaction or from other metabolic reactions of a host cell and a decrease in toxicity to a host cell of a polypeptide encoded by the selected nucleic acid
- the selected nucleic acids to be shuffled can be from any of a variety of sources, including synthetic or cloned DNAs.
- Exemplary targets for recombination include nucleic acids encoding arene dioxygenases and the like.
- shuffled nucleic acids are cloned into expression vectors to achieve desired expression levels.
- a phage display library comprising shuffled forms of a nucleic acid is provided.
- a shuffling mixture comprising at least three homologous DNAs, each of which is derived from a nucleic acid encoding a polypeptide or polypeptide fragment is provided.
- These polypeptides can be, for example, arene dioxygenases, and the like.
- Isolated nucleic acids identified by selection of the libraries in the methods above are also a feature of the invention.
- biotransformative methods using the dioxygenases of the invention for preparing diverse oxidized organic species include vicinal diols, hydroxylated aromatic carboxylic acids, hydroxy alkylarene, ⁇ - hydroxcarboxylic acids and the like. Methods for preparing adducts of oxidized organic species are also provided.
- improved polypeptides and host organisms expressing these polypeptides are also provided.
- Figure 1 Schematic showing the initial dioxygenation of an arene ⁇ -bond of a trialkylbenzene to produce a diol using a dioxygenase and the subsequent dehydration of this diol to a hydroxy trialkylbenzene.
- Figure 2 Schematic showing the conversion of a monohydroxy arene to a dihydroxy arene using a dioxygenase.
- Figure 3 Schematic showing esterification and de-esterification strategies using transferases, esterases and chemical dehydration.
- Figure 4. Schematic showing the initial dioxygenation of an arene ⁇ -bond of a dialkylbenzene to produce a diol using a dioxygenase and the subsequent dehydration of this diol to a hydroxy dialkylbenzene.
- Figure 5 Schematic showing the oxygenation of an alkylarene to the corresponding arene carboxylic acid, the subsequent oxidation of an arene ⁇ -bond to the corresponding diol and the alkylation and/or acylation of the carboxylic acid and/or hydroxyl moieties of the diol.
- Figure 6 Schematic of the selective oxidation of one alkyl group of a dialkylarene to the corresponding arene carboxylic acid, the subsequent oxidation of an arene ⁇ -bond to the corresponding diol, the dehydration of the diol and the alkylation and/or acylation of the carboxylic acid and/or hydroxyl moieties of the diol.
- Figure 7 Schematic showing the oxygenation of an alkylarene to the corresponding arenealkyl carboxylic acid, the subsequent oxidation of an arene ⁇ -bond to the corresponding diol and the alkylation and/or acylation of the carboxylic acid and/or hydroxyl moieties of the diol.
- Figure 8. Schematic showing coumarin and coumarin derivatives.
- Figure 9 Schematic showing synthetic routes to cinnamic acid, coumarin and derivatives of these compounds.
- Figure 10 Schematic showing the oxygenation of a ⁇ -bond of a polycyclic arene to the corresponding diol, the opening of the diol functionalized ring and the conversion of the ring opened product into a lactone.
- Figures 14A- 14R Table of presently preferred substrates and oxygenations.
- Figure 15 Enzymatic reaction schemes for multistep biochemical transformations of olefins to AHAs.
- Figure 16 Schematic illustrating crossovers for clones obtained from dioxygenase shuffling.
- the absolute configuration of the chiral centers is not indicated in the above Figures.
- the chiral centers of the chiral compounds can be R, S, or a mixture of these configurations.
- AHA refers to an ⁇ -hydroxycarboxylic acid.
- HCA refers to a hydroxylated aromatic carboxylic acid.
- ADO refers to an arene dioxygenase.
- DO refers to a dioxygenase.
- shuffling is used herein to indicate recombination between non- identical sequences, in some embodiments shuffling may include crossover via homologous recombination or via non-homologous recombination, such as via cre/lox and/or flp/frt systems.
- Shuffling can be carried out by employing a variety of different formats, including for example, in vitro and in vivo shuffling formats, in silico shuffling formats, shuffling formats that utilize either double-stranded or single-stranded templates, primer based shuffling formats, nucleic acid fragmentation-based shuffling formats, and oligonucleotide- mediated shuffling formats, all of which are based on recombination events between non- identical sequences and are described in more detail or referenced herein below, as well as other similar recombination-based formats.
- a “recombinant” nucleic acid is a nucleic acid produced by recombination between two or more nucleic acids, or any nucleic acid made by an in vitro or artificial process.
- the term "recombinant" when used with reference to a cell indicates that the cell includes (and optionally replicates) a heterologous nucleic acid, or expresses a peptide or protein encoded by a heterologous nucleic acid.
- Recombinant cells can contain genes that are not found within the native (non-recombinant) form of the cell. Recombinant cells can also contain genes found in the native form of the cell where the genes are modified and re- introduced into the cell by artificial means.
- the term also encompasses cells that contain a nucleic acid endogenous to the cell that has been artificially modified without removing the nucleic acid from the cell; such modifications include those obtained by gene replacement, site-specific mutation, and related techniques.
- a "recombinant dioxygenase nucleic acid” is a recombinant nucleic acid encoding a protein or RNA which confers dioxygenase activity to a cell when the nucleic acid is expressed in the cell.
- a "plurality of forms" of a selected nucleic acid refers to a plurality of homologs of the nucleic acid.
- the homologs can be from naturally occurring homologs (e.g., two or more homologous genes) or by artificial synthesis of one or more nucleic acids having related sequences, or by modification of one or more nucleic acid to produce related nucleic acids.
- Nucleic acids are homologous when they are derived, naturally or artificially, from a common ancestor sequence. During natural evolution, this occurs when two or more descendent sequences diverge from a parent sequence over time, i.e., due to mutation and natural selection. Under artificial conditions, divergence occurs, e.g., in one of two ways.
- a given sequence can be artificially recombined with another sequence, as occurs, e.g., during typical cloning, to produce a descendent nucleic acid.
- a nucleic acid can be synthesized de novo, by synthesizing a nucleic acid which varies in sequence from a given parental nucleic acid sequence.
- homology is typically inferred by sequence comparison between two sequences. Where two nucleic acid sequences show sequence similarity it is inferred that the two nucleic acids share a common ancestor. The precise level of sequence similarity required to establish homology varies in the art depending on a variety of factors. For purposes of this disclosure, two sequences are considered homologous where they share sufficient sequence identity to allow recombination to occur between two nucleic acid molecules. Typically, nucleic acids require regions of close similarity spaced roughly the same distance apart to permit recombination to occur.
- nucleic acid or polypeptide sequences refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured using one of the sequence comparison algorithms described below (or other algorithms available to persons of skill) or by visual inspection.
- substantially identical in the context of two nucleic acids or polypeptides (e.g., DNAs encoding a dioxygenase, or the amino acid sequence of the dioxygenase) refers to two or more sequences or subsequences that have at least about 60%, preferably 80%, most preferably 90-95% nucleotide or amino acid residue identity, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. Such "substantially identical" sequences are typically considered to be homologous.
- the "substantial identity” exists over a region of the sequences that is at least about 50 residues in length, more preferably over a region of at least about 100 residues, and most preferably the sequences are substantially identical over at least about 150 residues, or over the full length of the two sequences to be compared.
- sequence comparison and homology determination typically one sequence acts as a reference sequence to which test sequences are compared.
- test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated.
- sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
- Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by visual inspection (see generally, Ausubel et al, infra).
- HSPs high scoring sequence pairs
- initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them.
- the word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always ⁇ 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when.- the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached.
- the BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment.
- the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)).
- the BLAST algorithm In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA 90:5873-5787 (1993)).
- One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance.
- P(N) the smallest sum probability
- a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
- hybridizing specifically to refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions, including when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA.
- Bod(s) substantially refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target polynucleotide sequence.
- Stringent hybridization conditions and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments such as Southern and northern hybridizations are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize specifically at higher temperatures.
- An extensive guide to the hybridization of nucleic acids is found in Tijssen, LABORATORY TECHNIQUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY-HYBRIDIZATION WITH NUCLEIC ACID PROBES, part I, chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays," Elsevier, New York (1993).
- highly stringent hybridization and wash conditions are selected to be about 5 °C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH.
- T m thermal melting point
- highly stringent hybridization and wash conditions are selected to be about 5 °C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH.
- T m is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe.
- Very stringent conditions are selected to be equal to the T m for a particular probe.
- An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42 °C, with the hybridization being carried out overnight.
- An example of highly stringent wash conditions is 0.15M NaCl at 72 °C for about 15 minutes.
- An example of stringent wash conditions is a 0.2x SSC wash at 65 °C for 15 minutes (see, Sambrook, infra., for a description of SSC buffer). Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal.
- An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is lx SSC at 45°C for 15 minutes.
- An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x SSC at 40 °C for 15 minutes.
- stringent conditions typically involve salt concentrations of less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30 °C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide.
- a signal to noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization.
- Nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
- a further indication that two nucleic acid sequences or polypeptides are substantially identical/homologous is that the polypeptide encoded by the first nucleic acid is immunologically cross reactive with, or specifically binds to, the polypeptide encoded by the second nucleic acid.
- a polypeptide is typically substantially identical to a second polypeptide, for example, where the two peptides differ only by conservative substitutions.
- Constantly modified variations of a particular polynucleotide sequence refers to those polynucleotides that encode identical or essentially identical amino acid sequences, or where the polynucleotide does not encode an amino acid sequence, to essentially identical sequences.
- nucleic acid variations are "silent variations," which are one species of
- individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of amino acids in an encoded sequence are also “conservatively modified variations.” Sequences that differ by conservative variations are generally homologous.
- a “subsequence” refers to a sequence of nucleic acids or amino acids that comprise a part of a longer sequence of nucleic acids or amino acids (e.g., polypeptide) respectively.
- the term “gene” is used broadly to refer to any segment of DNA associated with expression of a given RNA or protein. Thus, genes include regions encoding expressed RNAs (which typically include polypeptide coding sequences) and, often, the regulatory sequences required for their expression. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information, and may include sequences designed to have desired parameters.
- nucleic acid or protein when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state.
- nucleic acid refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g. degenerate codon substitutions) and complementary sequences and as well as the sequence explicitly indicated.
- degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19: 5081 (1991); Ohtsuka et al., J. Biol. Chem. 260: 2605-2608 (1985); Cassol et al. (1992) ; Rossolini et al, Mol. Cell. Probes 8: 91-98 (1994)).
- nucleic acid is generic to the terms "gene”, "DNA,” “cDNA”, “oligonucleotide,” “RNA,” “mRNA,” “polynucleotide” and the like.
- Nucleic acid derived from a gene refers to a nucleic acid for whose synthesis the gene, or a subsequence thereof, has ultimately served as a template.
- an mRNA, a cDNA reverse transcribed from an mRNA, an RNA transcribed from that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified DNA, etc. are all derived from the gene and detection of such derived products is indicative of the presence and/or abundance of the original gene and/or gene transcript in a sample.
- a nucleic acid is "operably linked” when it is placed into a functional relationship with another nucleic acid sequence.
- a promoter or enhancer is operably linked to a coding sequence if it increases the transcription of the coding sequence.
- a "recombinant expression cassette” or simply an “expression cassette” is a nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements that are capable of effecting expression of a structural gene in hosts compatible with such sequences. Expression cassettes include at least promoters and optionally, transcription termination signals.
- the recombinant expression cassette includes a nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter.
- an expression cassette can also include nucleotide sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell. Transcription termination signals, enhancers, and other nucleic acid sequences that influence gene expression, can also be included in an expression cassette.
- Alkyl refers to straight- and branched-chain, saturated and unsaturated hydrocarbons.
- Lower alkyl refers to “alkyl” groups having from about 1 to about 6 carbon atoms.
- Substituted alkyl refers to alkyl as just described including one or more functional groups such as lower alkyl, aryl, acyl, halogen (i.e., alkylhalos, e.g., CF 3 ), hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, aryloxy, aryloxyalkyl, mercapto, both saturated and unsaturated cyclic hydrocarbons, heterocycles and the like. These groups may be attached to any carbon of the alkyl moiety.
- aryl is used herein to refer to an aromatic substituent which may be a single aromatic ring or multiple aromatic rings which are fused together, linked covalently, or linked to a common group such as a methylene or ethylene moiety.
- the common linking group may also be a carbonyl as in benzophenone.
- the aromatic ring(s) may include phenyl, napthyl, biphenyl, diphenylmethyl and benzophenone among others.
- aryl encompasses "arylalkyl.”
- alkylarene is used herein to refer to a subset of "aryl” in which the aryl group is substituted with an alkyl group as defined herein.
- Substituted aryl refers to aryl as just described including one or more functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF 3 ), hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, mercapto and both saturated and unsaturated cyclic hydrocarbons which are fused to the aromatic ring(s), linked covalently or linked to a common group such as a methylene or ethylene moiety.
- the linking group may also be a carbonyl such as in cyclohexyl phenyl ketone.
- substituted aryl encompasses "substituted arylalkyl.”
- acyl is used to describe a ketone substituent, — C(O)R, wherein R is alkyl or substituted alkyl, aryl or substituted aryl as defined herein.
- halogen is used herein to refer to fluorine, bromine, chlorine and iodine atoms.
- hydroxy is used herein to refer to the group — OH.
- amino is used to describe primary amines, R — NH 2 , wherein R is alkyl or substituted alkyl, aryl or substituted aryl as defined herein.
- alkoxy is used herein to refer to the — OR group, wherein R is a lower alkyl, substituted lower alkyl, aryl, substituted aryl, arylalkyl or substituted arylalkyl wherein the alkyl, aryl, substituted aryl, arylalkyl and substituted arylalkyl groups are as described herein.
- Suitable alkoxy radicals include, for example, methoxy, ethoxy, phenoxy, substituted phenoxy, benzyloxy, phenethyloxy, t-butoxy, etc.
- alkylamino denotes secondary and tertiary amines wherein the alkyl groups may be either the same or different and may consist of straight or branched, saturated or unsaturated hydrocarbons.
- unsaturated cyclic hydrocarbon is used to describe a non- aromatic group with at least one double bond, such as cyclopentene, cyclohexene, etc. and substituted analogues thereof.
- heteroaryl refers to aromatic rings in which one or more carbon atoms of the aromatic ring(s) are substituted by a heteroatom such as nitrogen, oxygen or sulfur.
- Heteroaryl refers to structures which may be a single aromatic ring, multiple aromatic ring(s), or one or more aromatic rings coupled to one or more non- aromatic ring(s). In structures having multiple rings, the rings can be fused together, linked covalently, or linked to a common group such as a methylene or ethylene moiety.
- the common linking group may also be a carbonyl as in phenyl pyridyl ketone.
- rings such as thiophene, pyridine, isoxazole, phthalimide, pyrazole, indole, furan, etc. or benzo-fused analogues of these rings are defined by the term "heteroaryl.”
- Alkylheteroaryl defines a subset of “heteroaryl” substituted with an alkyl group, as defined herein.
- Substituted heteroaryl refers to heteroaryl as just described wherein the heteroaryl nucleus is substituted with one or more functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF 3 ), hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, mercapto, etc.
- substituted analogues of heteroaromatic rings such as thiophene, pyridine, isoxazole, phthalimide, pyrazole, indole, furan, etc.
- heterocyclic is used herein to describe a saturated or unsaturated non-aromatic group having a single ring or multiple condensed rings from about 1 to about 12 carbon atoms and from about 1 to about 4 heteroatoms selected from nitrogen, sulfur or oxygen within the ring.
- heterocycles are, for example, tetrahydrofuran, morpholine, piperidine, pyrrolidine, etc.
- substituted heterocyclic as used herein describes a subset of “heterocyclic” wherein the heterocycle nucleus is substituted with one or more functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g., CF 3 ), hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, mercapto, etc.
- alkylheterocyclic defines a subset of "heterocyclic” substituted with an alkyl group, as defined herein.
- substituted heterocyclicalkyl defines a subset of “heterocyclic alkyl” wherein the heterocyclic nucleus is substituted with one or more functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF 3 ), hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, mercapto, etc.
- functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF 3 ), hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, mercapto, etc.
- This invention describes the generation of evolved dioxygenases with enhanced performance for use in the production of chemicals of industrial interest using any of a variety of shuffling techniques, including, for example, gene, family and whole genome shuffling as described herein.
- shuffling is used to enhance properties of dioxygenases, such as forward rate kinetics, substrate specificity and affinity and also to decrease susceptibility of dioxygenases to reversible inhibitors and inactivation by solvents, starting materials and reaction products and intermediates generated during the catalytic cycle.
- the present invention provides a method for obtaining a nucleic acid that encodes an improved polypeptide possessing dioxygenase activity.
- the improved polypeptide has at least one property improved over a naturally occurring dioxygenase polypeptide.
- the method includes: (a) creating a library of recombinant polynucleotides encoding a recombinant dioxygenase polypeptide; and (b) screening the library to identify a recombinant polynucleotide that encodes an improved recombinant dioxygenase polypeptide that has at least one property improved over the naturally occurring polypeptide.
- nucleic acids produced by this method that encode a dioxygenase polypeptide having at least one property improved over a naturally occurring dioxygenase polypeptide.
- the nucleic acid libraries of the invention are constructed by a method that includes shuffling a plurality of parental polynucleotides to produce one or more recombinant dioxygenase polynucleotide encoding the improved property.
- the polynucleotides are homologous. A detailed description of shuffling techniques is provided in Part A, herein below.
- At least one of the parental polynucleotides is selected from polynucleotides that encode at least one dioxygenase activity and those that do not encode at least one dioxygenase activity.
- the parental dioxygenase polynucleotide encodes a complete polypeptide or a polypeptide fragment selected from an arene dioxygenase or fragments thereof.
- the activity catalyzed by a dioxygenase polypeptide is a member selected from one or more reactions described in Figures 14A- 14R. Other oxidative transformations will be apparent to those of skill in the art.
- nucleic acid shuffling can result in optimization of a desirable property even in the absence of a detailed understanding of the mechanism by which the particular property is mediated.
- entirely new properties can be obtained upon shuffling of DNAs, i.e., shuffled DNAs can encode polypeptides or RNAs with properties entirely absent in the parental DNAs which are shuffled.
- properties or characteristics that can be acquired or improved vary widely, and depend on the choice of substrate.
- properties that one can improve include, but are not limited to, increased range of dioxygenase activity encoded by a particular gene, increased potency against a dioxygenase target, increased expression level of the dioxygenase gene, increased tolerance of the protein encoded by the dioxygenase gene to protease degradation (or other natural protein or RNA degradative processes), increased dioxygenase activity ranges for conditions such as heat, cold, low or high pH, reduced toxicity to the host cell, and increased resistance of the polypeptide and/or the organism expressing the polypeptide to organic solvents, and reaction feedstocks, intermediates and products.
- targets for modification vary in different applications, as does the property sought to be acquired or improved.
- candidate targets for acquisition of a property or improvement in a property include genes that encode proteins which have enzymatic or other activities useful in dioxygenase reactions.
- the methods typically use at least two variant forms of a starting target.
- the variant forms of candidate substrates can show substantial sequence or secondary structural similarity with each other, but they should also differ in at least one and preferably at least two positions.
- the initial diversity between forms can be the result of natural variation, e.g., the different variant forms (homologs) are obtained from different individuals or strains of an organism, or constitute related sequences from the same organism (e.g., allelic variations), or constitute homologs from different organisms (interspecific variants).
- initial diversity can be induced, e.g., the variant forms can be generated by error-prone transcription, such as an error-prone PCR or use of a polymerase which lacks proof-reading activity (see, Liao (1990) Gene 88:107-111), of the first variant form, or, by replication of the first form in a mutator strain (mutator host cells are discussed in further detail below, and are generally well known).
- the initial diversity between substrates is greatly augmented in subsequent steps of recombination for library generation.
- a mutator strain can include any mutants in any organism impaired in the functions of mismatch repair. These include mutant gene products of mutS, mutT, mutH, mutL, ovrD, dcm, vsr, umuC, umuD, sbcB, recJ, etc.
- the impairment is achieved by genetic mutation, allelic replacement, selective inhibition by an added reagent such as a small molecule or an expressed antisense RNA, or other techniques. Impairment can be of the genes noted, or of homologous genes in any organism.
- At least two variant forms of a nucleic acid which can confer dioxygenase activity are recombined to produce a library of recombinant dioxygenase genes.
- the library is then screened to identify at least one recombinant dioxygenase gene that is optimized for the particular property or properties of interest.
- the parental polynucleotides can be shuffled in substantially any cell type, including prokaryotes, eukaryotes, yeast, bacteria and fungi.
- the one or more recombinant dioxygenase nucleic acid is present in one or more bacterial, yeast, or fungal cells and the method involves: pooling multiple separate dioxygenase nucleic acids; screening the resulting pooled dioxygenase nucleic acids to identify a distinct or improved recombinant dioxygenase nucleic acid that exhibits distinct or improved dioxygenase activity compared to a non-recombinant dioxygenase activity nucleic acid; and cloning the distinct or improved recombinant nucleic acid.
- Recursive sequence recombination can be employed to achieve still further improvements in a desired property, or to bring about new (or "distinct") properties.
- Recursive sequence recombination entails successive cycles of recombination to generate molecular diversity. That is, one creates a family of nucleic acid molecules showing some sequence identity to each other but differing in the presence of mutations. In any given cycle, recombination can occur in vivo or in vitro, intracellularly or extracellularly.
- diversity resulting from recombination can be augmented in any cycle by applying prior methods of mutagenesis (e.g., error-prone PCR or cassette mutagenesis) to either the substrates or products for recombination.
- a recombination cycle is usually followed by at least one cycle of screening or selection for molecules having a desired property or characteristic.
- a recombination cycle is performed in vitro, the products of recombination, i.e., recombinant segments, are sometimes introduced into cells before the screening step.
- Recombinant segments can also be linked to an appropriate vector or other regulatory sequences before screening.
- products of recombination generated in vitro are sometimes packaged in viruses (e.g., bacteriophage) before screening.
- viruses e.g., bacteriophage
- recombination products can sometimes be screened in the cells in which recombination occurred.
- recombinant segments are extracted from the cells, and optionally packaged as viruses, before screening.
- a dioxygenase gene can have many component sequences each having a different intended role (e.g., coding sequence, regulatory sequences, targeting sequences, stability-conferring sequences, subunit sequences and sequences affecting integration). Each of these component sequences can be varied and recombined simultaneously. Screening/selection can then be performed, for example, for recombinant segments that have increased ability to confer dioxygenase activity upon a cell without the need to attribute such improvement to any of the individual component sequences of the vector.
- initial round(s) of screening are sometimes performed using bacterial cells due to high transfection efficiencies and ease of culture.
- eukaryotic dioxygenases such as eukaryotic arene dioxygenases
- yeast, fungal or other eukaryotic systems are used for library expression and screening.
- other types of screening that are not amenable to screening in bacterial or simple eukaryotic library cells, are performed in cells selected for use in an environment close to that of their intended use. Final rounds of screening can be performed in the precise cell type of intended use.
- At least one and usually a collection of recombinant segments surviving a first round of screening/selection are subject to a further round of recombination.
- These recombinant segments can be recombined with each other or with exogenous segments representing the original substrates or further variants thereof. Again, recombination can proceed in vitro or in vivo. If the previous screening step identifies desired recombinant segments as components of cells, the components can be subjected to further recombination in vivo, or can be subjected to further recombination in vitro, or can be isolated before performing a round of in vitro recombination.
- the previous screening step identifies desired recombinant segments in naked form or as components of viruses, these segments can be introduced into cells to perform a round of in vivo recombination.
- the second round of recombination irrespective how performed, generates further recombinant segments which encompass additional diversity than is present in recombinant segments resulting from previous rounds.
- the second round of recombination is optionally followed by a further round of screening/selection according to the principles discussed above for the first round.
- the stringency of screening/selection can be increased between rounds.
- the nature of the screen and the property being screened for can vary between rounds if improvement in more than one property is desired or if acquiring more than one new property is desired. Additional rounds of recombination and screening can then be performed until the recombinant segments have sufficiently evolved to acquire the desired new or improved property or function.
- the invention provides a recursive method for making a nucleic acid encoding a specific dioxygenase activity.
- the parental nucleic acids are shuffled in a plurality of cells and the method optionally further includes one or more of: (a) recombining DNA from the plurality of cells that display dioxygenase activity with a library of DNA fragments, at least one of which undergoes recombination with a segment in a cellular DNA present in the cells to produce recombined cells, or recombining DNA between the plurality of cells that display dioxygenase activity to produce cells with modified dioxygenase activity; (b) recombining and screening the recombined or modified cells to produce further recombined cells that have evolved additionally modified dioxygenase activity; and, (c) repeating (a) or (b) until the further recombined cells have acquired a desired dioxygenase activity.
- the invention provides a method for making a nucleic acid encoding a specific dioxygenase activity.
- This method includes: (a) recombining at least one distinct or improved recombinant nucleic acid with a further dioxygenase activity nucleic acid, which further nucleic acid is the same or different from one or more of the plurality of parental nucleic acids to produce a library of recombinant dioxygenase nucleic acids; (b) screening the library to identify at least one further distinct or improved recombinant dioxygenase nucleic acid that exhibits a further improvement or distinct property compared to the plurality of parental nucleic acids; and, optionally; (c) repeating (a) and (b) until the resulting further distinct or improved recombinant nucleic acid shows an additionally distinct or improved dioxygenase property.
- the practice of this invention involves the construction of recombinant nucleic acids and the expression of genes in transfected host cells.
- Molecular cloning techniques to achieve these ends are known in the art.
- a wide variety of cloning and in vitro amplification methods suitable for the construction of recombinant nucleic acids such as expression vectors are well-known to persons of skill.
- RNA polymerase mediated techniques e.g., NASBA
- PCR polymerase chain reaction
- LCR ligase chain reaction
- NASBA RNA polymerase mediated techniques
- the present invention provides a method of increasing dioxygenase activity in a cell.
- the method includes performing whole genome shuffling of a plurality of genomic nucleic acids in the cell and selecting for one or more dioxygenase activity.
- the genomic nucleic acids can be from substantially any source.
- the genomic nucleic acids are from a species or strain different from the cell.
- the cell is of prokaryotic or eukaryotic origin.
- any dioxygenase property can be selected for using the methods of the invention.
- a preferred property is the activity of the polypeptide towards a particular class of substrates.
- the dioxygenase property is its ability to effect one or more reactions described in Figures 14A-14R.
- the invention provides a nucleic acid shuffling mixture comprising: at least three homologous DNAs, each of which is derived from a nucleic acid encoding a polypeptide or polypeptide fragment which encodes dioxygenase activity.
- the at least three homologous DNAs are present in cell culture or in vitro.
- Oligonucleotides for use as probes e.g., in in vitro amplification methods, for use as gene probes, or as shuffling targets (e.g., synthetic genes or gene segments) are typically synthesized chemically according to the solid phase phosphoramidite triester method described by Beaucage and Caruthers, Tetrahedron Letts., 22(20): 1859-1862
- Oligonucleotides can also be custom made and ordered from a variety of commercial sources known to persons of skill.
- the methods of the invention entail performing recombination ("shuffling") and screening or selection to "evolve" individual genes, whole plasmids or viruses, multigene clusters, or even whole genomes (Stemmer, Bio/Technology 13:549-553 (1995)). Reiterative cycles of recombination and screening/selection can be performed to further evolve the nucleic acids of interest. Such techniques do not require the extensive analysis and computation required by conventional methods for polypeptide engineering. Shuffling allows the recombination of large numbers of mutations in a minimum number of selection cycles, in contrast to natural pair- wise recombination events (e.g., as occur during sexual replication).
- sequence recombination techniques described herein provide particular advantages in that they provide recombination between mutations in any or all of these, thereby providing a very fast way of exploring the manner in which different combinations of mutations can affect a desired result. In some instances, however, structural and/or functional information is available which, although not required for sequence recombination, provides opportunities for modification of the technique.
- a variety of nucleic acid shuffling protocols are available and fully described in the art. Descriptions of a variety of shuffling methods for generating modified nucleic acid sequences for use in the methods of the present invention include the following publications and the references cited therein: Stemmer, et al.
- Tumor Targetin 4 1-4; Ness et al. (1999) "DNA Shuffling of subgenomic sequences of subtilisin” Nature Biotechnology 17:893-896; Chang et al. (1999) "Evolution of a cytokine using DNA family shuffling” Nature Biotechnology 17:793-797; Minshull and Stemmer (1999) "Protein evolution by molecular breeding” Current Opinion in Chemical Biology 3:284-290; Christians et al. (1999) "Directed evolution of thymidine kinase for AZT phosphorylation using DNA family shuffling” Nature Biotechnology 17:259-264; Crameri et al.
- nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the references above, including e.g., DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids.
- nucleic acids can be recursively recombined in vivo, e.g., by allowing recombination to occur between nucleic acids in cells.
- whole genome recombination methods can be used in which whole genomes of cells or other organisms are recombined, optionally including spiking of the genomic recombination mixtures with desired library components (e.g., genes corresponding to the pathways of the present invention).
- synthetic recombination methods can be used, in which oligonucleotides corresponding to targets of interest are synthesized and reassembled in PCR or ligation reactions which include oligonucleotides which correspond to more than one parental nucleic acid, thereby generating new recombined nucleic acids.
- Oligonucleotides can be made by standard nucleotide addition methods, or can be made, e.g., by tri-nucleotide synthetic approaches.
- Fifth, in silico methods of recombination can be effected in which genetic algorithms are used in a computer to recombine sequence strings which correspond to homologous (or even non-homologous) nucleic acids.
- the resulting recombined sequence strings are optionally converted into nucleic acids by synthesis of nucleic acids which correspond to the recombined sequences, e.g., in concert with oligonucleotide synthesis/ gene reassembly techniques.
- any of the preceding general recombination formats can be practiced in a reiterative fashion to generate a more diverse set of recombinant nucleic acids.
- Sixth, methods of accessing natural diversity, e.g., by hybridization of diverse nucleic acids or nucleic acid fragments to single-stranded templates, followed by polymerization and/or ligation to regenerate full-length sequences, optionally followed by degradation of the templates and recovery of the resulting modified nucleic acids can be used.
- nucleic acids of the invention can be recombined (with each other, or with related (or even unrelated) sequences) to produce a diverse set of recombinant nucleic acids, including, e.g., sets of homologous nucleic acids.
- any nucleic acids which are produced can be selected for a desired activity.
- this can include testing for and identifying any activity that can be detected e.g., in an automatable format, by any of the assays in the art.
- a variety of related (or even unrelated) properties can be assayed for, using any available assay.
- DNA mutagenesis and shuffling provide a robust, widely applicable, means of generating diversity useful for the engineering of proteins, pathways, cells and organisms with improved characteristics.
- shuffling methodologies In addition to the basic formats described above, it is sometimes desirable to combine shuffling methodologies with other techniques for generating diversity.
- a variety of diversity generation methods can be practiced and the results (i.e., diverse populations of nucleic acids) screened for in the systems of the invention. Additional diversity can be introduced by methods which result in the alteration of individual nucleotides or groups of contiguous or non-contiguous nucleotides, i.e., mutagenesis methods.
- Mutagenesis methods include, for example, those described in PCT/US98/05223; Publ. No. WO98/42727; site-directed mutagenesis (Ling et al. (1997) "Approaches to DNA mutagenesis: an overview" Anal Biochem. 254(2): 157-178; Dale et al. (1996) "Oligonucleotide-directed random mutagenesis using the phosphorothioate method” Methods Mol. Biol. 57:369-374; Smith (1985) "In vitro mutagenesis” Ann. Rev. Genet.
- PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product.
- Examples of such techniques are found in the references above and, e.g., in Leung et al. (1989) Technique 1:11-15 and Caldwell et al. (1992) PCR Methods Applic. 2:28-33.
- assembly PCR can be used, in a process which involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions can occur in parallel in the same vial, with the products of one reaction priming the products of another reaction.
- Sexual PCR mutagenesis can be used in which homologous recombination occurs between DNA molecules of different but related DNA sequence in vitro, by random fragmentation of the DNA molecule based on sequence homology, followed by fixation of the crossover by primer extension in a PCR reaction. This process is described in the references above, e.g., in Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751.
- Recursive ensemble mutagenesis can be used in which an algorithm for protein mutagenesis is used to produce diverse populations of phenotypically related mutants whose members differ in amino acid sequence. This method uses a feedback mechanism to control successive rounds of combinatorial cassette mutagenesis. Examples of this approach are found in Arkin & Youvan (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815.
- oligonucleotide directed mutagenesis can be used in a process which allows for the generation of site-specific mutations in any nucleic acid sequence of interest. Examples of such techniques are found in the references above and, e.g., in Reidhaar-Olson et al. (1988) Science. 241:53-57.
- cassette mutagenesis can be used in a process which replaces a small region of a double stranded DNA molecule with a synthetic oligonucleotide cassette that differs from the native sequence.
- the oligonucleotide can contain, e.g., completely and/or partially randomized native sequence(s).
- In vivo mutagenesis can be used in a process of generating random mutations in any cloned DNA of interest which involves the propagation of the DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair pathways. These "mutator" strains have a higher random mutation rate than that of a wild-type parent. Propagating the DNA in one of these strains will eventually generate random mutations within the DNA.
- Exponential ensemble mutagenesis can be used for generating combinatorial libraries with a high percentage of unique and functional mutants, where small groups of residues are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins.
- Kits for mutagenesis are also commercially available.
- kits are available from, e.g., Stratagene (e.g., QuickChangeTM site-directed mutagenesis kit; and ChameleonTM double-stranded, site-directed mutagenesis kit), Bio/Can Scientific, Bio-Rad (e.g., using the Kunkel method described above), Boehringer Mannheim Corp., Clonetech Laboratories, DNA Technologies, Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc, Lemargo Inc, Life Technologies (Gibco BRL), New England Biolabs, Pharmacia Biotech, Promega Corp., Quantum Biotechnologies, Amersham International pic (e.g., using the Eckstein method above), and Boothn Biotechnology Ltd (e.g., using the Carter/Winter method above).
- Stratagene e.g., QuickChangeTM site-directed mutagenesis kit
- Bio/Can Scientific
- any of the described shuffling or mutagenesis techniques can be used in conjunction with procedures which introduce additional diversity into a genome, e.g. a bacterial, fungal, animal or plant genome.
- techniques have been proposed which produce nucleic acid multimers suitable for transformation into a variety of species (see, e.g., Schellenberger U.S. Patent No. 5,756,316 and the references above).
- multimers consist of genes that are divergent with respect to one another, (e.g., derived from natural diversity or through application of site directed mutagenesis, error prone PCR, passage through mutagenic bacterial strains, and the like), are transformed into a suitable host, this provides a source of nucleic acid diversity for DNA diversification.
- Multimers transformed into host species are suitable as substrates for in vivo shuffling protocols.
- a multiplicity of polynucleotides sharing regions of partial sequence similarity can be transformed into a host species and recombined in vivo by the host cell.
- Subsequent rounds of cell division can be used to generate libraries, members of which, comprise a single, homogenous population of monomeric or pooled nucleic acid.
- the monomeric nucleic acid can be recovered by standard techniques and recombined in any of the described shuffling formats.
- Shuffling formats employing chain termination methods have also been proposed (see e.g., U.S. Patent No. 5,965,408 and the references above).
- double stranded DNAs corresponding to one or more genes sharing regions of sequence similarity are combined and denatured, in the presence or absence of primers specific for the gene.
- the single stranded polynucleotides are then annealed and incubated in the presence of a polymerase and a chain terminating reagent (e.g., ultraviolet, gamma or X-ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as single strand binding proteins, transcription activating factors, or histones; polycyclic aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated polymerization mediated by rapid thermocycling; and the like), resulting in the production of partial duplex molecules.
- a chain terminating reagent e.g., ultraviolet, gamma or X-ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as single strand binding proteins
- the partial duplex molecules e.g., containing partially extended chains, are then denatured and reannealed in subsequent rounds of replication or partial replication resulting in polynucleotides which share varying degrees of sequence similarity and which are chimeric with respect to the starting population of DNA molecules.
- the products or partial pools of the products can be amplified at one or more stages in the process.
- Polynucleotides produced by a chain termination method, such as described above are suitable substrates for DNA shuffling according to any of the described formats. Diversity can be further increased by using non-homology based shuffling methods (which, as set forth in the above publications and applications can be homology or non-homology based, depending on the precise format).
- Multispecies expression libraries are, in general, libraries comprising cDNA or genomic sequences from a plurality of species or strains, operably linked to appropriate regulatory sequences, in an expression cassette.
- the cDNA and/or genomic sequences are optionally randomly concatenated to further enhance diversity.
- the vector can be a shuttle vector suitable for transformation and expression in more than one species of host organism, e.g., bacterial species, eukaryotic cells.
- the library is biased by preselecting sequences which encode a protein of interest, or which hybridize to a nucleic acid of interest.
- Any such libraries can be provided as substrates for any of the methods herein described.
- it is desirable to preselect or prescreen libraries e.g., an amplified library, a genomic library, a cDNA library, a normalized library, etc.
- substrate nucleic acids prior to shuffling, or to otherwise bias the substrates towards nucleic acids that encode functional products (shuffling procedures can also, independently have these effects).
- Libraries can be biased towards nucleic acids which encode proteins with desirable enzyme activities.
- the clone can be mutagenized using any known method for introducing DNA alterations, including, but not restricted to, DNA shuffling.
- a library comprising the mutagenized homologues is then screened for a desired activity, which can be the same as or different from the initially specified activity.
- Desired activities can be identified by any method known in the art.
- WO 99/10539 proposes that gene libraries can be screened by combining extracts from the gene library with components obtained from metabolically rich cells and identifying combinations which exhibit the desired activity.
- clones with desired activities can be identified by inserting bioactive substrates into samples of the library, and detecting bioactive fluorescence corresponding to the product of a desired activity using a fluorescent analyzer, e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer.
- a fluorescent analyzer e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer.
- Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe.
- polynucleotides encoding a desired activity e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a hydratase, a nitrilase, a transaminase, an amidase or an acylase
- a desired activity e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a hydratase, a nitrilase, a transaminas
- genomic DNA Single stranded DNA molecules from a population of genomic DNA are hybridized to a ligand-conjugated probe.
- the genomic DNA can be derived from either a cultivated or uncultivated microorganism, or from an environmental sample. Alternatively, the genomic DNA can be derived from a multicellular organism, or a tissue derived therefrom.
- Second strand synthesis can be conducted directly from the hybridization probe used in the capture, with or without prior release from the capture medium or by a wide variety of other strategies known in the art.
- the isolated single-stranded genomic DNA population can be fragmented without further cloning and used directly in a shuffling format that employs a single-stranded template.
- Assembly of complex chimeric genes from this population is the mediated by nuclease-base removal of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and subsequent single stranded ligation.
- the parental strand can be removed by digestion (if RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in a manner conducive to such separation) and other available separation/purification methods.
- the parental strand is optionally co-purified with the chimeric strands and removed during subsequent screening and processing steps.
- single-stranded molecules are converted to double-stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand-mediated binding.
- the selected DNA molecules are released from the support and introduced into a suitable host cell to generate a library enriched sequences which hybridize to the probe.
- a library produced in this manner provides a desirable substrate for further shuffling using any of the shuffling reactions described herein.
- the shuffling of a single gene and the shuffling of a family of genes provide two of the most powerful methods available for improving and "migrating" (gradually changing the type of reaction, substrate or activity of a selected enzyme) the functions of biocatalysts.
- homologous sequences e.g., from different species or chromosomal positions
- single gene shuffling a single sequence is mutated or otherwise altered and then recombined.
- the breeding procedure starts with at least two substrates that generally show substantial sequence identity to each other (i.e., at least about 30%, 50%, 70%, 80% or 90% sequence identity), but differ from each other at certain positions.
- the difference can be any type of mutation, for example, substitutions, insertions and deletions.
- different segments differ from each other in about 5-20 positions.
- the starting materials must differ from each other in at least two nucleotide positions. That is, if there are only two substrates, there should be at least two divergent positions. If there are three substrates, for example, one substrate can differ from the second at a single position, and the second can differ from the third at a different single position.
- the starting DNA segments can be natural variants of each other, for example, allelic or species variants.
- the segments can also be from nonallelic genes showing some degree of structural and usually functional relatedness (e.g., different genes within a superfamily, such as the arene dioxygenase super family).
- the starting DNA segments can also be induced variants of each other.
- one DNA segment can be produced by error-prone PCR replication of the other, or by substitution of a mutagenic cassette. Induced mutants can also be prepared by propagating one (or both) of the segments in a mutagenic strain. In these situations, strictly speaking, the second DNA segment is not a single segment but a large family of related segments.
- the different segments forming the starting materials are often the same length or substantially the same length. However, this need not be the case; for example; one segment can be a subsequence of another.
- the segments can be present as part of larger molecules, such as vectors, or can be in isolated form.
- the starting DNA segments are recombined by any of the sequence recombination formats provided herein to generate a diverse library of recombinant DNA segments.
- a library can vary widely in size from having fewer than 10 to more than 10 5 , 10 9 , 10 12 or more members.
- the starting segments and the recombinant libraries generated will include full-length coding sequences and any essential regulatory sequences, such as a promoter and polyadenylation sequence, required for expression.
- the recombinant DNA segments in the library can be inserted into a common vector providing sequences necessary for expression before performing screening/selection.
- restriction enzyme sites in nucleic acids to direct the recombination of mutations in a nucleic acid sequence of interest. These techniques are particularly preferred in the evolution of fragments that cannot readily be shuffled by existing methods due to the presence of repeated DNA or other problematic primary sequence motifs. These situations also include recombination formats in which it is preferred to retain certain sequences unmutated.
- the use of restriction enzyme sites is also preferred for shuffling large fragments (typically greater than 10 kb), such as gene clusters that cannot be readily shuffled and "PCR-amplified” because of their size. Although fragments up to 50 kb have been reported to be amplified by PCR (Barnes, Proc. Natl. Acad. Sci.
- the restriction endonucleases used are of the Class II type (Sambrook, Ausubel and Berger, supra) and of these, preferably those which generate nonpalindromic sticky end overhangs such as Alwn I, Sfi I or BstXl. These enzymes generate nonpalindromic ends that allow for efficient ordered reassembly with DNA ligase.
- restriction enzyme (or endonuclease) sites are identified by conventional restriction enzyme mapping techniques (Sambrook, Ausubel, and Berger, supra.), by analysis of sequence information for that gene, or by introduction of desired restriction sites into a nucleic acid sequence by synthesis (i.e. by incorporation of silent mutations).
- the DNA substrate molecules to be digested can either be from in vivo replicated DNA, such as a plasmid preparation, or from PCR amplified nucleic acid fragments harboring the restriction enzyme recognition sites of interest, preferably near the ends of the fragment.
- at least two variants of a gene of interest, each having one or more mutations are digested with at least one restriction enzyme determined to cut within the nucleic acid sequence of interest.
- the restriction fragments are then joined with DNA ligase to generate full length genes having shuffled regions. The number of regions shuffled will depend on the number of cuts within the nucleic acid sequence of interest.
- the shuffled molecules can be introduced into cells as described above and screened or selected for a desired property as described herein. Nucleic acids can then be isolated from pools (libraries), or clones having desired properties and subjected to the same procedure until a desired degree of improvement is obtained.
- At least one DNA substrate molecule or fragment thereof is isolated and subjected to mutagenesis.
- the pool or library of religated restriction fragments are subjected to mutagenesis before the digestion-ligation process is repeated.
- "Mutagenesis" as used herein includes such techniques known in the art as PCR mutagenesis, oligonucleotide-directed mutagenesis, site-directed mutagenesis, etc., and recursive sequence recombination by any of the techniques described herein.
- a further technique for recombining mutations in a nucleic acid sequence utilizes "reassembly PCR.” This method can be used to assemble multiple segments that have been separately evolved into a full length nucleic acid template such as a gene. This technique is performed when a pool of advantageous mutants is known from previous work or has been identified by screening mutants that may have been created by any mutagenesis technique known in the art, such as PCR mutagenesis, cassette mutagenesis, doped oligo mutagenesis, chemical mutagenesis, or propagation of the DNA template in vivo in mutator strains.
- Boundaries defining segments of a nucleic acid sequence of interest preferably lie in intergenic regions, introns, or areas of a gene not likely to have mutations of interest.
- oligonucleotide primers are synthesized for PCR amplification of segments of the nucleic acid sequence of interest, such that the sequences of the oligonucleotides overlap the junctions of two segments.
- the overlap region is typically about 10 to 100 nucleotides in length.
- Each of the segments is amplified with a set of such primers.
- the PCR products are then "reassembled" according to assembly protocols such as those discussed herein to assemble randomly fragmented genes.
- the PCR products are first purified away from the primers, by, for example, gel electrophoresis or size exclusion chromatography. Purified products are mixed together and subjected to about 1-10 cycles of denaturing, reannealing, and extension in the presence of polymerase and deoxynucleoside triphosphates (dNTP's) and appropriate buffer salts in the absence of additional primers ("self-priming"). Subsequent PCR with primers flanking the gene are used to amplify the yield of the fully reassembled and shuffled genes.
- dNTP's polymerase and deoxynucleoside triphosphates
- the resulting reassembled genes are subjected to mutagenesis before the process is repeated.
- the PCR primers for amplification of segments of the nucleic acid sequence of interest are used to introduce variation into the gene of interest as follows. Mutations at sites of interest in a nucleic acid sequence are identified by screening or selection, by sequencing homologues of the nucleic acid sequence, and so on. Oligonucleotide PCR primers are then synthesized which encode wild type or mutant information at sites of interest. These primers are then used in PCR mutagenesis to generate libraries of full length genes encoding permutations of wild type and mutant information at the designated positions. This technique is typically advantageous in cases where the screening or selection process is expensive, cumbersome, or impractical relative to the cost of sequencing the genes of mutants of interest and synthesizing mutagenic oligonucleotides.
- sequence information from one or more substrate sequences is added to a given "parental" sequence of interest, with subsequent recombination between rounds of screening or selection.
- this is done with site-directed mutagenesis performed by techniques well known in the art (e.g., Berger, Ausubel and Sambrook, supra.) with one substrate as template and oligonucleotides encoding single or multiple mutations from other substrate sequences, e.g. homologous genes.
- the selected recombinant(s) can be further evolved using RSR techniques described herein.
- site-directed mutagenesis can be done again with another collection of oligonucleotides encoding homologue mutations, and the above process repeated until the desired properties are obtained.
- degenerate oligonucleotides can be used that encode the sequences in both homologues.
- One oligonucleotide can include many such degenerate codons and still allow one to exhaustively search all permutations over that block of sequence.
- homologue sequence space When the homologue sequence space is very large, it can be advantageous to restrict the search to certain variants.
- computer modeling tools (Lathrop et al, J. Mol Biol. 255:641-665 (1996)) can be used to model each homologue mutation onto the target protein and discard any mutations that are predicted to grossly disrupt structure and function.
- the initial substrates for recombination are a pool of related sequences, e.g., different variant forms, as homologs from different individuals, strains, or species of an organism, or related sequences from the same organism, as allelic variations.
- the sequences can be DNA or RNA and can be of various lengths depending on the size of the gene or DNA fragment to be recombined or reassembled.
- the sequences are from 50 base pairs (bp) to 50 kilobases (kb).
- the pool of related substrates are converted into overlapping fragments, e.g., from about 5 bp to 5 kb or more.
- the size of the fragments is from about 10 bp to 1000 bp, and sometimes the size of the DNA fragments is from about 100 bp to 500 bp.
- the conversion can be effected by a number of different methods, such as DNase I or RNase digestion, random shearing or partial restriction enzyme digestion.
- DNase I or RNase digestion random shearing or partial restriction enzyme digestion.
- the concentration of nucleic acid fragments of a particular length and sequence is often less than 0.1 % or 1% by weight of the total nucleic acid.
- the number of different specific nucleic acid fragments in the mixture is usually at least about 100, 500 or 1000.
- the mixed population of nucleic acid fragments are converted to at least partially single-stranded form using a variety of techniques, including, for example, heating, chemical denaturation, use of DNA binding proteins, and the like. Conversion can be effected by heating to about 80 °C to 100 °C, more preferably from 90 °C to 96 °C, to form single-stranded nucleic acid fragments and then reannealing. Conversion can also be effected by treatment with single-stranded DNA binding protein (see Wold, Annu. Rev. Biochem. 66:61-92 (1997)) or recA protein (see, e.g., Kiianitsa, Proc. Natl. Acad. Sci. USA 94:7837-7840 (1997)).
- Single-stranded nucleic acid fragments having regions of sequence identity with other single-stranded nucleic acid fragments can then be reannealed by cooling to 20 °C to 75 °C, and preferably from 40 °C to 65 °C. Renaturation can be accelerated by the addition of polyethylene glycol (PEG), other volume-excluding reagents or salt.
- PEG polyethylene glycol
- the salt concentration is preferably from 0 mM to 200 mM, more preferably the salt concentration is from 10 mM to 100 mM.
- the salt may be KCl or NaCl.
- the concentration of PEG is preferably from 0% to 20%, more preferably from 5% to 10%.
- the fragments that reanneal can be from different substrates.
- the annealed nucleic acid fragments are incubated in the presence of a nucleic acid polymerase, such as Taq or Klenow, and dNTP's (i.e. dATP, dCTP, dGTP and dTTP). If regions of sequence identity are large, Taq polymerase can be used with an annealing temperature of between 45-65 °C. If the areas of identity are small, Klenow polymerase can be used with an annealing temperature of between 20-30 C. The polymerase can be added to the random nucleic acid fragments prior to annealing, simultaneously with annealing or after annealing.
- a nucleic acid polymerase such as Taq or Klenow
- dNTP's i.e. dATP, dCTP, dGTP and dTTP.
- the process of denaturation, renaturation and incubation in the presence of polymerase of overlapping fragments to generate a collection of polynucleotides containing different permutations of fragments is sometimes referred to as shuffling of the nucleic acid in vitro.
- This cycle is repeated for a desired number of times. Preferably the cycle is repeated from 2 to 100 times, more preferably the sequence is repeated from 10 to 40 times.
- the resulting nucleic acids are a family of double-stranded polynucleotides of from about 50 bp to about 100 kb, preferably from 500 bp to 50 kb.
- the population represents variants of the starting substrates showing substantial sequence identity thereto but also diverging at several positions.
- the population has many more members than the starting substrates.
- the population of fragments resulting from shuffling is used to transform host cells, optionally after cloning into a vector.
- subsequences of recombination substrates can be generated by amplifying the full-length sequences under conditions which produce a substantial fraction, typically at least 20 percent or more, of incompletely extended amplification products.
- Another embodiment uses random primers to prime the entire template DNA to generate less than full length amplification products.
- the amplification products, including the incompletely extended amplification products are denatured and subjected to at least one additional cycle of reannealing and amplification.
- This variation in which at least one cycle of reannealing and amplification provides a substantial fraction of incompletely extended products, is termed "stuttering.”
- the partially extended (less than full length) products reanneal to and prime extension on different sequence-related template species.
- the conversion of substrates to fragments can be effected by partial PCR amplification of substrates.
- a mixture of fragments is spiked with one or more oligonucleotides.
- the oligonucleotides can be designed to include precharacterized mutations of a wildtype sequence, or sites of natural variations between individuals or species.
- the oligonucleotides also include sufficient sequence or structural homology flanking such mutations or variations to allow annealing with the wildtype fragments. Annealing temperatures can be adjusted depending on the length of homology.
- recombination occurs in at least one cycle by template switching, such as when a DNA fragment derived from one template primes on the homologous position of a related but different template.
- Template switching can be induced by addition of recA (see, Kiianitsa (1997) supra), rad51 (see, Namsaraev, Mol. Cell. Biol. 17:5359-5368 (1997)), rad55 (see, Clever, EMBO J. 16:2535-2544 (1997)), rad57 (see, Sung, Genes Dev. 11:1111-1121 (1997)) or other polymerases (e.g., viral polymerases, reverse transcriptase) to the amplification mixture.
- Template switching can also be increased by increasing the DNA template concentration.
- Another embodiment utilizes at least one cycle of amplification, which can be conducted using a collection of overlapping single-stranded DNA fragments of related sequence, and different lengths. Fragments can be prepared using a single stranded DNA phage, such as M13 (see, Wang, Biochemistry 36:9486-9492 (1997)). Each fragment can hybridize to and prime polynucleotide chain extension of a second fragment from the collection, thus forming sequence-recombined polynucleotides.
- ssDNA fragments of variable length can be generated from a single primer by Pfu, Taq, Vent, Deep Vent, UlTma DNA polymerase or other DNA polymerases on a first DNA template (see, Cline, Nucleic Acids Res. 24:3546-3551 (1996)).
- the single stranded DNA fragments are used as primers for a second, Kunkel-type template, consisting of a uracil- containing circular ssDNA. This results in multiple substitutions of the first template into the second. See, Levichkin, Mol. Biology 29:572-577 (1995); Jung, Gene 121:17-24 (1992).
- shuffled nucleic acids obtained by use of the recursive recombination methods of the invention are put into a cell and/or organism for screening.
- Shuffled dioxygenase genes can be introduced into, for example, bacterial cells (including cyanobacteria), yeast cells, fungal cells, vertebrate cells, invertebrate cells or plant cells for initial screening.
- Bacterial species such as E.
- coli, Pseudomonas sp, Bacillus, subtilis, Burkholderia cepacia, Alcaligenes, Acinetobacter, Rhodococcus Arthrobacter, Sphingomonas are preferred examples of suitable bacterial cells into which one can insert and express shuffled dioxygenase genes which provide for convenient shuttling to other cell types (a variety of vectors for shuttling material between these bacterial cells and eukaryotic cells are available; see, Sambrook, Ausubel and Berger, all supra).
- the shuffled genes can be introduced into bacterial, fungal or yeast cells either by integration into the chromosomal DNA or as plasmids.
- shuffled genes can also be introduced into plant cells for production purposes (it will be appreciated that transgenic plants are, increasingly, an important source of industrial enzymes).
- a transgene of interest can be modified using the recursive sequence recombination methods of the invention in vitro and reinserted into the cell for in vivo/in situ selection for the new or improved dioxygenase property, in bacteria, eukaryotic cells, or whole eukaryotic organisms.
- DNA substrate molecules are introduced into cells, wherein the cellular machinery directs their recombination.
- a library of mutants is constructed and screened or selected for mutants with improved phenotypes by any of the techniques described herein.
- the DNA substrate molecules encoding the best candidates are recovered by any of the techniques described herein, then fragmented and used to transfect a plant host and screened or selected for improved function. If further improvement is desired, the DNA substrate molecules are recovered from the host cell, such as by PCR, and the process is repeated until a desired level of improvement is obtained.
- the fragments are denatured and reannealed prior to transfection, coated with recombination stimulating proteins such as recA, or co-transfected with a selectable marker such as Neo to allow the positive selection for cells receiving recombined versions of the gene of interest.
- recombination stimulating proteins such as recA
- a selectable marker such as Neo
- the efficiency of in vivo shuffling can be enhanced by increasing the copy number of a gene of interest in the host cells.
- the majority of bacterial cells in stationary phase cultures grown in rich media contain two, four or eight genomes. In minimal medium the cells contain one or two genomes.
- the number of genomes per bacterial cell thus depends on the growth rate of the cell as it enters stationary phase. This is because rapidly growing cells contain multiple replication forks, resulting in several genomes in the cells after termination.
- the number of genomes is strain dependent, although all strains tested have more than one chromosome in stationary phase.
- the number of genomes in stationary phase cells decreases with time. This appears to be due to fragmentation and degradation of entire chromosomes, similar to apoptosis in mammalian cells.
- This fragmentation of genomes in cells containing multiple genome copies results in massive recombination and mutagenesis.
- the presence of multiple genome copies in such cells results in a higher frequency of homologous recombination in these cells, both between copies of a gene in different genomes within the cell, and between a genome within the cell and a transfected fragment.
- the increased frequency of recombination allows one to evolve a gene more quickly to acquire optimized characteristics.
- the existence of multiple genomic copies in a cell type would usually not be advantageous due to the greater nutritional requirements needed to maintain this copy number.
- artificial conditions can be devised to select for high copy number.
- Modified cells having recombinant genomes are grown in rich media (in which conditions, multicopy number should not be a disadvantage) and exposed to a mutagen, such as ultraviolet or gamma irradiation or a chemical mutagen, e.g., mitomycin, nitrous acid, photoactivated psoralens, alone or in combination, which induces DNA breaks amenable to repair by recombination.
- a mutagen such as ultraviolet or gamma irradiation or a chemical mutagen, e.g., mitomycin, nitrous acid, photoactivated psoralens, alone or in combination, which induces DNA breaks amenable to repair by recombination.
- a mutagen such as ultraviolet or gamma irradiation or a chemical mutagen, e.g., mitomycin, nitrous acid, photoactivated psoralens, alone or in combination, which induces DNA breaks amenable to repair by recombination.
- individual cells can be sorted using a cell sorter for those cells containing more DNA, e.g., using DNA specific fluorescent compounds or sorting for increased size using light dispersion. Some or all of the collection of cells surviving selection are tested for the presence of a gene that is optimized for the desired property.
- phage libraries are made and recombined in mutator strains such as cells with mutant or impaired gene products of mutS, mufT, mutH, mutL, ovrD, dcm, vsr, umuC, umuD, sbcB, recJ, etc.
- the impairment is achieved by genetic mutation, allelic replacement, selective inhibition by an added reagent such as a small compound or an expressed antisense RNA, or other techniques.
- High multiplicity of infection (MOI) libraries are used to infect the cells to increase recombination frequency. Additional strategies for making phage libraries and or for recombining DNA from donor and recipient cells are set forth in U.S. Pat. No. 5,521,077. Additional recombination strategies for recombining plasmids in yeast are set forth in WO 97 07205.
- the selection methods herein are utilized in a "whole genome shuffling" format.
- An extensive guide to the many forms of whole genome shuffling is found in the pioneering application to the inventors and their co-workers entitled “Evolution of Whole Cells and Organisms by Recursive Sequence Recombination," Attorney Docket No. 018097-020720US filed July 15, 1998 by del Cardayre et al. (USSN 09/161,188).
- whole genome shuffling makes no presuppositions at all regarding what nucleic acids may confer a desired property. Instead, entire genomes (e.g., from a genomic library, or isolated from an organism) are shuffled in cells and selection protocols applied to the cells.
- the methods herein allow dioxygenase biocatalysts to be improved at a faster pace than conventional methods.
- Whole genome shuffling can at least double the rate of strain improvement for microorganisms used in fermentation as compared to traditional methods. This provides for a relative decrease in the cost of fermentation processes. New products can enter the market sooner, producers can increase profits as well as market share, and consumers gain access to more products of higher quality and at lower prices. Further, increased efficiency of production processes translates to less waste production and more frugal use of resources.
- Whole genome shuffling provides a means of accumulating multiple useful mutation per cycle and thus eliminate the inherent limitation of current strain improvement programs (SIPs).
- Nucleic acid shuffling provides recursive mutagenesis, recombination, and selection of DNA sequences.
- a key difference between nucleic acid shuffling-mediated recombination and natural sexual recombination is that nucleic acid shuffling effects both the pairwise (two parents) and the poolwise (multiple parents) recombination of parent molecules.
- Natural recombination is more conservative and is limited to pairwise recombination. In nature, pairwise recombination provides stability within a population by preventing large leaps in sequences or genomic structure that can result from poolwise recombination.
- poolwise recombination is appealing since the beneficial mutations of multiple parents can be combined during a single cross to produce a superior offspring.
- Poolwise recombination is analogous to the crossbreeding of inbred strains in classic strain improvement, except that the crosses occur between many strains at once.
- poolwise recombination is a sequence of events that effects the recombination of a population of nucleic acid sequences that results in the generation of new nucleic acids that contains genetic information from more than two of the original nucleic acids.
- Bacteria have no known sexual cycle per se, but there are natural mechanisms by which the genomes of these organisms undergo recombination. These mechanisms include natural competence, phage-mediated transduction, and cell-cell conjugation.
- Bacteria that are naturally competent are capable of efficiently taking up naked DNA from the environment. If homologous, this DNA undergoes recombination with the genome of the cell, resulting in genetic exchange.
- Bacillus subtilis the primary production organism of the enzyme industry, is known for the efficiency with which it carries out this process.
- a bacteriophage mediates genetic exchange. A transducing phage will often package headfulls of the host genome. These phage can infect a new host and deliver a fragment of the former host genome which is frequently integrated via homologous recombination. Cells can also transfer DNA between themselves by conjugation.
- Cells containing the appropriate mating factors transfer episomes as well as entire chromosomes to an appropriate acceptor cell where it can recombine with the acceptor genome.
- Conjugation resembles sexual recombination for microbes and can be intraspecific, interspecific, and intergeneric.
- an efficient means of transforming Streptomyces sp. a genera responsible for producing many commercial antibiotics, is by the conjugal transfer of plasmids from Echerichia coli.
- knowledge of competence, transducing phage, or fertility factors is lacking.
- Protoplast fusion has been developed as a versatile and general alternative to these natural methods of recombination.
- Protoplasts are prepared by removing the cell wall by treating cells with lytic enzymes in the presence of osmotic stabilizers. In the presence of a fusogenic agent, such as polyethylene glycol (PEG), protoplasts are induced to fuse and form transient hybrids or "fusants.” During this hybrid state, genetic recombination occurs at high frequency allowing the genomes to reassort. The final step is the successful segregation and regeneration of viable cells from the fused protoplasts.
- Protoplast fusion can be intraspecific, interspecific, and intergeneric and has been applied to both prokaryotes and eukaryotes. In addition, it is possible to fuse more than two cells, thus providing a mechanism for effecting poolwise recombination. While no fertility factors, transducing phages or competency development is needed for protoplast fusion, a method for the formation, fusing, and regeneration of protoplasts is typically optimized for each organism.
- ADO for clarity of illustration. Those of skill in the art will recognize that this discussion is illustrative of DOs in general, and is not limited to ADOs. Because all known ADOs are multicomponent enzymes having from 2 to 4 functionally different subunits, any of the sequence components comprising a particular
- ADO or a family of homologous ADOs can be shuffled.
- ADOs with changes in specificity, regioselectivity and mode of action e.g. dioxygenase or monooxygenase activity
- terminal oxygenase is made of two polypeptides, the large subunit is the preferred target sequence for shuffling.
- shuffling of the polynucleotide sequences encoding other functional polypeptides is a preferred embodiment.
- These sequences can be shuffled individually, in sub-sets or as a part of a gene cluster which encodes all of the ADO polypeptides. This allows for changes in both the coding polynucleotide sequences, and also for generating combinations of functional chimeric ADOs with various functions assembled from two or more parental sequences (family gene cluster shuffling).
- one or more of the more than 50 members of this superfamily is selected, aligned with similar homologous sequences, shuffled against these homologous sequences and screened.
- DNA from clones with improved activity can be shuffled together in subsequent rounds of shuffling and screened for further improvement.
- a first nucleic acid sequence encoding a first polypeptide sequence is selected.
- a plurality of codon altered nucleic acid sequences, each of which encode the first polypeptide, or a modified or related polypeptide is then selected (e.g., a library of codon altered nucleic acids can be selected in a biological assay which recognizes library components or activities), and the plurality of codon-altered nucleic acid sequences is recombined to produce a target codon altered nucleic acid encoding a second protein.
- the target codon altered nucleic acid is then screened for a detectable functional or structural property, optionally including comparison to the properties of the first polypeptide and/or related polypeptides.
- a nucleic acid encoding such a polypeptide can be used in essentially any procedure desired, including introducing the target codon altered nucleic acid into a cell, vector, virus, attenuated virus (e.g., as a component of a vaccine or immunogenic composition), transgenic organism, or the like.
- in silico shuffling utilizes computer algorithms to perform “virtual” shuffling using genetic operators in a computer.
- gene sequence strings are recombined in a computer system and desirable products are made, e.g., by reassembly PCR of synthetic oligonucleotides.
- silico shuffling is described in detail in Selifonov and Stemmer in "METHODS FOR MAKING CHARACTER STRINGS,
- POLYNUCLEOTIDES & POLYPEPTIDES HAVING DESIRED CHARACTERISTICS filed February 5, 1999, USSN 60/118854.
- genetic operators algorithms which represent given genetic events such as point mutations, recombination of two strands of homologous nucleic acids, etc.
- genetic operators are used to model recombinational or mutational events which can occur in one or more nucleic acid, e.g., by aligning nucleic acid sequence strings (using standard alignment software, or by manual inspection and alignment) and predicting recombinational outcomes.
- the predicted recombinational outcomes are used to produce corresponding molecules, e.g., by oligonucleotide synthesis and reassembly PCR.
- oligonucleotide mediated shuffling in which oligonucleotides corresponding to a family of related homologous nucleic acids (e.g., as applied to the present invention, interspecific or allelic variants of a dioxygenase nucleic acid) which are recombined to produce selectable nucleic acids.
- This format is described in detail in Crameri et al "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACLD RECOMBINATION" filed February 5, 1999, USSN 60/118,813 and Crameri et al. "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACLD RECOMBINATION" filed June 24, 1999, USSN 60/141,049.
- the technique can be used to recombine homologous or even non-homologous nucleic acid sequences.
- One advantage of the oligonucleotide-mediated recombination is the ability to recombine homologous nucleic acids with low sequence similarity, or even non- homologous nucleic acids.
- these low-homology oligonucleotide shuffling methods one or more set of fragmented nucleic acids are recombined, e.g., with a with a set of crossover family diversity oligonucleotides.
- Each of these crossover oligonucleotides have a plurality of sequence diversity domains corresponding to a plurality of sequence diversity domains from homologous or non-homologous nucleic acids with low sequence similarity.
- the fragmented oligonucleotides which are derived by comparison to one or more homologous or non-homologous nucleic acids, can hybridize to one or more region of the crossover oligos, facilitating recombination.
- sets of overlapping family gene oligonucleotides (which are derived by comparison of homologous nucleic acids and synthesis of oligonucleotide fragments) are hybridized and elongated (e.g., by reassembly PCR), providing a population of recombined nucleic acids, which can be selected for a desired trait or property.
- the set of overlapping family genes include a plurality of oligonucleotide member types which have consensus region subsequences derived from a plurality of homologous target nucleic acids.
- family gene shuffling oligonucleotides are provided by aligning homologous nucleic acid sequences to select conserved regions of sequence identity and regions of sequence diversity.
- a plurality of family gene shuffling oligonucleotides are synthesized (serially or in parallel) which correspond to at least one region of sequence diversity.
- Sets of fragments, or subsets of fragments, used in oligonucleotide shuffling approaches can be provided by cleaving one or more homologous nucleic acids (e.g., with a DNase), or, more commonly, by synthesizing a set of oligonucleotides corresponding to a plurality of regions of at least one nucleic acid (typically oligonucleotides corresponding to a full-length nucleic acid are provided as members of a set of nucleic acid fragments).
- homologous nucleic acids e.g., with a DNase
- synthesizing a set of oligonucleotides corresponding to a plurality of regions of at least one nucleic acid typically oligonucleotides corresponding to a full-length nucleic acid are provided as members of a set of nucleic acid fragments.
- these cleavage fragments can be used in conjunction with family gene shuffling oligonucleotides, e.g., in one or more recombination reaction to produce recombinant dioxygenase nucleic acids.
- polynucleotides encoding chimeric polypeptides can be used as substrates for shuffling in any of the above-described shuffling formats.
- Preferred chimeras have a shuffled active site or a shuffled active site region.
- Art-recognized methods for preparing chimeras are applicable to the methods described herein (see, for example, Shimoji et al, Biochemistry 37: 8848-8852 (1998)).
- the invention provides a method for obtaining a polynucleotide encoding an improved dioxygenase polypeptide acting on an organic substrate.
- Presently preferred substrates include a target group selected from classes of substrates shown in Figures 14A-14R.
- the improved polypeptide exhibits one or more improved properties, compared to a naturally occurring polypeptide acting on the substrate(s).
- the method involves: (a) creating a library of recombinant polynucleotides encoding a dioxygenase polypeptide acting on the substrate; and (b) screening the library to identify a recombinant polynucleotide encoding an improved polypeptide that exhibits one or more improved properties compared to a naturally occurring dioxygenase polypeptide.
- the library of recombinant polynucleotides is created by recombining at least a first form and a second form of a nucleic acid. At least one of these forms encodes the naturally occurring polypeptide or a fragment thereof.
- the first form and the second form differ from each other in two or more nucleotides.
- the first and second forms of the nucleic acid are homologous.
- the present invention also provides the polypeptides encoded by these polynucleotides and methods of using these peptides for synthesizing valuable organic compounds. Some of these polypeptides and methods of using them are set forth below.
- Shuffling approaches such as shuffling a family of genes, apply to enhancing performance of dioxygenase polypeptides useful in each of the following classes of industrial chemical transformation.
- Other dioxygenase enzyme classes are also useful in practicing the present invention.
- other polypeptides accessible through the present invention, and method of using these polypeptides will be apparent to those of skill in the art.
- the present invention provides improved polypeptides that can mediate the oxidation of ⁇ -bonds to vicinal diols.
- host organisms expressing such improved polypeptides and methods of using these polypeptides and organisms in synthetic processes.
- the enzymes known to oxidize ⁇ -bonds to the corresponding vicinal diols are the bacterial arene dioxygenases (ADOs). In the presence of oxygen, and of a reducing compound such as NAD(P)H, these enzymes catalyze the reductive dioxygenation of compounds as diverse as aromatic rings and non-aromatic multiple bonds.
- Arene dioxygenases such as toluene 2,3-dioxygenase, isopropylbenzene 2,3- dioxygenase, benzene- 1,2-di oxygenase, biphenyl-2,3-dioxygenase, naphthalene- 1,2- dioxygenase, and many homologous and/or functionally similar enzymes, constitute members of a class of enzymes useful in the manufacture of vicinal diols in a highly regioselective fashion. Moreover, the action of this class of enzymes on aromatic substrates does not involve formation of reactive arene epoxides or phenols.
- these enzymes While potentially interesting from an academic standpoint, these enzymes have not been generally utilized due to several shortcomings. For example, these enzymes do not exhibit sufficient turnover numbers nor are they known to provide satisfactory regioselectivity for dihydroxylation of ⁇ -bonds in a substrates having more than one ⁇ -bond, or with more than one type of ⁇ -bond (e.g., styrene).
- these enzymes do not exhibit sufficient turnover numbers nor are they known to provide satisfactory regioselectivity for dihydroxylation of ⁇ -bonds in a substrates having more than one ⁇ -bond, or with more than one type of ⁇ -bond (e.g., styrene).
- Arene dioxygenases of various specificity, regioselectivity and enantiospecificity are capable of forming vicinal diols from a large array of substituted aromatic compounds and non-aromatic alkenes.
- toluene dioxygenase has recently been implicated in dihydroxylation of several non-aromatic alkenes with concomitant formation of the glycol compounds of known and unknown stereochemistry. (Lange and Wackett, J ⁇ cte ⁇ Z.,179(12):3858-3865 (1997)).
- naphthalene dioxygenase has been shown to catalyze dihydroxylation of styrene to (R)l- phenyl-l,2-ethanediol with enantiomeric excess of about 79% (Lee et al., Appl. Environ. Microbiol, 62(9):3101-3106 (1996); Lee et al, J. Bacteriol, 178(11):3353-3356 (1996)).
- the present invention also provides improved polypeptides capable of oxidizing exocyclic and acyclic ⁇ -bonds for producing oxygen-containing species.
- polypeptides capable of oxidizing exocyclic and acyclic ⁇ -bonds for producing oxygen-containing species.
- the following discussion focuses on the oxidation of olefins. This focus is intended to be illustrative and not limiting of the scope of the invention. Many other appropriate substrates for oxidation using the methods of the invention will be apparent to those of skill in the art.
- An exemplary oxidation of an olefin to the corresponding vicinal diol uses a dioxygenase to obtain the glycol directly from the olefin. This is best accomplished by recruiting a dioxygenase, such as an arene dioxygenase.
- Arene dioxygenases are multi-component enzymes for which substrate specificity is primarily determined by the non-heme iron-sulfur cluster containing terminal oxidase protein(s), and, in the cases where terminal oxidase is comprised of two proteins, by the large subunit of the terminal oxidase.
- Other proteins such as ferredoxins and ferredoxin reductases provide transfer of electrons from reducing equivalents such as NAD(P)H to the terminal oxidase.
- nucleic acids that encode the terminal oxidase component(s) are the preferred substrate for the recombination and selection methods of the invention.
- Upon producing a library of recombinant polynucleotides as described herein one can then select to identify those polynucleotides that encode an enzyme that has the desired change in substrate specificity.
- Such changes in the substrate specificity can include, for example, a gain of turnover of a novel substrate, a change in regioselectivity of oxidation, and a change in chirality of product formed. If the goal is primarily to obtain enzymes that have an increased catalytic turnover with an already acceptable substrate, shuffling of the nucleic acids encoding all of the components of arene dioxygenase is preferred.
- arene dioxygenase genes that can be used as substrates for the recombination and selection methods of the invention are described in the art.
- Suitable arene dioxygenase-encoding polynucleotides can be obtained from many organisms using cloning methods known to one skilled in the art.
- the following list provides examples of polynucleotides that encode arene dioxygenases and are suitable for use in the methods of the invention.
- the loci are identified by GenBank ID and encode complete or partial protein components of the arene dioxygenases.
- Suitable loci include: [PSETODC1C] toluene-l,2-dioxygenase; [AF006691], [PJU53507], [PSECUMA], [REU24277] isopropylbenzene- 2 / 3-[E04215] / [PSEBDO] dioxygenase; benzene-l,2-dioxygenase; [U78099] tetrachlorobenzene dioxygenase;
- nucleic acids encoding arene dioxygenases are preferably selected from the following non-limiting set of genes and organisms: naphthalene 1,2-dioxygenase, 2,4-dinitro-toluene-4,5-dioxygenase, 2-nitrotoluene 2,3- dioxygenase, toluene-2,3-dioxygenase, isopropylbenzene-2,3-dioxygenase, benzene-1,2- dioxygenase and biphenyl-2,3-dioxygenase, chlorobenzene and tetrachlorobenzene dioxygenases.
- homologous arene dioxygenase genes can be found in many microorganisms which one skilled in the art can isolate from various sources including, for example, soil, sediment, air, and aqueous samples by enrichment culture techniques in mineral media using aromatic compounds such as alkyl and halogen- substituted benzenes, biphenyls, indans, naphthalenes and tetralins as carbon sources.
- aromatic compounds such as alkyl and halogen- substituted benzenes, biphenyls, indans, naphthalenes and tetralins as carbon sources.
- the present invention also provides improved polypeptides capable of oxidizing aromatic ⁇ -bonds for producing oxygen-containing aromatic species.
- substantially any oxidized aromatic or heteroaromatic species can be obtained using the polypeptides of the invention, in a presently preferred embodiment, the species include hydroxylated aromatic carboxylic acids and hydroxy alkyl arenes.
- Hydroxylated aromatic compounds such as hydroxylated aromatic carboxylic acids, and alkyl hydroxyarenes (e.g., di- and tri-methyl phenols) are an important group of industrial chemicals.
- the methylphenols find use in the industrial synthesis of vitamin E, and, also in the synthesis of various polymers and resins, where they can be used individually or as part of more complex compositions that include other phenolic and non-phenolic compounds.
- dimethylphenols and trimethylphenols are generated by successive methylation of phenol and cresols.
- current synthetic chemical methods based on methylation of phenol do not offer sufficiently high selectivity for preparing isomeric dimethyl- and trimethyl-phenols individually.
- HCA Hydroxylated aromatic carboxylic acids
- esters and lactones are useful components of various polymers and co-polymers, such as polyesters.
- Their utility stems largely from their bifunctional reactive nature (e.g., hydroxyl and carboxyl groups) and the hydrophobic aromatic ring, which often imparts desirable physical and chemical properties to the polymers (p-hydroxybenzoate, m-hydroxybenzoate).
- HCAs are also used in, for example, anti-microbial additives to pharmaceuticals (esters of para- hydroxybenzoic acid, parabens) and to fragrances (esters of salicylic acid, coumarins and 3,4-dihydrocoumarin). While many chemical synthetic methods for HCAs and their derivatives are known in the art, typically, these compounds are manufactured from a non-oxidized aromatic precursor by multistep processes, requiring both the harsh conditions and the extensive product purification from by-products, arising from non-selective reactions.
- the present invention solves many of these problems by providing a dioxygenase polypeptide that oxidizes one or more aromatic ⁇ -bonds to the corresponding diol. The diols can, if desired, be subsequently dehydrated to restore the aromatic system and yield a hydroxylated aromatic ring.
- accessory polypeptides In conjunction with the oxidative pathways utilizing polypeptides having dioxygenase activity, as discussed above, the present invention provides accessory non- di oxygenase polypeptides.
- "accessory polypeptides” refers to those polypeptide that do not carry out the initial dioxidation step in the methods of the invention.
- Exemplary accessory polypeptide include, ligases, transferases, dehydrogenases, and the like. Although both shuffled and non-shuffled accessory polypeptides can be used, preferred accessory polypeptides are those that have been shuffled.
- the non-dioxygenase polypeptides can be used at any step of a pathway using a dioxygenase of the invention.
- the accessory polypeptides are used to further transform an oxidation product.
- oxidized substrates that are produced by a dioxygenase of the invention, those of skill will appreciate that these routes can be practiced with analogous substrates that are, for example chemically synthesized, commercially available, etc.
- the present invention provides methods using both the improved accessory peptides and unimproved accessory peptides to further elaborate the dioxygenase- mediated reaction product.
- the method involves contacting the product of the dioxygenase- mediated reaction with one or more of the accessory polypeptides.
- the product is contacted with an organism that expresses the accessory polypeptide(s).
- the accessory polypeptides are improved polypeptides, they will generally be produced by the methods described herein.
- the improved dioxygenase and the accessory polypeptide(s) can be expressed by the same host cell, or they can be expressed by different host cells.
- the accessory polypeptide and the improved dioxygenase are expressed by the same host cell.
- the present invention makes possible the synthesis of a great variety of industrially valuable compounds via the methods disclosed herein.
- an alcohol or diol is converted to an aldehyde or carboxylic acid by the action of a dehydrogenase.
- the substrate for the dehydrogenase is preferably the product of an improved oxygenase of the invention.
- Polynucleotides encoding many known dehydrogenases can be used as substrates for nucleic acid shuffling.
- Exemplary dehydrogenases useful in practicing the present invention include, but are not limited to:
- a method for converting carboxylic acid and hydroxyl groups to adducts such as esters and ethers.
- Useful polypeptides include, for example, ligases and transferases (see, Fig.13). For the purposes of the discussion below, these polypeptides are referred to as "adduct-forming" polypeptides.
- the adduct-forming polypeptides are useful for enhancing and controlling the production of biotransformation products.
- These polypeptides which convert a diol, for example, to a monoacyl or monoglycosyl derivative can enhance control over the regioselectivity of subsequent reactions (e.g., chemical dehydration).
- the regioselectivity of chemical dehydration in certain cases can be controlled by converting the compounds to their diacyl derivatives by means of chemical reaction, and then selectively removing one of the acyl groups using an polypeptide of the invention.
- the isolation of certain products is simplified by their conversion to more hydrophobic species.
- the acylation of a diol to the corresponding carboxylic ester provides a more efficient recovery of such diols, in the form of an ester, by organic solvent extraction of the adduct.
- Preferred organic solvents are those that can be used in an immiscible biphasic organic-aqueous biotransformation with whole cells, whether in a batch or in a continuous mode.
- An adduct-forming polypeptide is optionally expressed by the same host cell that expresses the dioxygenase, dehydrogenase, racemase, etc., or by a different host cell. Moreover, an adduct-forming polypeptide can be a naturally occurring polypeptide, or it can be improved by the method of the invention.
- the polypeptide When the adduct-forming polypeptide is an improved polypeptide, in presently preferred embodiments, the polypeptide demonstrates increased efficiency in the formation of the monoacyl- or monoglycosyl- derivatives of a desired compound (e.g., a glycol, carboxylic acid, etc.).
- a desired compound e.g., a glycol, carboxylic acid, etc.
- Other improved adduct-forming polypeptides include transferases and ligases that selectively modify only one of the hydroxyl groups of a diol, thus providing a means for controlling the regioselectivity of dehydration of such derivatives to either of two possible isomeric ⁇ -hydroxycarboxylic acid compounds. a. Acyltransferases
- acyltransferases Other enzymes useful in practicing the present invention are the acyltransferases. These polypeptides are optionally evolved to enhance certain catalytic properties of the encoded polypeptides such as, specificity for a particular hydroxyl and/or acid, enantiomeric and/or diastereomeric selectivity.
- these polypeptides catalyze acyl transfer reactions as shown in Fig. 13.
- Acyltransferases are ubiquitous in nature, and many organisms (e.g., microbes, plants, mammals, etc.) can be used as sources of genes encoding these polypeptides.
- the acyltransferase genes are preferably selected from those encoding functional polypeptides that catalyze active (CoA) ester transfer reactions in the biocatalytic processes described herein.
- Preferred acyltransferase genes are selected from those encoding functional polypeptides catalyzing reactions of small non-biopolymeric molecules.
- a list of exemplary polynucleotides that can be recruited for this purpose are listed below by the corresponding GenBank identification:
- acetyl-CoA benzylalcohol acetyltransferase of Clarkia breweri, and benzoyl-CoA benzyl alcohol acetyltransferase present in the same organism, (Dudareva et al, Plant Physiol. 116(2): 599-604 (1998));
- an accessory polypeptide having acyl CoA ligase activity is provided.
- Specificity of acyl-CoA ligases towards a particular exogenous substrate or a group of substrates is preferably optimized by screening or selecting for the acylation of a substrate by shuffled and co-expressed acyl-CoA ligases and acyltransferases. Utilizing these polypeptides in tandem allows the combined effect of both polypeptides to be exploited.
- one or more of the members of the corresponding superfamilies of these polypeptides are selected, aligned with similar homologous sequences, and shuffled against these homologous sequences.
- a carboxylic acid is fed exogenously to an organism that expresses the ligase or transferase.
- the carboxylic acid is selected from those compounds that cannot be altered by the polypeptide used to produce the substrate acted upon by the adduct forming polypeptide.
- Such carboxylic acids include, for example, both substituted and non-substituted benzoic acid, phenylacetic acid, naphthoic, phenylpropionic acid, phenoxyacetic acid, cycloalkanoic acid, carboxylic acids derived from terpenes, pivalic acid, substituted acrylic acids, and the like.
- the invention also provides microorganisms in which one or more mutations are introduced.
- Preferred mutations are those that effectively block metabolic modifications of such acids beyond their conversion to a suitable active ester (e.g., as a derivative of coenzyme A).
- Such mutations in the host organism are optionally introduced by classical mutagenesis methods, by site-directed mutagenesis, by whole genome shuffling, and other methods known to those of skill in the art.
- the acyl transferase-encoding nucleic acids used as substrates for creating recombinant libraries encode polypeptides that transfer an acetyl group from an endogenous pool of acetyl-CoA in the cells of the host.
- the endogenous pools of acetyl-CoA can also be enhanced by nucleic acid shuffling of an acetyl-CoA ligase and by supplying an exogenous acetate in the medium.
- the organisms produce a sufficient amount of an acyl-CoA ligase so as to activate the carboxylic acids to CoA thioesters, which in turn serve as substrates for acyl-CoA transferases that utilize the oxidation products as substrates.
- the specificity of an acyl-CoA ligase towards a desired exogenous carboxylic acid can be optimized using the recombination and screening/selection methods of the invention.
- the screening or selecting is performed using co-expressed acyl-CoA ligases and acyltransferases, thus permitting one to screen on the basis of the combined effect of both polypeptides in the pathway for provision of monoacylated derivatives of the oxidation products.
- Nucleic acids that encode acyl-CoA ligases and other acyltransferases useful as substrates for the recombination and selection/screening methods of the invention include, for example, one or more members of the superfamilies of these polypeptides.
- the nucleic acids are selected, aligned with similar homologous sequences, and shuffled against these homologous sequences.
- one or more glycosyltransferases can be expressed by the host cells of the invention.
- one or more glycosyltransferases can be selected from the glycosyltransferase superfamily, aligned with similar homologous sequences, and shuffled against these homologous sequences.
- Glycosyl transfer reactions are ubiquitous in nature, and one of skill in the art can isolate such genes from a variety of organisms, using one or more of several art-recognized methods.
- the following are illustrative examples of glycosyltransferase-encoding nucleic acids that can be used as substrates for creation of the recombinant libraries.
- the libraries are then screened to identify those polypeptides that exhibit an improvement in the glycosylation of compounds such as alcohols, diols and ⁇ - hydroxycarboxylic acids:
- glycosyltransferases are selected from those which transfer hexose residues from UDP-hexose derivatives.
- Preferred hexoses include, for example, D-glucose, D-galactose and D-N-acetylglucosamine.
- the host cells of the present invention express a polypeptide capable of converting a carboxylic acid to a carboxylic acid methyl ester.
- a polypeptide capable of converting a carboxylic acid to a carboxylic acid methyl ester presently preferred polypeptides include methyltransferases.
- genes encoding S-adenosylmethionine-dependent methyltransferases are preferred.
- these polypeptides are evolved to enhance selected properties of the encoded polypeptides such as, specificity for a particular substrate and enantiomeric and/or diastereomeric selectivity and/or solvent resistance.
- these polypeptides can be evolved to catalyze the O- methylation of carboxyl groups of a caroxylic acid substrate thus forming the corresponding methyl esters.
- Methyltransferases are ubiquitous in nature, and many organisms (e.g., microbes, plants, mammals, etc.) can be used as sources of genes encoding these polypeptides. No matter their origin, the methyltransferase genes are preferably selected from those which encode functional polypeptides that catalyze the methylation of small non-biopolymeric molecules.
- the methyltransferases are those which act on the carboxyl groups of organic acids.
- methyltransferases Examples of various methyltransferases that can be expressed by host cells of the invention and which are useful for nucleic acid shuffling-based directed evolution of polypeptides catalyzing the methylation of carboxylic acids are listed below by the corresponding GenBank identification:
- NTDIMET o-diphenol-O- methyltransferase of N tabacum
- PCCCOAMTR, PUMCCOAMT trans- caffeoyl-CoA 3-O-methyltransferase of Petroselinum crispum
- PTOMTI s caffeic acid/5-hydroxyferulic acid O-methyltransferase (PTOMTI) of Populus tremuloide
- PBTAJ4894-PBTAJ4896 caffeoyl-CoA 3-O- methyltransferases of Populus balsamifera subsp.
- the present invention provides a nucleic acid encoding a polypeptide capable of converting a particular enantiomer of a chiral compound such as an alcohol, diol or ⁇ -hydroxycarboxylic acid or a precursor or analogue thereof to its antipode.
- a polypeptide capable of converting a particular enantiomer of a chiral compound such as an alcohol, diol or ⁇ -hydroxycarboxylic acid or a precursor or analogue thereof to its antipode.
- Presently preferred polypeptides include racemases, such as the mandelate racemase of Pseudomonas putida (PSEMDLABC). These polypeptides can expressed by hosts of the invention in their natural form or, alternatively, they can be evolved to enhance certain catalytic properties of the encoded polypeptides such as, specificity for a particular substrate and enantiomeric and/or diastereomeric selectivity.
- nucleic acids encoding the mandelate racemase of Pseudomonas putida which catalyzes the interconversion of mandelate R and S enantiomers, is a typical preferred example of genes selected for use in this invention.
- the nucleic acids encoding this gene, and any homologs of thereof, are subjected to nucleic acid shuffling to evolve polypeptides having improved or optimal performance and specificity towards particular substrates such as ⁇ -hydroxycarboxylic acids.
- the polypeptide has a performance and/or specificity that is enhanced over the wild type.
- Preferred polypeptides act on ⁇ -hydroxycarboxylic acid substrates, such as those displayed in Fig. 11. 4. Solvent resistance polypeptides
- the invention also provides organisms expressing one or more of the improved polypeptides of the invention and that are also resistant to solvents, organic substrates and reaction products (e.g., epoxides, glycols, ⁇ -hydroxyaldehydes, ⁇ - hydroxycarboxylic acids and ⁇ -hydroxycarboxylic acid derivatives (e.g., esters)) according to the methods of the invention.
- solvents organic substrates and reaction products
- the solvent resistance of organisms and polypeptide used in the biocatalytic conversion of organic compounds is important for enhancing the productivity of such processes. Increased solvent resistance of the organisms can enhance longevity, viability and catalytic activity of the microbial cells, and can simplify the administration of the feedstock compounds to the reactor and the recovery or separation of desired products by means of, for example, continuous or semi-continuous liquid-liquid extraction.
- the invention provides microbial cells that are useful in the synthetic methods described herein, which express proteins conferring resistance to solvents (in particular, organic solvents) upon the microbial cells. This allows the use of whole microbial cells in a organic-aqueous mixture (e.g., a biphasic mixture).
- the invention provides microbial strains including at least two of the polypeptide systems described herein.
- a microorganism of the invention can contain both a dioxygenase gene and a transferase gene.
- the microorganism can contain both an arene dioxygenase gene and a solvent resistance gene.
- the microbial cells thus provide a significant improvement in productivity of the synthesis processes, selectivity of product formation, operational simplicity, ease of product recovery and minimizing any by-product streams.
- Several microorganisms are known to possess high resistance to hydrophobic compounds such as benzene and lower alkylbenzenes.
- genes encoding a solvent efflux pump (srpPXRC) have been identified in Pseudomonas putida strains (Kieboom et al., J. Biol. Chem. 273:85-91 (1998)).
- genes such as those that encode many proton-dependent multidrug efflux systems, e.g., MexA-MexB-OprM, MexC- MexD-OprJ, and MexE-MexF-OprN of Pseudomonas aeruginosa (Li et al, J. Bacteriol. 180: 2987-2991 (1998)), or the tolC, acrAB, marA, soxS, and robA loci of Escherichia coli (Aono et al, J. Bacteriol. 180: 938-944 (1998); White et al, J. Bacteriol. 179: 6122-6126 (1997)), and in many other microorganisms, can be used to confer solvent resistance upon a host microbial strain used in the oxidative biocatalytic conversion of olefins by action of dioxygenases.
- MexA-MexB-OprM MexC- MexD
- the ability of a polypeptide to confer solvent resistance is enhanced by subjecting nucleic acids encoding solvent resistance polypeptides, or the genomes of the microorganisms themselves, to the recombination and selection/screening methods described herein.
- the nucleic acids listed above, as well as similar genes, provide a source of substrates for incorporation into organisms of the invention and/or use in nucleic acid shuffling and other methods of constructing libraries of recombinant polynucleotides. The libraries can then be screened to identify those nucleic acids that encode polypeptides conferring improved solvent tolerance on a host.
- biotransformation e.g., two-phase oxidation
- shuffling of nucleic acids that encode these polypeptides can be used to confer and to improve resistance of the microbial cell to high concentrations of biotransformation substrates, intermediates and endproducts, thus improving biocatalyst performance and productivity.
- the present invention provides polypeptides produced according to these disclosed methods. Moreover, the invention provides organisms that express the polypeptides produced by the method of the invention. The organisms of the invention can express one or more of the improved polypeptides. Also provided by the present invention are methods of synthesizing a desired compound. This method involves contacting an appropriate substrate with a polypeptide of the invention. In a preferred embodiment, the substrate is contacted with an organism of the invention that expresses a polypeptide of the invention. D. Methods of Using Improved Polypeptides to Prepare Organic Compounds
- the present invention provides a range of methods for preparing useful organic compounds by the oxidation and further elaboration of appropriate precursors.
- the methods provided by the present invention are, for example, the oxidation of alkylarene compounds to the corresponding unsaturated diols and the subsequent dehydration of these diols to hydroxy alkylarenes.
- an analogous method for preparing hydroxylated aromatic carboxylic acids there is provided.
- the invention provides methods for preparing exocyclic and/or acyclic diols from molecules having alkene bonds. These diols can be readily converted to ⁇ -hydroxycarboxylic acids.
- reaction types and sequences set forth below are illustrative of the scope of the invention.
- the dioxygenases of the invention are capable of oxidizing any organic substrate comprising an oxidizable moiety. Additional reaction sequences utilizing the polypeptides of the invention will be apparent to those of skill in the art.
- vicinal diols by oxidizing a ⁇ -bond using a dioxygenase of the invention provides ready access to a wide array of compounds that are useful as both final products and as intermediates in multi-step reaction pathways.
- the dioxygenases of the invention are capable of converting to vicinal diols an array of structurally distinct compounds comprising one or more ⁇ -bonds.
- the method can be practiced with essentially any ⁇ -bond, in essentially any compound, in a preferred embodiment, the method involves preparing a vicinal diol group by contacting a substrate comprising a carbon-carbon double bond with an improved dioxygenase polypeptide, or an organism expressing an improved dioxygenase polypeptide.
- the substrate comprising the carbon-carbon ⁇ -bond is selected from styrene, substituted styrene, divinylbenzene, substituted divinylbenzene, isoprene, butadiene, diallyl ether, allyl phenyl ether, substituted allyl phenyl ether, allyl alkyl ether, allyl aralkyl ether, vinylcyclohexene, vinylnorbomene, and acrolein.
- R 1 is selected from phenyl, substituted phenyl, pyridyl, substituted pyridyl, — NR 2 R 3 , —OR 2 , — CN, C(R )NR 2 R 3 and C(R 4 )OR 2 groups
- R 2 and R 3 are members independently selected from H, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic and substituted heterocyclic groups
- the diol includes a six-member ring having at least one endocyclic double bond and at least one substituent selected from methyl, carboxyl and combinations thereof.
- Preferred diols having this structure are displayed in Fig.l and are the compounds having the structures III, IV, V, VI, VII, VIII, and Fig. 4 and are compounds having the structures XXIII, XXIV, XXV, XXVI, XXVII, XXVIII, XXIX, XXX.
- the invention provides methods for preparing hydroxy arenes and hydroxy arenes that are further functionalized with, for example, alkyl and substituted alkyl groups.
- the method involves contacting a substrate comprising an aryl group with an improved dioxygenase of the invention to from a diol.
- the diol intermediate is subsequently dehydrated, thereby producing an aromatic ring functionalized with a hydroxy radical.
- Methods for carrying out the dehydration are known in the art. Both enzymatic and chemical/physical means are appropriate. Preferred chemical/physical techniques include acid, base, heat and combinations thereof.
- the substrate includes a member selected from arylalkyl groups, substituted arylalkyl groups, heteroarylalkyl groups, and substituted heteroaryl alkyl groups.
- the substrate has the structure
- each of the n R groups is a member selected from the group consisting of alkyl groups, substituted alkyl groups, alkynyl groups, aralkyl groups, alkoxy groups, aryloxy, alkylthio groups, cycloalkyl groups, alkenyl groups, halogens, CF 3 , CN, NO 2 , trimethylsilyl, trimethylgermanyl, trimethylstannyl, and alkylamines; and n is an integer from 0 to 5, inclusive.
- R contains about 1 to about 15 carbons, preferably R is a lower alkyl group, more preferably methyl; and n is an integer from 1 to 5, inclusive, more preferably, n is an integer from 1 to 4, inclusive.
- the enzymes, bioengineered pathways, and microorganisms of the invention are useful for the synthesis of a wide variety of compounds, including many that are of commercial importance for purposes such as vitamin production.
- the methods and enzymes of the invention provide a means by which commercially valuable compounds can be formed using relatively inexpensive compounds as precursors.
- An illustrative example of the use of the enzymes and methods of the invention is a new selective process for making isomeric trimethylphenols which can further be converted to trimethylhydroquinone (a key vitamin E intermediate). In the case of 2,4,5-trimefhylphenol, the new route uses similar conditions as are typically applied to conversion of 2,4,6-trimethylphenol (Fig. 2).
- tocopherols are isolated from common oils such as soybean, corn, canola, cottonseed, safflower, and the like, by means of repeated vacuum distillation and alkali treatment of the oils, followed by multi- step processes for removal of impurities such as sterols and fatty acids. Microbial production of tocopherols has also been reported, from e.g., Aspergillus, Lactobacter, Euglene and Mycobacte ⁇ um, but fermentation titers are low, rendering these processes commercially insignificant.
- vitamin E D- ⁇ -tocopherol
- 2,3,5-trimethylphenol Compound LX in Fig. 2
- 2,4,5-trimefhylphenol Compound X
- 2,3,6-trimethylphenol Compound XI
- 2,4,6-trimethylphenol Compound XII
- a natural or synthetic phytol side chain can be attached to the 2,3,5- trimethylhydroquinone using Friedel-Craft acidic catalytic conditions (see, e.g., US Patent No. 5,468,883).
- the 2,3,5-, 2,4,5, 2,4,6- and 2,3,6-trimethylphenols can be synthesized using the arene dioxygenases and microbial cells of the invention as shown in Fig. 1.
- 1,2,4-trimethylbenzene (Compound I) is oxidized to any of Compounds III, IV, V, VI, or VII using an arene dioxygenase.
- These arene s-dihydrodiols can then be subjected to chemical dehydration to obtain the corresponding trimethyl phenols as shown in Fig. 1.
- 1,3,5-trimethylbenzene (Compound II) can be oxidized using an arene dioxygenase to obtain Compound VIII, which in turn can be dehydrated to obtain 2,4,6- trimethylphenol (Compound XII).
- an arene dioxygenase exhibits enhanced regiospecificity for the addition of the hydroxyl residues to the appropriate carbon atoms and/or enhanced enantiospecificity to obtain the desired chirality.
- the invention also provides methods in which acyltransferases or glycosyltransferases are used to facilitate the production of a desired isomer of a dialkylphenol or a trialkylphenol from the ds-dihydrodiol intermediates that are formed upon arene dioxygenase-mediated biocatalysis.
- An example of this reaction is shown in Fig. 3, in which an acyltransferase or a glycosyltransferase is employed to acylate or glycosylate one of the cis-hydroxyl groups on a ds-dihydrodiol (Compound IV).
- acylation in conjunction with an esterase to convert a cz ' s-dihydrodiol di- or trialkylbenzene intermediate into a desired isomer of trialkyl or dialkylphenol.
- the cis-dihydrodiol product of an arene dioxygenase reaction is subjected to chemical acylation with an anhydride, resulting in acylation of both hydroxyl groups (e.g., Compound XVIII in Fig. 3).
- esterase is then employed to release one of the acyl groups, thus producing the monohydroxyl derivative (e.g., Compound XVI), which can then be converted to a desired dialkyl- or trialkylphenol by chemical dehydration (e.g., Compound IX).
- the esterase is a recombinant esterase that has been enhanced, using the methods of the invention, for improved properties such as regiospecificity and enantiospecificity, and the like.
- Such compounds can be synthesized using the methods, enzymes, and microorganisms of the invention.
- Fig. 4 shows the various dimethylphenol compounds that one can produce by arene dioxygenase-catalyzed oxidation of xylenes, preferably in conjunction with whole cell biocatalysis, followed by chemical dehydration.
- the invention provides methods in which o-xylene (Compound XXI) is oxidized by an arene dioxygenase to form one or more of the s-dihydrodiols shown as Compounds XXV, XXVI, and XXVII.
- Chemical dehydration of Compounds XXV and XXVII can then be used to obtain 2,3- dimethylphenol (Compound XXXII) and 3,4-dimethylphenol (Compound XXXIV), respectively, while dehydration of Compound XXVI results in both of these compounds being produced.
- the arene dioxygenase that is employed in the reaction is one that is optimized for the desired regiospecificity and/or enantiospecificity.
- the invention provides methods in which an arene dioxygenase is used to catalyze the oxidation of m-xylene (Compound XXII) to one or more of the arene ds-dihydrodiols Compound XXVIII, Compound XXIX, and Compound XXX.
- the arene ds-dihydrodiols can then, in turn, be subjected to chemical dehydration to obtain one or more dihydrophenols.
- the resulting arene ds-dihydrodiols can be chemically dehydrated to obtain 2,5-dimethylphenol.
- the arene dioxygenase is expressed by a cell that is of a species other than that from which the arene dioxygenase gene was obtained, or that the arene dioxygenase is expressed from a recombinant arene dioxygenase-encoding polynucleotide that has been optimized for improved properties using the recombination and selection/screening methods of the invention.
- Hydroxylated aromatic carboxylic acids have many diverse uses, including as antimicrobial additives, UV protectants (e.g. esters of p-hydroxybenzoic acid, parabens), pharmaceutical compositions (e.g., esters of salicylic acid, coumarins and 3,4- dihydroxcoumarin).
- UV protectants e.g. esters of p-hydroxybenzoic acid, parabens
- pharmaceutical compositions e.g., esters of salicylic acid, coumarins and 3,4- dihydroxcoumarin.
- the present invention provides a method for preparing hydroxylated aromatic carboxylic acids.
- the method involves contacting a substrate comprising an aryl carboxylic acid with a dioxygenase polypeptide of the invention.
- the polypeptide is preferably expressed by an organism of the invention.
- carboxylic acid substrates The carboxylic acids used as substrates in the present invention can be obtained from commercial sources, or they can be prepared by methods known in the art. In a preferred embodiment, the carboxylic acids are prepared by contacting a substrate comprising an aryl alkyl group with an oxygenase polypeptide to produce the corresponding aryl alkyl alcohol. The alcohol is subsequently acted upon by a dehydrogenase polypeptide to produce the desired carboxylic acid. Alternatively, the alcohol can be converted to COOH by chemical means.
- the first step in the biotransformation processes for conversion of methylaryl compounds, such as toluene and isomeric xylenes involves the selective oxidation of at least one methyl group present in the aromatic substrate to the corresponding carboxylic acid
- the substrate is toluene, p- or, m- or o-xylene or 1,2,4-trimethylbenzene, or a mixture thereof, and preferably, only one of the methyl groups is oxidized.
- the resulting alcohol is dehydrogenated, generally by the action of a dehydrogenase polypeptide to produce the desired carboxylic acid.
- the invention provides for polypeptides that selectively oxidize only one alkyl group of an arene bearing two or more alkyl substituents.
- This embodiment is illustrated in Fig. 6, with the monooxidation of various xylenes.
- p-xylene (2) is selectively converted to a monocarboxylic acid (22).
- the invention provides polypeptides that are capable of oxidizing more than one alkyl substituent of a species substituted with two or more alkyl groups.
- certain polypeptides of the invention are capable of oxidizing both of the methyl substituents of a xylene, such as o- xylene (4) to the corresponding benzenedimethanol (4a).
- the monoxygenation/dehydrogenation pathway produces a carboxylic acid having the structure:
- n R groups is independently selected from H, alkyl and substituted alkyl groups; and n is an integer from 1 to 5, inclusive, more preferably R is methyl, and more preferably still, n is an integer from 1 to 3, inclusive.
- the carboxylic acid group is selected from:
- enzymes for effecting these reactions are well known in the art, and are suitable for use in the construction of useful polypeptides and host strains.
- certain enzymes are presently preferred, including non-heme multicomponent monooxygenases of toluene and xylenes, andp-cymene, as well as certain arene dioxygenases which act on these substrate in a monooxygenase mode.
- the latter are exemplified by naphthalene dioxygenase, 2-nitrotoluene 2,3-dioxygenase and 2,4- dinitrotoluene 4,5-dioxygenase.
- loci are identified by GenBank ID and encode complete or partial protein components of the arene dioxygenases. Suitable loci include:
- the monooxygenase used is actually a dioxygenase that exhibits monooxygenase activity.
- the ability of a dioxygenase to act as a monooxygenase is a property that can be optimized by shuffling the nucleic acids encoding these dioxygenases.
- loci are identified by GenBank LD and encode complete or partial protein components of the arene dioxygenases. Suitable loci include:
- a polypeptide that catalyzes monooxygenation can be a naturally occurring polypeptide, or it can have one or more properties that are improved relative to an analogous naturally occurring polypeptide.
- the polypeptides are expressed by one or more host organisms.
- the polypeptide that catalyzes the monooxygenation can be co-expressed by the same host expressing a polypeptide used for further structural elaboration of the oxidation substrate or product (e.g., a dioxygenase polypeptide that oxidizes the ⁇ -bond).
- the mono- and di-oxygenase polypeptides can be expressed in different hosts.
- At least one alkyl group of the alkylarene has at least two carbon atoms.
- Preferred species produced in the monoxygenation step have the structure:
- each of the m R groups is selected from H, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic and substituted heterocyclic; m is an integer from 0 to 5, inclusive; and n is an integer from 1 to 10, inclusive.
- Preferred aryl groups are those substituted on the aryl group with at least one methyl moiety.
- the compound has the structure:
- n is an integer from 1 to 6, inclusive.
- Fig.7 illustrates the oxidation to a carboxylic acid of the terminal methyl groups of alkylbenzene compounds 5- 9.
- the oxidation is accomplished by recruiting one or more genes encoding oxygenase activity. Generally, this is best accomplished by expressing a suitable cytochrome P450 type enzyme system.
- the enzymes of this class are ubiquitous in nature, and they can be found in a variety of organisms.
- n-propylbenzene is known to undergo ⁇ - oxidation in strains of Pseudomonas desmolytica S449B1 and Pseudomonas convexa S107Bl(Jigami et al, Appl Environ Microbiol 38(5):783-788 (1979)) which can utilize this hydrocarbon in either of two alternative oxidation pathways.
- alkane monooxygenases of bacterial origin, or cytochromes P450 for camphor oxidation, whether wild-type or mutant can be recruited for the purpose of introducing the oxygen at the terminal methyl group of alkylarenes (Lee et al, Biochem. Biophys. Res. Commun. 218(1): 17-21 (1996); van Beilen et al., Mol. Microbiol. 6(21):3121-3136 (1992); Kok et al, J. Biol. Chem. 264(10):5435-5441 (1989); Kok et al, J. Biol. Chem. 264(10): 5442-5451 (1989); Loida et al, Protein Eng. 6(2):207- 212 (1993). (Hi) Oxygenation of arenes with exocyclic ⁇ -bonds
- the starting material for the carboxylic acid is an arene bearing an exocyclic ⁇ -bond.
- This class of compounds is exemplified by styrene.
- Other analogous species are set forth in Fig. 11.
- the conversion of the exocyclic ⁇ -bond is best accomplished by recruiting a cluster of bacterial styrene oxidation genes well known in the art (Marconi et al, Appl. Environ. Microbiol. 62(1): 121-127 (1996); Beltrametti et al, Appl. Environ. Microbiol. 63(6):2232-2239 (1997); O'Connor et al, Appl. Environ. Microbiol.
- the styrene epoxidation step can be accomplished by using monooxygenases of methyl substituted aromatic compounds, such as toluene or xylenes (Wubbolts, et al, Enzyme Micro.b Technol. 16(7):608-615 (1994).
- the alcohol from (i-iii), above is preferably treated with a dehydrogenase polypeptide.
- the dehydrogenase enzymes can be endogenous to a host that expresses one or more of the oxygenase polypeptides, or it can exhibit properties that are improved relative to an endogenously expressed dehydrogenase.
- the polypeptide that catalyzes the dehydrogenation can be a naturally occurring polypeptide, or it can have one or more properties that are improved relative to an analogous naturally occurring polypeptide.
- the polypeptides are expressed by one or more host organisms.
- polypeptide that catalyzes the dehydrogenation can be co-expressed by the same host expressing one or more of the dioxygenase polypeptide.
- the dehydrogenase and oxygenase polypeptides can be expressed in different hosts.
- the invention provides a method for altering or controlling the regiospecificity of the dehydrogenation reaction of a vicinal diol.
- This method "blocks" one of the vicinal diol hydroxyl groups by forming an ester, for example.
- the method involves contacting the vicinal diol with a polypeptide, preferably expressed by a host organism, having an activity selected from ligase, transferase and combinations thereof, thereby forming a ⁇ -hydroxycarboxylic acid adduct.
- this polypeptide can be expressed by the same host cell that expresses other polypeptides of the reaction cascade.
- this polypeptide can be a naturally occurring polypeptide, or it can be improved using the method of the invention.
- the molecule is submitted to a dioxygenation cycle.
- the dioxygenation of the aromatic ring is preferably accomplished by recruiting one or more arene dioxygenase genes, preferably of bacterial origin. Exemplary dioxygenase genes are disclosed herein.
- the method of the invention can be practiced using essentially any type of aromatic ring system. Exemplary aromatic systems include, benzenoid and fused benzenoid ring systems (e.g., benzene, napthalene, pyrene, benzopyran, benzofuran, etc.) and heteroaryl systems (pyridine pyrrole, furan, etc.).
- the substrate includes a benzenoid hydrocarbon.
- the polypeptide that catalyzes the dioxygenation can be coexpressed with one or more polypeptides used in this synthetic pathway.
- the monooxygenase, dehydrogenase and dioxygenase polypeptides can all be coexpressed in a single host.
- Other functional combinations of coexpression will be apparent to those of skill in the art.
- benzoate-l,2-dioxygenase or toluate-1,2- dioxygenase are used to catalyze the formation of compound 14, p-cumate 2,3- dioxygenase to catalyze formation of compounds 13, 25, 26, and phthalate 4,5- dioxygenase or phthalate 3,4-dioxygenase to catalyze the formation of compound 12 (see, Fig. 5).
- the present invention provides both a chemoenzymatic route to 3,4- dihydrocoumarin (43) from n-propylbenzene (steps as shown in Fig. 8), and means for the subsequent conversion of this compound to other lactone derivatives, such as coumarin (58) and 4-oxygenated derivatives of coumarin.
- compound 43 can be converted to 58 by chemical methods known in the art (e.g. by reaction with sulfur, or by catalytic dehydrogenation over Pd or Pt catalyst), the purpose of this invention is also in the provision of alternative biocatalytic means for effecting such reaction.
- one or more arene dioxygenase such as naphthalene dioxygenase, toluene 2,3-dioxygenase or other stmcturally and functionally related dioxygenases, are shuffled, as described herein, to produce an improved polypeptide.
- the improved polypeptide is used to catalyze benzylic monooxygenation reactions with a variety of benzocycloalkanes, and benzylic desaturation reactions of compounds exemplified by 1,2-dihydronapthalene, indan, and ethylbenzene.
- the invention preferably uses a subset of genes encoding catabolism of naphthalene by bacteria.
- at least four polypeptides are used to catalyze a series of reactions.
- These polypeptides include, naphthalene 1,2-dioxygenase (compound 10 to 61), NahA (a multicomponent enzyme); cis-l,2-dihydro-l,2-dihydroxynaphthalene dehydrogenase (compound 61 to 62), NahB; 1,2-dihydroxynaphthalene 1,1a dioxygenase (compound 62 to 63) and NahC.
- Compound 63 is known to be labile, readily undergoing a series of tautomerization and intramolecular ring closure reactions.
- the cis-trans equilibrium between compounds 63 and 64 is preferably effected by 2-hydroxybenzalpyruvate isomerase (NahD) which can be used to impose a degree of control on the isomerization of the double bond.
- NahE 2-hydroxybenzalpyruvate hydratase/aldolase
- preferred host strains are those essentially lacking activity of this enzyme.
- the step of this process which allows for the preparation of either coumarin 58, or cis/trans o-hydroxycinnamic acids (66, 67) is the provision of an alpha-ketoacid decarboxylase enzyme with specific activity towards either compound 63 or 64 or both.
- alpha-ketoacid decarboxylase enzymes are known, and in the preferred embodiment, the benzoylformate decarboxylase of Pseudomonas putida, or an enzyme stmcturally or functionally similar to it, is used (Gen Bank PSEMDLABC, benzoylformate decarboxylase (mdlC)).
- Ring closure of compound 66 to 58 can be effected by enzymatic or chemical means (e.g. extraction under acidic conditions).
- cis-trans isomerization of 67 to 66 can be effected by enzymatic or chemical means (e.g. by Pt or Pd- catalyzed hydrogenation/dehydrogenation under acidic conditions).
- Biocatalytic variations of these processes which can be used to produce coumarin from o-hydroxycinnamic acids, preferably involve the use of acyl-CoA ligase and transferase enzymes (pathway for conversion of 66 to 68 to 58), or conversion of 67 to glycoside 69 with subsequent isomerization later in an enzymatic process akin to that of coumarin biosynthesis in plants. 4.
- Preparation of a-hydroxycarboxylic acids ⁇ -hydroxycarboxylic acids (AHAs) are an important group of industrial chemicals.
- One of the simplest representatives of this class of compounds is lactic acid which find many uses, including synthesis of polyester polymers (polylactic acid).
- AHAs such as mandelic acid can also be used as a constituent of polymers or co-polymers with lactic acid.
- Enantiomerically pure AHAs are also used as resolving reagents for separating racemates of chiral molecules.
- AHAs are typically generated chemically by hydrolysis of a cyanohydrin, generally prepared from an aldehyde.
- Aldehydes are relatively expensive starting materials, and the cyanohydrin pathway does not readily provide for direct preparation of AHAs in high enantiomeric excess.
- One pathway for the synthesis of AHAs in high enantiomeric excess is through the use of one or more enzymatic reactions starting from an inexpensive and readily available starting material such as an alkene.
- Arene dioxygenases (ADOs) are known to oxidize alkenes to the corresponding vicinal diols.
- ADOs such as toluene 2,3-dioxygenase, isopropylbenzene 2,3- dioxygenase, benzene- 1,2-dioxygenase, biphenyl-2,3-dioxygenase naphthalene- 1,2- dioxygenase, and many homologous and/or functionally similar enzymes can be used to manufacture AHAs in a highly regioselective fashion.
- An example of this dioxygenation reaction is provided in Fig. 12, with the conversion of alkene (I) to the vicinal diol (II).
- the present invention provides general methods for the biocatalytic manufacture of AHAs and their esters, and also for the construction and optimization of the biocatalytic properties of enzymes and host strains that effect oxidative cis-dihydroxylation or epoxidation reactions of a variety of alkenes. Moreover, the invention provides for the subsequent enzymatic conversion of the dioxygenation products to AHAs and ester derivatives of AHAs.
- the invention provides a method for converting an olefin into an ⁇ -hydroxyacid.
- the method involves: (a) contacting the olefin with an improved dioxygenase polypeptide to form a vicinal diol; and (b) contacting the vicinal diol with a dehydrogenase polypeptide to form the ⁇ -hydroxyacid.
- the polypeptide that catalyzes the dehydrogenation can be a naturally occurring polypeptide, or it can have one or more properties that are improved relative to an analogous naturally occurring polypeptide.
- the polypeptides are expressed by one or more host organisms.
- the polypeptide that catalyzes the dehydrogenation can be co-expressed by the same host expressing the dioxygenase polypeptide that oxidizes the ⁇ -bond.
- the dehydrogenase and dioxygenase polypeptides can be expressed in different hosts. An example of the dehydrogenation is provided in Fig.
- the method of the invention can be used to produce AHAs having substantially any stmcture
- the ⁇ -hydroxycarboxylic acid has the stmcture:
- R 1 is selected from aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, substituted heterocyclic, — NR 2 R 3 , — OR 2 , — CN, C(R 4 )NR 2 R 3 and C(R 4 )OR 2 groups;
- R and R are members independently selected from H, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic and substituted heterocyclic groups;
- AHAs are bifunctional molecules with two chemically and enzymatically distinguishable functional groups, carboxyl and hydroxyl. In the biocatalytic modifications of AHAs described in this invention, either of these groups can be derivatized by bond formation. While these reactions do not change the oxidation state of the AHA molecule, recruitment of the enzymes effecting modification of AHAs provides the opportunity to generate biotransformation endproducts with substantially different physical and chemical properties than that of a free AHA. Generally desirable properties include an increase of hydrophobicity, a decrease of aqueous solubility and, for an ester formed through a carboxylic group of an AHA, a decrease in acidity of the process end-products.
- the adduct-forming polypeptide produces an ⁇ - hydroxycarboxylic acid adduct selected from esters and ethers.
- the method involves contacting an ⁇ -hydroxycarboxylic acid with a polypeptide having an activity selected from ligase, transferase and combinations thereof, thereby forming a ⁇ -hydroxyacid adduct.
- the adduct forming polypeptides useful in this embodiment can be naturally occurring polypeptides or, alternatively, they can be polypeptides improved using the methods of the invention, as discussed generally, above. Exemplary adduct forming reactions are provided in Fig. 13.
- This Figure shows the use of a methyltransferase to convert carboxylic acid (X) to the corresponding methyl ester (XI), acyltransferase I to convert compound X to ester XIII, and acyl-CoA ligase to convert X to intermediate XIV.
- This intermediate can then be transformed into a simple alkyl ester (XIX) or to structures having greater complexity of stmcture in the alcohol-derived component (e.g., XV).
- Species such as XV can be further elaborated using other polypeptides including, for example, acyltransferase III to produce compound XVII, thioesterase II to produce compound XVIII and thioesterase I to produce compound XVI.
- the ⁇ -hydroxycarboxylic acid adduct has the stmcture:
- R is selected from aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, substituted heterocyclic, — -NR 2 R 3 (R 4 ) m , — OR 2 , — CN, C(R 5 )NR 2 R 3 and C(R 5 )OR 2 groups
- R 2 , R 3 and R 4 are members independently selected from the group consisting of H, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic and substituted heterocyclic groups
- R 6 is selected from H, alkyl and substituted alkyl groups
- R 7 is C(O)R 8 , wherein R 8 is selected from H alkyl and substituted alkyl groups and R 7 and R 8 are not both H
- m is 0 or 1, such that when m is 1, an ammonium salt is provided
- n is an integer between 0 and 10, inclusive.
- the described reactions and pathways are utilized for biocatalytic whole-cell conversion of styrene to mandelic acid and its ester derivatives. The pathway for styrene conversion, all of its intermediates and reactions are shown in Figs. 14A- 14R.
- the esterified adducts provide an increase in the overall efficiency of the biotransformation process as they simplify end-product recovery.
- the esters are easily isolated by organic solvent extraction and partitioning.
- the adducts obviate the need for pH adjustment in the aqueous fermentation media to prevent the accumulation of the high levels of acidic biotransformation products.
- AHAs can be biocatalytically esterfied in a substantially aqueous environment.
- expression of genes encoding an S-adenosylmethionine (SAM)-dependent O-methyltransferase is used to effect conversion of AHAs to their methyl esters (e.g., Fig. 13, conversion of compound X to compound XI).
- SAM-dependent methyltransferases of differing substrate specificity are common in nature, and suitable enzymes and corresponding genes can be found and used directly for the purpose of this invention.
- these species can be further evolved and optimized for specific activity with the AHAs using one or more nucleic acid shuffling methods described herein.
- the invention also provides means for HTP screening for the presence, and quantitative determination, of the AHA-specific O-methyltransferase catalytic activities in microorganisms, cells, tissues or extracts of tissues of higher eukaryotic organisms. These methods can be used either to identify sources of corresponding genes or to evolve the desired specificity of known methyltransferases towards the AHAs by nucleic acid shuffling as described herein.
- acyltransferase enzymes which specifically esterify the sec-hydroxyl of AHAs by means of active carboxyl transfer from either acyl-coenzyme A or acylated acyl carrier protein (ACP) are incorporated into the reaction pathway.
- This pathway is depicted in Fig 13, as shown by the coupling of compounds X and XII to yield compound XIII.
- a preferred embodiment of this pathway involves recmiting and expressing gene(s) encoding acyl-CoA-dependent acyltransferases, including those which utilize as substrates acetyl-CoA and CoA derivatives of fatty acids, as well as lactoyl-CoA, CoA-thioesters with other AHAs, and CoA derivatives of aromatic, arylalkanoic, branched chain alkanoic carboxylic acids, and alpha-aminoacids.
- carboxylic acids either in the form of a free acid, salt or ester
- the invention provides a means for facilitating ester formation by recmiting and co-expressing those acyl- CoA ligases or ACPs which effect in-vivo activation of these acids forming suitable substrates for the acyl transferase enzymes that act on the AHAs.
- the invention also provides for another type of biochemical transformation of AHAs to AHA carboxylic esters wherein free AHAs are first converted to their active ester form by means of the enzymatic formation of a derivative with CoA or ACP (Fig. 13, compound XIV).
- acyltransferase enzymes and genes encoding them can be recmited for effecting subsequent transformations of compound XIV to esters of different compositions.
- the activated forms of these oligomeric esters can be converted to free carboxylic oligomers (e.g., XVIII) or to the cyclic substituted glycolides (XVI).
- an ⁇ -hydroxycarboxylic acid ester is catalyzed by an acyl CoA-ligase that is evolved by nucleic acid shuffling.
- shuffling of nucleic acids encoding acyl-CoA ligase activities results in an increase in the synthesis of esters.
- the esters are selected from structures XIII-XVIII (Fig. 13). The synthesis of these and other esters will generally rely on the provision of a corresponding ⁇ -hydroxycarboxylic acid precursor.
- the ⁇ -hydroxycarboxylic acid precursor is present in an amount sufficient to establish intracellular pools of CoA-activated carboxylic derivatives of ⁇ - hydroxycarboxylic acids.
- the transferase polypeptide is selected from glycosyltransferase and methyltransferase, more preferably methyltransferase and more preferably still a S-adenosylmethionine dependent O-methyltransferase. 5. Enzymes effecting chiral switch at the level of AHAs.
- Another object of this invention is the effective control of the enantiomeric composition of the compounds prepared by the methods of the invention.
- AHA esters made by the biotransformation process from alkenes This focus is intended to be illustrative and not limiting of the scope of this embodiment of the invention.
- Means of enantiomeric control when integrated as part of the multistep biocatalytic pathway, constitutes an important advantage as it allows selective production of either enantiomer of the AHA.
- the enantiomerically pure AHAs can be used as resolving reagents, chiral synthons, or monomers for polyesters or co-polyesters with lactic acid.
- the AHA is mandelic acid, or an analogue thereof, and the chiral switch is effected by recruiting mandelate a racemase gene. Mandelate reacemase catalyzes the interconversion of the R and S enantiomers of mandelic acid and its derivatives.
- Mandelate racemase is that of Pseudomonas putida (the sequence of the gene can be found in the GenBank database under the locus [PSEMDLABC]).
- Preferred mandelate racemases are those of the P. putida strain ATCC 12633, however, mandelate racemases from any other organism can be used.
- the chiral switch is made at the level of the AHA, this switch can be made with any of the precursors or adducts of the AHA as well.
- the AHA is modified by at least one of the ester-forming enzymes discussed herein.
- Preferred ester forming enzymes are those which specifically, or preferentially, act on one enantiomer of the AHA, thus allowing enantiospecific resolution of the racemate in-vivo.
- the activity of the above racemases provides an enantiomeric equilibrium at the expense of the non-esterified enantiomer.
- the combined action of the racemase and the AHA esterifying enzymes provides a chiral switch which allows preparation of one desired enantiomer, whether R or S, from AHAs of any enantiomeric composition.
- the invention provides methods of degrading or modifying organic materials which lead to their detoxification.
- exemplary compounds include stabilizing agents, antioxidizing agents, environmental pollutants and the like. This method is applicable to substantially any compound that can be detoxified by, for example, oxidation, either with or without additional stmctural elaboration.
- oxidation either with or without additional stmctural elaboration.
- discussion below focuses on the detoxification of agents commonly found in organic solvents and in ⁇ -bonded compounds of use in the present invention.
- antioxidants such as 4-tert-butylcatechol or alkylphenols (e.g. BHT) to prevent polymerization during storage and transportation. While the amount of these compounds is usually relatively small (10-15 ppm), they can inhibit biocatalyst performance as they accumulate in aqueous fermentation medium during prolonged incubations required to obtain satisfactory endproduct concentrations.
- phenolic stabilizing compounds can be used to alleviate any negative effects of these compounds on the whole cell biocatalyst performance. Their genes can be introduced in the same host organism used to produce endproducts or intermediates of relevance to this invention. Alternatively, they can be incorporated into a separate host organism. This obviates any need for additional steps in the process to remove these stabilizers. Optimization of one or several of these enzymes for the efficient removal of these stabilizing compounds is a target for nucleic acid shuffling.
- Exemplary enzymes for modifying phenolic and diphenolic stabilizers include, but not limited to, acyltransferase, methyltransferase, glycosyltransferase, lactase and peroxidase.
- catecholic stabilizers also can be modified to innocuous products by catechol dioxygenases effecting meta- or ortho- ⁇ ng cleavage. Many of these enzymes show a significant breadth of activity towards compounds related to phenolic stabilizers.
- nucleic acid shuffling can be applied to optimize enzyme parameters such as: a) increased turnover with particular phenolic stabilizer, b) increased functional expression, by obviating the requirements for certain post-transitional modifications of those enzymes which require such modifications (e.g. glycosylation of peroxidases and lactases); and c) alleviation of inhibition of these enzymes by high concentration of co- occurring feedstock compounds and intermediates and endproducts of the biocatalytic process.
- enzyme parameters such as: a) increased turnover with particular phenolic stabilizer, b) increased functional expression, by obviating the requirements for certain post-transitional modifications of those enzymes which require such modifications (e.g. glycosylation of peroxidases and lactases); and c) alleviation of inhibition of these enzymes by high concentration of co- occurring feedstock compounds and intermediates and endproducts of the biocatalytic process.
- a number of analytical techniques are useful in practicing the present invention. These analytical techniques are used to measure the extent of conversion of a particular substrate to product. These techniques are also used to analyze the regioselectivity and/or the enantiomeric selectivity of a particular reaction catalyzed by a polypeptide of the invention. Moreover, these techniques are employed to assess the effect of nucleic acid shuffling experiments on the efficiency and selectivity of the polypeptides produced following the shuffling.
- the discussion below focuses on those aspects and embodiments of the invention in which an olefin precursor is oxidized by a dioxygenase.
- the analytical techniques discussed in the this context are generally of broad applicability to other aspects and embodiments of the invention.
- Dioxygenase activity can be monitored by HPLC, chiral HPLC, gas chromatography, NMR spectrometry, and mass spectrometry, as well as a variety of other
- epoxide formation can be indirectly measured by various reactive colorimetric reactions.
- H 2 O 2 is used as the oxidant
- disappearance of peroxide over time can be monitored directly either potentiometrically or colorimetrically using a number of commercially available peroxide reactive dyes.
- a preferred method is high-throughput MS, or MS operating in a coordination ion spray and/or electrospray-based mode.
- selection protocols in which the organism uses a given alkene or aromatic system as a sole carbon source can be used. In some systems this will be most readily accomplished by using the dioxygenase to generate a metabolizable diol.
- strain improvement is having an assay that can be dependably used to identify a few mutants out of thousands that have potentially subtle increases in product yield.
- the limiting factor in many assay formats is the uniformity of library cell (or viral) growth. This variation is the source of baseline variability in subsequent assays. Inoculum size and culture environment (temperature/humidity) are sources of cell growth variation. Automation of all aspects of establishing initial cultures and state-of-the-art temperature and humidity controlled incubators are useful in reducing variability.
- library members e.g., cells, viral plaques, spores or the like, are separated on solid media to produce individual colonies (or plaques).
- colonies are identified, picked, and 10,000 different mutants inoculated into 96 well microtitre dishes containing two 3 mm glass balls/well.
- the Q-bot does not pick an entire colony but rather inserts a pin through the center of the colony and exits with a small sampling of cells, (or mycelia) and spores (or viruses in plaque applications).
- the time the pin is in the colony, the number of dips to inoculate the culture medium, and the time the pin is in that medium each effect inoculum size, and each can be controlled and optimized.
- the uniform process of the Q-bot decreases human handling error and increases the rate of establishing cultures (roughly 10,000/4 hours). These cultures are then shaken in a temperature and humidity controlled incubator. Glass or, preferably, stainless steel balls in the microtiter plates act to promote uniform aeration of cells and the dispersal of mycelial fragments similar to the blades of a fermenter.
- Prescreen The ability to detect a subtle increase in the performance of a shuffled library member over that of a parent strain relies on the sensitivity of the assay. The chance of finding the organisms having an improvement is increased by the number of individual mutants that can be screened by the assay. To increase the chances of identifying a pool of sufficient size, a prescreen that increases the number of mutants processed by 10-fold can be used. The goal of the primary screen will be to quickly identify mutants having equal or better product titres than the parent strain(s) and to move only these mutants forward to liquid cell culture for subsequent analysis.
- ADOs For the purpose of preparing a shuffled ADO library screening and sorting out non-functional variants, several general activity detection methods for ADOs can be used, including cases for direct screening and colony picking on agar medium plates. Certain presently preferred examples of general methods are: (a) the formation of indigo from indole, and similarly, from substituted indoles and indole-carboxylic acids (to produce indigo or substituted indigo). The development of blue or blue-grey hued growing colonies signifies the expression of catalytically functional ADOs. Most of the ADOs exhibit some activity with either indole (e.g.
- toluene dioxygenase biphenyl dioxygenase, naphthalene dioxygenase and homologous enzymes
- indole carboxylates e.g. toluate- 1,2-dioxygenase, p-cymate 2,3- dioxygenase
- catechol formation can be enhanced in the presence of an aromatic amine (e.g. p-toluidine) and iron salts.
- aromatic amine e.g. p-toluidine
- iron salts oxidize readily under oxygen, or in the presence of other oxidants, forming various colored products in the media surrounding the colonies expressing catalytically active ADOs.
- This assay method is applicable for cases where a cis-diol is unstable (angular dihydroxylation products at aromatic ⁇ -bonds substituted with heteroatoms such as N, O, S, and halogens) and rearomatized spontaneously with concomitant elimination of a leaving substituent.
- the leaving substituent e.g. halide, nitrite or ammonia
- pH indicator pH indicator
- the assay can also be used in cases of stable dihydrodiol formation, where a suitable gene of arene cis-diol dehydrogenase is co-expressed (in many cases such genes are indeed available and well known in the art).
- the accumulated catechol can be detected by the presence of colored oxidation products, whether enhanced by p-toluidine/Fe or not; and (c) color formation due to enzyme activities encoded by accessory genes for subsequent metabolism of arene cis-dihydrodiols.
- a yellow (or orange) color is developed by colonies in the medium which signifies the expression of catalytically active ADO.
- a combination of the methods can be used.
- indole/indigo assay is that the color does not diffuse into areas of medium surrounding the colonies.
- the advantage of other assays is that they often can provide information about both the general activity and also about the regioselectivity of the reaction catalyzed by ADO.
- Positive clones selected by either of the above-described exemplary methods can be examined by subsequent tier assay.
- the invention provides a screening process comprising: (a) introducing the library of recombinant polynucleotides into a population of test microorganisms such that the recombinant polynucleotides are expressed;
- the invention provides several methods for detecting and measuring catalytic properties encoded by the recombinant polynucleotides. These are exemplified by the following methods.
- Optimizing individual reactions and whole pathways for producing oxidized compounds, their derivatives, analogues and precursor compounds described in this invention can be monitored by virtually any analytic technique known in the art.
- the production of the desired compound is monitored using one or more techniques selected from thin layer chromatography (TLC), high performance liquid chromatography (HPLC), chiral HPLC, mass-spectrometry, mass spectrometry coupled with a chromatographic separation modality, NMR spectroscopy, radioactivity detection from a radioactively labeled compounds (e.g., olefins, diols, carboxylic acids, aldehydes,
- a radioactively labeled compounds e.g., olefins, diols, carboxylic acids, aldehydes,
- the preferred methods are selected from one or any combination of these methods.
- the methods of the invention are used to improve polypeptides that catalyze the initial oxidation of ⁇ -bonded species. Methods using dioxygenase-based pathways are encompassed herein.
- the oxidation product from the conversion of a substrate comprising a ⁇ -bond e.g., arenes, alkylarenes, alkenes, etc.
- a substrate comprising a ⁇ -bond e.g., arenes, alkylarenes, alkenes, etc.
- the vicinal diol derived from oxidation of an olefin is quantitated using a radioactively labeled substrate.
- a radioactively labeled substrate any radioactive isotope commonly used in the art can be incorporated into a substrate, preferred isotopic labels include, for example, 14 C and/or 3 H. Differences in the volatility of the olefin substrate and the corresponding diol can be exploited to quantitate the radioactively labeled product.
- This method can easily be applied to aqueous samples of culture fluids obtained by incubating individual clones of cells expressing libraries of a recombinant polynucleotide obtained using the methods of the invention.
- cells expressing libraries of recombinant polynucleotides encoding a dioxygenase can be grown in a multiwell dish with a radioactive substrate administered directly to the aqueous medium. After incubation of the cells with the radioactive olefin substrate, any residual uncoverted substrate is removed by evaporation, with or without application of vacuum. After removing the unconverted substrate, the culture fluid (or aliquots thereof) is mixed with a suitable scintillation cocktail, and the radioactivity in the samples is quantitatively measured. In a preferred embodiment, selection of the most active clones is based on the amount of radioactivity incorporated into the compounds produced by the organisms expressing the clone.
- radioactively labeled substrate can be administered as a vapor phase to colonies growing on a surface of a membrane filter overlaying agar-solidified medium. After incubation, the membrane is removed from the agar surface, and any residual hydrocarbon is evaporated from the membrane. The membrane is autoradiographed, or a scintillation dye is sprayed over the membrane for radioactivity detection.
- a modification of this assay that is particularly suitable for 14 C label detection in and/or around colonies capable of oxidizing ⁇ -bonds to the corresponding glycols involves using a porous membrane that has scintillation dye incorporated in the membrane composition by covalent or adsorption means. This assay is termed "scintillation proximity assay on membrane" or "SPA.”
- a variation of SPA is used to selectively quantify the glycol derived from the substrate.
- This variation involves adding beads for scintillation proximity assay to the samples of culture fluids or extracts obtained by incubation of cells with radiolabeled substrate as described above.
- the sample can be applied to a membrane.
- the beads or membrane are functionalized with groups that interact with a glycol.
- the beads or membranes contain a suitable scintillating dye and their surfaces are modified by chemical groups that interact readily with diols.
- a suitable scintillating dye can be prepared by known chemical methods from commercially available SPA materials and they can be used to trap free diols directly in the aqueous medium or culture broths obtained by incubation of the microbial cells with the radiolabeled substrates.
- the surface of the beads used in this assay is functionalized with a sufficient amount of a compound that interacts with a glycol, such as compounds containing aryl or alkylboronate (boronic acid).
- a glycol such as compounds containing aryl or alkylboronate (boronic acid).
- Such beads can be obtained by chemical modification of commercially available SPA beads by reactions known to one skilled in the art.
- the reactions used to modify the beads are analogous to those used for the preparation of arylboronate-modified resins for solid-phase extraction or chromatography. After incubation, the beads are washed with a sufficient amount of water or other suitable solvent and subjected to quantitative determination of radioactivity.
- Samples of culture fluids, or extracts in an appropriate solvent can be treated with known excess amounts of dilute solutions of, for example, a halogen (Cl 2 , Br 2 , 1 2 ), permanganate salts.
- a halogen Cl 2 , Br 2 , 1 2
- the residual excess amount of those reagents, left after reaction with any substrate present, can be measured by chemical methods known in the art for determination of these compounds (see, for example, VOGEL'S PRACTICAL ORGANIC CHEMISTRY 5 th Ed., Furniss et al, Eds., Longman Scientific and Technical, Essex, 1989).
- Mass spectrometry can also be used to determine the amount of a vicinal glycol formed due to species encoded by the libraries of shuffled oxygenase genes. Mass spectrometric methods allow ion peaks to be detected. The ion peaks derived from the vicinal glycol can be readily distinguished from peaks derived from olefin substrates. In a preferred embodiment, coordination ion spray or electrospray mass spectrometry is utilized.
- a compound that interacts with a component of the mixture preferably the glycol
- the sample analyzed contains excess arylboronic or alkylboronic acid.
- Preferred boronic acids are those containing at least one nitrogen atom and include, but are not limited to, dansylaminophenylboronic acid, aminophenylboronic acid, pyridylboronic acid.
- the ions detected in the mass spectmm derive from cyclic boronate ester derivatives of the glycols with a boronic acid.
- the samples are preferably analyzed in non- acidic and non-basic organic solvent or aqueous phase, substantially free of alcohols and other glycols.
- Other appropriate analytical conditions will be apparent to those of skill in the art.
- vicinal diols other than the analyte e.g., carbohydrates
- the periodate reagent can be used in solution, or preferably, immobilized on a solid phase (e.g. anion exchange resin).
- the amount of free aldehyde groups can be measured by a variety of assays know in the art.
- the aldehydes are quantitated by a method based on the formation of a colored hydrazone derivative.
- the free aldehydes obtained by this method can be trapped by aldehyde reactive groups (e.g., free amines) on the surface of an appropriately modified SPA beads or membranes.
- the substrate includes more than one ⁇ -bond (e.g., styrene, butadiene, etc.).
- one of the ⁇ -bonds undergoes reaction more readily than the other.
- the preferred method for making this determination is H or C NMR, although other methods can be used.
- Other methods include, for example, chromatography (e.g., TLC, GC, HPLC, etc.), UV/vis spectroscopy and IR spectroscopy.
- the method of choice is flow-through H or C NMR spectroscopy.
- the substrates are preferably labeled with C.
- ⁇ - bonded species can be synthesized by methods know in the art from a C enriched material to incorporate one, or any combination of several, labeled carbon atom(s) into the stmcture of these compounds (by synthetic methods known in the art as exemplified by Selifonov et al, in Appl Environ Microbiol. 64(4): 1447-53 (1998)).
- the enrichment levels for the labeled positions are preferably at least 5% of C, more preferably 50% and more preferably still 95% for any given labeled position. Incorporation of a C label provides a number of advantages, such as increasing the NMR signal and decreasing time required for spectral acquisition.
- labeled compounds allow for a quantitative or semi- quantitative interpretation of the composition of a mixture of isomeric oxidation products.
- incubations with ! C labeled olefins are conducted in multi-well plates, and aliquots of culture fluids or their extracts are sampled with an autosampler communicating with the NMR probe.
- the reaction components are not chromatographed or otherwise purified prior to obtaining a NMR spectmm.
- Determining the absolute configuration and the enantiomeric composition of the glycols formed from ⁇ -bonded species preferably employs a variation of the method described above for determining regioselectivity of dihydroxylation of the olefinic substrates by a dioxygenase using 1 H or 13 C NMR.
- the substrates are labeled with ,3 C and ' C NMR, is employed.
- This method preferably involves the use of a chiral and essentially enantiomerically pure derivatizing reagent such as a substituted arylboronic acid which forms a cyclic boronate derivatives with vicinal glycols, as know in the art (Burgess and Porte, Angew.
- both the substrates and one or more carbon atoms of the boronic acid is labeled with ' C.
- boronic acids are of use in the present invention, a currently preferred boronic acid is shown below:
- the absolute configuration of any chiral center of the compounds produced by the methods of the invention can be either R or S.
- the enantiomeric excess of the product is preferably 98% or more.
- NMR signals of different enantiomers of the reaction products can be distinguished in diastereomeric products using substantially enantiomerically pure boronate compounds as discussed above.
- the relative intensity of the NMR signals arising from corresponding atoms of the diastereomeric products can be used for estimating the enantiomeric composition of the product(s) present in the sample.
- AHA formation from glycols Among methods for specifically measuring the free AHAs produced in the biocatalytic process, those which are particularly preferred are methods using a variation of the scintillation proximity assay described above. These methods preferably use an excess of beads or membranes bearing one or more positively charged functional groups (e.g quaternary or tertiary or primary amines). In preferred embodiments, these beads or membranes act as an anion exchange medium and they selectively trap free AHAs, thereby removing them from aqueous culture broths. In another preferred embodiment, this method employs a radioactively labeled starting material, or subsequent intermediate, (e.g., glycol, epoxide, etc.). The radioactively labeled compound interacts with the beads or membrane.
- a radioactively labeled starting material, or subsequent intermediate e.g., glycol, epoxide, etc.
- non- specifically adsorbed label Prior to measuring the radioactivity associated with the beads or the membrane, non- specifically adsorbed label is preferably removed by evaporating excess radioactive compound and/or washing with an aqueous solution which does not cause elution of the AHAs from the anion-exchange beads or membrane.
- Preferred methods for determining the chirality and absolute configuration of AHAs formed in the described biotransformation process are substantially similar to those methods employed in making these determinations with respect to the glycols, as discussed above.
- a preferred analytical method is flow-through H or C NMR spectroscopy.
- the aromatic substrate for oxidation by a dioxygenase is preferably labeled by the 13 C isotope.
- Alkylaryl compounds or the corresponding arylalkanoic acids are synthesized by methods known in the art from a ' C enriched material to incorporate one, or any combination of several, labeled carbon atom(s) into the stmcture of these compounds.
- the enrichment levels for any labeled position are preferably at least 5% of C, and more preferably at least 95%.
- Incorporation of C label increases sensitivity of the NMR measurement, decreases time required for acquisition of spectmm per sample, and allows for quantitative or semi-quantitative interpretation of compositions of mixtures of isomeric oxidation products.
- incubations with 13 C labeled precursors are conducted in multi-well plates, and aliquots of culture fluids or their extracts are sampled with autosampler connected to the solvent line passing through NMR probe without any column separation.
- the absolute configuration of any chiral center may be either R or S.
- the enantiomeric excess is 98% or more.
- NMR signals of different enantiomers of HCAs can be distinguished in diastereomeric products using known methods, such as NMR in conjunction with lanthanide shift reagents.
- a variation of the SPA method is used.
- a solid support such as beads or a membrane containing a suitable scintillation dye is used.
- the solid support is modified with positively charged groups such that it acts like an anion-exchange material.
- These materials can be prepared from commercially available SPA materials and they can be used to trap free acids directly in the aqueous medium or culture broths obtained by incubation of the host cells with a radiolabeled alkylarene.
- esters of AHAs In the interest of brevity, the following discussion focuses on the determination of esters of AHAs. One of skill will appreciate that the same, or similar, methods can be used to determine esters of other compounds formed using the methods of the invention.
- Both spectroscopic and non-spectroscopic methods can be used to quantitate the extent of ester synthesis and to characterize the esters.
- the preferred non-spectroscopic method for assaying AHA methyl ester formation catalyzed by methyl transferases is based on use of a radioactively labeled precursors to AHA methyl esters.
- C or H methyl labeled SAM (or its in-vivo precursor, methionine) can be used as a probe.
- the labeled substrate is the free ⁇ -hydroxycarboxylic acid itself.
- methyltransferases that are selective for a particular AHA enantiomer can be selected and further improved by iterative cycles of shuffling and this assay.
- the selectivity of the methyltransferases of the invention towards a particular enantiomeric configuration of an AHA is preferably measured using samples of the ⁇ -hydroxycarboxylic acids that are substantially enantiomerically pure.
- Host cells employed in this biocatalytic cycle will preferably lack AHA racemase activity (e.g. mandelate racemase).
- both AHA enantiomers have a different radioactive label, e.g.
- one enantiomer is labeled with C, and another with H (at one or more H positions which do not readily exchange with water).
- Measurement of the radioactivity incorporated into the product is performed using a radioactivity detector that allows for the selective measurement of at least two different isotopes. This variation allows the evaluation of the enantioselectivity of a methyltransferases in a single sample.
- the radioactivity associated with methyl esters of AHAs is preferably measured in samples which are obtained by selective extraction or partitioning of the methyl esters from neutral or moderately basic (pH about 6-10) aqueous culture samples. These samples can contain varying amounts of free, labeled AHA, of AHA salts and other non- labeled organic compounds.
- the samples are preferably obtained by incubating individual clones expressing methyltransferase libraries with the labeled AHAs.
- the incubation medium is subsequently extracted by adding a defined amount of a preferably water- immiscible organic solvent, or by contacting the broth with an extraction medium (e.g. XAD-1180, or similar beads, or membrane).
- an extraction medium e.g. XAD-1180, or similar beads, or membrane.
- the extraction media following its removal from contact with the broth, the extraction media is preferably washed to remove adventitiously bound compounds.
- Preferred wash solutions are aqueous solutions that do not elute the AHA methyl esters from the extraction medium, but which remove other molecules adsorbed onto the medium.
- the radioactivity of the extracted material is then measured by methods well known in the art. In embodiments using beads or a membrane an appropriate scintillating dye is preferably used for detecting the radioactivity.
- Substantially similar methods can also be employed for detecting other neutral esters of AHAs, such as those exemplified by glycolides (e.g., XVI, Fig. 13) and esters of type XX.
- AHA-CoA esters of type XX.
- Variations on this method can include the use of a radioactively labeled alcohol (e.g., XIX) or any of its in-vivo metabolic precursor.
- the method for detecting polypeptide activity leading to the formation of neutral AHA esters employs TJV or fluorescence spectroscopy.
- This method is applicable to those embodiments in which the transferase activity yields products exhibiting distinct UV and/or fluorescent characteristics.
- Exemplary compounds include, for example, substituted or non-substituted esters of aromatic carboxylic acids (e.g., mandelic acid).
- a solvent or solid-phase extraction under neutral or moderately basic conditions (pH about 6- 12) is performed on the cell culture medium. Compounds thus isolated are detected by measurement of their UV absorption or fluorescence. These spectral parameters are evaluated to determine relative amounts and identities of the products formed by the transferase reactions. a. Screening for improved transferase activity
- the methods for detection of increased formation of monoacyl- and monoglycosyl-derivatives of, for example, glycols and ⁇ -hydroxycarboxylic acids include methods in which physical differences between the substrates, the ds-diols and the derivatives arising from the transferase-catalyzed reactions are measured.
- Preferred methods include HPLC and mass-spectrometry.
- mass-spectrometry In a high throughput modality, a method of choice is mass-spectrometry, preferably, coordination ion and/or electrospray mass- spectrometry.
- acyl transferases For acyl transferases, another presently preferred method uses a labeled acyl- donor precursor, e.g. labeled carboxylic acid or its derivative, administered to the cells that express libraries of shuffled genes encoding acyl ligases and/or acyl transferases, e.g., acyl- CoA ligases and acyl-CoA transferases.
- the amount of label in the hydrophobic reaction products is measured after extraction of the labeled derivatives into a suitable organic solvent, or after solid-phase extraction of these compounds by addition of a sufficient amount of hydrophobic porous resin beads (e.g., XAD 1180, XAD-2, -4, -8).
- scintillating dye can be present in the organic solvent, added to the samples, or chemically incorporated in the bead polymer. The latter constitutes a modification of scintillation proximity assay method.
- Methods for detecting regioselectivity of transferase reactions include HPLC, and in an HTP modality, flow-through NMR spectroscopy.
- NMR spectroscopy is used for determining relative amounts of different regiomeric monoacyl or monoglycosyl derivatives of oxidized substrates, the latter are preferably obtained by action of the arene dioxygenases on isotopically ( C and/or H) labeled substrate.
- Another variation of the NMR technique includes use of isotopically labeled precursors of acyl- or glycosyl- donor intermediates. 7. Selecting for enhanced organic solvent resistance.
- Selection for recombinant polynucleotides that provide improved organic solvent resistance can be accomplished by introducing a library of recombinant polynucleotides into a population of microorganism cells and subjecting the population to a medium that contains various concentrations of the organic hydrophobic compounds of interest.
- the medium can contain, for example, carbon, nitrogen and minerals, and preferably does not otherwise limit growth and viability of the cells in the absence of the solvent, thus ensuring that solvent resistance is essentially the only limiting factor affecting growth of the cells expressing variants of the genes encoding solvent resistance traits.
- one can employ a screening strategy to identify those recombinant polynucleotides that encode polypeptides that confer improved solvent resistance.
- reporter gene such as those encoding fluorescent proteins (exemplified by the green fluorescent protein, GFP).
- GFP green fluorescent protein
- those reporter genes are used which display their function in a fashion dependent on availability of intracellular reducing pools, such as NADH and NADPH, and essentially unimpaired ribosomal biosynthesis of proteins.
- lux bacterial luciferase gene clusters
- a variety of methods can be used to detect and to pick or to enrich for the clones with the most efficient solvent resistant traits as judged by display of the properties associated with the in-vivo reporter genes. These methods include, for example, fluorescence activating cell sorting of liquid cell suspensions (e.g., cells that express GFP) and CCD camera imaging of individual colonies grown on a solid(ified) medium (e.g., for cells that express lux).
- fluorescence activating cell sorting of liquid cell suspensions e.g., cells that express GFP
- CCD camera imaging of individual colonies grown on a solid(ified) medium e.g., for cells that express lux
- the invention provides a bioreactor system for carrying out biotransformations using the improved polypeptides of the invention.
- the bioreactor includes: (a) an improved dioxygenase polypeptide of the invention; (b) a redox partner source; (c) oxygen; and (d) a substrate for oxidation.
- the dioxygenase polypeptide is an arene dioxygenase polypeptide.
- the bioreactor further includes another useful polypeptide, such as a transferase, ligase, dehydrogenase and the like.
- the additional useful polypeptide(s) can be co-expressed by a host cell also expressing the improved dioxygenase or expressed by a host cell that does not express the improved dioxygenase.
- each of the polypeptides incorporated into the reactor can be provided as a constituent of a whole cell preparation, a polypeptide extract or as a substantially pure polypeptide.
- the cells and/or polypeptides are optionally in suspension, in solution, or immobilized on an insoluble matrix, bead or other particle. Additional considerations are discussed below. This discussion is intended as illustrative and not limiting. Other bioreactor formats, conditions, etc. will be apparent to those of skill in the art. General growth conditions for culturing the particular organisms are obtained from depositories and from texts known in the art such as BERGEY'S MANUAL OF SYSTEMATIC BACTERIOLOGY, Vol.l, N. R. Krieg, ed., Williams and Wilkins, Baltimore/London (1984).
- the nutrient medium for the growth of any oxidizing microorganism should contain sources of assimilable carbon and nitrogen, as well as mineral salts.
- Suitable sources of assimilable carbon and nitrogen include, but are not limited to, complex mixtures, such as those constituted by biological products of diverse origin, for example soy bean flour, cotton seed flour, lentil flour, pea flour, soluble and insoluble vegetable proteins, com steep liquor, yeast extract, peptones and meat extracts.
- Additional sources of nitrogen are ammonium salts and nitrates, such as ammonium chloride, ammonium sulfate, sodium nitrate and potassium nitrate.
- the nutrient medium should include, but is not limited to, the following ions: Mg 2+ , Na + , K + , Ca 2+ , NH , CI “ , SO 4 2” , PO 4 2” and NO 3 " and also ions of the trace elements such as Cu, Fe, Mn, Mo, Zn, Co and Ni.
- the preferred source of these ions are mineral salts. If these salts and trace elements are not present in sufficient amounts in the complex constituents of the nutrient medium or in the water used it is appropriate to supplement the nutrient medium accordingly.
- the microorganisms employed in the process of the invention can be in the form of fermentation broths, whole washed cells, concentrated cell suspensions, polypeptide extracts, and immobilized polypeptides and/or cells.
- concentrated cell suspensions, polypeptide extracts, and whole washed cells are used with the process of the invention (S. A. White and G. W. Claus, J. Bacteriology, 150:934-943 (1982)).
- polypeptides and cells are well known in the art and include such techniques as microencapsulation, attachment to alginate beads, cross- linked polyurethane, starch particles, polyacrylamide gels and the use of coacervates, which are aggregates of colloidal droplets.
- the polypeptide and/or cell is immobilized onto a glass particles having a porous outer surface, such as that described in Dubin , et al, U.S. Patent No. 5,922,531, issued July 13, 1999.
- Concentrated washed cell suspensions may be prepared as follows: the microorganisms are cultured in a suitable nutrient solution, harvested (for example by centrifuging) and suspended in a smaller volume (in salt or buffer solutions, such as physiological sodium chloride solution or aqueous solutions of potassium phosphate, sodium acetate, sodium maleate, magnesium sulfate, or simply in tap water, distilled water or nutrient solutions).
- salt or buffer solutions such as physiological sodium chloride solution or aqueous solutions of potassium phosphate, sodium acetate, sodium maleate, magnesium sulfate, or simply in tap water, distilled water or nutrient solutions.
- the substrate is then added to a cell suspension of this type and the oxidation reaction according to the invention is carried out under the conditions described.
- the conditions for oxidizing a substrate in growing microorganism cultures or fractionated cell extracts are advantageous for carrying out the process according to the invention with concentrated cell suspensions.
- the temperature range is from about 0 °C to about 45 °C and the pH range is from about 2 to about 10.
- washed or immobilized cells can simply be added to a solution of substrate, without any nutrient medium present.
- the extracts can be crude extracts, such as obtained by conventional digestion of microorganism cells.
- Methods to break up cells include, but are not limited to, mechanical dismption, physical disruption, chemical dismption, and enzymatic dismption. Such means to break up cells include ultrasonic treatments, passages through French pressure cells, grindings with quartz sand, autolysis, heating, osmotic shock, alkali treatment, detergents, or repeated freezing and thawing.
- the processes according to the invention are to be carried out with partially purified polypeptide extract preparations
- the methods of protein chemistry such as ultracentrifuging, precipitation reactions, ion exchange chromatography or adsorption chromatography, gel filtration or electrophoretic methods, can be employed to obtain such preparations.
- additional reactants such as, physiological or synthetic electron acceptors, like NAD + , NADP + , methylene blue, dichlorophenolindophenol, tetrazolium salts and the like.
- these reactants When these reactants are used, they can be employed either in equimolar amounts (concentrations which correspond to that of the substrate employed) or in catalytic amounts (concentrations which are markedly below the chosen concentration of substrate). If, when using catalytic amounts, it is to be ensured that the process according to the invention is carried out approximately quantitatively, a system which continuously regenerates the reactant which is present only in a catalytic amount must also be added to the reaction mixture.
- This system can be, for example, a polypeptide which ensures reoxidation (in the presence of oxygen or other oxidizing agents) of an electron acceptor which is reduced in the course of the reaction according to the invention.
- nutrient media can be solid, semi-solid or liquid.
- Aqueous-liquid nutrient media are preferably employed when media is used.
- Suitable media and suitable conditions for cultivation include known media and known conditions to which substrate can be added.
- the substrate to be oxidized in the process of the invention can be added to the base nutrient medium either on its own or as a mixture with one or more oxidizable compounds. Additional oxidizable compounds which can be used include polyols, such as sorbitol or glycerol.
- the substrate to be oxidized can be added either prior to inoculation or at any desired subsequent time (between the early log phase and the late stationary growth phase).
- the oxidizing organism is preferably pre-cultured with the oxidizable compounds.
- the inoculation of the nutrient media is effected by a variety of methods including slanted tube cultures and flask cultures.
- Contamination of the reaction solution should be avoided.
- sterilization of the nutrient media sterilization of the reaction vessels and sterilization of the air required for aeration is preferably undertaken. It is possible to use, for example, steam sterilization or dry sterilization for sterilization of the reaction vessels.
- the air and the nutrient media can likewise be sterilized by steam or by filtration. Heat sterilization of the reaction solution containing the substrate is also possible.
- the process of the invention can be carried out under aerobic conditions using shake flasks or aerated and agitated tanks.
- the process is carried out by the aerobic submersion procedure in tanks, for example in conventional fermentors. It is possible to carry out the process continuously or with batch or fed batch modes, preferably the batch mode.
- foam control agents such as liquid fats and oils, oil-in-water emulsions, paraffins, higher alcohols (such as octadecanol), silicone oils, polyoxyethylene compounds and polyoxypropylene compounds, can be added. Foam can also be suppressed or eliminated with the aid of mechanical devices.
- the toluene dioxygenase genes todClC2 A (Zylstra G. J. and D. T. Gibson. (1989) J. Biol. Chem. 264: 14940-14946) were used as a substrate for shuffling a family of genes.
- the transformed cells were plated onto LB (Luria Bertani) agar plates containing ampicillin and incubated at 37°C for 20 hr. Restriction digests were used to determine that 23 out of the 24 randomly selected library clones contained sequences that were chimeras of the two parental sequences.
- TIER 1 ASSAY The library was screened for active clones using a plate assay which detects the oxidation of indole to indigo by an active dioxygenase enzyme (Ensley et al. (1983) Science 222: 167-169).
- the library was plated onto LB agar plates containing ampicillin and incubated at 37°C for 20 hr after which the plates are placed at 23°C and incubated for an additional 24-48 hr.
- Clones expressing active dioxygenases produced colored colonies from accumulation of indigo. The color ranged from blue to blue-grey and the intensity of the color varied from clone to clone depending on the level of enzyme activity.
- the library was determined to be 70% active.
- the colored colonies were picked using an Q-bot (Genetix) into 384-well plates containing LB ampicillin and incubated at 37°C with shaking for 20 hr.
- the library plates were stored after the addition of sterile glycerol (10% final concentration) at -20°C until further screening.
- the following high-throughput (HTP) assay was developed and used to identify shuffled dioxygenases with improved oxidation of p-xylene to p-xylene diol.
- the assay is not limited to the substrate, p-xylene, and can be used for any volatile or toxic (to the test organism) substrate.
- Library clones are grown to saturation in a 96-well plate(s) with 250ul 2xYT containing ampicillin and 0.2%glucose.
- the plate(s) are inoculated with 3 ul/well inoculum.
- the plate(s) are incubated at 250 rpm, 37°C, with 85-90% humidity for 20 hr.
- the cultures are subcultured into deep 96-well induction plate(s) containing 0.5 ml 2xYT containing ampicillin and ImM IPTG/well.
- the plate(s) are incubated at 37°C, 250 rpm, with 85-90% humidity, 6 hr.
- the induced cells are harvested by centrifugation at 3000 rpm, 10 min., 10°C.
- the cells are washed once with minimal media and resuspended to a final volume of 0.4 ml.
- the cells are transferred to an assay plate(s) containing the volatile or toxic substrate to be tested embedded in solidified 1% agarose on the bottom of each well of the plate(s).
- the assay plate(s) are sealed and incubated at 23°C for 1 hr with vigorous shaking.
- the cell suspension is transferred to a clean polypropylene 96-well plate(s) and the cells pelleted by centrifugation at 3000 rpm, 10 min, at 10°C.
- the cell-free supematants are transferred to a 96-well plate(s) and the diol product analyzed by spectrophotometric methods, HPLC, or GC-MS.
- Potential positive clones are those clones with improved properties when compared to the best parent.
- the nine clones and the parents were tested after growth in two types of media; 2xYT and minimal media with fmctose as the carbon source. Of the nine clones, six were reconfirmed as 1.7-2.8-fold improved over the best parent.
- Kits will optionally additionally include instmctions for performing methods or assays, packaging materials, one or more containers which contain assay, device or system components, or the like.
- the present invention provides kits embodying the methods and apparatus herein.
- Kits of the invention optionally include one or more of the following: (1) a shuffled component as described herein; (2) instructions for practicing the methods described herein, and/or for operating the selection procedure herein; (3) one or more dioxygenase assay component; (4) a container for holding dioxygenase nucleic acids or polypeptides, other nucleic acids, transgenic plants, animals, cells, or the like and, (5) packaging materials.
- the present invention provides for the use of any composition or kit herein, for the practice of any method or assay herein, and/or for the use of any apparatus or kit to practice any assay or method herein.
- the kit of the invention includes one or more improved dioxygenase polypeptides of the invention.
- the kit includes a library of improved dioxygenase polypeptides.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
MXPA01013201A MXPA01013201A (en) | 1999-08-12 | 2000-08-11 | Dna shuffling of dioxygenase genes for production of industrial chemicals. |
EP00953983A EP1208193A1 (en) | 1999-08-12 | 2000-08-11 | Dna shuffling of dioxygenase genes for production of industrial chemicals |
CA002377669A CA2377669A1 (en) | 1999-08-12 | 2000-08-11 | Dna shuffling of dioxygenase genes for production of industrial chemicals |
IL14738400A IL147384A0 (en) | 1999-08-12 | 2000-08-11 | Dna shuffling of dioxygenase genes for production of industrial chemicals |
AU66343/00A AU6634300A (en) | 1999-08-12 | 2000-08-11 | Dna shuffling of dioxygenase genes for production of industrial chemicals |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14885099P | 1999-08-12 | 1999-08-12 | |
US60/148,850 | 1999-08-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2001012791A1 true WO2001012791A1 (en) | 2001-02-22 |
Family
ID=22527694
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2000/022038 WO2001012791A1 (en) | 1999-08-12 | 2000-08-11 | Dna shuffling of dioxygenase genes for production of industrial chemicals |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1208193A1 (en) |
AU (1) | AU6634300A (en) |
CA (1) | CA2377669A1 (en) |
IL (1) | IL147384A0 (en) |
MX (1) | MXPA01013201A (en) |
WO (1) | WO2001012791A1 (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001092247A2 (en) * | 2000-05-31 | 2001-12-06 | Maxygen, Inc. | Preparation of 4-hydroxy-3[2h]-furanones |
WO2005040376A2 (en) | 2003-10-23 | 2005-05-06 | C-Lecta Gmbh | Method from the selection of biomolecules from biomolecule variant libraries |
EP1616017A2 (en) * | 2003-04-14 | 2006-01-18 | E. I. du Pont de Nemours and Company | A method for producing para-hydroxystyrene and other multifunctional aromatic compounds using two-phase extractive fermentation |
DE102004038154A1 (en) * | 2004-08-06 | 2006-03-16 | Maxens Gmbh | Technologically produced dihydrocoumarin |
WO2018005655A3 (en) * | 2016-06-30 | 2018-02-22 | Zymergen Inc. | Methods for generating a bacterial hemoglobin library and uses thereof |
US10047358B1 (en) | 2015-12-07 | 2018-08-14 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
US10544411B2 (en) | 2016-06-30 | 2020-01-28 | Zymergen Inc. | Methods for generating a glucose permease library and uses thereof |
US11208649B2 (en) | 2015-12-07 | 2021-12-28 | Zymergen Inc. | HTP genomic engineering platform |
US11293029B2 (en) | 2015-12-07 | 2022-04-05 | Zymergen Inc. | Promoters from Corynebacterium glutamicum |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114561432A (en) * | 2020-11-27 | 2022-05-31 | 中国科学院天津工业生物技术研究所 | Ring opening method of aromatic compound |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997035966A1 (en) * | 1996-03-25 | 1997-10-02 | Maxygen, Inc. | Methods and compositions for cellular and metabolic engineering |
WO1998027230A1 (en) * | 1996-12-18 | 1998-06-25 | Maxygen, Inc. | Methods and compositions for polypeptide engineering |
-
2000
- 2000-08-11 AU AU66343/00A patent/AU6634300A/en not_active Abandoned
- 2000-08-11 MX MXPA01013201A patent/MXPA01013201A/en unknown
- 2000-08-11 EP EP00953983A patent/EP1208193A1/en not_active Withdrawn
- 2000-08-11 IL IL14738400A patent/IL147384A0/en unknown
- 2000-08-11 WO PCT/US2000/022038 patent/WO2001012791A1/en not_active Application Discontinuation
- 2000-08-11 CA CA002377669A patent/CA2377669A1/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997035966A1 (en) * | 1996-03-25 | 1997-10-02 | Maxygen, Inc. | Methods and compositions for cellular and metabolic engineering |
WO1998027230A1 (en) * | 1996-12-18 | 1998-06-25 | Maxygen, Inc. | Methods and compositions for polypeptide engineering |
Cited By (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001092247A2 (en) * | 2000-05-31 | 2001-12-06 | Maxygen, Inc. | Preparation of 4-hydroxy-3[2h]-furanones |
WO2001092247A3 (en) * | 2000-05-31 | 2002-08-01 | Maxygen Inc | Preparation of 4-hydroxy-3[2h]-furanones |
EP1616017A2 (en) * | 2003-04-14 | 2006-01-18 | E. I. du Pont de Nemours and Company | A method for producing para-hydroxystyrene and other multifunctional aromatic compounds using two-phase extractive fermentation |
EP1646712A2 (en) * | 2003-04-14 | 2006-04-19 | E. I. du Pont de Nemours and Company | A method for preparing para-hydroxystyrene by biocatalytic decarboxylation of para-hydroxycinnamic acid in a biphasic reaction medium |
EP1616017A4 (en) * | 2003-04-14 | 2008-03-19 | Du Pont | A method for producing para-hydroxystyrene and other multifunctional aromatic compounds using two-phase extractive fermentation |
EP1646712A4 (en) * | 2003-04-14 | 2008-10-22 | Du Pont | A method for preparing para-hydroxystyrene by biocatalytic decarboxylation of para-hydroxycinnamic acid in a biphasic reaction medium |
WO2005040376A2 (en) | 2003-10-23 | 2005-05-06 | C-Lecta Gmbh | Method from the selection of biomolecules from biomolecule variant libraries |
WO2005040376A3 (en) * | 2003-10-23 | 2005-07-14 | Univ Leipzig | Method from the selection of biomolecules from biomolecule variant libraries |
DE102004038154A1 (en) * | 2004-08-06 | 2006-03-16 | Maxens Gmbh | Technologically produced dihydrocoumarin |
US10457933B2 (en) | 2015-12-07 | 2019-10-29 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
US11155807B2 (en) | 2015-12-07 | 2021-10-26 | Zymergen Inc. | Automated system for HTP genomic engineering |
US10336998B2 (en) | 2015-12-07 | 2019-07-02 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
US11352621B2 (en) | 2015-12-07 | 2022-06-07 | Zymergen Inc. | HTP genomic engineering platform |
US11312951B2 (en) | 2015-12-07 | 2022-04-26 | Zymergen Inc. | Systems and methods for host cell improvement utilizing epistatic effects |
US11293029B2 (en) | 2015-12-07 | 2022-04-05 | Zymergen Inc. | Promoters from Corynebacterium glutamicum |
US10647980B2 (en) | 2015-12-07 | 2020-05-12 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
US10745694B2 (en) | 2015-12-07 | 2020-08-18 | Zymergen Inc. | Automated system for HTP genomic engineering |
US10808243B2 (en) | 2015-12-07 | 2020-10-20 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
US10883101B2 (en) | 2015-12-07 | 2021-01-05 | Zymergen Inc. | Automated system for HTP genomic engineering |
US10968445B2 (en) | 2015-12-07 | 2021-04-06 | Zymergen Inc. | HTP genomic engineering platform |
US11085040B2 (en) | 2015-12-07 | 2021-08-10 | Zymergen Inc. | Systems and methods for host cell improvement utilizing epistatic effects |
US11155808B2 (en) | 2015-12-07 | 2021-10-26 | Zymergen Inc. | HTP genomic engineering platform |
US10047358B1 (en) | 2015-12-07 | 2018-08-14 | Zymergen Inc. | Microbial strain improvement by a HTP genomic engineering platform |
US11208649B2 (en) | 2015-12-07 | 2021-12-28 | Zymergen Inc. | HTP genomic engineering platform |
US10544390B2 (en) | 2016-06-30 | 2020-01-28 | Zymergen Inc. | Methods for generating a bacterial hemoglobin library and uses thereof |
US10544411B2 (en) | 2016-06-30 | 2020-01-28 | Zymergen Inc. | Methods for generating a glucose permease library and uses thereof |
WO2018005655A3 (en) * | 2016-06-30 | 2018-02-22 | Zymergen Inc. | Methods for generating a bacterial hemoglobin library and uses thereof |
Also Published As
Publication number | Publication date |
---|---|
MXPA01013201A (en) | 2002-06-04 |
IL147384A0 (en) | 2002-08-14 |
AU6634300A (en) | 2001-03-13 |
CA2377669A1 (en) | 2001-02-22 |
EP1208193A1 (en) | 2002-05-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6605430B1 (en) | DNA shuffling of monooxygenase genes for production of industrial chemicals | |
US7670825B2 (en) | Method for enhancing production of isoprenoid compounds | |
US6500639B2 (en) | DNA shuffling to produce nucleic acids for mycotoxin detoxification | |
Wong et al. | Sensitive assay for laboratory evolution of hydroxylases toward aromatic and heterocyclic compounds | |
Gao et al. | Efficient biosynthesis of (2 S)-eriodictyol from (2 S)-naringenin in Saccharomyces cerevisiae through a combination of promoter adjustment and directed evolution | |
US7723498B2 (en) | Directed evolution of recombinant monooxygenase nucleic acids and related polypeptides and methods of use | |
JP2001197895A (en) | Method and composition for cell technology and metabolism technology | |
WO2001042455A1 (en) | Directed evolution of biosynthetic and biodegration pathways | |
US20020072097A1 (en) | Molecular breeding of transposable elements | |
JP6562950B2 (en) | Dreamenol synthase and method for producing dreammenol | |
EP1208193A1 (en) | Dna shuffling of dioxygenase genes for production of industrial chemicals | |
JP2017525380A (en) | Recombinant microorganism producing alkene from acetyl CoA | |
Wang et al. | Genetic characterization of enzymes involved in the priming steps of oxytetracycline biosynthesis in Streptomyces rimosus | |
Hua et al. | Offloading role of a discrete thioesterase in type II polyketide biosynthesis | |
Zielinski et al. | Generation of novel-substrate-accepting biphenyl dioxygenases through segmental random mutagenesis and identification of residues involved in enzyme specificity | |
EP2749644B1 (en) | Recombinant host cell for biosynthetic production of vanillin | |
Zhou et al. | Versatile CYP98A enzymes catalyse meta‐hydroxylation reveals diversity of salvianolic acids biosynthesis | |
US20140370557A1 (en) | Genetically engineered microbes and methods for producing 4-hydroxycoumarin | |
WO2001068803A2 (en) | Enzymes, pathways and organisms for making a polymerizable monomer by whole cell bioprocess | |
US20230051453A1 (en) | Biosynthetic platform for the production of olivetolic acid and analogues of olivetolic acid | |
Stierle et al. | P450 in C–C coupling of cyclodipeptides with nucleobases | |
US20040023348A1 (en) | Microbiological process for enantioselective (S)-hydroxylation | |
CN1378598A (en) | Evolution and use of enzymes for combinathorial and medicinal chemistry | |
WO2019052907A2 (en) | Method for detecting hydrogen peroxide conversion activity of an enzyme | |
Kadow | Baeyer-Villiger monooxygenases involved in camphor degradation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2377669 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: PA/a/2001/013201 Country of ref document: MX |
|
WWE | Wipo information: entry into national phase |
Ref document number: 66343/00 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2000953983 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2000953983 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
NENP | Non-entry into the national phase |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2000953983 Country of ref document: EP |