Methods for the targeted integration of nucleotide sequences into a plant are provided. Transfer cassettes comprising nucleotide sequences of interest flanked by non-identical recombination sites are used to transform a plant comprising a target site. The target site contains at least a set of non-identical recombination sites corresponding to those on the transfer cassette. Exchange of the nucleotide sequences flanked by the recombination sites is effected by a recombinase.

Claim:

That which is claimed:

1. A soybean plant cell having stably incorporated into its genome a transfer cassette comprising a nucleotide sequence of interest flanked by a first and a secondnon-identical recombination site, wherein said first and said second non-identical recombination site can recombine in said soybean plant cell with their identical recombination site in the presence of an appropriate recombinase, wherein said first andsaid second non-identical recombination site comprises a FRT site or a mutant FRT site.

2. The soybean plant cell of claim 1, wherein said first and said second non-identical recombination sites can recombine with their identical recombination sites in the presence of the appropriate recombinase when their identical recombinationsites are sexually crossed into said soybean plant cell.

3. The soybean plant cell of claim 1, wherein said non-identical recombination sites comprises at least a FRT1 site (SEQ ID NO:1), a FRT6 site (SEQ ID NO:4), a FRT7 site (SEQ ID NO:5) or a FRT5 site (SEQ ID NO:3).

5. The soybean plant cell of claim 1, wherein said soybean plant cell has stably incorporated into its genome a polynucleotide comprising in the following order, at least one expression cassette comprising a second nucleotide sequence ofinterest and the transfer cassette.

6. The soybean plant cell of claim 1, wherein said plant cell has stably incorporated into its genome a polynucleotide comprising in the following order, at least one expression cassette comprising a second nucleotide sequence of interest, apromoter, the transfer cassette comprising the nucleotide sequence of interest operably linked to said promoter.

7. The soybean plant cell of claim 1, wherein said nucleotide sequences of interest is selected from the group consisting of a sequence that influences plant growth, the susceptibly of the plant to disease, the susceptibly of the plant toinsects, and the nutritional value of the plant.

8. A soybean plant having stably incorporated into its genome a transfer cassette comprising a nucleotide sequence of interest flanked by a first and a second non-identical recombination site, wherein said first and said second non-identicalrecombination site can recombine in said soybean plant with their identical recombination site in the presence of an appropriate recombinase, wherein said first and said second non-identical recombination site comprises a FRT site or a mutant FRT site.

9. The soybean plant of claim 8, wherein said first and said second non-identical recombination sites can recombine with their identical recombination sites in the presence of the appropriate recombinase when their identical recombination sitesare sexually crossed into said soybean plant.

10. The soybean plant of claim 8, wherein said non-identical recombination sites comprises at least a FRT1 site (SEQ ID NO:1), a FRT6 site (SEQ ID NO:4), a FRT7 site (SEQ ID NO:5) or a FRT5 site (SEQ ID NO:3).

11. The soybean plant of claim 8, wherein the genome further comprises an expression cassette comprising a nucleotide sequence encoding a FLP recombinase.

12. A transformed soybean seed having stably incorporated into its genome a transfer cassette comprising a nucleotide sequence of interest flanked by a first and a second non-identical recombination site, wherein said first and said secondnon-identical recombination site can recombine with their identical recombination site in the presence of an appropriate recombinase, wherein said first and said second non-identical recombination site comprises a FRT site or a mutant FRT site.

13. The soybean plant of claim 8, wherein said soybean plant has stably incorporated into its genome a polynucleotide comprising in the following order, at least one expression cassette comprising a second nucleotide sequence of interest andthe transfer cassette.

14. The soybean plant of claim 8, wherein said soybean plant has stably incorporated into its genome a polynucleotide comprising in the following order, at least one expression cassette comprising a second nucleotide sequence of interest, apromoter, the transfer cassette comprising the nucleotide sequence of interest operably linked to said promoter.

Description:

REFERENCE TO A SEQUENCE LISTING SUBMITTED AS A TEXT FILE VIA EFS-WEB

The official copy of the sequence listing is submitted concurrently with the specification as a text file via EFS-Web, in compliance with the American Standard Code for Information Interchange (ASCII), with a file name of 396930seqlist.txt, acreation date of Oct. 20, 2010, and a size of 2.98 Kb. The sequence listing filed via EFS-Web is part of the specification and is hereby incorporated in its entirety by reference herein.

FIELD OF THE INVENTION

The invention relates to the genetic modification of plants. Particularly, the control of gene integration and expression in plants is provided.

BACKGROUND OF THE INVENTION

Genetic modification techniques enable one to insert exogenous nucleotide sequences into an organism's genome. A number of methods have been described for the genetic modification of plants. All of these methods are based on introducing aforeign DNA into the plant cell, isolation of those cells containing the foreign DNA integrated into the genome, followed by subsequent regeneration of a whole plant. Unfortunately, such methods produce transformed cells that contain the introducedforeign DNA inserted randomly throughout the genome and often in multiple copies.

The random insertion of introduced DNA into the genome of host cells can be lethal if the foreign DNA happens to insert into, and thus mutate, a critically important native gene. In addition, even if a random insertion event does not impair thefunctioning of a host cell gene, the expression of an inserted foreign gene may be influenced by "position effects" caused by the surrounding genomic DNA. In some cases, the gene is inserted into sites where the position effects are strong enough toprevent the synthesis of an effective amount of product from the introduced gene. In other instances, overproduction of the gene product has deleterious effects on the cell.

Transgene expression is typically governed by the sequences, including promoters and enhancers, which are physically linked to the transgene. Currently, it is not possible to precisely modify the structure of transgenes once they have beenintroduced into plant cells. In many applications of transgene technology, it would be desirable to introduce the transgene in one form, and then be able to modify the transgene in a defined manner. By this means, transgenes could be activated orinactivated where the sequences that control transgene expression can be altered by either removing sequences present in the original transgene or by inserting additional sequences into the transgene.

For higher eukaryotes, homologous recombination is an essential event participating in processes like DNA repair and chromatid exchange during mitosis and meiosis. Recombination depends on two highly homologous extended sequences and severalauxiliary proteins. Strand separation can occur at any point between the regions of homology, although particular sequences may influence efficiency. These processes can be exploited for a targeted integration of transgenes into the genome of certaincell types.

Even with the advances in genetic modification of higher plants, the major problems associated with the conventional gene transformation techniques have remained essentially unresolved as to the problems discussed above relating to variableexpression levels due to chromosomal position effects and copy number variation of transferred genes. For these reasons, efficient methods are needed for targeting and control of insertion of nucleotide sequences to be integrated into a plant genome.

SUMMARY OF THE INVENTION

Compositions and methods for the targeted integration of nucleotide sequences into a transformed plant are provided. The compositions comprise transfer cassettes which are flanked by non-identical recombination sites.

The methods find use in targeting the integration of nucleotide sequences of interest to a specific chromosomal site, finding optimal integration sites in a plant genome, comparing promoter activity in transformed plants, engineering chromosomalrearrangements, and other genetic manipulation of plants.

Novel minimal recombination sites (FRT) are provided for use in the methods of the invention. Also provided are targeting cassettes and transgenic plants and plant cells containing corresponding non-identical recombination sites.

Compositions and methods for the directional, targeted integration of exogenous nucleotides into a transformed plant are provided. The methods use novel recombination sites in a gene targeting system which facilitates directional targeting ofdesired genes and nucleotide sequences into corresponding recombination sites previously introduced into the target plant genome.

In the methods of the invention, a nucleotide sequence flanked by two non-identical recombination sites is introduced into the target organism's genome establishing a target site for insertion of nucleotide sequences of interest. Once a stableplant or cultured tissue is established a second construct, or nucleotide sequence of interest, flanked by corresponding recombination sites as those flanking the target site, is introduced into the stably transformed plant or tissues in the presence ofa recombinase protein. This process results in exchange of the nucleotide sequences between the non-identical recombination sites of the target site and the transfer cassette.

It is recognized that the transformed plant may comprise multiple target sites; i.e., sets of non-identical recombination sites. In this manner, multiple manipulations of the target site in the transformed plant are available. By target sitein the transformed plant is intended a DNA sequence that has been inserted into the transformed plant's genome and comprises non-identical recombination sites.

The two-micron plasmid found in most naturally occurring strains of Saccharomyces cerevisiae, encodes a site-specific recombinase that promotes an inversion of the DNA between two inverted repeats. This inversion plays a central role in plasmidcopy-number amplification. The protein, designated FLP protein, catalyzes site-specific recombination events. The minimal recombination site (FRT, SEQ ID NO:1) has been defined and contains two inverted 13-base pair (bp) repeats surrounding anasymmetric 8-bp spacer. The FLP protein cleaves the site at the junctions of the repeats and the spacer and is covalently linked to the DNA via a 3' phosphate.

Site specific recombinases like FLP cleave and relegate DNA at specific target sequences, resulting in a precisely defined recombination between two identical sites. To function, the system needs the recombination sites and the recombinase. Noauxiliary factors are needed. Thus, the entire system can be inserted into and function in plant cells.

The yeast FLP\FRT site specific recombination system has been shown to function in plants. To date, the system has been utilized for excision of unwanted DNA. See, Lyznik et al. (1993) Nucleic Acid Res. 21:969-975. In contrast, the presentinvention utilizes non-identical FRTs for the exchange, targeting, arrangement, insertion and control of expression of nucleotide sequences in the plant genome.

To practice the methods of the invention, a transformed organism of interest, particularly a plant, containing a target site integrated into its genome is needed. The target site is characterized by being flanked by non-identical recombinationsites. A targeting cassette is additionally required containing a nucleotide sequence flanked by corresponding non-identical recombination sites as those sites contained in the target site of the transformed organism. A recombinase which recognizes thenon-identical recombination sites and catalyzes site-specific recombination is required.

It is recognized that the recombinase can be provided by any means known in the art. That is, it can be provided in the organism or plant cell by transforming the organism with an expression cassette capable of expressing the recombinase in theorganism, by transient expression; or by providing messenger RNA (mRNA) for the recombinase or the recombinase protein.

By "non-identical recombination sites" is intended that the flanking recombination sites are not identical in sequence and will not recombine or recombination between the sites will be minimal. That is, one flanking recombination site may be aFRT site where the second recombination site may be a mutated FRT site. The non-identical recombination sites used in the methods of the invention prevent or greatly suppress recombination between the two flanking recombination sites and excision of thenucleotide sequence contained therein. Accordingly, it is recognized that any suitable non-identical recombination sites may be utilized in the invention, including FRT and mutant FRT sites, FRT and LOX sites, LOX and mutant LOX sites, as well as otherrecombination sites known in the art.

By suitable non-identical recombination site implies that in the presence of active recombinase, excision of sequences between two non-identical recombination sites occurs, if at all, with an efficiency considerably lower than therecombinationally-mediated exchange targeting arrangement of nucleotide sequences into the plant genome. Thus, suitable non-identical sites for use in the invention include those sites where the efficiency of recombination between the sites is low; forexample, where the efficiency is less than about 30 to about 50%, preferably less than about 10 to about 30%, more preferably less than about 5 to about 10%.

As noted above, the recombination sites in the targeting cassette correspond to those in the target site of the transformed plant. That is, if the target site of the transformed plant contains flanking non-identical recombination sites of FRTand a mutant FRT, the targeting cassette will contain the same FRT and mutant FRT non-identical recombination sites.

It is furthermore recognized that the recombinase, which is used in the invention, will depend upon the recombination sites in the target site of the transformed plant and the targeting cassette. That is, if FRT sites are utilized, the FLPrecombinase will be needed. In the same manner, where lox sites are utilized, the Cre recombinase is required. If the non-identical recombination sites comprise both a FRT and a lox site, both the FLP and Cre recombinase will be required in the plantcell.

The FLP recombinase is a protein that catalyzes a site-specific reaction that is involved in amplifying the copy number of the two micron plasmid of S. cerevisiae during DNA replication. FLP protein has been cloned and expressed. See, forexample, Cox (1993) Proc. Natl. Acad. Sci. U.S.A. 80:4223-4227. The FLP recombinase for use in the invention may be that derived from the genus Saccharomyces. It may be preferable to synthesize the recombinase using plant preferred codons foroptimum expression in a plant of interest. See, for example, U.S. application Ser. No. 08/972,258 filed Nov. 18, 1997, entitled "Novel Nucleic Acid Sequence Encoding FLP Recombinase" now U.S. Pat. No. 5,929,301, herein incorporated by reference.

Where appropriate, the nucleotide sequences to be inserted in the plant genome may be optimized for increased expression in the transformed plant. Where mammalian, yeast, or bacterial genes are used in the invention, they can be synthesizedusing plant preferred codons for improved expression. It is recognized that for expression in monocots, dicot genes can also be synthesized using monocot preferred codons. Methods are available in the art for synthesizing plant preferred genes. See,for example, U.S. Pat. Nos. 5,380,831, 5,436,391, and Murray et al. (1989) Nucleic Acids Res. 17:477-498, herein incorporated by reference.

The plant preferred codons may be determined from the codons utilized more frequently in the proteins expressed in the plant of interest. It is recognized that monocot or dicot preferred sequences may be constructed as well as plant preferredsequences for particular plant species. See, for example, EPA 0359472; EPA 0385962; WO 91/16432; Perlak et al. (1991) Proc. Natl. Acad. Sci. USA, 88:3324-3328; and Murray et al. (1989) Nucleic Acids Research, 17: 477-498. U.S. Pat. No. 5,380,831;U.S. Pat. No. 5,436,391; and the like, herein incorporated by reference. It is further recognized that all or any part of the gene sequence may be optimized or synthetic. That is, fully optimized or partially optimized sequences may also be used.

Additional sequence modifications are known to enhance gene expression in a cellular host and can be used in the invention. These include elimination of sequences encoding spurious polyadenylation signals, exon-intron splice site signals,transposon-like repeats, and other such well-characterized sequences, which may be deleterious to gene expression. The G-C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genesexpressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.

wherein the nucleotides within the brackets indicate the spacer region. The nucleotides in the spacer region can be replaced with a combination of nucleotides, so long as the two 13-base repeats are separated by eight nucleotides. It appearsthat the actual nucleotide sequence of the spacer is not critical, however for the practice of the invention, some substitutions of nucleotides in the space region may work better than others.

The eight base pair spacer is involved in DNA-DNA pairing during strand exchange. The asymmetry of the region determines the direction of site alignment in the recombination event, which will subsequently lead to either inversion or excision. As indicated above, most of the spacer can be mutated without a loss of function. See, for example, Schlake and Bode (1994) Biochemistry 33:12746-12751, herein incorporated by reference.

Novel FRT mutant sites are provided for use in the practice of the methods of the present invention. Such mutant sites may be constructed by PCR-based mutagenesis. While mutant FRT sites (SEQ ID NOS:2, 3, 4 and 5) are provided herein, it isrecognized that other mutant FRT sites may be used in the practice of the invention. The present invention is not the use of a particular FRT or recombination site, but rather that non-identical recombination sites or FRT sites can be utilized fortargeted insertion and expression of nucleotide sequences in a plant genome. Thus, other mutant FRT sites can be constructed and utilized based upon the present disclosure.

As discussed above, bringing genomic DNA containing a target site with non-identical recombination sites together with a vector containing a transfer cassette with corresponding non-identical recombination sites, in the presence of therecombinase, results in recombination. The nucleotide sequence of the transfer cassette located between the flanking recombination sites is exchanged with the nucleotide sequence of the target site located between the flanking recombination sites. Inthis manner, nucleotide sequences of interest may be precisely incorporated into the genome of the host.

It is recognized that many variations of the invention can be practiced. For example, target sites can be constructed having multiple non-identical recombination sites. Thus, multiple genes or nucleotide sequences can be stacked or ordered atprecise locations in the plant genome. Likewise, once a target site has been established within the genome, additional recombination sites may be introduced by incorporating such sites within the nucleotide sequence of the transfer cassette and thetransfer of the sites to the target sequence. Thus, once a target site has been established, it is possible to subsequently add sites, or alter sites through recombination.

Another variation includes providing a promoter or transcription initiation region operably linked with the target site in an organism. Preferably, the promoter will be 5' to the first recombination site. By transforming the organism with atransfer cassette comprising a coding region, expression of the coding region will occur upon integration of the transfer cassette into the target site. This embodiment provides for a method to select transformed cells, particularly plant cells, byproviding a selectable marker sequence as the coding sequence.

Other advantages of the present system include the ability to reduce the complexity of integration of trans-genes or transferred DNA in an organism by utilizing transfer cassettes as discussed above and selecting organisms with simpleintegration patterns. In the same manner, preferred sites within the genome can be identified by comparing several transformation events. A preferred site within the genome includes one that does not disrupt expression of essential sequences andprovides for adequate expression of the transgene sequence.

The methods of the invention also provide for means to combine multiple cassettes at one location within the genome. See, for example, FIG. 1. Recombination sites may be added or deleted at target sites within the genome.

Any means known in the art for bringing the three components of the system together may be used in the invention. For example, a plant can be stably transformed to harbor the target site in its genome. The recombinase may be transientlyexpressed or provided. Alternatively, a nucleotide sequence capable of expressing the recombinase may be stably integrated into the genome of the plant. In the presence of the corresponding target site and the recombinase, the transfer cassette,flanked by corresponding non-identical recombination sites, is inserted into the transformed plant's genome.

Alternatively, the components of the system may be brought together by sexually crossing transformed plants. In this embodiment, a transformed plant, parent one, containing a target site integrated in its genome can be sexually crossed with asecond plant, parent two, that has been genetically transformed with a transfer cassette containing flanking non-identical recombination sites, which correspond to those in plant one. Either plant one or plant two contains within its genome a nucleotidesequence expressing recombinase. The recombinase may be under the control of a constitutive or inducible promoter.

The compositions and methods of the invention find use in targeting the integration of transferred nucleotide sequences to a specific chromosomal site. The nucleotide sequence may encode any nucleotide sequence of interest. Particular genes ofinterest include those which provide a readily analyzable functional feature to the host cell and/or organism, such as marker genes, as well as other genes that alter the phenotype of the recipient cells, and the like. Thus, genes effecting plantgrowth, height, susceptibility to disease, insects, nutritional value, and the like may be utilized in the invention. The nucleotide sequence also may encode an "antisense" sequence to turn off or modify gene expression.

It is recognized that the nucleotide sequences will be utilized in a functional expression unit or cassette. By functional expression unit or cassette is intended, the nucleotide sequence of interest with a functional promoter, and in mostinstances a termination region. There are various ways to achieve the functional expression unit within the practice of the invention. In one embodiment of the invention, the nucleic acid of interest is transferred or inserted into the genome as afunctional expression unit. Alternatively, the nucleotide sequence may be inserted into a site within the genome which is 3' to a promoter region. In this latter instance, the insertion of the coding sequence 3' to the promoter region is such that afunctional expression unit is achieved upon integration.

For convenience, for expression in plants, the nucleic acid encoding target sites and the transfer cassettes, including the nucleotide sequences of interest, can be contained within expression cassettes. The expression cassette will comprise atranscriptional initiation region, or promoter, operably linked to the nucleic acid encoding the peptide of interest. Such an expression cassette is provided with a plurality of restriction sites for insertion of the gene or genes of interest to beunder the transcriptional regulation of the regulatory regions.

The transcriptional initiation region, the promoter, may be native or homologous or foreign or heterologous to the host, or could be the natural sequence or a synthetic sequence. By foreign is intended that the transcriptional initiation regionis not found in the wild-type host into which the transcriptional initiation region is introduced. Either a native or heterologous promoter may be used with respect to the coding sequence of interest.

The expression cassettes may contain one or more than one gene or nucleic acid sequence to be transferred and expressed in the transformed plant. Thus, each nucleic acid sequence will be operably linked to 5' and 3' regulatory sequences. Alternatively, multiple expression cassettes may be provided.

Generally, the expression cassette will comprise a selectable marker gene for the selection of transformed cells. Selectable marker genes are utilized for the selection of transformed cells or tissues.

The methods of the invention can also be utilized to find optimal integration sites within a plant genome. In this manner, a plant is transformed with an expression cassette comprising a selectable marker gene. The expression cassette is atarget site as the marker gene is flanked by non-identical recombination sites. Transformed protoplast, tissues, or whole plants can be tested to determine the levels of activity of the inserted gene. By comparison of cellular activities of the gene indifferent insertion sites, preferred integration sites may be found wherein the gene is expressed at high or acceptable levels. These plants can then be utilized with subsequent retargeting techniques to replace the marker gene with other genes ornucleotide sequences of interest. In the same manner, multiple genes may be inserted at the optimal site for expression. See, for example, FIG. 2 which sets forth one scheme for gene stacking utilizing site-specific integration using the FRT/FLPsystem.

A variety of genetic manipulations are available using the compositions of the present invention including, for example, comparing promoter activity in a transformed plant. Prior to the present invention, promoter activity could not accuratelybe assessed and compared because the chimeric genes were inserted at different locations within the plant genome. Such chromosomal locations affected activity. By utilizing the methods of the present invention, a direct comparison of promoter activityin a defined chromosomal context is possible. Thus, using the methods, enhanced activity of genes can be achieved by selecting optimal chromosomal sites as well as optimal promoters for expression in the plant cell.

The cells which have been transformed may be grown into plants in accordance with conventional approaches. See, for example, McCormick et al. (1986) Plant Cell Reports, 5:81-84. These regenerated plants may then be pollinated with either thesame transformed strain or different strains, and the resulting hybrid having the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that the subject phenotypic characteristic is stably maintained and inheritedand then seeds harvested to ensure the desired phenotype or other property has been achieved.

It is recognized that any means of transformation may be utilized for the present invention. However, for inserting the target site within the transformed plant, Agrobacterium-mediated transformation may be preferred. Agrobacterium-mediatedtransformation generally tends to insert a lower copy number of transferred DNA than does particle bombardment or other transformation means.

The following examples are offered by way of illustration and not by way of limitation.

EXPERIMENTAL

The general present invention provides a procedure for using existing and novel FRT sites in a new gene targeting system which facilitates directional retargeting of desired genes into FRT sites previously introduced in the target organism'sgenome. The novel FRT sites differ from previously described FRT sites in the sequence of the 8 by spacer regions of the FRT sites. Previous publications also have shown that in the presence of FLP protein, recombination of sequences between two FRTsites occurs efficiently only with two identical FRT sites. See for example Umlauf and Cox (1988) Embo J. 7:1845-1852; Schlake and Bode (1994) Biochem. 33:12746-12751. To use the invention, a gene or DNA sequence is flanked by two non-identical FRTsites and introduced into a target organism's genome. The enclosed gene can be a selectable marker, thereby allowing selection for successfully introduced sequences. Molecular characterization confirms integration of desired sequences includingcomplete FRT sites. Listed below are generic examples of vector constructions useful in practicing the invention:

A. FRTa-P1-G1-T1-FRTb

B. FRTa-P1-G1-T1-FRTa

C. FRTb-P1-G1-T1-FRTb

D. P1-FRTa-G1-T1-FRTb

E. P1-FRTa-G1-T1-FRTa

F. P1-FRTb-G1-T1-FRTb

G. P1-ATG::FRTa::G1(noATG)-T1-P2-G2-T2-FRTb

H. P1-ATG::FRTa::G1(noATG)-T1-P2-G2-T2-FRTb-P3-G3-T3

I. P1-ATG::FRTa::G1(noATG)-T1-FRTa::G2(noATG)-T2-FRTb

J. P1-ATG::FRTa::G1(noATG)-T1-FRTa::G2(noATG)-T2-FRTb-P3-G3-T3

K. P1-FRTa-G1-T1-P2-G2-T2-FRTb

L. P1-FRTa-G1-T1-P2-G2-T2-FRTb-P3-G3-T3

M. P1-FRTa-G1-T1-FRTa-G2-T2-FRTb

N. P1-FRTa-G1-T1-FRTa-G2-T2-FRTb-P3-G3-T3

Variations thereof may be constructed with other promoters, genes, terminators or FRT sites.

FRTa and FRTb are two examples of non-identical FRT sites. P1, P2 and P3 are different promoters, G1, G2, and G3 are different genes, T1, T2 and T3 are different terminators. ATG is the start of translation codon for the subsequent gene. Thedesignation noATG indicates that particular gene is devoid of the ATG translation start codon. The symbol :: implies a fusion between adjacent elements, and where used between ATG, FRT and a gene, implies that the sequences are put together to generatean in frame translation fusion that results in a properly expressed and functional gene product.

A to F are preferred configurations for testing new FRT sites for ability to recombine sequences between them; the desired situation being that when two of the same site are used, recombination is efficient and that when two different sites areused, no recombination between them takes place in the presence of FLP protein. G to J are preferred configurations for general use in developing lines for retargeting. It is understood that any number of genes or other combinations of sequences can beassembled for use as part of this invention. K to N are possible configurations that could be used also.

Once a stable plant or cultured tissue is established with one of the constructs above, a second construct flanked by the same FRT sites used to flank the sequences in the first construct above is introduced into the stably transformed tissuesin conjunction with the expression of FLP protein. The new vector constructs can be, but are not limited to the following:

O. FRTa::G1 (noATG)-T1-FRTb

P. FRTa::G1 (noATG)-T1-P2-G2-T2-FRTb

Q. FRTa-G1-T1-FRTb

R. FRTa-G1-T1-P2-G2-T2-FRTb

The FLP protein can be supplied by a) co-transforming with a plasmid carrying a gene encoding FLP; b) co-introducing FLP mRNA or protein directly; c) using a line for the initial transformation that expresses FLP either constitutively orfollowing induction; or d) growing out the plants carrying the initial targeted vectors, crossing to plants that express active FLP protein and selecting events in the progeny.

As a working example, sequence O above is introduced into a line containing a copy of sequence G stably integrated in the genome, in the presence of functional FLP protein. Recombination takes place between identical FRT sites such that thesequence between FRT sites in O replaces the sequence between the corresponding FRT sites of sequence G, thereby yielding a directionally targeted reintegrated new sequence. The new gene in O is now driven off of the P1 promoter in G. The purpose fordesigning some of the constructs without an ATG start codon on the gene is so that if random integration occurs, there is an extremely low probability of expression of the introduced gene, since in order for this to happen, the fragment would need tointegrate behind an endogenous promoter region and in the correct reading frame. This would occur extremely rarely and our data to date have yielded no examples of this happening using a sequence such as O where the contained gene is the easily scorableGUS gene. One requirement for each gene to be constructed in this way (i.e., no ATG on the gene but with the ATG upstream of the FRT site) is the demonstration that the gene can tolerate a fusion of the FRT sequence between the ATG codon and the secondcodon of the protein. To date this has worked for quite a number but not all genes; in the latter cases the other form of the construct retaining the ATG (for example Q) could be used. All of the sequences listed above are expected to work in thisscheme, some at different frequencies or efficiencies than others.

One problem this strategy addresses is limitations with current transformation approaches, particularly in plants, where delivery of DNA into cells or nuclei and subsequent integration in the genome occurs more or less randomly andunpredictably. This is particularly true with particle bombardment methods; arguments have been made that Agrobacterium-based methods tend to deliver T-DNA border-flanked sequences to more actively transcribed regions of the genome, but beyond that theprocess is still largely random. Therefore, for commercial product development, large numbers (estimates of <200) of events need to be generated in order to identify one event: a) that expresses at the desired level; b) where the gene product isfunctional and efficacious; c) which has a simple integration complexity to facilitate breeding; d) which does not contain extraneous sequences posing possible regulatory concerns; e) which maintains stability in expression over generations; f) mostimportantly, which does not have a negative impact on agronomic performance characteristics when carried through a breeding program involving introgression of the trait into different genetic backgrounds. Resource utilization is very large and soschemes that can markedly reduce the resource demand would be very beneficial to production of larger numbers of desired final products.

Example 1

Creation of Novel Non-Identical FRT Sites

DNA fragments containing novel FRT sequences were constructed either by synthesizing, annealing and ligating complementary oligonucleotides or by creating primers for PCR amplification (Mullis and Faloona, 1987) of a DNA product containing thenew FRT sequence near the 5' end of the PCR product. The newly constructed FRT product includes flanking restriction sites useful for cloning into plant expression units. In general, the 5' end is flanked by an NheI site and a terminal NcoI site. TheNcoI site includes the bases ATG, which are advantageously used in newly developed vector constructs as the recognition sequence to initiate an open reading frame. In sequence-based constructs designated noATG/FRT, the NheI site is used for cloningthereby eliminating the upstream ATG in the process. At the 3' end of the FRT sequence, a restriction site is included enabling unique identification of the individual spacer sequences. As specific examples, the wild type FRT site (designated FRT1here) is cloned with a flanking BglII site, the FRT5 site (spacer TTCAAAAG) (nt 39-46 of SEQ ID NO:3) has a ScaI site, the FRT6 site (spacer TTCAAAAA) (nt 36-49 of SEQ ID NO:4) has an AatII site, and the FRT7 site (spacer TTCAATAA) (nt 36-46 of SEQ IDNO:5) has an SpeI site. The outermost flanking restriction site is an XhoI site and is used to clone a gene of interest into the open reading frame.

The structures and sequences of the FRT sites as designed and/or used in the present invention example are depicted below with positions of restriction sites, repeats and spacer regions indicated.

Based on the design of FRT sites as described above, PCR or standard mutagenesis protocols were used to create an XhoI site overlapping the start of a gene sequence to be used for cloning downstream of the FRT site, thereby converting the ATGstart codon to GTG. Ligation of an FRT to the mutated gene sequence at XhoI creates a new open reading frame initiating 5' to the FRT. A second FRT sequence can be cloned downstream of the terminator using a variety of methods including PCR orligation. The FRT/gene/terminator/FRT unit can then be used to make target or substrate constructs.

Targets are created by inserting a promoter at the NcoI site upstream of the first FRT. This maintains a complete open reading frame of the FRT/gene fusion. These target constructs are for use in transformation experiments to create desirable"target lines". Substrate vectors are constructed by cloning with the NheI site to truncate the start codon of the FRT/gene unit, thereby eliminating the proper open reading frame. These substrate vectors are used in experiments designed to retarget anew gene flanked by FRT sites into the corresponding FRT sites previously introduced in the target lines. In either case, to create multiple gene cassettes, additional promoter/gene/terminator units are inserted between the terminator and the second FRTin either target or substrate molecules.

Example 3

Demonstration of Functionality of Novel FRT Sites and Requirement for Two Identical Sites for Efficient Recombination of DNA Sequences Positioned Between Two FRT Sites

Plasmids containing two identical or two different FRT sequences were assayed for efficiency of recombination of sequences between the FRT sites by transformation into 294-FLP, a version of the E. coli strain MM294 with FLP recombinaseintegrated into the lacZ locus (Buchholz et al. 1996). Strains were grown overnight at 37.degree. C. with shaking, allowing for constitutive expression of FLP recombinase in the cultures. The plasmid DNA was isolated using standard procedures anddigested with restriction enzymes that create novel restriction fragments following FLP mediated recombination. The extent of recombination between FRT sites was estimated by examining banding patterns on an agarose gel. Table 1 summarizes data fromthe gel analysis.

TABLE-US-00003 TABLE 1 Target Site Combination Extent of Recombination FRT1 and FRT1 Complete FRT5 and FRT5 Extensive, but partially incomplete FRT6 and FRT6 Complete FRT7 and FRT7 Complete FRT1 and FRT5 No recombination FRT1 and FRT6 Norecombination FRT1 and FRT7 No recombination FRT5 and FRT6 No recombination FRT5 and FRT7 No recombination FRT6 and FRT7 Very small amount of recombination

The results from these studies indicate that excision of sequences between identical FRT sites occurs with high efficiency in general (FRT5, SEQ ID NO:3, appeared to be less efficient overall than FRT1, SEQ ID NO:2, or the novel FRT6, SEQ IDNO:4, and FRT 7, SEQ ID NO:5, sites). As importantly, recombination with two different FRT sites was absent, or at least undetectable under the conditions of this assay for all combinations but FRT6, SEQ ID NO:4, and FRT7, SEQ ID NO:5, where a smalldegree of recombination was noted. These data provided strong support for the potential utility of non-identical FRT sites in developing a directional gene integration system. A point to note is that because recombination of sequences between twoidentical FRT sites can occur with different efficiencies depending on the specific FRT site used (e.g., FRT5, SEQ ID NO:3, in the present experiment), the design of constructs for directional targeted integration may require judicious selection of pairsof FRT sites to optimize for the desired recombination efficiency or to avoid any unwanted recombination.

Example 4

Introduction of DNA Sequences which Include Novel Non-Identical FRT Sites into Plant Cells, Generation and Recovery of Stable Transgenic Events ("Target Lines"), Preservation of "Target Lines" and Regeneration of Plants

The second expression units have the maize ubiquitin promoter or alternatively either the enhanced or the standard versions of the cauliflower mosaic virus 35S promoter. The standard 35S promoter includes bases -421 to +2 (from Gardner et al.(1981) Nucl. Acids Res. 9:2871-2888), and the enhanced version has a duplication of bases -421 to -90 upstream of this standard 35S promoter. The 79 by tobacco mosaic virus leader O' (Gallie et al. (1987) Nucl. Acids Res. 15:3257-3273) is inserteddownstream of the 35S promoter followed by the first intron of the maize alcohol dehydrogenase ADH1-S gene (Dennis et al. (1984) Nucl. Acids Res. 12:3983-3990). Coding sequences in these second expression units include either mo-PAT, bar (Thompson etal. (1987) EMBO J. 6:2519-2523), or HM1 (Johal and Briggs, Science 258:985-987) genes followed by either the PinII terminator or the 35S terminator (nucleotides 7487-7639 in Gardner et al. (1981) Nucl. Acids Res. 9:2871-2888). Varying FRT sites areligated downstream of the terminators as shown in the table. A third expression unit is present in PHP9643 and has an FRT1/GFPm fusion cloned using the flanking NheI site of FRT1 (SEQ ID NO:2) to remove the ATG start codon of GFPm, thereby making itnon-functional in the existing construct, but where correct excision of sequences between FRT1 (SEQ ID NO:2) sites can bring the GFPm in frame with the ubiquitin promoter and ATG of the first expression unit, thereby making it functional. Downstream ofGFPm is the PinII terminator followed by an FRT5 sequence (SEQ ID NO:3).

PHP9643 was cloned into a pUC derived plasmid backbone. All other vectors were cloned into a pSB11 (See, for example, EPA0672752A1, EPA0604662A1, EPA0687730A1 and U.S. Pat. No. 5,591,616) type plasmid with the expression units containedbetween the TDNA border sequences. All are oriented with expression unit one adjacent to the right border. The pSB11-based plasmids were integrated into the super binary plasmid pSB1 (See, for example, EPA0672752A 1, EPA0604662A1, EPA0687730A1 and U.S. Pat. No. 5,591,616) by homologous recombination between the two plasmids. E. coli strain HB101 containing the pSB11 derivatives was mated with Agrobacterium strain LBA4404 harboring pSB1 to create the cointegrate plasmids PHP10616, PHP11407, PHP11410,PHP11457, PHP11599, PHP11893 and PHP14220 in Agrobacterium (by the method of Ditta et al. (1980) Proc. Natl. Acad. Sci. USA 77:7347-7351). The cointegrates were verified by Agrobacterium resistance to spectinomycin and SalI restriction digests.

Table 2 also includes one example of a vector for creating a target line where the FRT sites are inserted in the maize ubiquitin intron (last entry) as an alternative location for placement of FRT or other target sites.

Following selection of stably transformed events, samples of these target lines were cryopreserved as a supply for future experiments using the approach described by Peterson (see application Ser. No. 08/859,313, now U.S. Pat. No. 6,143,563). For several but not all events, another sample callus from several of the stable transgenic events was grown, transferred onto regeneration medium to induce plantlet formation and plants were subsequently recovered and grown to maturity (Register et al.(1994) Plant Mol. Biol. 25:951-961).

Example 5

Demonstration of Functionality of Novel FRT Sites in Plants

(A) Excision of DNA Sequences Between Two Identical FRT Sites, but not when Flanked by Two Non-Identical FRT Sequences

The extent of intra-plasmid recombination was examined in plants using the FRT excision constructs described in Table 3 below. The vectors PHP10968, PHP10998, PHP10969, PHP11272, PHP11243, PHP11244, PHP12140, PHP12141, PHP12156, and PHP12157were constructed by ligating the maize Ubiquitin promoter upstream of FRT sequences using NcoI or other sites that maintained the ATG start codon. The FRT sequence was fused in frame at the flanking XhoI site to a GFPm sequence containing a serine tothreonine mutation at amino acid residue 65 in the wild type sequence (new sequence termed GFPm-S65T). The pinII terminator was cloned downstream of GFPm. The second expression unit consists of a promoterless FRT, cloned with the 5' flanking NheI siteto remove the ATG start codon, fused in frame to the GUS coding sequence (Jefferson et al. (1986) Proc. Natl. Acad. Sci. USA 83: 8447-8451) and followed by the pinII terminator. The vector backbone is a pUC derived plasmid in all cases. Experimentswere conducted by bombarding the indicated plasmids into maize cells along with construct PHP5096, which carries a functional expression cassette for FLP protein. PHP5096, the FLPm expression vector that was used in experiments with the excision andsubstrate vectors, consists of the maize Ubiquitin promoter cloned upstream of the FLPm coding sequence (U.S. patent application Ser. No. 08/972,258 to "Novel Nucleic Acid Sequence Encoding FLP Recombinase" now U.S. Pat. No. 5,929,301) and the pinIIterminator in a pUC derived plasmid backbone. In each case, successful excision would remove intervening sequences between the indicated FRT sites thereby bringing an inactive uidA (GUS) gene in frame with and in proximity to the ubiquitin promoterresulting in GUS activity. If excision does not occur, no GUS expression is expected. The results for GUS expression from these experiments are indicated in Table 4 below. In these studies efficient excision occurred only where constructs containedtwo identical FRT sites. In the case of the FRT6 (SEQ ID NO:4) and FRT7 (SEQ ID NO:5) combination, a small amount of recombination was observed, again emphasizing the need for testing target site combinations and judiciously selecting appropriatecombinations for the application.

B) Transient Integration of a Second DNA Sequence Flanked by Two Non-Identical FRT Sequences into Plant Cells

Summarized in Table 5 below are data from experiments in which target lines created using the plasmids described in Table 2 were bombarded with a substrate plasmid containing a GUS reporter gene flanked by the corresponding FRT sites used in thetarget constructs. This experiment measured the ability to detect transient GUS expression shortly after introduction of the substrate plasmid. Since there is no promoter in front of the first coding sequence in the substrate plasmids, randomintegration, unless occurring in frame behind an appropriate regulatory sequence elsewhere in the genome, would not result in GUS expression. This assay system then evaluates the ability to target FRT-flanked genes into FRT sites in the genome. Ingeneral, FRT substrate vectors (Table 6) are constructed as promoterless FRT/gene fusions cloned using the 5' flanking NheI site of the FRT to remove the ATG start codon. Genes fused in frame to the FRT with the flanking XhoI site include one of severalscorable or selectable marker genes such as aadA (Svab et al. (1990) Plant Mol. Biol. 14: 197-205), uidA, GFPm, GFPm-C3/intron or bar and are followed by a pinII terminator. In some cases (PHP10259, PHP10603, PHP11561, and PHP11633), plasmids contain asingle expression unit and the second heterologous FRT site is cloned downstream of the pinII terminator. Substrate plasmids PHP10859, PHP10997, PHP11204, PHP11699, and PHP12190 have in addition to the first expression unit described above, a secondunit consisting of the maize ubiquitin promoter, the enhanced 35S promoter or a chimeric promoter consisting of the 35S enhancer region cloned upstream of a synthetic core promoter termed Rsyn7 (U.S. Pat. No. 6,072,050 which is a continuation in partof U.S. patent application Ser. No. 08/661,601 filed Jun. 11, 1996 now abandoned) cloned upstream of either the HM1, aadA, GUS, or bar coding sequences and the pinII terminator. A heterologous FRT is inserted downstream of the second terminator. Finally, PHP11003 and PHP11809 contain three expression units. The first unit is a promoterless noATG/FRT/gene fusion as described above, the second unit contains either the chimeric 35S enhancer/Rsyn7 promoter described above or the ZmdJ1 promoter(Baszczynski et al. (1997) Maydica 42:189-201) cloned upstream of the GUS coding sequence and the pinII terminator. The third expression unit consists of the maize ubiquitin promoter cloned upstream of the HM1 coding sequence, pinII terminator and aheterologous FRT sequence. All FRT substrate vectors are cloned into a pUC derived plasmid backbone. Details of the components of these vectors are described in Table 6. Also listed in Table 6 are two vectors with alternative placement of FRT sites inthe ubiquitin 5' UTR or intron.

Results in Table 5 indicate that the frequency and level of GUS expression varies among different events, as might be predicted for genes inserted in different positions in the genome. The prediction is that once a high frequency, highexpressing line is identified, that the expression of genes subsequently introduced into those same sites will also be higher than in other lower expressing events. C) Stable Integration of a Second DNA Sequence Flanked by Two Non-Identical FRTSequences into Plant Cells

A subset of the stable transgenic "target lines" described in example 4 above was used in experiments aimed at stably retargeting into these primary target lines a new gene flanked by the same FRT sites used in the target lines and cloned in asecond construct "substrate" plasmid. Table 7 lists the constructs contained in the primary target lines (from Table 2), the FRT sites contained in these lines and the substrate plasmids (from Table 6) that were subsequently retargeted into the targetlines.

Table 8 presents data from stable transgenic events which demonstrate successful and reproducible targeting of introduced sequences to previously created genomic target sites. The data shown are for 18 independent target lines, each retargetedwith a promoterless GUS construct. Since the bar gene was concurrently introduced on the same plasmid, the proportion of GUS expressing events from the total events recovered on bialophos selection provides a measure of retargeting frequency relative torandom integration.

Initial evaluation of the impact of the introduced sequences on plant growth and gene expression is conducted in the greenhouse by making regular observations through to pollination and seed set. Plants are both selfed and crossed to othergenotypes to obtain T1 seed for subsequent greenhouse and field evaluation. For gene expression evaluation, both qualitative and quantitative data are collected and analyzed. T1 seeds from transgenic events which give acceptable or desirable levels ofexpression and which show no significant negative impact on plant development (e.g., have normal developmental morphology, are male and female fertile, etc.) are then grown in managed field plots along with non-transgenic control plants, and standardagronomic performance data is collected and evaluated.

Example 7

Conversion of an Introduced Functional FRT Sequence into a Second Non-Identical Functional FRT Sequence

The approach taken here to develop a method for converting between different FRT sites for use in various applications is based on the previously described "chimeraplasty" strategy for making specific targeted nucleotide modifications at aspecified extrachromosomal or genomic target sequence in animal cells (Yoon et al. (1996) Proc. Natl. Acad. Sci. 93:2071-2076; Cole-Strauss et al. (1996) Science 273:1386-1389). This capability in plants, as demonstrated recently in our laboratoriesand described in WO99/25853, published May 27, 1999, is beneficial to extending the potential use of the present invention for broader application. The proposed use of this "chimeraplasty" technology in the present invention would be to target andmodify nucleotides in one FRT site of a pair of non-identical FRT sites flanking a DNA sequence of interest in a way that then makes the two FRT sites identical. Subsequent or concurrent expression of FLP recombinase in cells with these FRT sitemodifications would lead to excision of the sequences between these now identical FRT sites, thereby removing specifically the undesirable DNA sequences from the previously created stable transgenic event containing those sequences. An application ofthis approach would be for example in the case of a selectable marker which is required during initial steps of a breeding or backcrossing program to maintain and select for preferred individual plants, but which is not desired in the final product.

The target vectors for evaluating this FRT site modification strategy are shown generically below, where P1 and P2 represent two different promoters, G1 and G2 represent two genes, and T1 and T2 represent two terminator regions; these regionsare shown as white boxes. Different FRT sites are indicated and shown as dark boxes. One version of the construct incorporates a third unique FRT site downstream of the second gene and is used to evaluate whether the targeted conversion, in this case,of FRT5 to FRT6 (SEQ ID NO:4), also results in conversion of the downstream FRT1 (SEQ ID NO:2) site to an FRT6 (SEQ ID NO:4) site. In the former case, expression of the downstream gene (G1) should be detected, while if the conversion is not specific toFRT5 (SEQ ID NO:3) and the FRT1 (SEQ ID NO:2) site is converted also, then both gene activities will be lost. For the specific examples used here P1 is the maize ubiquitin promoter, P2 is the enhanced CaMV 35S promoter, G1 is the uidA (GUS) gene, G2 isthe bar gene, and T1 and T2 are pinII terminators. It is understood that based on the various descriptions of vector constructs earlier in this application, a variety of different promoters, genes, terminators or DNA sequences or FRT sites could be usedin practicing this component method. The DNA cassettes as shown below could be assembled into either a pUC-based plasmid for direct DNA delivery methods (such as particle bombardment) or into a binary vector for Agrobacterium-based transformation asdescribed previously.

##STR00001## B) Design of Chimeric Oligonucleotide Molecules for Chimeraplasty-Based Targeted Conversion of an FRT Site

Shown below are specific examples of chimeric molecules that would be used to modify a single nucleotide so as to convert the FRT5 (SEQ ID NO:3) site to an FRT6 (SEQ ID NO:4) site in constructs as described above. Both the linear sequence ofthese chimeric molecules as well as the predicted active form of the molecule (based on the Yoon et al. and Cole-Strauss et al. publications above) are shown. DNA residues are represented in upper case, RNA residues in lower case, and the site to bemodified (a single nucleotide difference between FRT5, SEQ ID NO:3, and FRT6, SEQ ID NO:4) is underlined and in bold. Two examples of chimeras are presented below differing in the number of residues downstream of the FRT5 (SEQ ID NO:4) site that wouldbe included in the chimeric molecule design and which would thus determine the specificity to the target sequence. 1. Chimeric oligonucleotide linear sequence (sequence includes six target-specific residues downstream of the FRT site being modified inthe target construct and should convert only this single specific FRT5, SEQ ID NO:3, site to an FRT6, SEQ ID NO:4, site)

2. Chimeric oligonucleotide linear sequence (sequence contains residues specific to only sequences in the FRT site and so should convert any FRT5, SEQ ID NO:3, site in a target molecule to an FRT6, SEQ ID NO:4, site)

Vector constructions and chimeric oligonucleotide molecules as described above were generated and used in experiments. C) Demonstration of Conversion from One FRT Site to Another

Stable transgenic maize lines are generated with the constructs as described above or with other related ones by transforming in the constructs and selecting on bialophos as described before. Tissues to be used for chimera delivery aretransferred onto non-bialophos-containing media and the chimeric oligonucleotides are delivered into cells of these stable events by particle bombardment, together with co-delivery of PHP5096 which carries a functional FLP recombinase expressioncassette. In control experiments, only chimeric molecules or only PHP5096 are delivered. After sufficient time for cells to recover without bialophos selection, samples of the bombarded events are evaluated for GUS expression. For those bombardedevents containing the construct with the downstream FRT1 (SEQ ID NO:2) site which do not show GUS expression, an equivalent sample of cells are plated and grown on medium with or without bialophos selection to assess sensitivity to the chemical. If thechimeric molecules are specific for modifying only the FRT5 (SEQ ID NO:3) site, then no differences in number and growth of cells should be observed between treatments with or without selection. Otherwise, reduced growth and recovery should be noted.

D) Molecular Verification of Stable Conversion of FRT Sites

DNA from those samples that exhibit GUS expression is isolated, amplified by PCR if necessary, and sequenced by standard methods through the region corresponding to the predicted nucleotide conversion. A sufficient stretch of DNA is sequencedto cover the entire originally introduced region of DNA so as to confirm correct and specific conversion. Using standard methods for PCR, Southern analysis and/or sequencing of GUS expressing and non-expressing samples establishes the presence orabsence of specific DNA fragments prior to and following chimeric molecule and FLP recombinase delivery, and thus substantiates the visual and biochemical observations made above.

E) Utility of Chimeraplasty-Based FRT Site Conversion in a Transgene Stacking Strategy for Plants

Described in FIG. 1 is one potential strategy for combining or stacking multiple desired transgenes at one genomic location using the non-identical FRT-based system of the present invention. While stacking of genes can be achieved without theuse of the targeted FRT conversion method described in this example 7, this latter method extends the capabilities of the system by allowing in vivo conversion of FRT sites to create new sites, rather than re-introducing new FRT sites by transformation. In the diagram of FIG. 1, an FRT site with an asterisk beside it indicates that it was initially created to be non-functional with respect to recombination between it and the equivalent FRT site without an asterisk, but which upon conversion with thechimeraplasty-based approach described herein renders it capable of recombination with its equivalent non-asterisk counterpart. In the specific example presented in the figure, this would facilitate for example removal of a selectable marker either tono longer have it present, or to allow one to re-use the selectable marker in future transformations. Thus this method also provides a mechanism to recycle selectable markers, as is possible in using the FRT system of the present invention alone.

Discussion

To date in plants, the major application of the FLP/FRT system has been for DNA excision (Lyznik et al. (1993) Nucleic Acids Res. 21:969-975). For example, a gene such as a selectable marker flanked by FRT sites is first introduced into plantcells by one of several transformation approaches, and stable transgenic events or plants are recovered via appropriate selection. Then in order to eliminate the selectable marker gene, FLP protein is expressed in the cells either transiently byintroducing a plasmid carrying a FLP expression cassette, stably following integration of an introduced FLP expression cassette, or by crossing plants carrying the FRT-flanked selectable marker gene with plants carrying sequences for and expressingactive FLP protein (WO99/25841, published May 27, 1999, to "Novel Nucleic Acid Sequence Encoding FLP Recombinase").

A major problem associated with developing the FLP/FRT system for integrating genes into animals or plants stems from the fact that the recombination reaction catalyzed by yeast FLP recombinase is a reversible process (Sadowski et al. (1995) inProgress in Nucleic Acid Research and Molecular Biology 51:53-91). For example, following introduction of a DNA sequence flanked by similarly oriented FRT sites into plant cells in the presence of actively expressing FLP recombinase, recombinationshould lead to insertion of the new DNA sequences at the endogenous FRT site. However, with continued expression of FLP enzyme, the reverse reaction would lead to re-excision of the introduced sequences because of recombination between the identical FRTsites. Since the reaction is reversible, integration and excision can repeatedly continue towards equilibrium. As cells divide and the DNA substrate concentration per cell decreases, the probability of integration decreases, such that in general, aslong as active FLP protein is expressed the reaction will be driven towards the non-integrated state. To favor integration, a situation must be established which precludes re-excision once integration occurs. A number of strategies have been suggested,including limiting the duration of activity of FLP recombinase through inducible expression or by directly introducing FLP protein or RNA into cells (Sadowski et al. (1995) Progress on Nucleic Acid Research and Molecular Biology 51:53-91), but to date noroutine non-random integration system has been established for plants.

The present invention describes the development of a useful new gene targeting system for plants which utilizes the yeast FLP recombinase or a modified FLP recombinase designed to work more efficiently in certain plant species and novelnon-identical FRT sites which can be used for directional non-reversible DNA integration. Additionally, described herein is a novel use of accessory technologies such as "chimeraplasty" permitting in vivo or in vitro modification of DNA sequences, suchas FRT sites to further extend the utility of the system. Data provided demonstrate the successful stable integration of DNA sequences between two previously introduced non-identical FRT sites in maize. We show also that the DNA sequences between theFRT sites can be subsequently replaced by a second DNA sequence flanked by the same FRT sites as the first. Together these results demonstrate that it is possible to introduce and recover pairs of non-identical FRT sites at certain genomic locations,that one can select desirable or preferred genomic locations for expressing DNA sequences of interest, and that these selected locations can be used to re-target other DNA sequences of interest. Apart from the obvious benefits of being able to integrategenes into the genome of plants, the present invention provides a means for facilitating the introduction of novel genes or DNA sequences into genomic locations previously determined to be particularly beneficial for gene integration from the perspectiveof providing suitable levels of stable expression of the introduced gene(s) and not exhibiting deleterious impacts on agronomic characteristics including yield. In addition the invention provides a system whereby integration of two or more genes can betargeted to the same genomic location, providing a mechanism for "gene stacking". These stacked genes can then be maintained and managed as a closely linked pair of traits in breeding programs. Thus this invention also provides an improved method forintroducing, maintaining and breeding multiple genetic traits of interest, including agronomic traits, commercially important genes or other heterologous gene products.

The invention further proposes to use the non-recombination feature of non-identical FRT sites to allow creation of a set of `parental` lines, which are initially well-characterized for all the desired expression and performance parametersdescribed above. These lines then serve as the basis for introduction of new traits into the same predefined sites in the genome where the initial genes were introduced. Many fewer events would need to be generated, since integration wouldpreferentially occur in sites shown to express well and have minimal negative impact on performance.

All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference tothe same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of theappended claims.