WO2000018778A1 - Synthesis of codon randomized nucleic acids - Google Patents

Synthesis of codon randomized nucleic acids Download PDF

Info

Publication number
WO2000018778A1
WO2000018778A1 PCT/US1999/022436 US9922436W WO0018778A1 WO 2000018778 A1 WO2000018778 A1 WO 2000018778A1 US 9922436 W US9922436 W US 9922436W WO 0018778 A1 WO0018778 A1 WO 0018778A1
Authority
WO
WIPO (PCT)
Prior art keywords
protecting group
mononucleosides
codons
products
protected
Prior art date
Application number
PCT/US1999/022436
Other languages
French (fr)
Other versions
WO2000018778A9 (en
Inventor
Peter Lohse
Robert G. Kuimelis
Original Assignee
Phylos, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Phylos, Inc. filed Critical Phylos, Inc.
Priority to AU62704/99A priority Critical patent/AU6270499A/en
Publication of WO2000018778A1 publication Critical patent/WO2000018778A1/en
Publication of WO2000018778A9 publication Critical patent/WO2000018778A9/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H21/00Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1058Directional evolution of libraries, e.g. evolution of libraries is achieved by mutagenesis and screening or selection of mixed population of organisms
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P20/00Technologies relating to chemical industry
    • Y02P20/50Improvements relating to the production of bulk chemicals
    • Y02P20/55Design of synthesis routes, e.g. reducing the use of auxiliary or protecting groups

Definitions

  • the invention relates to methods for chemically synthesizing DNA or RNA.
  • Proteins with desired functions can be prepared using methods such as oligonucleotide-directed mutagenesis. Proteins with desired functions can also be selected from pools of randomly synthesized proteins, including proteins which are generated from random DNA template libraries.
  • DNA libraries may also be generated using a variety of techniques. Such DNA libraries can be synthesized on a solid support (e.g., a CPG support), in a liquid phase, or in a combination solid-liquid phase (e.g., a PEG support). Most commonly, DNA libraries are prepared using a standard DNA synthesizer and a random mixture of all 4 nucleotides in each coupling step. By this approach, the trinucleotides, or codons, that correspond to the different amino acids, are randomly generated. This codon randomized DNA can then be transcribed into RNA, which is in turn used to synthesize polypeptides; the approach described above thus provides a means for generating a wide variety of DNA sequences and proteins products.
  • a solid support e.g., a CPG support
  • a combination solid-liquid phase e.g., a PEG support
  • DNA libraries are prepared using a standard DNA synthesizer and a random mixture of all 4 nucleotides in each coupling step
  • the invention features a method for generating a selected set of codons; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', and a subset B of the first set is protected with a protecting group B', where A' and B' are orthogonal protecting groups; (b) selectively removing protecting group A' from subset A; (c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with protecting group A'; (d) optionally removing protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where the third set is protected with protecting
  • the selected set includes at least one codon corresponding to each of the 20 naturally-occurring amino acids; preferably, each of these codons corresponds to a highly expressed codon for one of the naturally-occurring amino acids.
  • the selected set may also consist of trinucleotides coding only for a class of amino acids, e.g., hydrophobic amino acids, hydrophilic amino acids, basic amino acids, or acidic amino acids.
  • the selected set may consist of trinucleotides coding for a mixture of amino acids, e.g., acidic and basic amino acids.
  • steps (a) to (i) take place in the same reaction vessel; in addition, protecting groups A' and B' are two different groups and are preferably chosen from an acid-cleavable protecting group (for example a dimethoxytrityl group), a base-cleavable protecting group (for example, a fluorenylmethyloxycarbonyl group), or a fluoride-cleavable protecting group (for example, a silyl group).
  • each of the codons terminates in a cytidine or a guanosine residue.
  • the invention features a method for generating an oligonucleotide from a selected set of codons; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', and a subset B of the first set is protected with a protecting group B', where A' and B' are orthogonal protecting groups; (b) selectively removing protecting group A' from subset A; (c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with protecting group A'; (d) optionally removing protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where
  • steps (a) to (k) take place in the same reaction vessel.
  • the invention features a method for generating a selected set of codons including the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', a subset B of the first set is protected with a protecting group B', and a subset C of the first set is protected with a protecting group C, where A', B', and C are orthogonal protecting groups; (b) selectively removing the protecting group A' from the subset A;(c) coupling the product formed in step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with the protecting group A'; (d) optionally removing the protecting group A' from the products of
  • steps (a) to (o) take place in the same reaction vessel.
  • one of the protecting groups A', B', and C is preferably an acid-cleavable protecting group (for example a dimethoxytrityl group)
  • another of the protecting groups A', B', and C is preferably a base-cleavable protecting group (for example, a fluorenylmethyloxycarbonyl group)
  • the last of the protecting groups A', B', and C is preferably a fluoride-cleavable protecting group (for example, a silyl group).
  • the invention features a method for generating an oligonucleotide from a selected set of codons including the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', a subset B of the first set is protected with a protecting group B', and a subset C of the first set is protected with a protecting group C, where A', B', and C are orthogonal protecting groups; (b) selectively removing the protecting group A' from the subset A; (c) coupling the product formed in step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with the protecting group A'; (d) optionally removing the protecting group A' from the products of step (c); (e) optionally coup
  • steps (a) to (q) take place in the same reaction vessel.
  • the invention features a method for generating, in the same reaction vessel, a selected set of codons; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, or dinucleotides, or mixture thereof; (b) adding a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (c) optionally adding a third set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; and (d) optionally repeating step (c) to yield a selected set of codons.
  • the selected set includes at least one codon having A or G at the third codon position; fewer than 3% of the codons in the selected set correspond to a stop codon.
  • the selected set includes at least one codon for each of the 20 naturally-occurring amino acids; preferably, each codon corresponds to a highly-expressed codon for one of the naturally-occurring amino acids.
  • the selected set may consist of one class of codons, e.g., hydrophobic amino acids.
  • the selected set may consist of trinucleotides coding for a mixture of amino acids, e.g., acidic and basic amino acids.
  • fewer than 2% of the codons correspond to a stop codon; more preferably, fewer than 1%, 0.5%, or 0.1%, of the codons correspond to a stop codon.
  • each of the codons terminates in a cytidine or a guanosine residue.
  • any combination of couplings of mononucleosides, mononucleotides, and dinucleotides may be used to generated codons, which are trinucleotides.
  • dinucleotides may be coupled with mononucleosides.
  • Dinucleotides would not be coupled with dinucleotides, as that would generated tetranucleotides.
  • the invention features a method for generating an oligonucleotide from a selected set of codons.
  • the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (b) adding a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (c) optionally adding a third set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; and (d) optionally repeating step (c) to yield a selected set of codons that includes at least one codon having A or G at the third codon position and in which fewer than 3% of the codons correspond to a stop codon.
  • Steps (a), (b), (c), and (d) occur in the same reaction vessel; these steps are repeated until an oligonucleotide of the desired length is achieved.
  • the selected set includes at least one codon for each of the 20 naturally-occurring amino acids, and fewer than 2% of the codons correspond to a stop codon.
  • nucleoside is meant any sugar-base moiety, including sugar- base moieties in which one or more nitrogen atoms of the nitrogenous bases are protected, and or in which the 5'-OH of the sugar is protected.
  • Nucleosides also include nucleoside phosphoramidites and protected phosphoramidites.
  • nucleotide is meant any sugar-phosphate-base moiety, as well as any derivatized sugar-phosphate-base moiety.
  • One or more nitrogen atoms of the nitrogenous bases can be protected, and/or the 5'-OH of the sugar can be protected.
  • Dinucleotides can include dinucleotide phosphoramidites; in addition, the internucleotide linkage may be protected.
  • oligonucleotide is meant either a DNA sequence or an RNA sequence
  • nucleic acid is meant either DNA or RNA.
  • highly-expressed codons are meant the codons present in higher than normal abundance in highly expressed genes.
  • stop codon is meant one of the DNA codons TAA, TGA, and TAG; and the RNA codons UAA, UGA, and UAG.
  • a selected set of codons is meant a set of trinucleotide sequences where each trinucleotide has an assigned representation in the set.
  • a selected set of codons may be a set that contains at least one codon for each of the naturally occurring amino acids, and in which some hydrophobic amino acids (e.g., Val, Leu, lie, Phe) are twice as abundant (e.g., AAC : CAC : GAC : TAC : ACC : CCC : GCC : TCC : AGC : CGC : GGC :TGC : ATC : CTC : GTC : TTC : AAG : CAG : GAG : TGG : ATG - 1:1:1:1:1:1:1:1:1:1:1:1:2:2:2:2:2:1:1:1:1:1).
  • hydrophobic amino acids e.g., Val, Leu, lie, Phe
  • a selected set of codons may also be a set that is biased towards basic amino acids (e.g., His, Lys, Arg).
  • the nucleosides and nucleotides used herein are often referred to with shorthand designations, in which the protecting group of the 5'-OH is superscripted.
  • T C is used to represent N'-benzoyl 5'-O-(4,4'- dimethoxytrityl) 2'-deoxycytidine or N'-benzoyl 5'-0-(4,4'-dimethoxytrityl) 3'- 0-(allyloxy diisopropylamino phosphinyl) 2'-deoxycytidine
  • F G is used to represent N 2 -isobutyryl-5'-0-[(9-fluorenyl)methoxycarbonyl] 2'- deoxyguanosine or N 2 -isobutyryl-5'-0-[(9-fluorenyl)methoxycarbonyl] 3'-0 ⁇ (allyloxy diisopropylamino phosphinyl) 2'-deoxyguanosine.
  • the present invention provides a number of advantages over conventional techniques of nucleic acid synthesis.
  • the methods described herein provide control over codon format (trinucleotide sequences) as well as control over the representation of the codon in a selected set.
  • the invention can therefore generate sets of codons that contain at least one codon for each of the naturally-occurring amino acids and, importantly, can generate libraries that are substantially free of stop codons.
  • the invention also provides DNA consisting essentially of highly-expressed codons and, as noted above, free of stop codons, which can be efficiently translated to polypeptides of desired lengths.
  • control over codon representation allows for the synthesis of DNA templates that can be used to generate proteins rich in selected amino acids, for example, hydrophobic amino acids, which can be instrumental in protein design techniques.
  • FIGURES 1, 2, 3, 4, 5, 6, 7, 8, and 9 are each illustrations of coupling sequences for the synthesis of codon libraries.
  • sequence of bases in DNA and its RNA counterpart determines the sequence of the amino acids in the protein synthesized from this DNA. Sequences of three bases, referred to as codons, correspond to different amino acids. During translation, these codons are read from the 5' end to the 3' end; the resulting protein has an amino acid sequence that corresponds to the sequence of codons.
  • TAA, TAG, and TGA which correspond to the RNA codons UAA, UAG, and UGA
  • these codons signal release factors to terminate protein synthesis.
  • the presence of stop codons therefore leads to termination of protein synthesis before the entire DNA sequence is translated.
  • the invention features convenient methods for the controlled synthesis of codon randomized nucleic acids, such as DNA, in which the presence of stop codons can be avoided.
  • the DNA strand can be used as a template for the synthesis of a complementary DNA strand, which in turn can serve as a template for the synthesis of the corresponding messenger RNA.
  • messenger RNA can be synthesized directly using the methods of the invention.
  • a desired set of codons, as well as the desired frequency of each codon in the set is first chosen.
  • the set can include, for example, the most highly-expressed codons for each of the 20 naturally-occurring amino acids, in equal distribution.
  • Highly expressed DNA codons in eukaryotic translation systems typically exhibit either 2'-deoxycytidine (C) or 2'-deoxyguanosine (G) at the 3' end (that is, at the third codon position).
  • Another desired set can include, for example, at least one codon for each of the 20 naturally-occurring amino acids, and in which hydrophobic amino acids are twice as abundant.
  • a selected set of highly expressed codons for all 20 naturally-occurring amino acids can be prepared in which all of the codons have a C or G at the third position. Two of the three stop codons, TAA and TGA, are therefore readily excluded from this set.
  • stop codons has a C at the third position; codons ending in C can therefore be randomly generated without the introduction of stop codons.
  • the generation of a set of codons ending in C can produce codons for fifteen of the naturally-occurring amino acids.
  • the present invention also features methods for generating libraries of codons by using nucleosides and nucleotides with different 5 '-protecting groups as building blocks.
  • the different 5'-protecting groups can be cleaved under orthogonal conditions.
  • the conditions for cleaving one 5'-protecting group do not cleave the other 5'- protecting groups.
  • An example of one pair of orthogonal protecting groups includes a dimethoxytrityl group (DMT or T), which is cleaved under acidic conditions, and a fluorenylmethyloxycarbonyl group (Fmoc or F), which is cleaved under basic conditions.
  • a set of orthogonal protecting groups is the set including a dimethoxytrityl group (DMT or T), which is cleaved under acidic conditions, a fluorenylmethyloxycarbonyl group (Fmoc or F), which is cleaved under basic conditions, and a silyl group (S), which is cleaved with fluoride.
  • DMT or T dimethoxytrityl group
  • Fmoc or F fluorenylmethyloxycarbonyl group
  • S silyl group
  • a mixture of nucleosides is treated with acid.
  • the DMT-protected nucleosides are deprotected, while the Fmoc protected nucleosides remain protected.
  • nucleotides are added to this mixture, they couple only with the deprotected nucleosides, allowing for coupling specificity.
  • a mixture of -V / -benzoyl-5 , -0-(4,4'-dimethoxytrityl)- 2'-deoxycytidine ( T C) and N 2 -isobutyryl-5'-O-[(9-fluorenyl)methoxycarbonyl]- 2'-deoxyguanosine ( F G) is treated with acid.
  • the DMT group is cleaved from the C mononucleosides, thus leaving them free to couple with nucleotides.
  • the Fmoc of the G mononucleosides remains attached. Since none of the stop codons end in C, trinucleotides may be randomly generated at this step without the introduction of stop codons.
  • nucleoside phosphoramidites are used for the coupling reactions.
  • the internucleotide linkages can be protected with a protecting group, such as an allyl moiety.
  • the allyl protecting group is stable toward both acid and base, but can be cleaved with aqueous ammonia or by palladium (Pd(0)) catalysis.
  • Example 1 A Synthesis of-N -benzoy1-5'-( -(4,4'-dimethoxytrity1V '- 0-(a11y1oxy diisopropylamino phosphinyl) 2'-deoxycy ⁇ idine ( T C)
  • the combined organic layers are dried over Na 2 S0 4 and concentrated.
  • the crude product is purified by chromatography (silica gel, eluting with CH 2 Cl 2 /MeOH/Et 3 N, 94:4:2).
  • the product, - i y-benzoyl-5'-O-(4,4 , -dimethoxytrityl) 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxycytidine ( T C) is precipitated from CH 2 C1 2 into pentane at -60°C.
  • the reaction mixture was then extracted with 5% NaHC0 3 (3 x 50 mL), H 2 0 (2 x 50 mL), and dried with Na j SO ⁇
  • the Na 2 S0 4 was removed by filtration, and the organics were concentrated to 50 mL under reduced pressure before loading onto a silica gel column (200 g).
  • the column was eluted with EtOAc/heptane/TEA (49/50/1 v:v).
  • Fractions containing the desired product were combined and evaporated under reduced pressure to yield an oily residue, which was applied to another silica gel column (200 g) and eluted with a stepwise gradient of EtOAc (25 ⁇ 75%) in heptane containing 1% TEA.
  • the chemical stability of the new DMT-allyl dC phosphoramidite monomer was monitored by preparing a 0.1M solution in CDC1 3 and collecting the 31 P NMR spectrum at 24 hour intervals. The monomer was determined to be stable for at least 8 days (i.e., no change in spectrum between 300 and -50 ppm).
  • the coupling ability of the new monomer was evaluated by solid-phase synthesis of the sequence 5'-d(C 9 T) on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems) using a standard coupling protocol provided by the manufacturer, except that the monomer coupling time was increased to 120 seconds.
  • the solid support from the two syntheses was divided into five portions and treated with 1.5 mL of one of the following at the indicated temperature: concentrated ammonium hydroxide at room temperature, concentrated ammonium hydroxide at 55 ° C, a mixture of concentrated ammonium hydroxide in ethanol (3:1 v/v) at 55 °C, a mixture of t-butyl amine/methanol/water (1 :1:2 v/v) at 55 °C, 2M anhydrous ammonia in methanol at 55 °C.
  • Example 2 A Synthesis of /V -isobutyryl-5'-O-[(9- fluorenyl Ymethoxycarbonyl] 3'-O-fa ⁇ yloxy diisopropylamino phosphinyl) 2'-deoxyguanosine ( F G)
  • the product is then converted to the phosphoramidite (also referred to as F G) using the reaction conditions described in Lehmann et al., Nucleic Acids Res., Vol. 17, No. 7, 2379-2390 (1989).
  • N 6 -benzoyl-5'-0-[(9-fluorenyl)methoxycarbonyl] 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine ( F A) is prepared using the same reaction conditions.
  • Example 2B Alternative synthesis of Fmoc-allyl dG phosphoramidite monomer and its application in D ⁇ A synthesis
  • the monomer was 10% degraded after 1 day and 50% degraded after 3 days, as indicated by the appearance and growth of a new peak at 153 ppm resulting from spontaneous loss of the 5'-Fmoc group. As most syntheses are completed within several hours, the stability of the Fmoc-allyl dG phosphoramidite was deemed suitable.
  • the coupling ability of the new monomer was evaluated by solid-phase synthesis of the sequence 5'-d(G 9 T) on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems).
  • the standard synthesis protocol provided by the manufacturer was modified to increase the coupling time (900 sec), increase the capping step (120 sec), increase the oxidation time (60 sec), and deliver the 5'-Fmoc deprotection reagent for 120 seconds from an auxiliary bottle position.
  • Both O.IM DBU in acetonitrile and 0.1 M piperidine in anhydrous DMF were evaluated as 5'-Fmoc deprotection reagents.
  • the completed 5'-d(G 9 T) sequences were deprotected in concentrated ammonium hydroxide for 18 hours at 55 °C, concentrated in a Speed- Vac, analyzed by anion-exchange HPLC under denaturing conditions (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4), and compared to a control sequence synthesized with standard DMT-dG cyanoethyl phosphoramidites.
  • 0.1M Piperidine in DMF was the preferred 5'-Fmoc deprotection reagent.
  • Example 3A Synthesis of N -isobutyry1-5'-( -[trimethy1si1y1] V-O- (allyloxy diisopropylamino phosphinyl) 2'-deoxyg ⁇ anosine ( S G)
  • the product is then converted to the phosphoramidite (also referred to as S G) using the reaction conditions described in Example 1.
  • Example 3B Synthesis of silyl-allyl dG phosphoramidite monomer and its application in D ⁇ A synthesis
  • ⁇ 2 -Isobutyryl-2'-deoxyguanosine (6.75 g, 20 mmol) was evaporated from pyridine (3 x 100 mL), dissolved in anhydrous pyridine (75 mL) and cooled to 0°C.
  • Bis(trimethylsiloxy)cyclododecyloxy-silyl chloride (8.5 g, 22 mmol) was added to the stirred solution. After two hours the reaction mixture was concentrated to dryness under reduced pressure and resuspended in CH 2 C1 2 (100 mL).
  • the coupling ability of the new Silyl- allyl dG monomer was evaluated by solid-phase synthesis of the sequence 5'-d(G 9 T) on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems) using a polystyrene solid support (PE BioSy stems, Foster City, CA).
  • the standard 0.2 ⁇ mole cyanoethyl phosphoramidite synthesis protocol provided by the manufacturer was modified to accommodate the new chemistries.
  • the modified protocol contained longer monomer coupling steps (240 sec), longer wash times (120 sec), and new cycles to deliver the non-standard Silyl deprotection reagent (HF/TEA, 1.1M: 1.6M in DMF) from an auxiliary bottle position.
  • the oil is co- evaporated from toluene (twice), ethanol, then chloroform, and subjected to a short column chromatography (silica gel) eluting with a gradient of 0-5% methanol in chloroform. Fractions containing the major product are collected, concentrated to a foam, dissolved in chloroform, precipitated with pentane, filtered, and then dried under vacuum.
  • the DMT protecting group is then cleaved as follows.
  • the product is dissolved in 75 ml CH 2 Cl 2 /MeOH (8:1 v/v); Amberlyst® 15 ion exchange resin is then added in portions until the surface of the resin remains orange colored.
  • the suspension is stirred 24 hours, the resin is filtered off, and the solution is concentrated in vacuo.
  • the product is precipitated twice from petroleum ether (500 ml) at 40-60°C.
  • N ⁇ 5 -benzoyl-3'-0-2'ert-butyl-dimethylsilyl 2'-deoxyadenine, ⁇ 2 - isobutyryl-3'-0-tert-butyl-dimethylsilyl 2'-deoxyguanosine, and N -benzoyl-3'- 0-tert-butyl-dimethylsilyl 2'-deoxycytidine are prepared using the same reaction conditions.
  • a solution containing a mixture of 15 mmol 3'-0-tert-butyl- dimethylsilyl 2'-deoxythymidine and 24 mmol tetrazole is dried by repeated coevaporation with acetonitrile/toluene. The mixture is then dissolved in 50 ml dry acetonitrile. 15 mmol 7V 5 -benzoyl-5'-0-(4,4'-dimethoxytrityl) 3'-0- (allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine, which is pre-dried by repeated coevaporation with toluene, in 30 ml dry acetonitrile is added.
  • reaction is followed by TLC. If the reaction does not go to completion, additional phosphoramidite can be added.
  • the TBDMS ether protecting group is cleaved as follows.
  • the product is dissolved in 40 ml THF.
  • 30 mmol tefrabutylammonium fluoride is added, and the reaction mixture is stirred 1 hour at 25 °C.
  • the THF is evaporated in vacuo; water is then added to the concentrated reaction mixture.
  • the resulting mixture is extracted with CH 2 C1 2 (3 xl00 ml).
  • the combined organic layers are dried over ⁇ a 2 S0 4 , filtered, and concentrated.
  • the product is then purified with column chromatography (silica gel, using methanol in
  • Example 6 Synthesis of N -ben7oyl-5'-O-(4 J 4 , -dimethoxytrity1)-2'- deoxycytidine 3'-Q-succ ⁇ nic acid ⁇ -benzoyl-5 , -O-(4,4'-dimethoxytrityl)-2'-deoxycytidine and succinic anhydride (10-fold excess) are dissolved in DMF and stirred at 70 °C for 40 hours. The reaction is monitored by TLC (silica gel, development in ether, then chloroform/methanol 9:1). After completion of the reaction, the reaction mixture is taken up in methylene chloride, then washed with 20% aqueous citric acid solution.
  • TLC sica gel, development in ether, then chloroform/methanol 9:1
  • the aqueous phase is washed twice with methylene chloride.
  • the combined organic layers are washed with water, dried over anhydrous ⁇ a ⁇ O ⁇ and concentrated to dryness.
  • the product is purified by chromatography (silica gel, eluting with chloroform and chloroform/methanol 99:1).
  • N 2 -isobutyryl-5'-O-[(9-fluorenyl)methoxycarbonyl]-2 l - deoxyguanosine 3'-O-succinic acid is prepared using the same reaction conditions.
  • Example 7 Functionalization of support A glass support for use in D ⁇ A synthesis is treated with Fmoc- sarcosine in the presence of dicyclohexylcarbodiimide, followed by removed of the Fmoc group with piperidine/DMF.
  • the support is separated by filtration, washed with methylene chloride and diethyl ether. Unreacted amino groups are capped by treatment of the support with a mixture of THF/luti dine/acetic anhydride (8:1:1) and N-methylimidazole in THF. The support is then washed with methylene chloride and diethyl ether, and dried in vacuo.
  • the codons are built up from the 3'-end, as shown in Figure 1, using solid phase synthesis.
  • a solid phase synthesizer is used, according to the manufacturer's instructions.
  • a 16:5 mixture of N ⁇ enzoyl-S'-O- ⁇ '-dimethoxytrityl) ⁇ '- deoxycytidine ( T C) and N 2 -isobutyryl-5 l -0-[(9-fluorenyl)methoxycarbonyl]-2'- deoxyguanosine ( F G) is attached to a support, as described in Examples 6 and 7.
  • trichloroacetic acid is added to cleave the trityl protecting groups from the T C mononucleosides. Since the Fmoc protecting group is not labile under acidic conditions, the F G mononucleosides remain protected, and therefore unreactive.
  • a 1 :1 1:1 mixture ofN 6 -benzoyl-5'-O-(4,4"-dimethoxytrityl) 3'-0- (allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine ( T A), N -benzoyl-5'- 0-(4,4'-dimethoxytrityl) 3'-O-(allyloxy diisopropylamino phosphinyl) 2'- deoxycytidine ( T C), N -isobutyryl-5 , -O-(4,4'-dimethoxytrityl) 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxyguanosine ( T G), and 5'-0-(4,4'- dimethoxytrityl) 3'-O-(allyloxy diisopropylamino phosphinyl) 2'- deoxythymidine ( T T)
  • the trityl protecting groups of the dinucleotides are then cleaved with acid.
  • the dinucleotides are coupled with a 1 : 1 :1 : 1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result is a mixture of 16 unique codons, each corresponding to a different amino acid (with the exception of TTC and AGC, which both represent serine), and F G mononucleosides.
  • the Fmoc protecting groups of the G mononucleosides are then cleaved with l,8-diazabicyclo[5.4.0]undec-7-ene (DBU), as described in Lehmann et al., Nucleic Acids Res. 17:2379 (1989).
  • DBU l,8-diazabicyclo[5.4.0]undec-7-ene
  • the trityl protecting groups of the trinucleotides are not labile under basic conditions; the trinucleotides therefore remain unreactive.
  • the deprotected G mononucleosides are coupled with a 3 : 1 : 1 mixture of F A mononucleoside phosphoramidites, T TG dinucleotide phosphoramidite, and T AT dinucleotide phosphoramidite, and the products of the coupling reactions are oxidized.
  • the result is two more trinucleotide codons, and F AG dinucleotides.
  • the Fmoc protecting groups of the dinucleotides are once again cleaved with base, the dinucleotides are coupled with a 1:1 : 1 mixture of T A, T C, and T G mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the nucleotide T is omitted, as the inclusion of this nucleotide at this point would result in the synthesis of a TAG codon.
  • the end result of the successive deprotection and coupling reactions is a mixture of 21 codons, each corresponding to one of the 20 naturally occurring amino acids. All 20 amino acids are represented, and only one amino acid is represented twice.
  • the invention therefore provides a synthesis of a codon set in which all of the amino acids are represented approximately equally. Most importantly, the set contains substantially no stop codons.
  • the trityl group can be removed from the trinucleotides, and a mixture of T C and F G nucleoside phosphoramidites can be added. The process for synthesizing the codons can then be repeated until DNA of the desired length is achieved.
  • the standard 0.2 ⁇ mole cyanoethyl phosphoramidite synthesis protocol provided by the manufacturer was modified to accommodate the new chemistries.
  • the modified protocol contained longer monomer coupling steps (240 sec), longer wash times (120 sec), and new cycles to deliver the non-standard Fmoc deprotection reagent (0.1M piperidine in anhydrous DMF).
  • the 5'-DMT was removed with 3% dichloroacetic acid in CH 2 C1 2 and DMT-C monomer was then delivered to the column to extend the CCT sequence to CCCT.
  • a 0.1M piperidine solution in DMF was delivered to the column to remove the 5'-Fmoc protecting group.
  • DMT-C monomer was again added to the column to form CGCT from the remaining GCT sequence.
  • the terminal 5'-DMT was removed with 3% dichloroacetic acid in CH 2 C1 2 and the CPG support was treated with concentrated ammonium hydroxide at 55° C for 16 hours. The solution was finally cooled, concentrated to dryness on a Speed- Vac, and taken up in water.
  • the tetramer standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) produced 2.4:0.6: 1.0 for the normalized ratio of C:G:T nucleosides, respectively, compared to a theoretical value of 2.5:0.5: 1.
  • the same tetramer prepared via the acid/base orthogonal deprotection scheme produced 2.2:0.8:1.0 for the normalized ratio of C:G:T nucleosides.
  • the 15-mer 5 * -d(ACGTGGCTGAACSCT), where S is either G or C, was also synthesized on an automated DNA synthesizer using the same acid/base orthogonal deprotection scheme described above.
  • the terminal 5'-DMT was removed with 3% dichloroacetic acid in CH 2 C1 2 , and the CPG support was treated with concentrated ammonium hydroxide at 55° C for 16 hours. The solution was finally cooled, concentrated to dryness on a Speed- Vac, and taken up in water.
  • the ratio of the peak areas was 0.61 :0.39
  • the ratio of the peak areas was 0.51 :0.49.
  • the modified protocol contained longer monomer coupling steps (240 sec), longer wash times (120 sec), and new cycles to deliver the non-standard silyl deprotection reagent (HF/TEA, 1. IM: 1.6M in DMF) over 180 seconds.
  • the 5'-DMT was removed with 3% dichloroacetic acid in CH 2 C1 2 and DMT-C monomer was delivered to the column to extend the CCT sequence to CCCT.
  • an HF/TEA mixture in DMF (1.1M:1.6M) was delivered to the column to remove the 5'-Silyl protecting group.
  • DMT-C monomer was again added to the column to form CGCT from the remaining GCT sequence.
  • the terminal 5'-DMT was removed with 3% dichloroacetic acid in CH 2 C1 2 and the CPG support was treated with concentrated ammonium hydroxide at 55 °C for eight hours. The solution was finally cooled and concentrated on a Speed- Vac.
  • the tetramer standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) produced 2.0:1.0:1.0 for the normalized ratio of C:G:T nucleosides, respectively, compared to a theoretical value of 2.5:0.5:1.
  • the same tetramer prepared via the acid/base orthogonal deprotection scheme produced the normalized ratio 1.9: 1.1 : 1.0 for C:G:T nucleosides.
  • the terminal 5'-DMT was removed with 3% dichloroacetic acid in CH 2 C1 2 and the polystyrene support was treated with concenfrated ammonium hydroxide at 55° C for 16 hours.
  • the solution was finally cooled, concentrated to dryness on a Speed- Vac, and taken up in water.
  • the ratio of the peak areas at 260 nm was 0.34:0.66, whereas in the case of the standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) the ratio of the peak areas was 0.4:0.6.
  • Example 9 Removal of oligonucleotide from support
  • the support is treated with concentrated ammonia at 70 °C for 2 hours in a tightly closed Eppendorf tube, to cleave the oligonucleotides from the support.
  • the ammonia solution is evaporated on a speed- vac concentrator.
  • the residue is taken up in water and centrifuged (15 minutes, 0°C). DNA is precipitated from the supernatant by the addition of dioxane and THF. After centrifuging (15 minutes, 0°C), the pellet is dissolved in water.
  • the product DNA is purified by reverse-phase HPLC.
  • the support material is treated under argon with
  • examples 10-13 are carried out using the general methods described in Example 8; the successive coupling reactions take place in the same reaction vessel.
  • a 14:6 mixture of T C and F G is attached to a support, as described in Examples 6 and 7.
  • the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are then coupled with a 1 :1 :1:1: 1 :1:1:1 : 1 :1:1: 1 : 1 mixture of T AA, T CA, T GA, T TA, T AC, T CC, T GC, T AG, T GG, T TG, T AT, T CT, T GT, and T TT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result of these reactions is a mixture of 14 unique codons, each representing a different amino acid, and F G mononucleoside, as shown in Figure 2.
  • the Fmoc protecting groups of the G mononucleosides are then cleaved with DBU, as described in Example 6.
  • the deprotected mononucleosides are coupled with a 1: 1: 1:1:1:1 mixture of T AA, T CA, T GA, T AG, T TG, and T AT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the end result of these coupling reactions is a mixture of 20 unique trinucleotides, each representing a codon for one of the 20 naturally-occurring amino acids, as shown in Figure 2. Once again, no stop codons are present in the mixture.
  • This process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • a 16:5 mixture of T C and F G mononucleosides is attached to a support.
  • the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are then coupled with a 1 : 1 : 1 :1 mixture of
  • T A, T C, T G, and T T nucleoside phosphoramidites and the products of the coupling reactions are oxidized.
  • the result of these reactions is a 1 : 1: 1 : 1 mixture of T AC, T CC, T GC, and T TC dinucleotides, and F G mononucleoside.
  • the trityl protecting groups of the dinucleotides are then cleaved with acid.
  • the dinucleotides are coupled with a 1 : 1 : 1 : 1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result is a mixture of 16 unique codons, each representing a different amino acid (with the exception of TTC and AGC, which both correspond to serine), and F G mononucleoside.
  • the protecting groups of the F G mononucleosides are then cleaved with DBU.
  • the G mononucleosides are coupled with a 1 :1:1:1 : 1 mixture of T AA, T CA, T GA, T TG, and T AT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the end result of the successive deprotection and coupling reactions is a mixture of 21 codons.
  • the process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • a 16:6 mixture of T C and F G mononucleosides is attached to a support, and the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are coupled with a 1: 1:1 :1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result of these reactions is a 1 :1:1:1 mixture of T AC, T CC, T GC, and T TC dinucleotides, and F G mononucleoside.
  • the trityl protecting groups of the dinucleotides are then cleaved with acid.
  • the dinucleotides are coupled with a 1 : 1 :1 : 1 mixture of T A, T C, T G, and T T mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the protecting groups of the F G mononucleosides are then cleaved with DBU.
  • the G mononucleosides are coupled with a 3: 1 : 1 : 1 mixture of F A nucleoside phosphoramidite and T TG, T AT, and T CU dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the Fmoc protecting groups are cleaved from the dinucleotides, and a 1 :1 : 1 mixture of T A, T C, and T G mononucleoside phosphoramidites is added; the products of the coupling reactions are then oxidized.
  • the result of these successive deprotection and coupling reactions is a mixture of 22 codons, each corresponding to an amino acid.
  • the synthetic scheme results in the generation of a set of codons in which the amino acids Ser and Leu are twice as abundant as the other naturally occurring amino acids. This distribution is close to the amino acid distribution typically found in biological proteins.
  • the process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • Example 1 Synthesis of trinucleotides
  • a 16:6 mixture of T C and F G mononucleotides is attached to a support.
  • the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are coupled with a 1: 1 :1 :1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result of these reactions is a 1 : 1:1 : 1 mixture of T AC, T CC, T GC, and T TC dinucleotides, and F G mononucleoside.
  • the trityl protecting groups of the dinucleotides are then cleaved with acid.
  • the dinucleotides are coupled with a 1 : 1 :1 : 1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the protecting groups of the F G mononucleotides are then cleaved with DBU.
  • the G mononucleosides are coupled with a 1 : 1 :1 :1 :1 :1 mixture of T AA, T CA, T GA, T TG, T AT, and T CU dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result of the successive deprotection and coupling reactions is a mixture of 22 codons.
  • the synthetic scheme results in the generation of a set of codons in which the amino acids Ser and Leu are twice as abundant as the other naturally occurring amino acids. This distribution represents the amino acid distribution found in biological proteins.
  • the process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • a 16:3:2 mixture of T C, F G, and S G mononucleosides is attached to a support.
  • the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are coupled with a 1 : 1 : 1 : 1 mixture of T A,
  • T C, T G, and T T nucleoside phosphoramidites and the products of the coupling reactions are oxidized.
  • the result of these reactions is a 1:1:1:1 mixture of T AC, T CC, T GC, and T TC dinucleotides, F G mononucleosides, and S G mononucleosides .
  • the trityl protecting groups of the dinucleotides are cleaved with acid.
  • the dinucleotides are coupled with a 1 : 1 : 1 : 1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the protecting groups of the F G mononucleotides are cleaved with DBU.
  • the G mononucleosides are coupled with F A mononucleoside phosphoramidite, and the products of the coupling reactions are oxidized.
  • the Fmoc protecting groups are again cleaved.
  • the dinucleotides are coupled with a 1:1 :1 mixture of T A, T C, and T G mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the silyl protecting groups are cleaved with anhydrous tefra-n- butylammonium fluoride.
  • the G mononucleosides are coupled with a 1 : 1 mixture of F G and S T mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the Fmoc protecting group of the dinucleotide is cleaved, and the dinucleotide is coupled with T T mononucleoside phosphoramidite. The product of the coupling reaction is oxidized. Finally, the silyl group of the S TG dinucleotide is cleaved. The dinucleotide is coupled with T A, and the product is oxidized.
  • the result of the successive deprotection and coupling reactions is a mixture of 21 codons.
  • the process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • a 6: 1 mixture of T C and F G mononucleosides is attached to a support, and the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are coupled with a 1 : 1 : 1 : 1 : 1 : 1 mixture of T CC, T GC, T AT, T CT, T GT, and T TT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the Fmoc protecting group of the F G mononucleoside is then cleaved.
  • the mononucleoside is coupled with a T AT dinucleotide phosphoramidite, and the product of the coupling reaction is oxidized.
  • a 16:5 mixture of T C and F G mononucleosides is attached to a support.
  • the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are then coupled with a 1 : 1: 1 :2 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the result of these reactions is a 1 : 1 : 1 :2 mixture of T AC, T CC, T GC, and T TC dinucleotides, and F G mononucleoside.
  • the trityl protecting groups of the dinucleotides are then cleaved with acid.
  • the dinucleotides are coupled with a 1 :1 : 1 :1 mixture of T A, T C, T G, and T T nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
  • the protecting groups of the F G mononucleosides are then cleaved with DBU.
  • the G mononucleosides are coupled with a 1 : 1 : 1 : 1 : 1 mixture of T AA, T CA, T GA, T TG, and T AT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the end result of the successive deprotection and coupling reactions is a mixture of 20 codons; the codons ATC, CTG, GTC, and TTC, which correspond to the hydrophobic amino acids He, Leu, Val, and Phe, are represented twice.
  • the process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • the resulting DNA will code for proteins with a high percentage of hydrophobic amino acids.
  • Example 17 Synthesis of codons with a bias for basic amino acids
  • a 14:6 mixture of T C and F G mononucleosides is attached to a support.
  • the trityl protecting groups are cleaved with trichloroacetic acid.
  • the C mononucleosides are then coupled with a 1:2:1 : 1 : 1 : 1 : 1 : 1:1 : 1 :1: 1 : 1 mixture of T AA, T CA, T GA, T TA, T AC, T CC, T GC, T AG, T GG, T TG, T AT, T CT, T GT, and T TT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the protecting groups of the F G mononucleosides are then cleaved with DBU.
  • the G mononucleosides are coupled with a 2:l:l :2:l:l mixture of T AA, T CA, T GA, T AG, T TG, and T AT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
  • the end result of the successive deprotection and coupling reactions is a mixture of 20 codons; the codons CAC, AAG, and AGG, which correspond to the basic amino acids His, Lys, and Arg, are represented twice.
  • the process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
  • the resulting DNA will code for proteins with a high percentage of basic amino acids.
  • Example 10 For example, after a group of codons is prepared as described in Example 8, the scheme described in Example 10 may be used to generate the next set of codons. This process may be continued until DNA of the desired length is achieved.
  • the trinucleotides generated by any approach may be cleaved from the support using concentrated ammonia at room temperature.
  • the 3'-OH group is then derivatized with allyloxy bis- (diisopropylamino)phosphine to yield the trinucleotide phosphoramidite, and the trinucleotide phosphoramidites are then used as building blocks to synthesize DNA.
  • the methods of the invention may be used for any application in which nucleic acid synthesis is required.
  • these methods can be used in the synthesis of single-stranded DNA.
  • this DNA serve as a template for the synthesis of a complementary DNA strand, which can in turn serves as a template for messenger RNA synthesis.
  • the methods of the invention find use, for example, in techniques of randomized cassette mutagenesis of proteins, phage display techniques, ribosome display techniques, and protein-nucleic acid fusion techniques.
  • Codon-randomized DNA can also be used in cellular cultures (in vivo) for protein expression, or for in vitro applications using, for example, T7 RNA polymerase, and in vitro translation systems.

Abstract

A method for generating a selected set of codons is disclosed; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or mixture thereof, where a subset A of the first set is protected with a protecting group A', and a subset B of the first set is protected with a protecting group B', where A' and B' are orthogonal protecting groups; (b) selectively removing the protecting group A' from subset A; (c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with protecting group A'; (d) optionally removing protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where the third set is protected with protecting group A'; (f) selectively removing the protecting group B' from subset B; (g) coupling the products of step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the fourth set is protected with protecting group A' or protecting group B'; (h) optionally selectively removing protecting group B' from the products of step (g); and (i) optionally coupling the products of step (h) with a fifth set of mononucleosides, to yield a selected set of codons.

Description

SYNTHESIS OF CODON RANDOMIZED NI ICEETC ACTDS
Background of the Invention The invention relates to methods for chemically synthesizing DNA or RNA.
Pharmaceutical research relies, in part, on the identification of novel proteins with desired functions and properties. To identify proteins or peptides with improved properties, derivatives of known proteins and peptides can be prepared using methods such as oligonucleotide-directed mutagenesis. Proteins with desired functions can also be selected from pools of randomly synthesized proteins, including proteins which are generated from random DNA template libraries.
DNA libraries, in turn, may also be generated using a variety of techniques. Such DNA libraries can be synthesized on a solid support (e.g., a CPG support), in a liquid phase, or in a combination solid-liquid phase (e.g., a PEG support). Most commonly, DNA libraries are prepared using a standard DNA synthesizer and a random mixture of all 4 nucleotides in each coupling step. By this approach, the trinucleotides, or codons, that correspond to the different amino acids, are randomly generated. This codon randomized DNA can then be transcribed into RNA, which is in turn used to synthesize polypeptides; the approach described above thus provides a means for generating a wide variety of DNA sequences and proteins products.
Although it is commonly utilized, the random generation of DNA by conventional techniques can have disadvantages. For example, methods that rely on completely random generation of codons generally suffer from limited control over the synthesis of polypeptides generated from this DNA. The presence of weakly expressed codons in the random product mixture, for example, lowers the efficiency with which the DNA is translated. Furthermore, a small subset of randomly generated codons (approximately 3 out of 64) corresponds to a stop codon. As the presence of stop codons terminates protein synthesis, protein libraries generated from randomly generated DNA templates can sometimes exhibit low yields of full-length proteins. In addition, methods that rely on the completely random generation of DNA do not allow for a bias for a selected group of amino acids, for example, hydrophobic amino acids.
Summary of the Invention In one aspect, the invention features a method for generating a selected set of codons; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', and a subset B of the first set is protected with a protecting group B', where A' and B' are orthogonal protecting groups; (b) selectively removing protecting group A' from subset A; (c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with protecting group A'; (d) optionally removing protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where the third set is protected with protecting group A'; (f) selectively removing protecting group B' from subset B; (g) coupling the products of step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the fourth set is protected with protecting group A' or protecting group B'; (h) optionally selectively removing protecting group B' from the products of step (g); and (i) optionally coupling the products of step (h) with a fifth set of mononucleosides, to yield a selected set of codons.
In preferred methods, the selected set includes at least one codon corresponding to each of the 20 naturally-occurring amino acids; preferably, each of these codons corresponds to a highly expressed codon for one of the naturally-occurring amino acids. The selected set may also consist of trinucleotides coding only for a class of amino acids, e.g., hydrophobic amino acids, hydrophilic amino acids, basic amino acids, or acidic amino acids. In another preferred method, the selected set may consist of trinucleotides coding for a mixture of amino acids, e.g., acidic and basic amino acids.
Preferably, fewer than 3% of the codons correspond to a stop codon; more preferably, fewer than 2%, 1%, 0.5% or 0.1%, of the codons correspond to a stop codon. In preferred methods, steps (a) to (i) take place in the same reaction vessel; in addition, protecting groups A' and B' are two different groups and are preferably chosen from an acid-cleavable protecting group (for example a dimethoxytrityl group), a base-cleavable protecting group (for example, a fluorenylmethyloxycarbonyl group), or a fluoride-cleavable protecting group (for example, a silyl group). In other preferred methods, each of the codons terminates in a cytidine or a guanosine residue.
In a second aspect, the invention features a method for generating an oligonucleotide from a selected set of codons; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', and a subset B of the first set is protected with a protecting group B', where A' and B' are orthogonal protecting groups; (b) selectively removing protecting group A' from subset A; (c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with protecting group A'; (d) optionally removing protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where the third set is protected with protecting group A'; (f) selectively removing protecting group B' from subset B; (g) coupling the products of step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the fourth set is protected with protecting group A' or protecting group B'; (h) optionally selectively removing protecting group B' from the products of step (g); (i) optionally coupling the products of step (h) with a fifth set of mononucleosides; (j) removing the protecting groups from the products of step (g) or (i); and (k) repeating steps (a) to (j) until an oligonucleotide with the desired length is achieved. Preferably, steps (a) to (k) take place in the same reaction vessel. In a third aspect, the invention features a method for generating a selected set of codons including the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', a subset B of the first set is protected with a protecting group B', and a subset C of the first set is protected with a protecting group C, where A', B', and C are orthogonal protecting groups; (b) selectively removing the protecting group A' from the subset A;(c) coupling the product formed in step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with the protecting group A'; (d) optionally removing the protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where the third set of mononucleosides is protected with the protecting group A'; (f) selectively removing the protecting group B' from the subset B; (g) coupling the products formed in step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the fourth set is protected with the protecting group A' or the protecting group B'; (h) optionally selectively removing the protecting group B' from the products of step (g); (i) optionally coupling the products of step (h) with a fifth set of mononucleosides, where the fifth set is protected with protecting group A'; (j) selectively removing the protecting group C from the subset C; (k) coupling the products formed in step (j) with a sixth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset of the sixth set is protected with the protecting group C, and the remainder of the sixth set is protected with protecting group B'; (1) optionally selectively removing the protecting group B' from the products of step (k); (m) optionally coupling the products of step (1) with a seventh set of mononucleosides, where the seventh set is protected with protecting group A' or protecting group B'; (n) selectively removing the protecting group C from the products of step (m); and (o) coupling the products of step (n) with an eighth set of mononucleosides, to yield a selected set of codons.
In preferred methods, steps (a) to (o) take place in the same reaction vessel. In addition, one of the protecting groups A', B', and C is preferably an acid-cleavable protecting group (for example a dimethoxytrityl group), another of the protecting groups A', B', and C is preferably a base-cleavable protecting group (for example, a fluorenylmethyloxycarbonyl group), and the last of the protecting groups A', B', and C is preferably a fluoride-cleavable protecting group (for example, a silyl group). In a fourth aspect, the invention features a method for generating an oligonucleotide from a selected set of codons including the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset A of the first set is protected with a protecting group A', a subset B of the first set is protected with a protecting group B', and a subset C of the first set is protected with a protecting group C, where A', B', and C are orthogonal protecting groups; (b) selectively removing the protecting group A' from the subset A; (c) coupling the product formed in step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the second set is protected with the protecting group A'; (d) optionally removing the protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, where the third set is protected with the protecting group A'; (f) selectively removing the protecting group B' from the subset B; (g) coupling the products formed in step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where the fourth set is protected with the protecting group A' or the protecting group B'; (h) optionally selectively removing the protecting group B' from the products of step (g); (i) optionally coupling the products of step (h) with a fifth set of mononucleosides, where the fifth set is protected with protecting group A'; (j) selectively removing the protecting group C from the subset C; (k) coupling the products formed in step (j) with a sixth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, where a subset of the sixth set is protected with the protecting group C, and the remainder of the sixth set is protected with protecting group B'; (1) optionally selectively removing the protecting group B' from the products of step (k); (m) optionally coupling the products of step (1) with a seventh set of mononucleosides, where the seventh set is protected with protecting group A' or protecting group B'; (n) selectively removing the protecting group C from the products of step (m); (o) coupling the products of step (n) with an eighth set of mononucleosides; (p) removing the protecting groups from the products of step (o); and (q) repeating steps (a) to (p) until an oligonucleotide with the desired length is achieved. Preferably, steps (a) to (q) take place in the same reaction vessel. In a fifth aspect, the invention features a method for generating, in the same reaction vessel, a selected set of codons; the method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, or dinucleotides, or mixture thereof; (b) adding a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (c) optionally adding a third set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; and (d) optionally repeating step (c) to yield a selected set of codons. The selected set includes at least one codon having A or G at the third codon position; fewer than 3% of the codons in the selected set correspond to a stop codon.
In preferred methods, the selected set includes at least one codon for each of the 20 naturally-occurring amino acids; preferably, each codon corresponds to a highly-expressed codon for one of the naturally-occurring amino acids. In other preferred methods, the selected set may consist of one class of codons, e.g., hydrophobic amino acids. In another preferred method, the selected set may consist of trinucleotides coding for a mixture of amino acids, e.g., acidic and basic amino acids. Preferably, fewer than 2% of the codons correspond to a stop codon; more preferably, fewer than 1%, 0.5%, or 0.1%, of the codons correspond to a stop codon. In still other preferred methods, each of the codons terminates in a cytidine or a guanosine residue.
Any combination of couplings of mononucleosides, mononucleotides, and dinucleotides may be used to generated codons, which are trinucleotides. For example, dinucleotides may be coupled with mononucleosides. Dinucleotides would not be coupled with dinucleotides, as that would generated tetranucleotides.
In a sixth aspect, the invention features a method for generating an oligonucleotide from a selected set of codons. The method includes the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (b) adding a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (c) optionally adding a third set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; and (d) optionally repeating step (c) to yield a selected set of codons that includes at least one codon having A or G at the third codon position and in which fewer than 3% of the codons correspond to a stop codon. Steps (a), (b), (c), and (d) occur in the same reaction vessel; these steps are repeated until an oligonucleotide of the desired length is achieved. Preferably, the selected set includes at least one codon for each of the 20 naturally-occurring amino acids, and fewer than 2% of the codons correspond to a stop codon.
By "nucleoside" is meant any sugar-base moiety, including sugar- base moieties in which one or more nitrogen atoms of the nitrogenous bases are protected, and or in which the 5'-OH of the sugar is protected. "Nucleosides" also include nucleoside phosphoramidites and protected phosphoramidites.
By "nucleotide" is meant any sugar-phosphate-base moiety, as well as any derivatized sugar-phosphate-base moiety. One or more nitrogen atoms of the nitrogenous bases can be protected, and/or the 5'-OH of the sugar can be protected. Dinucleotides can include dinucleotide phosphoramidites; in addition, the internucleotide linkage may be protected.
By "oligonucleotide" is meant either a DNA sequence or an RNA sequence; by "nucleic acid" is meant either DNA or RNA.
By "highly-expressed codons" are meant the codons present in higher than normal abundance in highly expressed genes.
By "stop codon" is meant one of the DNA codons TAA, TGA, and TAG; and the RNA codons UAA, UGA, and UAG.
By a "selected set of codons" is meant a set of trinucleotide sequences where each trinucleotide has an assigned representation in the set. For example, a selected set of codons may be a set that contains at least one codon for each of the naturally occuring amino acids (e.g., AAC : CAC : GAC : TAC: ACC : CCC : GCC : TCC : AGC : CGC : GGC : TGC : ATC : CTC : GTC : TTC : AAG : CAG : GAG : TGG : ATG = 1:1:1:1:1:1:1:1:1:1:1:1:1:1:1:1:1:1:1:1:1). Alternatively, a selected set of codons may be a set that contains at least one codon for each of the naturally occurring amino acids, and in which some hydrophobic amino acids (e.g., Val, Leu, lie, Phe) are twice as abundant (e.g., AAC : CAC : GAC : TAC : ACC : CCC : GCC : TCC : AGC : CGC : GGC :TGC : ATC : CTC : GTC : TTC : AAG : CAG : GAG : TGG : ATG - 1:1:1:1:1:1:1:1:1:1:2:2:2:2:2:1:1:1:1:1). A selected set of codons may also be a set that is biased towards basic amino acids (e.g., His, Lys, Arg). The composition of such a set can be, e.g., AAC : CAC : GAC : TAC : ACC : CCC : GCC : AGC : GGC : TGC : ATC : CTC : GTC : TTC : AAG : CAG : GAG : AGG : TGG : ATG=
1:2:1:1:1:1:1:1:1:1:1:1:1:1:2:1:1:2:1:1. Alternatively, a selected set of codons may be a set in which all of the codons code for hydrophobic amino acids (e.g, Pro, Ala, lie, Leu, Val, Phe, Met) in equal distribution (e.g., CCC : GCC : ATC : CTC : GTC : TTC : ATG = 1:1:1:1:1:1:1). The nucleosides and nucleotides used herein are often referred to with shorthand designations, in which the protecting group of the 5'-OH is superscripted. For example, "TC" is used to represent N'-benzoyl 5'-O-(4,4'- dimethoxytrityl) 2'-deoxycytidine or N'-benzoyl 5'-0-(4,4'-dimethoxytrityl) 3'- 0-(allyloxy diisopropylamino phosphinyl) 2'-deoxycytidine, and "FG" is used to represent N2-isobutyryl-5'-0-[(9-fluorenyl)methoxycarbonyl] 2'- deoxyguanosine or N2-isobutyryl-5'-0-[(9-fluorenyl)methoxycarbonyl] 3'-0^ (allyloxy diisopropylamino phosphinyl) 2'-deoxyguanosine.
The present invention provides a number of advantages over conventional techniques of nucleic acid synthesis. For example, the methods described herein provide control over codon format (trinucleotide sequences) as well as control over the representation of the codon in a selected set. The invention can therefore generate sets of codons that contain at least one codon for each of the naturally-occurring amino acids and, importantly, can generate libraries that are substantially free of stop codons. The invention also provides DNA consisting essentially of highly-expressed codons and, as noted above, free of stop codons, which can be efficiently translated to polypeptides of desired lengths. Furthermore, control over codon representation allows for the synthesis of DNA templates that can be used to generate proteins rich in selected amino acids, for example, hydrophobic amino acids, which can be instrumental in protein design techniques.
Brief Description of the Drawings FIGURES 1, 2, 3, 4, 5, 6, 7, 8, and 9 are each illustrations of coupling sequences for the synthesis of codon libraries.
Description of the Preferred Embodiments The sequence of bases in DNA and its RNA counterpart determines the sequence of the amino acids in the protein synthesized from this DNA. Sequences of three bases, referred to as codons, correspond to different amino acids. During translation, these codons are read from the 5' end to the 3' end; the resulting protein has an amino acid sequence that corresponds to the sequence of codons.
Three DNA codons, TAA, TAG, and TGA (which correspond to the RNA codons UAA, UAG, and UGA) do not code for any amino acids. Instead, these codons signal release factors to terminate protein synthesis. The presence of stop codons therefore leads to termination of protein synthesis before the entire DNA sequence is translated.
The invention features convenient methods for the controlled synthesis of codon randomized nucleic acids, such as DNA, in which the presence of stop codons can be avoided. If desired, the DNA strand can be used as a template for the synthesis of a complementary DNA strand, which in turn can serve as a template for the synthesis of the corresponding messenger RNA. Alternatively, messenger RNA can be synthesized directly using the methods of the invention.
As described in more detail below, in the present approach, a desired set of codons, as well as the desired frequency of each codon in the set, is first chosen. The set can include, for example, the most highly-expressed codons for each of the 20 naturally-occurring amino acids, in equal distribution. Highly expressed DNA codons in eukaryotic translation systems typically exhibit either 2'-deoxycytidine (C) or 2'-deoxyguanosine (G) at the 3' end (that is, at the third codon position). Another desired set can include, for example, at least one codon for each of the 20 naturally-occurring amino acids, and in which hydrophobic amino acids are twice as abundant.
In addition, it is generally desirable to omit, from the codon mixture, the known stop codons, TAA, TGA, and TAG, for DNA synthesis, or UAA, UGA, and UAG for RNA synthesis. The omission of these codons from the synthetic library maximizes the likelihood that protein translation is not aborted and that proteins of desired length are generated.
In one particular example, a selected set of highly expressed codons for all 20 naturally-occurring amino acids can be prepared in which all of the codons have a C or G at the third position. Two of the three stop codons, TAA and TGA, are therefore readily excluded from this set.
In addition, none of the stop codons has a C at the third position; codons ending in C can therefore be randomly generated without the introduction of stop codons. The generation of a set of codons ending in C can produce codons for fifteen of the naturally-occurring amino acids.
The exclusion of the stop codon TAG (or, for RNA synthesis, UAG) is more complicated because the best expressed codons for the amino acids Lys, Gin, Glu, Trp, and Met each have a G at position three. The simultaneous generation of codons for these amino acids, and the exclusion of the stop codon TAG, requires a strategic coupling sequence. Several such coupling sequences are described in detail below as part of the present invention.
The present invention also features methods for generating libraries of codons by using nucleosides and nucleotides with different 5 '-protecting groups as building blocks. In preferred methods, the different 5'-protecting groups can be cleaved under orthogonal conditions. In other words, the conditions for cleaving one 5'-protecting group do not cleave the other 5'- protecting groups. An example of one pair of orthogonal protecting groups includes a dimethoxytrityl group (DMT or T), which is cleaved under acidic conditions, and a fluorenylmethyloxycarbonyl group (Fmoc or F), which is cleaved under basic conditions. Another example of a set of orthogonal protecting groups is the set including a dimethoxytrityl group (DMT or T), which is cleaved under acidic conditions, a fluorenylmethyloxycarbonyl group (Fmoc or F), which is cleaved under basic conditions, and a silyl group (S), which is cleaved with fluoride.
In one particular method, a mixture of nucleosides, some of which are protected with a DMT group, and some of which are protected with a Fmoc group, is treated with acid. The DMT-protected nucleosides are deprotected, while the Fmoc protected nucleosides remain protected. When nucleotides are added to this mixture, they couple only with the deprotected nucleosides, allowing for coupling specificity.
In one example, a mixture of -V/-benzoyl-5,-0-(4,4'-dimethoxytrityl)- 2'-deoxycytidine (TC) and N2-isobutyryl-5'-O-[(9-fluorenyl)methoxycarbonyl]- 2'-deoxyguanosine (FG) is treated with acid. The DMT group is cleaved from the C mononucleosides, thus leaving them free to couple with nucleotides. The Fmoc of the G mononucleosides remains attached. Since none of the stop codons end in C, trinucleotides may be randomly generated at this step without the introduction of stop codons. By this technique, successive coupling steps produce a mixture of trityl-protected trinucleotides ending in C. Throughout the coupling steps, the G mononucleosides remain protected and therefore unreactive. Codons with G at the third position are then prepared. The Fmoc groups are cleaved with base. The coupling sequence is designed to avoid synthesis of the codon TAG (or UAG).
In yet further preferred embodiments, similar processes are carried out using silyl protecting groups. Examples of several different coupling schemes are given below. It is to be understood, however, that the invention encompasses additional coupling schemes as well.
Preferably, nucleoside phosphoramidites are used for the coupling reactions. In addition, the internucleotide linkages can be protected with a protecting group, such as an allyl moiety. The allyl protecting group is stable toward both acid and base, but can be cleaved with aqueous ammonia or by palladium (Pd(0)) catalysis.
There now follow particular examples of nucleic acid preparative techniques. These examples are provided for the purpose of illustrating the invention, and should not be construed as limiting.
Example 1 A: Synthesis of-N -benzoy1-5'-( -(4,4'-dimethoxytrity1V '- 0-(a11y1oxy diisopropylamino phosphinyl) 2'-deoxycyτidine (TC)
Allyloxy bis-(diisopropylamino)phosphine is prepared as described in Bannwarth et al., Tetrahedron Lett., 30:4219 (1989), and diisopropylammonium tetrazolide is prepared as described in Barone et al.,
Nucleic Acids Res. 12:4051 (1984). 20 mmol of N*-benzoyl-5'-O-(4,4'- dimethoxytrityl) 2'-deoxycytidine and 10 mmol of diisopropylammonium tetrazolide are taken up in anhydrous acetonitrile and evaporated. 25 mmol of allyloxy bis-(diisopropylamino)phosphine in 200 ml of anhydrous methylene chloride are added to the residue, with stirring. After 10 minutes of stirring at about 25 °C, the reaction mixture is poured into 300 ml saturated NaHC03 solution and extracted with methylene chloride (3 200 ml). The combined organic layers are dried over Na2S04 and concentrated. The crude product is purified by chromatography (silica gel, eluting with CH2Cl2/MeOH/Et3N, 94:4:2). The product, -iy-benzoyl-5'-O-(4,4,-dimethoxytrityl) 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxycytidine (TC), is precipitated from CH2C12 into pentane at -60°C. N<s-benzoyl-5,-O-(4,4'-dimethoxytrityl) 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine (TA), N sobutyryl-5'-O-(4,4'- dimethoxytrityl) 3'-O-(allyloxy diisopropylamino phosphinyl) 2'- deoxyguanosine (TG), and 5,-O-(4,4'-dimethoxytrityl) 3'-O-(allyloxy diisopropylamino phosphinyl) 2'-deoxy-thymidine (TT) are prepared using the same reaction conditions.
Example 1 : Alternative synthesis of DMT-allyl dC phosphoramidite monomer
5'-0-DMT-Ν4-benzoyl-2'-deoxycytidine (10 g, 15.8 mmol) was dissolved in CH2C12 (100 mL). Diisopropylammonium tetrazolide (1.35 g, 7.88 mmol) was added followed by allyl-N,N,N,N-tetraisopropylphosphoramidite (5g, 17.3 mmol), and the reaction was stirred overnight at room temperature under argon. The reaction mixture was then extracted with 5% NaHC03 (3 x 50 mL), H20 (2 x 50 mL), and dried with NajSO^ The Na2S04 was removed by filtration, and the organics were concentrated to 50 mL under reduced pressure before loading onto a silica gel column (200 g). The column was eluted with EtOAc/heptane/TEA (49/50/1 v:v). Fractions containing the desired product were combined and evaporated under reduced pressure to yield an oily residue, which was applied to another silica gel column (200 g) and eluted with a stepwise gradient of EtOAc (25 → 75%) in heptane containing 1% TEA. Fractions containing the desired product were combined, concentrated under reduced pressure, dissolved in a small amount of CH2C12, and precipitated with heptane to give a white powder (8.4 g): 5,-0-Dimethoxytrityl-3'-2-allyl-N,N-diisopropyl-
N4-benzoyl-2'-deoxycytidine phosphoramidite. Purity was determined to be greater than 95% by both RP-HPLC and 31P NMR.
The chemical stability of the new DMT-allyl dC phosphoramidite monomer was monitored by preparing a 0.1M solution in CDC13 and collecting the 31P NMR spectrum at 24 hour intervals. The monomer was determined to be stable for at least 8 days (i.e., no change in spectrum between 300 and -50 ppm). The coupling ability of the new monomer was evaluated by solid-phase synthesis of the sequence 5'-d(C9T) on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems) using a standard coupling protocol provided by the manufacturer, except that the monomer coupling time was increased to 120 seconds. Trityl absorbance data collected from the instrument indicated that the coupling efficiency was comparable to the same sequence prepared with conventional cyanoethyl phosphoramidites, demonstrating that the DMT-allyl dC monomer couples effectively. After completion of the synthesis, the solid support from the two syntheses (synthesized with either DMT-allyl dC monomer or DMT-cyanoethyl dC monomer) was divided into five portions and treated with 1.5 mL of one of the following at the indicated temperature: concentrated ammonium hydroxide at room temperature, concentrated ammonium hydroxide at 55 ° C, a mixture of concentrated ammonium hydroxide in ethanol (3:1 v/v) at 55 °C, a mixture of t-butyl amine/methanol/water (1 :1:2 v/v) at 55 °C, 2M anhydrous ammonia in methanol at 55 °C. After 24 hours, the room temperature ammonium hydroxide sample was decanted and concentrated to dryness in a Speed- Vac. For the other four mixtures incubated at 55 °C, aliquots were removed 8, 17, and 24 hours, and concentrated to dryness in a Speed- Vac. These 26 samples were then analyzed by anion-exchange HPLC under denaturing conditions (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4). In the case of 5'-d(C9T) prepared with DMT-allyl dC phosphoramidite, excellent results were obtained with all conditions except the following: concentrated ammonium hydroxide at room temperature for 24 hours, 2M anhydrous ammonia in methanol at 55 °C for 8, 17, or 24 hours. In the case of 5'-d(C9T) prepared with DMT-cyanoethyl dC phosphoramidite, all of the deprotection reagents completely removed the allyl protecting groups except concentrated ammonium hydroxide at room temperature for 24 hours. The preferred deprotection reagent was determined to be concentrated ammonium hydroxide at 55 °C for between 12 and 24 hours.
Example 2 A: Synthesis of /V -isobutyryl-5'-O-[(9- fluorenyl Ymethoxycarbonyl] 3'-O-faπyloxy diisopropylamino phosphinyl) 2'-deoxyguanosine (FG)
3.0 mmol N"-isobutyryl-2'-deoxyguanosine are co-evaporated from
25 ml pyridine twice, then dissolved in 20 ml pyridine and cooled to 0°C. 3.0 mmol 9-fluorenylmethyl chloroformate (Fmoc-chloride) is added to the stirred solution. The reaction is monitored using thin-layer chromatography (eluting with diethyl ether, then chloroform/methanol 9:1). The reaction is terminated by adding ethanediol; the mixture is then concentrated to an oil. The oil is dissolved in chloroform (150 ml) and washed with saturated ΝaHC03 solution. The aqueous phase is extracted twice with chloroform, and the combined chloroform portions are dried over anhydrous NajSO^ filtered, then concentrated to an oil. The oil is co-evaporated from toluene (twice), ethanol, then chloroform, and subjected to a short column chromatography (silica gel) eluting with a gradient of 0-5% methanol in chloroform. Fractions containing the major product are collected, concentrated to a foam, dissolved in chloroform, precipitated with pentane, filtered, and then dried under vacuum to yield N2-isobutyryl-5'-0-[(9-fluorenyl)methoxycarbonyl] 2'-deoxyguanosine (FG).
The product is then converted to the phosphoramidite (also referred to as FG) using the reaction conditions described in Lehmann et al., Nucleic Acids Res., Vol. 17, No. 7, 2379-2390 (1989).
N6-benzoyl-5'-0-[(9-fluorenyl)methoxycarbonyl] 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine (FA) is prepared using the same reaction conditions.
Example 2B: Alternative synthesis of Fmoc-allyl dG phosphoramidite monomer and its application in DΝA synthesis
Ν2-Isobutyryl-2'-deoxyguanosine (15 g, 44.5 mmol) was evaporated from pyridine (3 x 100 mL) and dissolved in anhydrous pyridine (150 mL).
The solution was cooled to 0°C and Fmoc-Cl (12.6 g, 49 mmol) was added.
After completion of the reaction, as indicated by TLC, the mixture was evaporated to dryness, redissolved in CH2C12 (200 mL) and washed with 5%
NaHC03 (2 x 75 mL) followed by H20 (2 x 75 mL) and brine (1 x 75 mL). The organic layer was dried with NajSO^ filtered, and concentrated to a low volume under reduced pressure. The concentrated organic layer was applied to a silica gel column (700 g) and eluted first with EtOAc, and then with
MeOH/CH2Cl2/EtOAc (5/30/65 v:v). Fractions containing the desired product were combined and concentrated to dryness under reduced pressure to yield a slightly yellow solid (7.1 g).
5'-0-Fmoc-N2-isobutyryldeoxyguanosine (8 g, 14.3 mmol) prepared above was evaporated from pyridine (2 x 100 mL) and then acetonitrile (3 x 100 mL). The resulting solid was dissolved in CH2C12 (100 mL). Allyl-N,N,N,N-tetraisopropylphosphoramidite (5.45 mL, 17.2 mmol) was added to the reaction mixture followed by diisopropylammonium tetrazolide (0.62 g, 3.6 mmol). After two hours of stirring at room temperature, the reaction mixture was washed with 5% NaHC03 (1 x 50 mL), H20 (1 x 100 mL), and then dried with Na^O^ filtered, and concentrated under reduced pressure. The concentrated mixture was applied to a silica gel column (500 g) and eluted with a step wise gradient of EtO Ac (25 - 95%) in heptane containing 1% lutidine, according to the procedure described in Lehmann et al., Nucleic Acids Research 17: 2379 (1989). Fractions containing the desired product were combined and concentrated to dryness under reduced pressure. The residue was dissolved in toluene (15 mL) and precipitated into stirred heptane (1 L). Filtration yielded an off-white powder (4.5 g) which was further purified by silica gel chromatography. The silica gel (400 g) was packed with EtOAc/heptane/lutidine (79/19/2 v:v) and then washed with EtO Ac/heptane (80/20 v:v) prior to applying the partially purified material (4.2 g) from above. The column was eluted with EtOAc/heptane (80/20 v:v), and fractions containing the desired product were combined and evaporated under reduced pressure in the presence of anhydrous toluene (3 x 20 mL) followed by evaporation from anhydrous acetonitrile (2 x 30 mL). Only the last evaporation was taken to dryness, which yielded a white foam (2.6 g): 5,-0-Fmoc-3,-2-allyl-N,N-diisopropyl-N2-isobutyryl-2'-deoxyguanosine phosphoramidite. Purity and identity were established by RP-HPLC, 31P NMR and 1H NMR. The chemical stability of the new Fmoc-allyl dG phosphoramidite monomer was monitored by preparing a 0.1M solution in CDC13 and collecting the 31P NMR spectrum at 24 hour intervals. The monomer was 10% degraded after 1 day and 50% degraded after 3 days, as indicated by the appearance and growth of a new peak at 153 ppm resulting from spontaneous loss of the 5'-Fmoc group. As most syntheses are completed within several hours, the stability of the Fmoc-allyl dG phosphoramidite was deemed suitable. The coupling ability of the new monomer was evaluated by solid-phase synthesis of the sequence 5'-d(G9T) on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems). The standard synthesis protocol provided by the manufacturer was modified to increase the coupling time (900 sec), increase the capping step (120 sec), increase the oxidation time (60 sec), and deliver the 5'-Fmoc deprotection reagent for 120 seconds from an auxiliary bottle position. Both O.IM DBU in acetonitrile and 0.1 M piperidine in anhydrous DMF were evaluated as 5'-Fmoc deprotection reagents. The completed 5'-d(G9T) sequences were deprotected in concentrated ammonium hydroxide for 18 hours at 55 °C, concentrated in a Speed- Vac, analyzed by anion-exchange HPLC under denaturing conditions (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4), and compared to a control sequence synthesized with standard DMT-dG cyanoethyl phosphoramidites. 0.1M Piperidine in DMF was the preferred 5'-Fmoc deprotection reagent.
Example 3A: Synthesis of N -isobutyry1-5'-( -[trimethy1si1y1] V-O- (allyloxy diisopropylamino phosphinyl) 2'-deoxygυanosine (SG)
2.0 mmol N -isobutyryl-2'-deoxyguanosine are dissolved in 20 ml
DMF and stirred at 25 °C. 3.0 mmol trimethylsilyl chloride and 0.5 mmol imidazole are added to the stirred solution. The reaction is monitored using thin-layer chromatography. When the reaction is complete, the mixture is concentrated to an oil. The oil is dissolved in chloroform and washed with saturated ΝaHC03 solution. The aqueous phase is extracted twice with chloroform, and the combined chloroform portions are dried over anhydrous Na2S04, filtered, then concentrated to an oil. The oil is co-evaporated from toluene (twice), ethanol, then chloroform, and subjected to a short column chromatography (silica gel) eluting with a gradient of 0-5% methanol in chloroform. Fractions containing the major product are collected, concentrated to a foam, dissolved in chloroform, precipitated with pentane, filtered, and then dried under vacuum to yield N -isobutyryl-5'-0-[trimethylsilyl] 2'- deoxyguanosine (SG).
The product is then converted to the phosphoramidite (also referred to as SG) using the reaction conditions described in Example 1.
Example 3B: Synthesis of silyl-allyl dG phosphoramidite monomer and its application in DΝA synthesis
Ν2-Isobutyryl-2'-deoxyguanosine (6.75 g, 20 mmol) was evaporated from pyridine (3 x 100 mL), dissolved in anhydrous pyridine (75 mL) and cooled to 0°C. Bis(trimethylsiloxy)cyclododecyloxy-silyl chloride (8.5 g, 22 mmol) was added to the stirred solution. After two hours the reaction mixture was concentrated to dryness under reduced pressure and resuspended in CH2C12 (100 mL). This solution was washed with 5% NaHC03 (2 x 30 mL), H20 (2 x 30 mL), and then dried with Na^O^ filtered, and concentrated under reduced pressure. The concentrated mixture was applied to a silica gel column (500 g) and eluted with a stepwise gradient of MeOH (0 → 10%) in CH2C12. Fractions containing the desired product were combined and concentrated to dryness under reduced pressure to yield a white solid (9.5 g).
5,-0-Bis(trimethylsiloxy)cyclododecyloxy-silyl-N2-isobutyryl- 2'-deoxyguanosine (9 g, 12.3 mmol) from above was evaporated first from pyridine (2 x 100 mL) and then acetonitrile (3 x 100 mL). The residue was dissolved in anhydrous CH2C12 (100 mL) and allyl-N,N,N,N-tetraisopropylphosphoramidite (4.2 mL, 14.5 mmol) was added to the stirred reaction mixture followed by diisopropylammonium tetrazolide (0.56 g, 3.0 mmol). After two hours the reaction mixture was washed with 5% NaHC03 (1 x 50 mL), H20 (1 x 100 mL) and then dried with Na^O^ filtered, and concentrated under reduced pressure. The concenfrated solution was applied to a silica gel column (500 g) that had been packed with EtOAc/hexane (70:30 v:v) containing 5% TEA. The product was eluted with EtOAc/hexane (70:30 v:v) containing 2% TEA. Fractions containing the desired product were combined and concentrated to dryness under reduced pressure. The residue was taken up in toluene (100 mL) and evaporated to dryness two times, and this process was repeated with anhydrous acetonitrile (3 x 100 mL) to finally give a white foam (8.1 g): 5,-0-Bis(trimethylsiloxy)cyclododecyloxy-silyl-3'- 2-allyl-N,N-diisopropyl-N2-isobutyryl-2'-deoxyguanosine phosphoramidite. Purity was determined to be greater than 98% by both 31P NMR and RP-HPLC. The coupling ability of the new Silyl- allyl dG monomer was evaluated by solid-phase synthesis of the sequence 5'-d(G9T) on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems) using a polystyrene solid support (PE BioSy stems, Foster City, CA). The standard 0.2 μmole cyanoethyl phosphoramidite synthesis protocol provided by the manufacturer was modified to accommodate the new chemistries. The modified protocol contained longer monomer coupling steps (240 sec), longer wash times (120 sec), and new cycles to deliver the non-standard Silyl deprotection reagent (HF/TEA, 1.1M: 1.6M in DMF) from an auxiliary bottle position. In addition, the standard trichloroacetic acid reagent was replaced with 3% dichloroacetic acid in CH2C12. The completed 5'-d(G9T) sequences were deprotected in concentrated ammonium hydroxide for 18 hours at 55 °C, concenfrated in a Speed- Vac, analyzed by anion-exchange HPLC under denaturing conditions (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4), and compared to a control sequence synthesized with standard DMT-dG cyanoethyl phosphoramidites. These materials were also analyzed by MALDI-TOF mass spectrometry and gave the expected signal at m z 3206.07 for the sequence prepared with the new Silyl-allyl dG phosphoramidite monomer and m z 3206.36 for the sequence prepared with conventional DMT-cyanoethyl dG phosphoramidite monomer (theoretical mass of d(G9T) = 3205.12).
Example 4: Synthesis of 3'-0-fert-huty1-dimethy1si1y1 ?.- deoxythymidine
20 mmol 5'-0-(4,4'-dimethoxytrityl) 2'-deoxythymidine are dissolved in 200 ml DMF and stirred at 25°C. 30 mmol tert-butyldimethylsilyl chloride and 5 mmol imidazole are added to the stirred solution. The reaction is monitored using thin-layer chromatography. When the reaction is complete, the mixture is concentrated to an oil. The oil is dissolved in chloroform (150 ml) and washed with saturated NaHC03 solution. The aqueous phase is extracted twice with chloroform; the combined chloroform portions are dried over anhydrous Na2S04, filtered, then concentrated to an oil. The oil is co- evaporated from toluene (twice), ethanol, then chloroform, and subjected to a short column chromatography (silica gel) eluting with a gradient of 0-5% methanol in chloroform. Fractions containing the major product are collected, concentrated to a foam, dissolved in chloroform, precipitated with pentane, filtered, and then dried under vacuum.
The DMT protecting group is then cleaved as follows. The product is dissolved in 75 ml CH2Cl2/MeOH (8:1 v/v); Amberlyst® 15 ion exchange resin is then added in portions until the surface of the resin remains orange colored. The suspension is stirred 24 hours, the resin is filtered off, and the solution is concentrated in vacuo. The product is precipitated twice from petroleum ether (500 ml) at 40-60°C. N<5-benzoyl-3'-0-2'ert-butyl-dimethylsilyl 2'-deoxyadenine, Ν2- isobutyryl-3'-0-tert-butyl-dimethylsilyl 2'-deoxyguanosine, and N -benzoyl-3'- 0-tert-butyl-dimethylsilyl 2'-deoxycytidine are prepared using the same reaction conditions.
Example 5: Synthesis of TAT dimicleotide phosphoroamiHite
A solution containing a mixture of 15 mmol 3'-0-tert-butyl- dimethylsilyl 2'-deoxythymidine and 24 mmol tetrazole is dried by repeated coevaporation with acetonitrile/toluene. The mixture is then dissolved in 50 ml dry acetonitrile. 15 mmol 7V5-benzoyl-5'-0-(4,4'-dimethoxytrityl) 3'-0- (allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine, which is pre-dried by repeated coevaporation with toluene, in 30 ml dry acetonitrile is added. The reaction is followed by TLC. If the reaction does not go to completion, additional phosphoramidite can be added. When the reaction is complete, the reaction mixture is cooled in an ice bath, and 40 mmol tert-butyl hydroperoxide is added. After about 15 minutes, the solution is concentrated in vacuo. The oil is dissolved in ethyl acetate and washed with a phosphate buffer (pH = 6.8) and water. The solution is dried over anhydrous Νa^-SO^ filtered, and concentrated to dryness.
The TBDMS ether protecting group is cleaved as follows. The product is dissolved in 40 ml THF. 30 mmol tefrabutylammonium fluoride is added, and the reaction mixture is stirred 1 hour at 25 °C. The THF is evaporated in vacuo; water is then added to the concentrated reaction mixture. The resulting mixture is extracted with CH2C12 (3 xl00 ml). The combined organic layers are dried over Νa2S04, filtered, and concentrated. The product is then purified with column chromatography (silica gel, using methanol in
CH2C12 to elute). The product is then converted to the phosphoramidite using the reaction conditions described in Example 1. Other dinucleotide phosphoramidites (e.g., TTG and TAT) are prepared using the same reaction conditions.
Example 6: Synthesis of N -ben7oyl-5'-O-(4J4,-dimethoxytrity1)-2'- deoxycytidine 3'-Q-succιnic acid Λ^-benzoyl-5,-O-(4,4'-dimethoxytrityl)-2'-deoxycytidine and succinic anhydride (10-fold excess) are dissolved in DMF and stirred at 70 °C for 40 hours. The reaction is monitored by TLC (silica gel, development in ether, then chloroform/methanol 9:1). After completion of the reaction, the reaction mixture is taken up in methylene chloride, then washed with 20% aqueous citric acid solution. The aqueous phase is washed twice with methylene chloride. The combined organic layers are washed with water, dried over anhydrous Νa^O^ and concentrated to dryness. The product is purified by chromatography (silica gel, eluting with chloroform and chloroform/methanol 99:1). N2-isobutyryl-5'-O-[(9-fluorenyl)methoxycarbonyl]-2l- deoxyguanosine 3'-O-succinic acid is prepared using the same reaction conditions.
Example 7: Functionalization of support A glass support for use in DΝA synthesis is treated with Fmoc- sarcosine in the presence of dicyclohexylcarbodiimide, followed by removed of the Fmoc group with piperidine/DMF.
Ji -benzoyl-5'-O-(4,4'-dimethoxytrityl)-2'-deoxycytidine 3,-0- succinic acid and N2-isobutyryl-5'-0-[(9-fluorenyl)methoxycarbonyl]-2'- deoxyguanosine 3'-O-succinic acid are dissolved in THF; dicyclohexylcarbodiimide is added to the solution. The reaction is stirred for 0.5 hours at 25 °C, then filtered. The filtrate is evaporated to dryness. The residue is dissolved in DMF, filtered, and shaken with the functionalized glass support for 16 hours. The support is separated by filtration, washed with methylene chloride and diethyl ether. Unreacted amino groups are capped by treatment of the support with a mixture of THF/luti dine/acetic anhydride (8:1:1) and N-methylimidazole in THF. The support is then washed with methylene chloride and diethyl ether, and dried in vacuo.
Example 8A: Synthesis of codons
In one preferred synthetic approach, the codons are built up from the 3'-end, as shown in Figure 1, using solid phase synthesis. A solid phase synthesizer is used, according to the manufacturer's instructions.
A 16:5 mixture of N ^enzoyl-S'-O-^'-dimethoxytrityl)^'- deoxycytidine (TC) and N2-isobutyryl-5l-0-[(9-fluorenyl)methoxycarbonyl]-2'- deoxyguanosine (FG) is attached to a support, as described in Examples 6 and 7.
After the nucleosides have been attached to the glass support, trichloroacetic acid is added to cleave the trityl protecting groups from the TC mononucleosides. Since the Fmoc protecting group is not labile under acidic conditions, the FG mononucleosides remain protected, and therefore unreactive.
A 1 :1 : 1:1 mixture ofN6-benzoyl-5'-O-(4,4"-dimethoxytrityl) 3'-0- (allyloxy diisopropylamino phosphinyl) 2'-deoxyadenine (TA), N -benzoyl-5'- 0-(4,4'-dimethoxytrityl) 3'-O-(allyloxy diisopropylamino phosphinyl) 2'- deoxycytidine (TC), N -isobutyryl-5,-O-(4,4'-dimethoxytrityl) 3'-0-(allyloxy diisopropylamino phosphinyl) 2'-deoxyguanosine (TG), and 5'-0-(4,4'- dimethoxytrityl) 3'-O-(allyloxy diisopropylamino phosphinyl) 2'- deoxythymidine (TT) mononucleoside phosphoramidites is pre-dried by repeated coevaporation with acetonitrile/toluene, then dissolved in dry acetonitrile. The mixture is then added to the mixture of C and FG mononucleosides. When the coupling reaction is complete, a solution of 0.02 iodine in THF/pyridine/water is added to oxidize the products. The result of this series of reactions is a 1 :1 :1 :1 mixture of TAC, TCC, TGC, and TTC dinucleotides, and FG mononucleoside.
The trityl protecting groups of the dinucleotides are then cleaved with acid. The dinucleotides are coupled with a 1 : 1 :1 : 1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result is a mixture of 16 unique codons, each corresponding to a different amino acid (with the exception of TTC and AGC, which both represent serine), and FG mononucleosides. The Fmoc protecting groups of the G mononucleosides are then cleaved with l,8-diazabicyclo[5.4.0]undec-7-ene (DBU), as described in Lehmann et al., Nucleic Acids Res. 17:2379 (1989). The trityl protecting groups of the trinucleotides are not labile under basic conditions; the trinucleotides therefore remain unreactive. The deprotected G mononucleosides are coupled with a 3 : 1 : 1 mixture of FA mononucleoside phosphoramidites, TTG dinucleotide phosphoramidite, and TAT dinucleotide phosphoramidite, and the products of the coupling reactions are oxidized. The result is two more trinucleotide codons, and FAG dinucleotides.
The Fmoc protecting groups of the dinucleotides are once again cleaved with base, the dinucleotides are coupled with a 1:1 : 1 mixture of TA, TC, and TG mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The nucleotide T is omitted, as the inclusion of this nucleotide at this point would result in the synthesis of a TAG codon.
As shown in Figure 1, the end result of the successive deprotection and coupling reactions is a mixture of 21 codons, each corresponding to one of the 20 naturally occurring amino acids. All 20 amino acids are represented, and only one amino acid is represented twice. The invention therefore provides a synthesis of a codon set in which all of the amino acids are represented approximately equally. Most importantly, the set contains substantially no stop codons.
The trityl group can be removed from the trinucleotides, and a mixture of TC and FG nucleoside phosphoramidites can be added. The process for synthesizing the codons can then be repeated until DNA of the desired length is achieved.
Figure imgf000029_0001
orthogonal deprotection
The tetramer 5'-d(CSCT), where S is either G or C, was synthesized on an automated DNA synthesizer (Expedite 8909, PerSeptive Biosystems) using the acid/base orthogonal deprotection scheme employing both DMT and Fmoc 5'-hydroxyl protecting groups with allyl-P protection. The standard 0.2 μmole cyanoethyl phosphoramidite synthesis protocol provided by the manufacturer was modified to accommodate the new chemistries. The modified protocol contained longer monomer coupling steps (240 sec), longer wash times (120 sec), and new cycles to deliver the non-standard Fmoc deprotection reagent (0.1M piperidine in anhydrous DMF). In addition, the standard trichloroacetic acid reagent was replaced with 3% dichlorocetic acid in CH2C12. CPG solid support functionalized with 0.2 μmole T monomer was loaded onto the instrument, and DMT-C monomer was added according to the modified protocol. After removal of the 5'-DMT group, equal volumes of Fmoc-G and DMT-C monomer were delivered to the column with the tetrazole coupling agent to form a mixture of two trimers on the solid support: GCT (with 5'-Fmoc protection), and CCT (with 5'-DMT protection). The 5'-DMT was removed with 3% dichloroacetic acid in CH2C12 and DMT-C monomer was then delivered to the column to extend the CCT sequence to CCCT. Next, a 0.1M piperidine solution in DMF was delivered to the column to remove the 5'-Fmoc protecting group. DMT-C monomer was again added to the column to form CGCT from the remaining GCT sequence. The terminal 5'-DMT was removed with 3% dichloroacetic acid in CH2C12 and the CPG support was treated with concentrated ammonium hydroxide at 55° C for 16 hours. The solution was finally cooled, concentrated to dryness on a Speed- Vac, and taken up in water. This material was analyzed by MALDI-TOF mass specfrometry and gave the expected signals at m/z 1110.84 (for dCCCT; theoretical = 1110.80) and m z 1151.23 ( for dCGCT; theoretical = 1150.82 ). A small amount of the crude material was also degraded enzymatically with a mixture of snake venom phosphodiesterase and bacterial alkaline phosphatase to establish the nucleoside ratio. The resulting digest was analyzed quantitatively by RP-HPLC according to the general scheme described in Eadie et al., Anal. Biochem. 165: 442 (1987). The tetramer standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) produced 2.4:0.6: 1.0 for the normalized ratio of C:G:T nucleosides, respectively, compared to a theoretical value of 2.5:0.5: 1. The same tetramer prepared via the acid/base orthogonal deprotection scheme produced 2.2:0.8:1.0 for the normalized ratio of C:G:T nucleosides.
The 15-mer 5*-d(ACGTGGCTGAACSCT), where S is either G or C, was also synthesized on an automated DNA synthesizer using the same acid/base orthogonal deprotection scheme described above. The terminal 5'-DMT was removed with 3% dichloroacetic acid in CH2C12, and the CPG support was treated with concentrated ammonium hydroxide at 55° C for 16 hours. The solution was finally cooled, concentrated to dryness on a Speed- Vac, and taken up in water. The mixture was analyzed by anion-exchange HPLC under denaturing conditions (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4) and revealed two closely spaced peaks (retention time difference = 30 sec), corresponding to the two 15-mers differing by one base (C or G at position 3). In the case of the sequences prepared via the acid/base orthogonal deprotection scheme, the ratio of the peak areas was 0.61 :0.39, whereas in the case of the standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) the ratio of the peak areas was 0.51 :0.49. The sequence prepared by the orthogonal deprotection scheme was also analyzed by MALDI-TOF mass spectrometry and gave the expected signals at m/z 4553.63 (for dACGTGGCTGAACCCT; theoretical = 4553.04) and m z 4592.56 (for dACGTGGCTGAACGCT; theoretical = 4593.07).
Example 8C: Synthesis of CCC/CGC codons (Pro/Arg) via acid/fluoride orthogonal deprotection
The tetramer 5'-d(CSCT), where S is either G or C, was synthesized on an automated DNA synthesizer using the acid/fluoride orthogonal deprotection scheme employing both DMT and Silyl 5'-hydroxyl protecting groups with allyl-P protection. The standard 0.2 μmole cyanoethyl phosphoramidite synthesis protocol provided by the manufacturer was modified to accommodate the new chemistries. The modified protocol contained longer monomer coupling steps (240 sec), longer wash times (120 sec), and new cycles to deliver the non-standard silyl deprotection reagent (HF/TEA, 1. IM: 1.6M in DMF) over 180 seconds. In addition, the standard trichloroacetic acid reagent was replaced with 3% dichlorocetic acid in CH2C12. Polystyrene solid support functionalized with 0.2 μmole T monomer (PE Biosystems, Foster City, CA) was loaded onto the instrument, and DMT-C monomer was added according to the modified protocol. After removal of the 5'-DMT group, equal volumes of Silyl-G and DMT-C were delivered to the column with the tefrazole coupling agent to form a mixture of two trimers: GCT (with 5'-Silyl protection), and CCT (with 5'-DMT protection). The 5'-DMT was removed with 3% dichloroacetic acid in CH2C12 and DMT-C monomer was delivered to the column to extend the CCT sequence to CCCT. Next, an HF/TEA mixture in DMF (1.1M:1.6M) was delivered to the column to remove the 5'-Silyl protecting group. DMT-C monomer was again added to the column to form CGCT from the remaining GCT sequence. The terminal 5'-DMT was removed with 3% dichloroacetic acid in CH2C12 and the CPG support was treated with concentrated ammonium hydroxide at 55 °C for eight hours. The solution was finally cooled and concentrated on a Speed- Vac. A portion of this material was purified by anion-exchange HPLC (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4), and degraded enzymatically with a mixture of snake venom phosphodiesterase and bacterial alkaline phosphatase to establish the nucleoside ratio. The resulting digest was analyzed quantitatively by RP-HPLC according to the general scheme described in Eadie et al., Anal. Biochem. 165: 442 (1987). The tetramer standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) produced 2.0:1.0:1.0 for the normalized ratio of C:G:T nucleosides, respectively, compared to a theoretical value of 2.5:0.5:1. The same tetramer prepared via the acid/base orthogonal deprotection scheme produced the normalized ratio 1.9: 1.1 : 1.0 for C:G:T nucleosides. The 15-mer 5*-d(ACGTGGCTGAACSCT), where S is either G or C, was synthesized on an automated DNA synthesizer using the acid/fluoride orthogonal deprotection scheme described above. The terminal 5'-DMT was removed with 3% dichloroacetic acid in CH2C12 and the polystyrene support was treated with concenfrated ammonium hydroxide at 55° C for 16 hours. The solution was finally cooled, concentrated to dryness on a Speed- Vac, and taken up in water. The mixture was analyzed by anion-exchange HPLC under denaturing conditions (Dionex DNAPac PA- 100 column, sodium chloride gradient in 25 mM NaOH, pH 12.4) and revealed two closely spaced peaks (retention time difference = 30 sec.) corresponding to the two 15-mers differing by one base (C vs. G) at position three. In the case of the sequences prepared via the acid/fluoride orthogonal deprotection scheme, the ratio of the peak areas at 260 nm was 0.34:0.66, whereas in the case of the standard (prepared with conventional cyanoethyl phosphoramidites and the manufacturer's S coupling cycle at base position three) the ratio of the peak areas was 0.4:0.6. This material was also analyzed by MALDI-TOF mass spectrometry and gave the expected signals at m/z 455267.63 (for dACGTGGCTGAACCCT; theoretical = 4553.04) and m/z 4593.41 (for dACGTGGCTGAACGCT; theoretical - 4593.07).
Example 9: Removal of oligonucleotide from support At the end of the above-described coupling reactions, the support is treated with concentrated ammonia at 70 °C for 2 hours in a tightly closed Eppendorf tube, to cleave the oligonucleotides from the support. After filfration, the ammonia solution is evaporated on a speed- vac concentrator. The residue is taken up in water and centrifuged (15 minutes, 0°C). DNA is precipitated from the supernatant by the addition of dioxane and THF. After centrifuging (15 minutes, 0°C), the pellet is dissolved in water. The product DNA is purified by reverse-phase HPLC. Alternatively, the support material is treated under argon with
Pd(PPh3)4/morpholine in THF/DMSO/0.5 M HC1 (2/2/2/1) at 25 °C. The support is washed with THF and acetone and treated with concenfrated NH3 for 2 hours at 25 °C. After filtration the ammonia solution is evaporated, the residue is dissolved in water, and the DNA is purified by HPLC.
Example 10: Synthesis of random codons
In other preferred synthetic approaches, examples 10-13 are carried out using the general methods described in Example 8; the successive coupling reactions take place in the same reaction vessel.
In a first approach, a 14:6 mixture of TC and FG is attached to a support, as described in Examples 6 and 7. The trityl protecting groups are cleaved with trichloroacetic acid. The C mononucleosides are then coupled with a 1 :1 :1:1: 1 :1:1:1 : 1 :1:1 :1: 1 : 1 mixture of TAA, TCA, TGA, TTA, TAC, TCC, TGC, TAG, TGG, TTG, TAT, TCT, TGT, and TTT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized. The result of these reactions is a mixture of 14 unique codons, each representing a different amino acid, and FG mononucleoside, as shown in Figure 2.
The Fmoc protecting groups of the G mononucleosides are then cleaved with DBU, as described in Example 6. The deprotected mononucleosides are coupled with a 1: 1: 1:1:1:1 mixture of TAA, TCA, TGA, TAG, TTG, and TAT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized. The end result of these coupling reactions is a mixture of 20 unique trinucleotides, each representing a codon for one of the 20 naturally-occurring amino acids, as shown in Figure 2. Once again, no stop codons are present in the mixture.
This process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
Example 11 : Synthesis of codons
In another preferred approach, a 16:5 mixture of TC and FG mononucleosides is attached to a support. The trityl protecting groups are cleaved with trichloroacetic acid. The C mononucleosides are then coupled with a 1 : 1 : 1 :1 mixture of
TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result of these reactions is a 1 : 1: 1 : 1 mixture of TAC, TCC, TGC, and TTC dinucleotides, and FG mononucleoside.
The trityl protecting groups of the dinucleotides are then cleaved with acid. The dinucleotides are coupled with a 1 : 1 : 1 : 1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result is a mixture of 16 unique codons, each representing a different amino acid (with the exception of TTC and AGC, which both correspond to serine), and FG mononucleoside.
The protecting groups of the FG mononucleosides are then cleaved with DBU. The G mononucleosides are coupled with a 1 :1:1:1 : 1 mixture of TAA, TCA, TGA, TTG, and TAT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
As shown in Figure 3, the end result of the successive deprotection and coupling reactions is a mixture of 21 codons. The process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
Example 12: Synthesis of trinucleotides
In yet another preferred synthetic approach, a 16:6 mixture of TC and FG mononucleosides is attached to a support, and the trityl protecting groups are cleaved with trichloroacetic acid. The C mononucleosides are coupled with a 1: 1:1 :1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result of these reactions is a 1 :1:1:1 mixture of TAC, TCC, TGC, and TTC dinucleotides, and FG mononucleoside.
The trityl protecting groups of the dinucleotides are then cleaved with acid. The dinucleotides are coupled with a 1 : 1 :1 : 1 mixture of TA, TC, TG, and TT mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
The protecting groups of the FG mononucleosides are then cleaved with DBU. The G mononucleosides are coupled with a 3: 1 : 1 : 1 mixture of FA nucleoside phosphoramidite and TTG, TAT, and TCU dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized. The Fmoc protecting groups are cleaved from the dinucleotides, and a 1 :1 : 1 mixture of TA, TC, and TG mononucleoside phosphoramidites is added; the products of the coupling reactions are then oxidized.
As shown in Figure 4, the result of these successive deprotection and coupling reactions is a mixture of 22 codons, each corresponding to an amino acid. The synthetic scheme results in the generation of a set of codons in which the amino acids Ser and Leu are twice as abundant as the other naturally occurring amino acids. This distribution is close to the amino acid distribution typically found in biological proteins. The process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
Example 1 : Synthesis of trinucleotides In another preferred synthetic approach, a 16:6 mixture of TC and FG mononucleotides is attached to a support. The trityl protecting groups are cleaved with trichloroacetic acid.
The C mononucleosides are coupled with a 1: 1 :1 :1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result of these reactions is a 1 : 1:1 : 1 mixture of TAC, TCC, TGC, and TTC dinucleotides, and FG mononucleoside.
The trityl protecting groups of the dinucleotides are then cleaved with acid. The dinucleotides are coupled with a 1 : 1 :1 : 1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
The protecting groups of the FG mononucleotides are then cleaved with DBU. The G mononucleosides are coupled with a 1 : 1 :1 :1 :1 :1 mixture of TAA, TCA, TGA, TTG, TAT, and TCU dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
As shown in Figure 5, the result of the successive deprotection and coupling reactions is a mixture of 22 codons. The synthetic scheme results in the generation of a set of codons in which the amino acids Ser and Leu are twice as abundant as the other naturally occurring amino acids. This distribution represents the amino acid distribution found in biological proteins. The process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
Example 14: Synthesis of trinucleotides using three protecting
Figure imgf000037_0001
In an additional preferred synthetic approach, a 16:3:2 mixture of TC, FG, and SG mononucleosides is attached to a support. The trityl protecting groups are cleaved with trichloroacetic acid. The C mononucleosides are coupled with a 1 : 1 : 1 : 1 mixture of TA,
TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result of these reactions is a 1:1:1:1 mixture of TAC, TCC, TGC, and TTC dinucleotides, FG mononucleosides, and SG mononucleosides . The trityl protecting groups of the dinucleotides are cleaved with acid. The dinucleotides are coupled with a 1 : 1 : 1 : 1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
The protecting groups of the FG mononucleotides are cleaved with DBU. The G mononucleosides are coupled with FA mononucleoside phosphoramidite, and the products of the coupling reactions are oxidized.
The Fmoc protecting groups are again cleaved. The dinucleotides are coupled with a 1:1 :1 mixture of TA, TC, and TG mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
The silyl protecting groups are cleaved with anhydrous tefra-n- butylammonium fluoride. The G mononucleosides are coupled with a 1 : 1 mixture of FG and ST mononucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
The Fmoc protecting group of the dinucleotide is cleaved, and the dinucleotide is coupled with TT mononucleoside phosphoramidite. The product of the coupling reaction is oxidized. Finally, the silyl group of the STG dinucleotide is cleaved. The dinucleotide is coupled with TA, and the product is oxidized.
As shown in Figure 6, the result of the successive deprotection and coupling reactions is a mixture of 21 codons. The process for synthesizing the codons can be repeated until DNA of the desired length is achieved.
Example 15: Synthesis of hydrophobic amino acids
In yet another preferred synthetic approach, a 6: 1 mixture of TC and FG mononucleosides is attached to a support, and the trityl protecting groups are cleaved with trichloroacetic acid. The C mononucleosides are coupled with a 1 : 1 : 1 : 1 : 1 : 1 mixture of TCC, TGC, TAT, TCT, TGT, and TTT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized. The Fmoc protecting group of the FG mononucleoside is then cleaved. The mononucleoside is coupled with a TAT dinucleotide phosphoramidite, and the product of the coupling reaction is oxidized.
As shown in Figure 7, the result of these successive deprotection and coupling reactions is a mixture of 7 codons, each corresponding to a hydrophobic amino acid (Pro, Ala, He, Leu, Val, Phe, or Met). Example 16: Synthesis of codons with a bias for hydrophobic amino acids
In another preferred approach, a 16:5 mixture of TC and FG mononucleosides is attached to a support. The trityl protecting groups are cleaved with trichloroacetic acid.
The C mononucleosides are then coupled with a 1 : 1: 1 :2 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized. The result of these reactions is a 1 : 1 : 1 :2 mixture of TAC, TCC, TGC, and TTC dinucleotides, and FG mononucleoside. The trityl protecting groups of the dinucleotides are then cleaved with acid. The dinucleotides are coupled with a 1 :1 : 1 :1 mixture of TA, TC, TG, and TT nucleoside phosphoramidites, and the products of the coupling reactions are oxidized.
The protecting groups of the FG mononucleosides are then cleaved with DBU. The G mononucleosides are coupled with a 1 : 1 : 1 : 1 : 1 mixture of TAA, TCA, TGA, TTG, and TAT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
As shown in Figure 8, the end result of the successive deprotection and coupling reactions is a mixture of 20 codons; the codons ATC, CTG, GTC, and TTC, which correspond to the hydrophobic amino acids He, Leu, Val, and Phe, are represented twice. The process for synthesizing the codons can be repeated until DNA of the desired length is achieved. The resulting DNA will code for proteins with a high percentage of hydrophobic amino acids.
Example 17: Synthesis of codons with a bias for basic amino acids In another preferred approach, a 14:6 mixture of TC and FG mononucleosides is attached to a support. The trityl protecting groups are cleaved with trichloroacetic acid. The C mononucleosides are then coupled with a 1:2:1 : 1 : 1 : 1 : 1 : 1 : 1:1 : 1 :1: 1 : 1 mixture of TAA, TCA, TGA, TTA, TAC, TCC, TGC, TAG, TGG, TTG, TAT, TCT, TGT, and TTT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized. The protecting groups of the FG mononucleosides are then cleaved with DBU. The G mononucleosides are coupled with a 2:l:l :2:l:l mixture of TAA, TCA, TGA, TAG, TTG, and TAT dinucleotide phosphoramidites, and the products of the coupling reactions are oxidized.
As shown in Figure 9, the end result of the successive deprotection and coupling reactions is a mixture of 20 codons; the codons CAC, AAG, and AGG, which correspond to the basic amino acids His, Lys, and Arg, are represented twice. The process for synthesizing the codons can be repeated until DNA of the desired length is achieved. The resulting DNA will code for proteins with a high percentage of basic amino acids.
Example 1 : Combined synthetic approaches
In addition to the above schemes, the coupling approaches described in Examples 8 and 10-17 can be combined, in succession, to synthesize DNA.
For example, after a group of codons is prepared as described in Example 8, the scheme described in Example 10 may be used to generate the next set of codons. This process may be continued until DNA of the desired length is achieved.
Alternatively, the trinucleotides generated by any approach may be cleaved from the support using concentrated ammonia at room temperature.
The 3'-OH group is then derivatized with allyloxy bis- (diisopropylamino)phosphine to yield the trinucleotide phosphoramidite, and the trinucleotide phosphoramidites are then used as building blocks to synthesize DNA. Use
The methods of the invention may be used for any application in which nucleic acid synthesis is required. For example, these methods can be used in the synthesis of single-stranded DNA. Frequently, this DNA serve as a template for the synthesis of a complementary DNA strand, which can in turn serves as a template for messenger RNA synthesis.
Because of this application, the methods of the invention find use, for example, in techniques of randomized cassette mutagenesis of proteins, phage display techniques, ribosome display techniques, and protein-nucleic acid fusion techniques.
Codon-randomized DNA can also be used in cellular cultures (in vivo) for protein expression, or for in vitro applications using, for example, T7 RNA polymerase, and in vitro translation systems.
All publications and patents mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent was specifically and individually indicated to be incorporated by reference.
Other Embodiments From the foregoing description, it will be apparent that variations and modifications may be made to the invention described herein to adopt it to various usages and conditions. Such embodiments are also within the scope of the following claims.
What is claimed is:

Claims

Oaims 1. A method for generating a selected set of codons, said method comprising the steps of:
(a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or mixture thereof, wherein a subset A of said first set is protected with a protecting group A', and a subset B of said first set is protected with a protecting group B', wherein A' and B' are orthogonal protecting groups;
(b) selectively removing said protecting group A' from said subset A;
(c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said second set is protected with said protecting group A';
(d) optionally removing said protecting group A' from the products of step (c);
(e) optionally coupling the products of step (d) with a third set of mononucleosides, wherein said third set is protected with said protecting group A';
(f) selectively removing said protecting group B' from said subset B; (g) coupling the products of step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said fourth set is protected with said protecting group A' or said protecting group B';
(h) optionally selectively removing said protecting group B' from the products of step (g); and
(i) optionally coupling the products of step (h) with a fifth set of mononucleosides, to yield a selected set of codons.
2. The method of claim 1, wherein said selected set of codons comprises at least one codon corresponding to each of the 20 naturally- occurring amino acids.
3. The method of claim 2, wherein each of said codons corresponds to a highly-expressed codon for one of the 20 naturally-occurring amino acids.
4. The method of claim 1, wherein said selected set of codons consists essentially of codons for hydrophobic amino acids, consists essentially of codons for hydrophilic amino acids, consists essentially of codons for basic amino acids, or consists essentially of codons for acidic amino acids.
5. The method of claim 1, wherein fewer than 3%, fewer than 2%, fewer than 1%, fewer than 0.5%, or fewer than 0.1% of said codons correspond to a stop codon.
6. The method of claim 1, wherein steps (a) to (i) take place in the same reaction vessel.
7. The method of claim 1, wherein said protecting groups A' and B' are two different groups and are chosen from an acid-cleavable protecting group, a base-cleavable protecting group, or a fluoride-cleavable protecting group.
8. The method of claim 7, wherein said protecting groups A' and B' are two different groups and are chosen from a dimethoxytrityl group, a fluorenylmethyloxycarbonyl group, or a silyl group.
9. The method of claim 1, wherein each of said codons terminates in a cytidine or a guanosine residue.
10. A method for generating an oligonucleotide from a selected set of codons, said method comprising the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein a subset A of said first set is protected with a protecting group A', and a subset B of said first set is protected with a protecting group B', wherein A' and B' are orthogonal protecting groups; (b) selectively removing said protecting group A' from said subset
A;
(c) coupling the products of step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said second set is protected with said protecting group A'; (d) optionally removing said protecting group A' from the products of step (c);
(e) optionally coupling the products of step (d) with a third set of mononucleosides, wherein said third set is protected with said protecting group A'; (f) selectively removing said protecting group B' from said subset B;
(g) coupling the products of step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said fourth set is protected with said protecting group A' or said protecting group B'; (h) optionally selectively removing said protecting group B' from the products of step (g);
(i) optionally coupling the products of step (h) with a fifth set of mononucleosides;
(j) removing the protecting groups from the products of step (g) or (i); and
(k) repeating steps (a) to (j) until an oligonucleotide with the desired length is achieved.
11. The method of claim 10, wherein steps (a) to (k) take place in the same reaction vessel.
12. A method for generating a selected set of codons, said method comprising the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein a subset A of said first set is protected with a protecting group A', a subset B of said first set is protected with a protecting group B', and a subset C of said first set is protected with a protecting group C, wherein A', B', and C are orthogonal protecting groups; (b) selectively removing said protecting group A' from said subset
A;
(c) coupling the product formed in step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said second set is protected with said protecting group A'; (d) optionally removing said protecting group A' from the products of step (c);
(e) optionally coupling the products of step (d) with a third set of mononucleosides, wherein said third set of mononucleosides is protected with said protecting group A'; (f) selectively removing said protecting group B' from said subset B;
(g) coupling the products formed in step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said fourth set is protected with said protecting group A' or said protecting group B';
(h) optionally selectively removing said protecting group B' from the products of step (g);
(i) optionally coupling the products of step (h) with a fifth set of mononucleosides, wherein said fifth set is protected with protecting group A';
(j) selectively removing said protecting group C from said subset C;
(k) coupling the products formed in step (j) with a sixth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein a subset of said sixth set is protected with said protecting group C, and the remainder of said sixth set is protected with protecting group B';
(1) optionally selectively removing said protecting group B' from the products of step (k); (m) optionally coupling the products of step (1) with a seventh set of mononucleosides, wherein said seventh set is protected with protecting group A' or protecting group B';
(n) selectively removing said protecting group C from the products of step (m); and (o) coupling the products of step (n) with an eighth set of mononucleosides, to yield a selected set of codons.
13. The method of claim 12, wherein steps (a) to (o) take place in the same reaction vessel.
14. The method of claim 12, wherein one of said protecting groups A', B', and C is an acid-cleavable protecting group, one of said protecting groups A', B', and C is a base-cleavable protecting group, and one of said protecting groups A', B', and C is a fluoride-cleavable protecting group.
15. The method of claim 14, wherein one of said protecting groups A', B', and C is a dimethoxytrityl group, one of said protecting groups A', B', and C is a fluorenylmethyloxycarbonyl group, and one of said protecting groups A', B', and C is a silyl group.
16. A method for generating an oligonucleotide from a selected set of codons, said method comprising the steps of:
(a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein a subset A of said first set is protected with a protecting group A', a subset B of said first set is protected with a protecting group B', and a subset C of said first set is protected with a protecting group C, wherein A', B', and C are orthogonal protecting groups;
(b) selectively removing said protecting group A' from said subset A; (c) coupling the product formed in step (b) with a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said second set is protected with said protecting group A';
(d) optionally removing said protecting group A' from the products of step (c); (e) optionally coupling the products of step (d) with a third set of mononucleosides, wherein said third set is protected with said protecting group A';
(f) selectively removing said protecting group B' from said subset B;
(g) coupling the products formed in step (f) with a fourth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein said fourth set is protected with said protecting group A' or said protecting group B';
(h) optionally selectively removing said protecting group B' from the products of step (g);
(i) optionally coupling the products of step (h) with a fifth set of mononucleosides, wherein said fifth set is protected with protecting group A';
(j) selectively removing said protecting group C from said subset C;
(k) coupling the products formed in step (j) with a sixth set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof, wherein a subset of said sixth set is protected with said protecting group C, and the remainder of said sixth set is protected with protecting group B ' ;
(1) optionally selectively removing said protecting group B' from the products of step (k);
(m) optionally coupling the products of step (1) with a seventh set of mononucleosides, wherein said seventh set is protected with protecting group A' or protecting group B ' ;
(n) selectively removing said protecting group C from the products of step (m);
(o) coupling the products of step (n) with an eighth set of mononucleosides; (p) removing the protecting groups from the products of step (o); and
(q) repeating steps (a) to (p) until an oligonucleotide with the desired length is achieved.
17. The method of claim 16, wherein steps (a) to (q) take place in the same reaction vessel.
18. A method for generating, in the same reaction vessel, a selected set of codons, said method comprising the steps of: (a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof;
(b) adding a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; (c) optionally adding a third set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof; and
(d) optionally repeating step (c) to yield a selected set of codons, wherein said selected set includes at least one codon having A or G at the third codon position, and wherein fewer than 3% of the codons in said selected set correspond to a stop codon.
19. The method of claim 18, wherein said selected set of codons includes at least one codon for each of the 20 naturally-occurring amino acids.
20. The method of claim 19, wherein each of said codons corresponds to a highly-expressed codon for one of the 20 naturally-occurring amino acids.
21. The method of claim 18, wherein said selected set of codons consists essentially of codons for basic amino acids or consists essentially of codons for hydrophobic amino acids.
22. The method of claim 18, wherein fewer than 2%, fewer than 1%, fewer than 0.5%, or fewer than 0.1 % of said codons correspond to a stop codon.
23. The method of claim 18, wherein each of said codons terminates in a cytidine or a guanosine residue.
24. A method for generating an oligonucleotide from a selected set of codons, said method comprising the steps of:
(a) providing a first set of mononucleosides, mononucleotides, dinucleotides, or mixture thereof; (b) adding a second set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof;
(c) optionally adding a third set of mononucleosides, mononucleotides, dinucleotides, or a mixture thereof;
(d) optionally repeating step (c) to yield a selected set of codons, wherein said selected set includes at least one codon having A or G at the third codon position, wherein fewer than 3% of the codons in said selected set correspond to a stop codon, and wherein steps (a), (b), (c) and (d) occur in the same reaction vessel; and
(e) repeating steps (a) to (d) until an oligonucleotide of the desired length is achieved.
25. The method of claim 24, wherein said selected set of codons includes at least one codon for each of the 20 naturally-occurring amino acids.
26. The method of claim 24, wherein fewer than 2% of said codons correspond to a stop codon.
PCT/US1999/022436 1998-09-29 1999-09-28 Synthesis of codon randomized nucleic acids WO2000018778A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU62704/99A AU6270499A (en) 1998-09-29 1999-09-28 Synthesis of codon randomized nucleic acids

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10229998P 1998-09-29 1998-09-29
US60/102,299 1998-09-29

Publications (2)

Publication Number Publication Date
WO2000018778A1 true WO2000018778A1 (en) 2000-04-06
WO2000018778A9 WO2000018778A9 (en) 2000-06-08

Family

ID=22289166

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/022436 WO2000018778A1 (en) 1998-09-29 1999-09-28 Synthesis of codon randomized nucleic acids

Country Status (2)

Country Link
AU (1) AU6270499A (en)
WO (1) WO2000018778A1 (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0984021A2 (en) * 1998-08-03 2000-03-08 Hewlett-Packard Company Synthesis of oligonucleotides
US6596539B1 (en) 1997-10-31 2003-07-22 Maxygen, Inc. Modification of virus tropism and host range by viral genome shuffling
US6689568B2 (en) 2001-02-01 2004-02-10 Agilent Technologies, Inc. Capture arrays using polypeptide capture agents
US6841535B2 (en) 2000-07-31 2005-01-11 Active Motif Peptide-mediated transfection agents and methods of use
US6852850B2 (en) 2001-10-31 2005-02-08 Agilent Technologies, Inc. Use of ionic liquids for fabrication of polynucleotide arrays
WO2006093157A1 (en) * 2005-02-28 2006-09-08 Tokyo Institute Of Technology Oligonucleotide derivative, probe for detection of gene, and dna chip
US7135565B2 (en) 2000-07-28 2006-11-14 Agilent Technologies, Inc. Synthesis of polynucleotides using combined oxidation/deprotection chemistry
US7411061B2 (en) 2001-10-31 2008-08-12 Agilent Technologies, Inc. Method of synthesizing polynucleotides using ionic liquids
WO2017127750A1 (en) 2016-01-22 2017-07-27 Modernatx, Inc. Messenger ribonucleic acids for the production of intracellular binding polypeptides and methods of use thereof
WO2017201328A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. POLYNUCLEOTIDES ENCODING α-GALACTOSIDASE A FOR THE TREATMENT OF FABRY DISEASE
WO2017201325A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Combinations of mrnas encoding immune modulating polypeptides and uses thereof
WO2017201348A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding galactose-1-phosphate uridylyltransferase for the treatment of galactosemia type 1
WO2017201332A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding acyl-coa dehydrogenase, very long-chain for the treatment of very long-chain acyl-coa dehydrogenase deficiency
WO2017201342A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding jagged1 for the treatment of alagille syndrome
WO2017201350A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding interleukin-12 (il12) and uses thereof
WO2017201349A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding citrin for the treatment of citrullinemia type 2
WO2017201346A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding porphobilinogen deaminase for the treatment of acute intermittent porphyria
WO2018213731A1 (en) 2017-05-18 2018-11-22 Modernatx, Inc. Polynucleotides encoding tethered interleukin-12 (il12) polypeptides and uses thereof
EP4324473A2 (en) 2014-11-10 2024-02-21 ModernaTX, Inc. Multiparametric nucleic acid optimization

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5703218A (en) * 1990-07-27 1997-12-30 Chiron Corporation Oligonucleotides containing hydroxyl-protecting groups orthogonally removable by reduction
US5889136A (en) * 1995-06-09 1999-03-30 The Regents Of The University Of Colorado Orthoester protecting groups in RNA synthesis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5703218A (en) * 1990-07-27 1997-12-30 Chiron Corporation Oligonucleotides containing hydroxyl-protecting groups orthogonally removable by reduction
US5889136A (en) * 1995-06-09 1999-03-30 The Regents Of The University Of Colorado Orthoester protecting groups in RNA synthesis

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6596539B1 (en) 1997-10-31 2003-07-22 Maxygen, Inc. Modification of virus tropism and host range by viral genome shuffling
EP1553102A1 (en) * 1998-08-03 2005-07-13 Agilent Technologies, Inc. Synthesis of oligonucleotides
EP0984021A3 (en) * 1998-08-03 2001-08-01 Agilent Technologies Inc. Synthesis of oligonucleotides
EP0984021A2 (en) * 1998-08-03 2000-03-08 Hewlett-Packard Company Synthesis of oligonucleotides
US7135565B2 (en) 2000-07-28 2006-11-14 Agilent Technologies, Inc. Synthesis of polynucleotides using combined oxidation/deprotection chemistry
US6841535B2 (en) 2000-07-31 2005-01-11 Active Motif Peptide-mediated transfection agents and methods of use
US6689568B2 (en) 2001-02-01 2004-02-10 Agilent Technologies, Inc. Capture arrays using polypeptide capture agents
US6852850B2 (en) 2001-10-31 2005-02-08 Agilent Technologies, Inc. Use of ionic liquids for fabrication of polynucleotide arrays
US7411061B2 (en) 2001-10-31 2008-08-12 Agilent Technologies, Inc. Method of synthesizing polynucleotides using ionic liquids
US7435810B2 (en) 2001-10-31 2008-10-14 Agilent Technologies, Inc. Use of ionic liquids for fabrication of polynucleotide arrays
WO2006093157A1 (en) * 2005-02-28 2006-09-08 Tokyo Institute Of Technology Oligonucleotide derivative, probe for detection of gene, and dna chip
US7851157B1 (en) 2005-02-28 2010-12-14 Tokyo Institute Of Technology Oligonucleotide derivative, probe for detection of gene, and DNA chip
EP4324473A2 (en) 2014-11-10 2024-02-21 ModernaTX, Inc. Multiparametric nucleic acid optimization
WO2017127750A1 (en) 2016-01-22 2017-07-27 Modernatx, Inc. Messenger ribonucleic acids for the production of intracellular binding polypeptides and methods of use thereof
WO2017201325A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Combinations of mrnas encoding immune modulating polypeptides and uses thereof
WO2017201348A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding galactose-1-phosphate uridylyltransferase for the treatment of galactosemia type 1
WO2017201332A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding acyl-coa dehydrogenase, very long-chain for the treatment of very long-chain acyl-coa dehydrogenase deficiency
WO2017201342A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding jagged1 for the treatment of alagille syndrome
WO2017201350A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding interleukin-12 (il12) and uses thereof
WO2017201349A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding citrin for the treatment of citrullinemia type 2
WO2017201346A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. Polynucleotides encoding porphobilinogen deaminase for the treatment of acute intermittent porphyria
EP3896164A1 (en) 2016-05-18 2021-10-20 ModernaTX, Inc. Polynucleotides encoding alpha-galactosidase a for the treatment of fabry disease
EP4137509A1 (en) 2016-05-18 2023-02-22 ModernaTX, Inc. Combinations of mrnas encoding immune modulating polypeptides and uses thereof
EP4186518A1 (en) 2016-05-18 2023-05-31 ModernaTX, Inc. Polynucleotides encoding interleukin-12 (il12) and uses thereof
WO2017201328A1 (en) 2016-05-18 2017-11-23 Modernatx, Inc. POLYNUCLEOTIDES ENCODING α-GALACTOSIDASE A FOR THE TREATMENT OF FABRY DISEASE
WO2018213731A1 (en) 2017-05-18 2018-11-22 Modernatx, Inc. Polynucleotides encoding tethered interleukin-12 (il12) polypeptides and uses thereof

Also Published As

Publication number Publication date
WO2000018778A9 (en) 2000-06-08
AU6270499A (en) 2000-04-17

Similar Documents

Publication Publication Date Title
WO2000018778A1 (en) Synthesis of codon randomized nucleic acids
US6590093B1 (en) Orthoester protecting groups
US4959463A (en) Intermediates
EP1343802B1 (en) Process for the preparation of oligonucleotides
AU737213B2 (en) Method for solution phase synthesis of oligonucleotides and peptides
JP4791043B2 (en) Methods and compositions for synthesizing two or more oligonucleotides in tandem on the same solid support
EP0843684B1 (en) Universal solid supports and methods for their use
EP1149109B1 (en) Method for deprotecting oligonucleotides
US5204455A (en) Monomethoxytrityl protected oligonucleotides bound to a solid support
CA1260930A (en) Nucleotide derivative
US6001966A (en) Method for solution phase synthesis of oligonucleotides and peptides
US6605708B1 (en) Building blocks with carbamate internucleoside linkages and oligonucleotides derived therefrom
US5717085A (en) Process for preparing codon amidites
WO1997029116A1 (en) Sulphur containing dinucleotide phosphoramidites
US5864031A (en) Process for preparing 5-dithio-modified oligonucleotides
EP1105404A1 (en) Purification of oligomers using dual-end selection
US20240092823A1 (en) Process for the de-tritylation of oligonucleotides
CA2424716A1 (en) Process for producing multiple oligonucleotides on a solid support
EP0458876B1 (en) Method of separating oligonucleotides from a mixture
US5688940A (en) Linker for immobilization, modification and subsequent release of oligomers with a terminal hydroxyl group
EP2550368A1 (en) A combinatorial library, a method for preparation of that combinatorial library, a method for sequence identification, a method for sequencing the elements of combinatorial libraries of oligonucleotides and/or oligonucleotide analogues, the use of a linker to generate combinatorial libraries and a sequence identification set
US6153742A (en) General process for the preparation of cyclic oligonucleotides
Hirao et al. Synthesis of nonadeca-and octadecaribonucleotides using the solid-phase phosphotriester with tetrahydropyranyl groups as the 2'-hydroxy-protecting group.
AU738032B2 (en) Method for solution phase synthesis of oligonucleotides
Ishikawa et al. Synthesis of fused oligoribonucleotides with trideoxyribonucleotide containing phosphorothioate to stabilize against nuclease activity

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: C2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 1/8-8/8, DRAWINGS, REPLACED BY NEW PAGES 1/10-10/10; DUE TO LATE TRANSMITTAL BY THE RECEIVINGOFFICE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase