WO1998005765A1 - Double-stranded dna with cohesive end(s), and method of shuffling dna using the same - Google Patents

Double-stranded dna with cohesive end(s), and method of shuffling dna using the same Download PDF

Info

Publication number
WO1998005765A1
WO1998005765A1 PCT/DK1997/000317 DK9700317W WO9805765A1 WO 1998005765 A1 WO1998005765 A1 WO 1998005765A1 DK 9700317 W DK9700317 W DK 9700317W WO 9805765 A1 WO9805765 A1 WO 9805765A1
Authority
WO
WIPO (PCT)
Prior art keywords
dna
gene
sequence
stranded
double
Prior art date
Application number
PCT/DK1997/000317
Other languages
French (fr)
Inventor
Yoshiaki Miyota
Shiro Fukuyama
Original Assignee
Novo Nordisk A/S
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novo Nordisk A/S filed Critical Novo Nordisk A/S
Priority to AU36917/97A priority Critical patent/AU3691797A/en
Publication of WO1998005765A1 publication Critical patent/WO1998005765A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • C12N15/1027Mutagenizing nucleic acids by DNA shuffling, e.g. RSR, STEP, RPR
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6811Selection methods for production or design of target specific oligonucleotides or binding molecules

Definitions

  • the present invention relates to a double-stranded DNA with a cohesive end or cohesive ends having a desired sequence and a method for producing it, and also a method for shuffling a DNA using DNA blocks with a cohesive end or cohesive ends, the DNA as shuffled according to the method, a DNA pool to be obtained according to the shuffling method, and also a genetic product to be produced by the use of the DNA pool .
  • 095/22625 disclosed is a method for forming novel genes by dividing a plurality of genes at random and homolo- gously reco bining them to reconstruct novel genes.
  • this is one method for forming chimera genes.
  • the genes to be formed by this method are similar to the original genes, and the former shall have the essential base sequences of the latter.
  • it is difficult to desire the impartation of some additional functions to organisms which they could not gain during the steps of their evolution.
  • One method for this may be to prepare a nucleic acid pool that covers all base combinations.
  • the total number of the base sequences that may code for a relatively small protein with 100 amino acids (300 bp) is an enormous number of 4 300 (about 10 180 ) , and it is in fact impossible to prepare the nucleic acid pool that may cover all of them.
  • restriction enzymes it is possible to prepare a nucleic acid pool including various molecules by blending several kinds of DNA blocks having the same cohesive end or blunt end followed by ligating them at random, and to select from this pool some molecules having desired properties. According to this method, however, the DNAs to be used must have prede- termined restriction enzyme recognizing sites. Even though the DNAs have such restriction enzyme recognizing sites, there is an extremely small probability that the sites are positioned at the desired sites. In this method, in addition, the both ends of the blocks must be of the same type, and there is a high probability that the blocks are therefore self-ligated. A means of forming the restriction enzyme recognizing sites through site-specific mutation may be taken in order to overcome these problems.
  • the matter as to whether or not the blocks can be ligated in accordance with the predetermined frame is likely much governed by chance. That is, the matter as to whether or not a desired protein can be produced without misreading the reading frame of the codon shall be governed by chance. Therefore, the method is extremely inefficient.
  • the subject matter of the present invention is to provide a method for efficiently obtaining base sequences that exist in spaces greatly different from those of naturally-existing base sequences, and also to provide genetic products to be obtained by expressing, as genes, the nucleic acid sequences that are obtained in that manner and that do not exist naturally.
  • the sequence space of a gene includes the full-length sequence thereof to be theoretically constituted by a combination of four bases, A, G, C and T.
  • a base sequen- ce that codes for a protein composed of a number "n" of amino acids shall be constructed by selecting and sequencing any desired one of the four bases for a total of 3n-times, therefore including 4 3n combinations.
  • a protein composed of 100 amino acids shall include different base sequences of about 10 18 types as so mentioned hereinabove.
  • sequencing spaces for proteins shall extend unlimitedly. During the steps of evolution of organisms, only a part of such sequencing spaces have been examined, and there is a great probability that some sequences coding for proteins which may have some extremely excellent functions could exist in the other great sequencing spaces .
  • the protein engineering studies which have been and are being made in many laboratories and institutes at present are essentially directed to the creation of novel proteins having functions superior to those of naturally-existing proteins, and one essential approach made therein to this pur- pose is to substitute amino acids in existing sequences, as so mentioned hereinabove.
  • amino acid substitution is nothing but the essential means that organisms have carried out during the steps of their evolution or, that is, such is the imitation of organisms and is to search only around the sequences that organisms already examined. In addition, there is a probability that the sequences thus obtained will be those that were already weeded out in the past.
  • the present inventors have considered that, in order to be greatly apart from the sequencing spaces that organisms already examined, if we carry out such matters that could not have been carried out by organisms, the purpose will be attained.
  • the present invention provides the following:
  • a DNA with a cohesive end comprising (a) a double- stranded DNA having the same sequence as that of a part of a gene, and (b) a single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, wherein the single-stranded DNA is linked to either one end of the double-stranded DNA to form a cohesive end.
  • a DNA with cohesive ends comprising (a) a double- stranded DNA having the same sequence as that of a part of a gene, (b) a first, single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, and (c) a second, single- stranded DNA having a base sequence that exists on said gene at the site adjoining the part corresponding to said double- stranded DNA, wherein the second, single-stranded DNA is linked to said double-stranded DNA at one end corresponding to said adjoining site, while the first, single-stranded DNA is linked thereto at the other end of the complementary strand opposite to said end, thereby forming cohesive ends.
  • a method for producing a DNA with a cohesive end or cohesive ends wherein a part of a DNA, as a template, and an oligonucleotide containing at least one ribonucleotide, as a primer, are subjected to DNA polymerase reaction to prepare a double-stranded DNA, then the ribonucleotide (s) is/are removed through enzymatic reaction or chemical reaction, and the nu- cleotide(s) remaining at the 5 ' -terminal (s) of the site(s) at which said ribonucleotide (s) existed are removed.
  • a method for producing the DNA with a cohesive end of the previous 1) comprising the following steps a) to d) : a) a step of linking (i) an oligonucleotide having the same base sequence as that of a part of a gene DNA to (ii) an oligonucleotide having a base sequence that exists on the gene at the site not adjoining the base sequence of (i) or a base sequence which the gene does not have, and containing at least one ribonucleotide, in such a manner that the oligonucleotide (ii) is positioned at the 5 '-terminal of the oligonucleotide
  • a method for producing the DNA with cohesive ends of the previous 2) comprising the following steps a) to d) : a) a step of linking (i) an oligonucleotide having the same base sequence as that of a part of a gene DNA to (ii) an oligonucleotide having a base sequence that exists on the gene at the site not adjoining the base sequence of (i) or a base sequence which the gene does not have, and containing at least one ribonucleotide, in such a manner that the oligonucleotide (ii) is positioned at the 5 '-terminal of the oligonucleotide (i); b) a step of preparing a double-stranded DNA through DNA polymerase reaction between a DNA containing the part corresponding to the oligonucleotide (i) in said a) , as a template, and (i) the linked oligonucleotide as obtained in the previous step a)
  • a method for shuffling a DNA comprising dividing a DNA into a plurality of DNA blocks each having a cohesive end or cohesive ends, followed by ligating them together into a sequence that is different from the sequence of the original, non-divided DNA.
  • a method for shuffling a DNA comprising applying the method of any one of the previous 5) to 7) to various sites of a DNA, thereby dividing the DNA into a plurality of DNA blocks each having a cohesive end or cohesive ends, at least one block of which shall have a cohesive end that is complementary to the cohesive end of another block not having been directly adjacent to said one block on the original DNA, followed by ligating them together into a sequence that is different from the sequence of the original, non-divided DNA.
  • a method for producing a DNA pool comprising applying the method of any one of the previous 5) to 7) to various sites of a template DNA to thereby prepare a mixture of DNA blocks each having a cohesive end or cohesive ends that satisfies the following conditions, followed by ligating these into any desired sequences:
  • Each block has a double-stranded site having the same sequence as that of a part of the template DNA.
  • Condition 2 At least two of the blocks that constitute the block mixture further have, in addition to said double- stranded site, s single-stranded site (cohesive end) that is complementary to the cohesive end of blocks that are not directly adjacent to said blocks on the template DNA.
  • Condition 3 The block mixture contains at least two different blocks which are the same in the double-stranded site but are different only in the single-stranded site and which satisfy the condition 2. 21) The method for producing a DNA pool according to the previous 20) , wherein the template DNA is a gene that codes for an enzymatic function or a control gene DNA for the gene.
  • the template DNA is a gene DNA that codes for any one of proteases, lipases, cellulases, amylases, catalases, xylanases, oxidases, dehydrogenases, oxygenases and reductases.
  • DNA with Cohesive End(s) The present invention provides a DNA with any desired cohesive end or ends (herein referred to as "DNA with cohesive end(s)" unless otherwise specifically indicated).
  • the cohesive end as referred to herein indicates a single-stranded site as protruded from the end of a double-stranded DNA.
  • Such a cohesive end may be formed when a DNA is cleaved with a restriction enzyme such as EcoRI .
  • the base sequence of the thus-formed cohesive end is defined, depending on the restriction enzyme used, and its length is generally composed of several bases or so.
  • the sequence of the resulting double-stranded part of the DNA is also limited to the region as sandwiched between the restriction enzyme recognizing sites.
  • the DNA with cohesive end(s) of the present invention may have a structure in which a cohesive end or cohesive ends having a desired length and a desired sequence is/are added to the end or ends of a double-stranded DNA having a desired sequence.
  • sequence of the double-stranded part of the DNA with cohesive end(s) of the present invention is not specifically defined.
  • the sequence may be the same as that of a part of a gene.
  • its length may be generally composed of 50 base pairs (bp) or more, preferably 45 bp or more.
  • the sequence of the cohesive end is not also specifically defined, but in order to prevent the self-ligation thereof in various reactions, it is preferable that the sequence does not form a stem structure.
  • the "sequence to form a stem structure" as referred to herein includes, for example, AATT, which shall have just the same sequence as that of its complementary strand (TTAA) .
  • the length of the cohesive end may be generally 2 bp or more, preferably from 15 bp to 30 bp. If the cohesive end is too long, it may form a secondary struc- ture whereby the intermolecular annealing will be difficult. However, if it is too short, its melting temperature (Tm) is low and the annealing will be unstable.
  • the cohesive end may be linked to either the 3 '-terminal or the 5 '-terminal of the double-stranded DNA, but is prefe- rably linked to the 3 '-terminal thereof.
  • the cohesive end may be linked to either only one terminal of the double-stranded DNA or the both terminals thereof.
  • the DNA with cohesive end(s) of the present invention can be produced typically according to a method comprising the following steps a) to d) .
  • the method mentioned below is directed to the production of a DNA with a cohesive end, which has a structure to be represented by a formula (2) :
  • oligonucleotide, a which is complementary to its terminal, X
  • an oligonucleotide, c having the same sequence as that of the other terminal, c, are prepared.
  • X and c each may have a sequence having a base length of from 15 to 30 bp or so.
  • oligonucleotide b, which is complementary to the sequence to be prepared by removing one base (this is referred to as X) from the 5'- terminal of the sequence of the intended cohesive end, a " .
  • the base sequence, & may be a part of the above-mentioned DNA or may be any other sequence that the DNA does not have.
  • oligonucleotides may be prepared by any methods. If their sequences are previously known, they can be synthesized, using a known DNA synthesizer.
  • oligonucleotides a and b
  • a ribonucleotide is linked together via a ribonucleotide.
  • This linkage can be attained by ordinary synthesizing methods. For example, it can be attained according to the process mentioned below.
  • a phosphoryl group is added to the 5 ' -terminal of the oligonucleotide, a, according to the reaction of the following formula (3) : (3) wherein (P) is a phosphoryl group.
  • This reaction can be effected by the action of a polynucleotide kinase.
  • ATP is used in an amount of from 2 to 10 times or so, by mol, relative to the oligonucleotide, a.
  • the reaction temperature may be from 30 to 40°C or so.
  • the reaction time may be from 10 minutes to 1 hour or so. Most suitably, the pH is from 7 to 9 or so.
  • the oligonucleotide is represented by a ' .
  • a ribonucleotide is added to the 3'- terminal of the oligonucleotide, b, according to the reaction of the following formula (4) : (4) wherein X is any one of ATP, GTP, CTP and UTP; (rX) is a ribonucleotide.
  • This reaction can be effected by the action of, for example, a terminal deoxynucleotidyl transferase.
  • XTP nucleoside triphosphate
  • a-1 a ribonucleotide that corresponds to the base X in the previous step (a-1) .
  • the nucleoside triphosphate is used in an amount of from 2 to 10 times, by mol, relative to the oligonucleotide, b.
  • the reaction temperature may be from 30 to 40°C or so.
  • the reaction time may be from 30 minutes to 2 hours or so.
  • the oligonucleotide is represented by b'.
  • the sequence of b' is complementary to the sequence, -s ⁇ .
  • This reaction can be effected by the action of an RNA ligase in the presence of ATP and divalent metal ions (Japanese Patent Application Laid-Open No. 5-292967) .
  • Divalent metal ions useful in this reaction include, for example, magnesium ions and manganese ions, but preferred are magnesium ions.
  • the ligase employable is an RNA ligase.
  • the RNA ligase is an enzyme to catalyze the ligation of the hydroxyl group at the 3 ' -terminal and the phosphoryl group at the 5 ' -terminal , and this also efficiently catalyzes the ligation of a polydeoxyribonucleotide having a ribonucleotide only at its 3'- terminal and a polydeoxyribonucleotide with a 5 '-terminal phosphoryl group.
  • a T4 RNA ligase is effected in a buffer, at a pH of from 7 to 9 and at a temperature of from 10 to 40°C, over a period of from 30 to 180 minutes.
  • the oligonucleotides may be reacted in a solution comprising 50 mM Tris-HCl (pH 8.0), 20 mM MgCl2, 0.1 mM ATP, 10 mg/ liter BSA, 1 mM hexaammine cobalt chloride (HCC) and 25 % polyethylene glycol 6000, at 25°C for 60 minutes or longer.
  • a double-stranded DNA containing the sequence, X, as prepared in the previous step (a-1) , as a template, and using the linked oligonucleotide, b'-a', as prepared in the previous step (a-2), as a primer, prepared is a double-stranded DNA through DNA polymerase reaction.
  • a double-stranded DNA containing the sequence, X, and a sequence, 3, on their strands is subjected to thermal or alkaline denaturation to give single-stranded DNAs, which are added to the primer of b'-a" and subjected to PCR with the oligonucleotide, c, as prepared in the previous step (a-1) .
  • the primer annealing condition and the polymerase reaction condition to be employed herein may be the same as those in ordinary polymerase reaction.
  • the DNA polymerase to be employed herein may be any and every enzyme that can catalyze the DNA chain-extending reaction, which includes, for example, Taq polymerase, Klenow fragment, DNA polymerase I, etc. As a result of this reaction, obtained is a double-stranded DNA with blunt ends, which is represented by the a formula (6) :
  • Step c) the ribonucleotide is removed from the double- stranded DNA through enzymatic reaction or chemical reaction.
  • One example of useful enzymes is a ribonuclease.
  • the reaction is generally effected at a pH of from 6 to 8 or so, at from 30 to 70°C or so, over a period of from 10 to 60 minutes or so.
  • non-enzymatic chemicals usable herein mentioned are sodium hydroxide and the like.
  • a partly-discontinuous, double-stranded DNA of the following formula (7) in which the part corresponding to the above- mentioned base, X, has been deleted.
  • the nucleotide existing at the 5'- terminal of the above-mentioned deletion is removed.
  • the double-stranded DNA, from which the ribonucleotide has been removed in the previous step c) is heated at from 50 to 90°C or so.
  • the polynucleotide that has been separated from the strand through this reaction can be removed, using, for example, a span column or the like.
  • a desired sequence, ⁇ t ⁇ t which does not adjoin the sequence, X, in the template DNA was introduced into the DNA to form the cohesive end.
  • an oligonucleotide, c' which is different from the oligonucleotide, c, in that its 3 '-terminal deoxyribonucleotide is substituted with a ribonucleotide, may be used as the primer in place of the oligonucleotide, c, to prepare a double-stranded DNA with two cohesive ends of a formula (8) : (8).
  • the present invention also provides a method of shuffling a DNA, which is characterized by using DNAs with cohesive end(s) .
  • shuffling as referred to herein in- dicates the operation of dividing a DNA into plural blocks followed by re-sequencing them into a desired, different sequence.
  • one DNA has a sequence composed of a number, n, of blocks, as represented by a formula (9) : A - al - a2 - . . . . - a n - B (9) wherein the starting end A and/or the terminal end B may be omitted
  • this may be shuffled according to the present invention to give a different DNA to be represented by a formula (10) : A - al' - a2' - . . . . - a x - B (10) wherein al • , a2 * , . . . , ax are blocks that are independently selected from the group of al, a2, . . .
  • the total number of the blocks al', a2 ' , . . . , a x may not be the same as the total number of the blocks cii f SL f * • • • f £t_n •
  • Fig. 1 The principle of the DNA shuffling of the present invention which utilizes DNAs with cohesive end(s) is graphically illustrated in Fig. 1.
  • the DNA is shuffled at the intermediate part, pi - p2 - p3 (the uppermost row) into p3 - pi - p2 (the lowermost row) , without changing the both ends PA and PB-
  • This shuffling operation is useful as a method for obtaining gene sequences that have not heretofore existed naturally, without changing the sequences of the promoter and the terminator.
  • the above-mentioned method of preparing DNAs with cohesive end(s) is applied first to the parts PA, pi, p2, p3 and p ⁇ constituting the template DNA, to thereby prepare DNA blocks, al, a2 and a3, each having the structure with two cohesive ends (formula (8)), and DNA blocks, A and B, each having the structure with one cohesive end (formula (2)).
  • the cohesive ends, a A , aif, a 2 f and a3 f are formed by removing the corresponding complementary strand from the blocks, PA, pi/ p2 and p3, respectively.
  • the cohesive ends, a ⁇ r , a2r, a 3r an( * a B, are designed according to the desired sequence to be prepared after the shuffling.
  • the end, a ⁇ r is designed to be a complementary strand to the end, a3f, and the block, al is ligated to the block a3 after the shuffling.
  • the ligation is conducted, using a DNA ligase in the presence of ATP.
  • the type of the DNA ligase to be employed herein is not specifically defined. In this embodiment, since the single- stranded part of each cohesive end is long, it is unnecessary to employ the ordinary reaction at 16°C, but a thermophilic DNA ligase is advantageously employed.
  • a2r; a 3r an d a B are designed to be the complementary strands to a ⁇ _f, aA and a2f, respectively, in the same manner as above.
  • a sequence having a structure of A - a3 - al - a2 - B is finally obtained.
  • This is seemingly the same as the re- sequenced order of P - p3 - pi - p2 - PB to be obtained by dividing the original DNA into the constitutive blocks pi, p2, p3 , PA and PB, followed by re-sequencing these into a different sequence.
  • any other desired sequences can be realized in the same manner as above. If the block, A or B, is made to have two cohesive ends, while the other blocks are made to have one cohesive end, it is possible to obtain still different sequences through shuffling where the latter blocks with one cohesive end are positioned at the terminals.
  • the shuffling of the invention it is also possible to introduce foreign DNA block(s) with cohesive end(s), which are not in the original gene, into the gene DNA.
  • the terminal of one gene for example, the block A in the above-mentioned embodiment, may be processed into a DNA block with two cohesive ends, if desired.
  • the blocks which are the units to be shuffled, are oligonucleotides or polynucleotides composed of 2 or more nucleotides (hereinafter referred to as "oligonucleotides") . In general, these are preferably composed of 30 or more nucleotide units, more preferably 45 or more nucleotide units.
  • the uppermost limit of the block length is not specifically defined, provided that the block length is shorter than the length of one gene. If, however, the block length is too large, the re-sequenced DNA to be obtained by the shuffling shall have many non-mutated base sequence parts. Therefore, in general, the block length is preferably within the range of from 10 to 35 % of the length of a gene.
  • the gene to be shuffled is a gene that codes for a protein
  • the gene blocks, oligonucleotides have the same reading frame before and after the division.
  • the gene blocks to be shuffled are desirably so designed that they are translated to always give the corresponding amino acid sequences, irrespective of their relative positions in the shuffled sequence.
  • the double-stranded parts and the cohesive ends shall be selected for their codon units in accordance with the reading frame of the gene DNA to be shuffled. Needless-to-say, it is unnecessary to conduct the division into segment blocks with genetic meanings.
  • the kind of the gene to be shuffled according to the present invention is not specifically defined.
  • Employable herein is any and every gene that is composed of polynucleotide chains and contains a coding region necessary for expressing a protein or RNA.
  • the nucleotide unit may contain any molecule of deoxy- ribonucleotides or ribonucleotides.
  • genes coding for proteins especially enzymes, or control genes for enzymatic functions.
  • enzymes include proteases, lipa- ses, cellulases, amylases, catalases, xylanases, oxidases, dehydrogenases, oxygenases and reductases.
  • the kind of the gene to which the present invention is directed is not specifically defined but shall be such that, when it is introduced into a suitable host, the host can produce the genetic product through expression of the gene.
  • genes as cloned from living organisms include artificially synthesized genes, and even genes as cloned from living organisms and artificially mutated.
  • prokaryotes with definite enzyme producibility .
  • prokaryotes mentioned are bacillus bacteria.
  • One example of the genes derived from such bacteria is a protease API21 gene derived from Bacillus NKS-21 (FERM BP-93-1) (Japanese Patent Application Laid-Open No. 5-91876, Sequence Number 1).
  • the present invention also provides a DNA pool to be obtained according to the above-mentioned shuffling method.
  • the "DNA pool” as referred to herein means a high-density mixture of two or more DNAs.
  • the DNA pool of the present invention can contain a particular number or more, for example, 10 or more different DNA molecules having different structures. It is desirable that, when the mixture, DNA pool is directly used in biochemical operation or reaction, it is in such a form that all the plural nucleic acid components constituting it can be reacted. However, the form of the mixture, DNA pool is not specifically defined, and the DNA pool may be either in solution or dry mixture.
  • a plurality of cohesive ends for each block are prepared in the above-men- tioned shuffling process.
  • complementary strands to the other cohesive ends, aA and a2f are prepared in addition to the complementary strand to a3f, then DNAs of A - al - a2 - B and A - al -a2 - al can be ob- tained.
  • a complementary strand to the other cohesive end alf of al is added, it is also possible to produce other DNAs comprising a series of the same blocks, such as A - al - al - al.
  • a DNA is divided into blocks of al, a2 , a3 , . . . , an- Then, each block is processed to have a cohesive end or cohesive ends according to the above-mentioned process.
  • the cohesive ends are designed to be oligonucleotides that are complementary to the cohesive ends of the other blocks or are complementary to the other cohesive end of themselves. All or a part of the thus-obtained DNA blocks are mixed and ligated to each other, thereby producing a nucleic acid pool containing different nucleic acids composed of the blocks as differently sequenced at random.
  • the thus-shuffled, single or mixed, double-stranded DNAs are blunted.
  • the blunting may be omitted, if DNA blocks with one cohesive end are positioned at the ends of the shuffled, double-stranded DNA.
  • the 5 '-terminal of the sequence containing a DNA block with a predetermined promoter sequence which is based on the direction of the promoter, is not made to have a cohesive end but is made to have a blunt end
  • the 3 '-terminal of the sequence containing a DNA block with a predetermined terminator sequence which is based on the direction of the terminator, is not made to have a cohesive end but is also made to have a blunt end.
  • the thus-shuffled DNA is inserted into a desired vector, preferably an expression vector such as pKK223-3, using a DNA ligase.
  • a desired vector preferably an expression vector such as pKK223-3
  • the promoter sequence and the terminator sequence to be in the shuffled DNA are not limited to only one each, but a plurality of promoter sequences and terminator sequences may be therein.
  • the polynucleotide blocks positioned at the both ends of the shuffled DNA may be designed to have suitable restriction enzyme recognizing sites.
  • the DNA may be ligated to a suitable vector, using the defined re- striction enzymes.
  • the vector library thus produced in the manner mentioned above is introduced into a suitable host, in which the genetic information is expressed.
  • the intended gene- tic product with favorable properties and also the gene coding for it can be obtained.
  • Any and every ordinary host can be used herein.
  • Preferred examples of the host include cells of E. coli , bacillus bacteria, yeasts, and lactic acid bacteria. If desired, in-vitro transcription systems and translation systems are also employable herein. In those cases, the genetic information can be expressed even when the gene is not ligated to a vector.
  • the "genetic information” as referred to herein indicates the information on a gene which is carried by a DNA and which is translated into a protein or is transcribed into RNA in a suitable living body by the DNA for itself or after having been ligated to any other DNA or RNA.
  • the genetic information that is expected to be expressed according to the method of the present invention is not specifically defined, but includes, for example, those on various genetic products, such as enzymes, antibodies, hormones receptor proteins and ribozymes, and those on various control functions of, for example, operators, promoters and attenua- tors.
  • a nucleic acid pool was produced in accordance with the process mentioned below, based on the wild-type alkali protease
  • Step a) Preparation of Oligonucleotide Blocks for Primer (1-1) Synthesis of Oligonucleotide Blocks:
  • oligonucleotides containing a part of the base sequence are parts of the base sequence of API21 (Japanese Patent Application Laid-Open No. 5-91876) (including complementary strands) , or oligonucleotides containing a part of the base sequence.
  • the sequence of oligo 4a is to follow glutamine of Sequence Number 1 and, and this contains a termination codon of the gene.
  • These oligonucleotides were so designed that they might be the best when the oligo A was overhung on the 3 '-terminal of the amplified DNA in the experiment to follow hereinunder, using a Tag polymerase.
  • oligonucleotides were synthesized in a DM trityl-on condition (that is, while the 5' -hydroxyl group was protected with dimethoxytrityl group) , and purified through an OPC column.
  • the reagents used herein were obtained from Perkin Elmer Co.
  • HCC hexaammine cobalt chloride
  • BSA bovine serum albumin
  • Oligo 2r, oligo 3r, oligo 4r, oligo lb, oligo 2b, oligo 3b and oligo 4b were processed in the same manner as above. These four polynucleotides thus formed are referred to as oligo lr ' , oligo 2r', oligo 3r', oligo 4r', oligo lb', oligo 2b', oligo 3b' and oligo 4b' . (1-3) Phosphorylation:
  • Oligo 2a, oligo 3a and oligo 4a were processed in the same manner as above. These polynucleotides thus formed are referred to as oligo la', oligo 2a', oligo 3a' and oligo 4a'. (1-4) Ligation of Oligonucleotide Blocks:
  • oligo la' 100 pmols of oligo lb', 100 pmols of oligo 2b', 100 pmols of oligo 3b', 100 pmols of oligo 4b', which had been obtained in the above, as well as 1 nmol of ATP and 50 units of T4 RNA ligase were added to the same standard solution as that mentioned above to make 10 ⁇ l in total, and these were reacted at 25°C for 4 hours.
  • oligo 2a' with oligo lb', oligo 2b', oligo 3b' and oligo 4b' were also reacted in the same manner as above.
  • oligo 1M A mixture of the four polynucleotides thus formed as a result of this reaction, oligo la' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b', is referred to as oligo 1M; a mixture of the four polynucleotides, oligo 2a' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b', is referred to as oligo 2M; a mixture of the four polynucleotides, oligo 3a' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b', is referred to as oligo 3M; and a mixture of the four polynucleotides, oligo 4a' ligated to oligo lb', oligo 2b', oligo 3b' and oligo
  • Steps b) to d) Formation of Gene Blocks
  • plasmid pSDT812 Japanese Patent Application Laid-Open No. 1-141596
  • the gene of the wild- type alkali protease as cloned from Bacillus NKS-21 was subjected to PCR with primers, oligo 1M and oligo 2r'.
  • the gene fragment as amplified through this reaction was treated with a ribonuclease, and then heated at 80_C for 5 minutes, whereby the polynucleotide (s) positioned at the 5 '-terminal of the ribonucleotide existing in the both strands or one strand was/were removed.
  • prepared was a gene block with cohesive end(s) .
  • This gene block is referred to as block 1M.
  • Block 1M, block 2M, block 3M, block B and block F of the same amount were blended and ligated together, using Pfu DNA ligase.
  • the reaction mixture was subjected to agarose gel electrophoresis, through which was collected the DNA fragment of about 1.5 kbp.
  • the thus-collected DNA of about 1.5 kbp was digested with restriction enzymes, EcoRI and BamHI, then mixed with a plasmid, pHY300PLK (manufactured by Yakulto Honsha Co.), which had been digested with restriction enzymes, EcoRI and BamHI and processed with an alkali phosphatase, and thus ligated together, using a ligation kit (manufactured by Takara Shuzo Co.).
  • a ligation kit manufactured by Takara Shuzo Co.
  • nucleic acid pool is obtained that covers all combinations of clones each containing the same or different three of these blocks.
  • the DNAs as produced in Example 3 were mixed. Using the resulting DNA mixture, cells of Bacillus subtili ⁇ UOT0999 were transformed. Tetracycline-resistant transformants were selec- ted. 300 transformants were replicated on a skim milk-containing medium plate, on which were found clear zones around the colonies of 12 transformants. Accordingly, it is under- stood that the enzyme which the shuffled gene codes for can be selected depending on its activity. The base sequences of these 12 clones that formed the clear zones were analyzed, from which it was found that these were sequenced in the same order of blocks as in the wild-type enzyme.
  • RNAs From 10 clones (one clone forms halo, while 9 clones do not) as selected from the transformants that had obtained in Example 3, and also from the host, Bacillus subtilis U0T0999, full-length RNAs were prepared. These were proces ⁇ ed with a ribonuclease-free deoxyribonuclease, in order to remove the influence of the plasmids on the hybridization to be effected later on. Next, using oligo lr as the probe, these were subj- ected to Northern hybridization. As a result, all lanes corresponding to the RNA of the transformants gave detectable bands, but no band was detected on the lanes corresponding to the RNA of the host.
  • a double- stranded DNA molecule with any desired cohesive end or ends.
  • various DNAs with various base sequences which are substantially apart from the naturally-existing base seguence spaces, and also a DNA pool of a mixture of such DNAs, through simple processes. Therefore, it is possible to obtain excellent genetic products, such as proteins and enzymes, which could not be obtained in conventional methods and which were not examined by organisms in the past.
  • nucleic acid pool it is possible to obtain a mixture of nucleic acids while optionally shuffling the constitutive blocks at random in the intermediate parts but fixing the terminal sequences to be predetermined, desired ones, and it is also possible to shuffle the constitutive blocks without changing the amino acid sequence which each block codes for. Therefore, as compared with a me- thod of producing a completely-randomized nucleic acid pool, there is a high possibility that useful genetic products can be produced according to the method of the present invention.
  • Length of Sequence 16 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology : Linear
  • Fig. 1 is a graphical view showing one embodiment of trie method of the present invention for shuffling a DNA.

Abstract

To provide a method of mutation of DNAs, which is substantially different from the conventional methods applicable to naturally-existing DNAs, and also to provide useful genetic products to be produced by the use of thus-mutated DNAs. A DNA with a cohesive end comprising (a) a double-stranded DNA having the same sequence as that of a part of a gene, and (b) a single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, wherein the single-stranded DNA is linked to either one end of the double-stranded DNA to form a cohesive end; a method for producing it; a method of shuffling a DNA using it; a DNA and a DNA pool to be obtained by the shuffling method; a method for producing the DNA pool; and a genetic product to be obtained by expressing the genetic information existing in the DNA pool.

Description

Title: Double-stranded DNA with cohesive end(s), and method of shuffling DNA using the same
INUSTRIAL FIELD
The present invention relates to a double-stranded DNA with a cohesive end or cohesive ends having a desired sequence and a method for producing it, and also a method for shuffling a DNA using DNA blocks with a cohesive end or cohesive ends, the DNA as shuffled according to the method, a DNA pool to be obtained according to the shuffling method, and also a genetic product to be produced by the use of the DNA pool .
BACKGROUND ART
One approach to protein engineering for improving naturally-existing proteins to modified ones which are more useful to human beings is to improve proteins through site-specific mutation, which has produced some results (Japanese Patent Application Laid-Open No. 5-91876) . However, this requires the clarification or identification of the stereostructure of the targeted protein, and much labor is needed for the analysis of the stereostructure. In addition, even though the stereostructure could be clarified or identified, there are still many unknown matters for the relationship between the structure and the function with proteins. Therefore, it is still difficult to surely impart an intended function to the targeted protein. In order to overcome these difficulties, a process comprising random mutation and screening and also evolutional molecular engineering that utilizes the evolution of organisms have been being highlighted and said to be extremely useful (Proc. Natl. Acad. Sci., USA, .83., 576 (1986)). However, the current methods are directed to the substitution of at most several a ino acids.
In 095/22625, disclosed is a method for forming novel genes by dividing a plurality of genes at random and homolo- gously reco bining them to reconstruct novel genes. However, this is one method for forming chimera genes. The genes to be formed by this method are similar to the original genes, and the former shall have the essential base sequences of the latter. Using such known methods, it is difficult to desire the impartation of some additional functions to organisms which they could not gain during the steps of their evolution. In order to obtain genetic products, of which the functions are greatly different from those of naturally-existing substances such as proteins, it is believed effective to prepare a pool of nucleic acids having significantly different base sequence spaces from those existing naturally, and to produce from them genetic products having the intended functions.
One method for this may be to prepare a nucleic acid pool that covers all base combinations. However, even the total number of the base sequences that may code for a relatively small protein with 100 amino acids (300 bp) is an enormous number of 4300 (about 10180) , and it is in fact impossible to prepare the nucleic acid pool that may cover all of them.
For proteins of some kinds, their sub-structures which are referred to as modules were specifically noted, and an attempt was made to change the sequencing of the base sequence blocks corresponding to the individual modules to thereby produce mutants having different module sequences (Viva Origino, Vol. 23. No. 1 (1995) 86-87) . In this attempt, however, the base sequences were re-sequenced merely individually for the individual mutants. No one has heretofore attempted the formation of a nucleic acid pool covering all re-sequenced molecules and the collection of genes capable of expressing products having intended properties from the pool.
Utilizing restriction enzymes, it is possible to prepare a nucleic acid pool including various molecules by blending several kinds of DNA blocks having the same cohesive end or blunt end followed by ligating them at random, and to select from this pool some molecules having desired properties. According to this method, however, the DNAs to be used must have prede- termined restriction enzyme recognizing sites. Even though the DNAs have such restriction enzyme recognizing sites, there is an extremely small probability that the sites are positioned at the desired sites. In this method, in addition, the both ends of the blocks must be of the same type, and there is a high probability that the blocks are therefore self-ligated. A means of forming the restriction enzyme recognizing sites through site-specific mutation may be taken in order to overcome these problems. However, the matter as to whether or not the blocks can be ligated in accordance with the predetermined frame is likely much governed by chance. That is, the matter as to whether or not a desired protein can be produced without misreading the reading frame of the codon shall be governed by chance. Therefore, the method is extremely inefficient.
The subject matter of the present invention is to provide a method for efficiently obtaining base sequences that exist in spaces greatly different from those of naturally-existing base sequences, and also to provide genetic products to be obtained by expressing, as genes, the nucleic acid sequences that are obtained in that manner and that do not exist naturally.
The sequence space of a gene includes the full-length sequence thereof to be theoretically constituted by a combination of four bases, A, G, C and T. For example, a base sequen- ce that codes for a protein composed of a number "n" of amino acids shall be constructed by selecting and sequencing any desired one of the four bases for a total of 3n-times, therefore including 43n combinations. Accordingly, a protein composed of 100 amino acids shall include different base sequences of about 1018 types as so mentioned hereinabove.
In fact, there is no limitation for the number of amino acids that constitute proteins. Therefore, the sequencing spaces for proteins shall extend unlimitedly. During the steps of evolution of organisms, only a part of such sequencing spaces have been examined, and there is a great probability that some sequences coding for proteins which may have some extremely excellent functions could exist in the other great sequencing spaces . The protein engineering studies which have been and are being made in many laboratories and institutes at present are essentially directed to the creation of novel proteins having functions superior to those of naturally-existing proteins, and one essential approach made therein to this pur- pose is to substitute amino acids in existing sequences, as so mentioned hereinabove. However, the amino acid substitution is nothing but the essential means that organisms have carried out during the steps of their evolution or, that is, such is the imitation of organisms and is to search only around the sequences that organisms already examined. In addition, there is a probability that the sequences thus obtained will be those that were already weeded out in the past.
We, the present inventors have considered that, in order to be greatly apart from the sequencing spaces that organisms already examined, if we carry out such matters that could not have been carried out by organisms, the purpose will be attained. We know that the division of a gene into several blocks followed by the change in the sequencing of the thus-divided blocks, if occurred in organisms, shall kill the organisms. Therefore, we have concluded that this method is suitable for our purpose. Having thus concluded, we, the present inventors have assiduously studied various matters relating to this method and, as a result, have found a method of forming a desired cohesive end or ends on a desired DNA. Uti- lizing this method, we have succeeded in a method of dividing a gene into several blocks and re-sequencing them into different sequences and also in a method of producing a molecule pool including such different base sequences existing in different sequencing spaces, and thus have completed the present invention.
Accordingly, the present invention provides the following:
1) A DNA with a cohesive end comprising (a) a double- stranded DNA having the same sequence as that of a part of a gene, and (b) a single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, wherein the single-stranded DNA is linked to either one end of the double-stranded DNA to form a cohesive end.
2) A DNA with cohesive ends comprising (a) a double- stranded DNA having the same sequence as that of a part of a gene, (b) a first, single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, and (c) a second, single- stranded DNA having a base sequence that exists on said gene at the site adjoining the part corresponding to said double- stranded DNA, wherein the second, single-stranded DNA is linked to said double-stranded DNA at one end corresponding to said adjoining site, while the first, single-stranded DNA is linked thereto at the other end of the complementary strand opposite to said end, thereby forming cohesive ends. 3) The DNA with a cohesive end or cohesive ends according to the previous 1) or 2) , wherein the single-stranded DNA has a length of 2 bases or more. 4) The DNA with a cohesive end or cohesive ends according to any one of the previous 1) to 3) , wherein the cohesive end/ends is/are positioned at the 3 '-terminal/terminals.
5) A method for producing a DNA with a cohesive end or cohesive ends, wherein a part of a DNA, as a template, and an oligonucleotide containing at least one ribonucleotide, as a primer, are subjected to DNA polymerase reaction to prepare a double-stranded DNA, then the ribonucleotide (s) is/are removed through enzymatic reaction or chemical reaction, and the nu- cleotide(s) remaining at the 5 ' -terminal (s) of the site(s) at which said ribonucleotide (s) existed are removed.
6) A method for producing the DNA with a cohesive end of the previous 1) , comprising the following steps a) to d) : a) a step of linking (i) an oligonucleotide having the same base sequence as that of a part of a gene DNA to (ii) an oligonucleotide having a base sequence that exists on the gene at the site not adjoining the base sequence of (i) or a base sequence which the gene does not have, and containing at least one ribonucleotide, in such a manner that the oligonucleotide (ii) is positioned at the 5 '-terminal of the oligonucleotide
(D; b) a step of preparing a double-stranded DNA through DNA polymerase reaction between a DNA containing the part corresponding to the oligonucleotide (i) in said a) , as a tem- plate, and the linked oligonucleotide as obtained in the previous step a) , as a primer; c) a step of removing the ribonucleotide from said double- stranded DNA through enzymatic reaction or chemical reaction; and d) a step of removing the nucleotide remaining at the 5'- terminal of the site at which said ribonucleotide existed.
7) A method for producing the DNA with cohesive ends of the previous 2) , comprising the following steps a) to d) : a) a step of linking (i) an oligonucleotide having the same base sequence as that of a part of a gene DNA to (ii) an oligonucleotide having a base sequence that exists on the gene at the site not adjoining the base sequence of (i) or a base sequence which the gene does not have, and containing at least one ribonucleotide, in such a manner that the oligonucleotide (ii) is positioned at the 5 '-terminal of the oligonucleotide (i); b) a step of preparing a double-stranded DNA through DNA polymerase reaction between a DNA containing the part corresponding to the oligonucleotide (i) in said a) , as a template, and (i) the linked oligonucleotide as obtained in the previous step a) and (ii) an oligonucleotide which is a complementary strand of an oligonucleotide existing on the gene at the site separated from said oligonucleotide-corresponding part by at least 3 bases or more toward the 3' -terminal and which contains at least one ribonucleotide, as primers; c) a step of removing the ribonucleotides from said double-stranded DNA through enzymatic reaction or chemical reaction; and d) a step of removing the nucleotides remaining at the 5'- terminals of the sites at which said ribonucleotides existed.
8) A method for shuffling a DNA, comprising dividing a DNA into a plurality of DNA blocks each having a cohesive end or cohesive ends, followed by ligating them together into a sequence that is different from the sequence of the original, non-divided DNA.
9) A method for shuffling a DNA, comprising applying the method of any one of the previous 5) to 7) to various sites of a DNA, thereby dividing the DNA into a plurality of DNA blocks each having a cohesive end or cohesive ends, at least one block of which shall have a cohesive end that is complementary to the cohesive end of another block not having been directly adjacent to said one block on the original DNA, followed by ligating them together into a sequence that is different from the sequence of the original, non-divided DNA.
10) The shuffling method according to the previous 8) or 9) , wherein the DNA is divided into 3 or more blocks.
11) The shuffling method according to any one of the previous 8) to 10) , wherein the blocks are ligated together using a DNA ligase.
12) A DNA as shuffled according to the method of any one of the previous 8) to 11) .
[0016] 13) The DNA according to the previous 12) , wherein a gene coding for an enzymatic function or a control gene for the gene is shuffled. 14) The DNA according to the previous 13) , wherein the gene is a gene that codes for any one of proteases, lipases, cellulases, a ylases, catalases, xylanases, oxidases, dehydro- genases, oxygenases and reductases. 15) The DNA according to the previous 13) or 14) , wherein the gene is one derived from prokaryotes.
16) The DNA according to the previous 15) , wherein the gene is one derived from bacillus bacteria.
17) The DNA according to the previous 16) , wherein the gene is a protease API21 gene.
18) A DNA pool containing plural kinds of DNAs having different structures that are obtained according to the shuffling method of any one of the previous 8 to 11) .
19) The DNA pool according to the previous 18) , which contains 10 or more kinds of DNAs.
20) A method for producing a DNA pool, comprising applying the method of any one of the previous 5) to 7) to various sites of a template DNA to thereby prepare a mixture of DNA blocks each having a cohesive end or cohesive ends that satisfies the following conditions, followed by ligating these into any desired sequences:
Condition 1: Each block has a double-stranded site having the same sequence as that of a part of the template DNA.
Condition 2 : At least two of the blocks that constitute the block mixture further have, in addition to said double- stranded site, s single-stranded site (cohesive end) that is complementary to the cohesive end of blocks that are not directly adjacent to said blocks on the template DNA. Condition 3: The block mixture contains at least two different blocks which are the same in the double-stranded site but are different only in the single-stranded site and which satisfy the condition 2. 21) The method for producing a DNA pool according to the previous 20) , wherein the template DNA is a gene that codes for an enzymatic function or a control gene DNA for the gene.
22) The method for producing a DNA pool according to the previous 21) , wherein the template DNA is a gene DNA that codes for any one of proteases, lipases, cellulases, amylases, catalases, xylanases, oxidases, dehydrogenases, oxygenases and reductases.
23) The method for producing a DNA pool according to the previous 22) , wherein the template DNA is one derived from prokaryotes.
24) The method for producing a DNA pool according to the previous 23), wherein the template DNA is one derived from bacillus bacteria.
25) The method for producing a DNA pool according to the previous 24) , wherein the template DNA is a protease API21 gene.
26) The method for producing a DNA pool according to any one of the previous 20) to 25) , wherein the DNA blocks are ligated together using a DNA ligase. 27) A genetic product to be obtained by expressing the genetic information on DNA molecules that exist in the DNA pool of any one of the previous 18) to 26) . Now the present invention is described in detail hereinunder .
DNA with Cohesive End(s) The present invention provides a DNA with any desired cohesive end or ends (herein referred to as "DNA with cohesive end(s)" unless otherwise specifically indicated). The cohesive end as referred to herein indicates a single-stranded site as protruded from the end of a double-stranded DNA. Such a cohesive end may be formed when a DNA is cleaved with a restriction enzyme such as EcoRI . In this case, however, the base sequence of the thus-formed cohesive end is defined, depending on the restriction enzyme used, and its length is generally composed of several bases or so. If a naturally- existing DNA is cleaved with a restriction enzyme, the sequence of the resulting double-stranded part of the DNA is also limited to the region as sandwiched between the restriction enzyme recognizing sites. As opposed to this, the DNA with cohesive end(s) of the present invention may have a structure in which a cohesive end or cohesive ends having a desired length and a desired sequence is/are added to the end or ends of a double-stranded DNA having a desired sequence.
As has been mentioned hereinabove, the sequence of the double-stranded part of the DNA with cohesive end(s) of the present invention is not specifically defined. For example, the sequence may be the same as that of a part of a gene. Though not also specifically defined, its length may be generally composed of 50 base pairs (bp) or more, preferably 45 bp or more. The sequence of the cohesive end is not also specifically defined, but in order to prevent the self-ligation thereof in various reactions, it is preferable that the sequence does not form a stem structure. The "sequence to form a stem structure" as referred to herein includes, for example, AATT, which shall have just the same sequence as that of its complementary strand (TTAA) . The length of the cohesive end may be generally 2 bp or more, preferably from 15 bp to 30 bp. If the cohesive end is too long, it may form a secondary struc- ture whereby the intermolecular annealing will be difficult. However, if it is too short, its melting temperature (Tm) is low and the annealing will be unstable.
The cohesive end may be linked to either the 3 '-terminal or the 5 '-terminal of the double-stranded DNA, but is prefe- rably linked to the 3 '-terminal thereof. The cohesive end may be linked to either only one terminal of the double-stranded DNA or the both terminals thereof.
Method for Producing DNA with Cohesive End(s) The DNA with cohesive end(s) of the present invention can be produced typically according to a method comprising the following steps a) to d) . The method mentioned below is directed to the production of a DNA with a cohesive end, which has a structure to be represented by a formula (2) :
(2) wherein X and c are desired sequences; a and S are sequences that are complementary to and c, respectively; and _f is a sequence of a cohesive end, and which is based on a double-stranded DNA (template DNA) having a structure to be represented by a formula (1) :
(1) wherein X, c, a, S, and &s> have the same meanings as above. However, the present invention is not limited to only the production illustrated herein, but other DNAs with cohesive end(s) having other structures can also be produced in the same manner as below according to the present invention.
Step a)
(a-1) Preparation of Oligonucleotide:
First, the part that shall be selected as the double- stranded part of the intended DNA with a cohesive end is defined on a template DNA. An oligonucleotide, a, which is complementary to its terminal, X, and an oligonucleotide, c, having the same sequence as that of the other terminal, c, are prepared. X and c each may have a sequence having a base length of from 15 to 30 bp or so.
On the other hand, prepared is an oligonucleotide, b, which is complementary to the sequence to be prepared by removing one base (this is referred to as X) from the 5'- terminal of the sequence of the intended cohesive end, a " . The base sequence, & may be a part of the above-mentioned DNA or may be any other sequence that the DNA does not have.
These oligonucleotides, a, b and c, may be prepared by any methods. If their sequences are previously known, they can be synthesized, using a known DNA synthesizer.
(a-2) Preparation of Ribonucleotide-Containing Fragments:
Next, the oligonucleotides, a and b, are linked together via a ribonucleotide. This linkage can be attained by ordinary synthesizing methods. For example, it can be attained according to the process mentioned below.
First, a phosphoryl group is added to the 5 ' -terminal of the oligonucleotide, a, according to the reaction of the following formula (3) : (3) wherein (P) is a phosphoryl group.
This reaction can be effected by the action of a polynucleotide kinase. ATP is used in an amount of from 2 to 10 times or so, by mol, relative to the oligonucleotide, a. The reaction temperature may be from 30 to 40°C or so. The reaction time may be from 10 minutes to 1 hour or so. Most suitably, the pH is from 7 to 9 or so. After the addition of the phosphoryl group thereto, the oligonucleotide is represented by a ' .
[0024] On the other hand, a ribonucleotide is added to the 3'- terminal of the oligonucleotide, b, according to the reaction of the following formula (4) : (4) wherein X is any one of ATP, GTP, CTP and UTP; (rX) is a ribonucleotide.
This reaction can be effected by the action of, for example, a terminal deoxynucleotidyl transferase. For the nucleoside triphosphate (XTP) to be used herein, is selected a ribonucleotide that corresponds to the base X in the previous step (a-1) . The nucleoside triphosphate is used in an amount of from 2 to 10 times, by mol, relative to the oligonucleotide, b. The reaction temperature may be from 30 to 40°C or so. The reaction time may be from 30 minutes to 2 hours or so. After the addition thereto, the oligonucleotide is represented by b'. The sequence of b' is complementary to the sequence, -s^.
The thus-obtained oligonucleotides, a' and b', are mixed, whereby the 5' -terminal (phosphoryl group) of a ' is bonded to the 3 '-terminal (hydroxyl group) of the ribonucleotide of b' , according to the reaction of the following formula (5) :
(5). This reaction can be effected by the action of an RNA ligase in the presence of ATP and divalent metal ions (Japanese Patent Application Laid-Open No. 5-292967) . Divalent metal ions useful in this reaction include, for example, magnesium ions and manganese ions, but preferred are magnesium ions. As the ligase, employable is an RNA ligase. The RNA ligase is an enzyme to catalyze the ligation of the hydroxyl group at the 3 ' -terminal and the phosphoryl group at the 5 ' -terminal , and this also efficiently catalyzes the ligation of a polydeoxyribonucleotide having a ribonucleotide only at its 3'- terminal and a polydeoxyribonucleotide with a 5 '-terminal phosphoryl group. Preferably used is a T4 RNA ligase. The reaction is generally effected in a buffer, at a pH of from 7 to 9 and at a temperature of from 10 to 40°C, over a period of from 30 to 180 minutes. For example, the oligonucleotides may be reacted in a solution comprising 50 mM Tris-HCl (pH 8.0), 20 mM MgCl2, 0.1 mM ATP, 10 mg/ liter BSA, 1 mM hexaammine cobalt chloride (HCC) and 25 % polyethylene glycol 6000, at 25°C for 60 minutes or longer. Step b)
Using the DNA containing the sequence, X, as prepared in the previous step (a-1) , as a template, and using the linked oligonucleotide, b'-a', as prepared in the previous step (a-2), as a primer, prepared is a double-stranded DNA through DNA polymerase reaction. In general, a double-stranded DNA containing the sequence, X, and a sequence, 3, on their strands is subjected to thermal or alkaline denaturation to give single-stranded DNAs, which are added to the primer of b'-a" and subjected to PCR with the oligonucleotide, c, as prepared in the previous step (a-1) . The primer annealing condition and the polymerase reaction condition to be employed herein may be the same as those in ordinary polymerase reaction. The DNA polymerase to be employed herein may be any and every enzyme that can catalyze the DNA chain-extending reaction, which includes, for example, Taq polymerase, Klenow fragment, DNA polymerase I, etc. As a result of this reaction, obtained is a double-stranded DNA with blunt ends, which is represented by the a formula (6) :
(6). Step c) Next, the ribonucleotide is removed from the double- stranded DNA through enzymatic reaction or chemical reaction. One example of useful enzymes is a ribonuclease. The reaction is generally effected at a pH of from 6 to 8 or so, at from 30 to 70°C or so, over a period of from 10 to 60 minutes or so. As non-enzymatic chemicals usable herein, mentioned are sodium hydroxide and the like. As a result of this reaction, obtained is a partly-discontinuous, double-stranded DNA of the following formula (7) , in which the part corresponding to the above- mentioned base, X, has been deleted.
(7). Step d)
After the above step, the nucleotide existing at the 5'- terminal of the above-mentioned deletion is removed. To remove this nucleotide, for example, the double-stranded DNA, from which the ribonucleotide has been removed in the previous step c) , is heated at from 50 to 90°C or so. The polynucleotide that has been separated from the strand through this reaction can be removed, using, for example, a span column or the like. Thus is obtained the double-stranded DNA with a cohesive end of the above-mentioned formula (2) .
In the process mentioned above, obtained is a double- stranded DNA with a cohesive end only at its one 3 '-terminal. In the same manner as this, also obtainable is a double- stranded DNA with cohesive ends at the both 3 ' -terminals.
In the above-mentioned process, a desired sequence, άt^t which does not adjoin the sequence, X, in the template DNA was introduced into the DNA to form the cohesive end. Apart from this, it is also possible to introduce thereinto an additional oligonucleotide that adjoin the sequence in the template DNA to form another cohesive end. For example, in the embodiment mentioned above, an oligonucleotide, c', which is different from the oligonucleotide, c, in that its 3 '-terminal deoxyribonucleotide is substituted with a ribonucleotide, may be used as the primer in place of the oligonucleotide, c, to prepare a double-stranded DNA with two cohesive ends of a formula (8) : (8).
Method of Shuffling DNA
The present invention also provides a method of shuffling a DNA, which is characterized by using DNAs with cohesive end(s) . The terminology "shuffling" as referred to herein in- dicates the operation of dividing a DNA into plural blocks followed by re-sequencing them into a desired, different sequence.
For example, where one DNA has a sequence composed of a number, n, of blocks, as represented by a formula (9) : A - al - a2 - . . . . - an - B (9) wherein the starting end A and/or the terminal end B may be omitted, this may be shuffled according to the present invention to give a different DNA to be represented by a formula (10) : A - al' - a2' - . . . . - ax - B (10) wherein al • , a2 * , . . . , ax are blocks that are independently selected from the group of al, a2, . . . , an; and the total number of the blocks al', a2 ' , . . . , ax may not be the same as the total number of the blocks ciif SL f * • • f £t_n •
The principle of the DNA shuffling of the present invention which utilizes DNAs with cohesive end(s) is graphically illustrated in Fig. 1. In Fig. 1, the DNA is shuffled at the intermediate part, pi - p2 - p3 (the uppermost row) into p3 - pi - p2 (the lowermost row) , without changing the both ends PA and PB- This shuffling operation is useful as a method for obtaining gene sequences that have not heretofore existed naturally, without changing the sequences of the promoter and the terminator.
Concretely, the above-mentioned method of preparing DNAs with cohesive end(s) is applied first to the parts PA, pi, p2, p3 and pβ constituting the template DNA, to thereby prepare DNA blocks, al, a2 and a3, each having the structure with two cohesive ends (formula (8)), and DNA blocks, A and B, each having the structure with one cohesive end (formula (2)). The cohesive ends, aA, aif, a2f and a3f, are formed by removing the corresponding complementary strand from the blocks, PA, pi/ p2 and p3, respectively. The cohesive ends, aιr, a2r, a3r an(* aB, are designed according to the desired sequence to be prepared after the shuffling. In the embodiment of Fig. 1, the end, aιr is designed to be a complementary strand to the end, a3f, and the block, al is ligated to the block a3 after the shuffling. The ligation is conducted, using a DNA ligase in the presence of ATP. The type of the DNA ligase to be employed herein is not specifically defined. In this embodiment, since the single- stranded part of each cohesive end is long, it is unnecessary to employ the ordinary reaction at 16°C, but a thermophilic DNA ligase is advantageously employed.
In the embodiment of Fig. 1, a2r; a3r and aB are designed to be the complementary strands to aι_f, aA and a2f, respectively, in the same manner as above. As a result of the shuff- ling, a sequence having a structure of A - a3 - al - a2 - B is finally obtained. This is seemingly the same as the re- sequenced order of P - p3 - pi - p2 - PB to be obtained by dividing the original DNA into the constitutive blocks pi, p2, p3 , PA and PB, followed by re-sequencing these into a different sequence.
Any other desired sequences can be realized in the same manner as above. If the block, A or B, is made to have two cohesive ends, while the other blocks are made to have one cohesive end, it is possible to obtain still different sequences through shuffling where the latter blocks with one cohesive end are positioned at the terminals.
In the shuffling of the invention, it is also possible to introduce foreign DNA block(s) with cohesive end(s), which are not in the original gene, into the gene DNA. For example, it is possible to shuffle two or more gene DNAs. In this case, the terminal of one gene, for example, the block A in the above-mentioned embodiment, may be processed into a DNA block with two cohesive ends, if desired.
The blocks, which are the units to be shuffled, are oligonucleotides or polynucleotides composed of 2 or more nucleotides (hereinafter referred to as "oligonucleotides") . In general, these are preferably composed of 30 or more nucleotide units, more preferably 45 or more nucleotide units. The uppermost limit of the block length is not specifically defined, provided that the block length is shorter than the length of one gene. If, however, the block length is too large, the re-sequenced DNA to be obtained by the shuffling shall have many non-mutated base sequence parts. Therefore, in general, the block length is preferably within the range of from 10 to 35 % of the length of a gene.
Where the gene to be shuffled is a gene that codes for a protein, it is desirable that the gene blocks, oligonucleotides have the same reading frame before and after the division. Namely, the gene blocks to be shuffled are desirably so designed that they are translated to always give the corresponding amino acid sequences, irrespective of their relative positions in the shuffled sequence. For this, the double-stranded parts and the cohesive ends shall be selected for their codon units in accordance with the reading frame of the gene DNA to be shuffled. Needless-to-say, it is unnecessary to conduct the division into segment blocks with genetic meanings. Namely, it is unnecessary to conduct the division of the gene DNA into the constitutive exons or segment blocks that correspond to the domains or modules of the protein which the gene DNA codes for. There is a probability that the shuffling at such sites would have been examined in the natural world in the past. In order to obtain base sequences that have not heretofore been examined in the natural world, it is desirable that the division of the gene DNA is effected inside the constitutive exons or at the sites corresponding to the inside of the domains or modules of the protein which the gene DNA codes for.
Employing such means, therefore, it is possible to obtain proteins which have different structures as a whole from those of natural proteins but which partly contain amino acid sequences that have been confirmed to be useful in the natural world. Accordingly, the probability of obtaining useful proteins by such means is enlarged, as compared with the means of synthesizing proteins totally at random. The kind of the gene to be shuffled according to the present invention is not specifically defined. Employable herein is any and every gene that is composed of polynucleotide chains and contains a coding region necessary for expressing a protein or RNA. The nucleotide unit may contain any molecule of deoxy- ribonucleotides or ribonucleotides. For the purpose of finding out useful base sequences, preferred are genes coding for proteins, especially enzymes, or control genes for enzymatic functions. Examples of such enzymes include proteases, lipa- ses, cellulases, amylases, catalases, xylanases, oxidases, dehydrogenases, oxygenases and reductases.
The kind of the gene to which the present invention is directed is not specifically defined but shall be such that, when it is introduced into a suitable host, the host can produce the genetic product through expression of the gene. As examples, referred to are genes as cloned from living organisms, artificially synthesized genes, and even genes as cloned from living organisms and artificially mutated. For the genes derived from living organisms, employable are prokaryotes with definite enzyme producibility . As examples of such prokaryotes, mentioned are bacillus bacteria. One example of the genes derived from such bacteria is a protease API21 gene derived from Bacillus NKS-21 (FERM BP-93-1) (Japanese Patent Application Laid-Open No. 5-91876, Sequence Number 1).
DNA Pool
The present invention also provides a DNA pool to be obtained according to the above-mentioned shuffling method. The "DNA pool" as referred to herein means a high-density mixture of two or more DNAs. The DNA pool of the present invention can contain a particular number or more, for example, 10 or more different DNA molecules having different structures. It is desirable that, when the mixture, DNA pool is directly used in biochemical operation or reaction, it is in such a form that all the plural nucleic acid components constituting it can be reacted. However, the form of the mixture, DNA pool is not specifically defined, and the DNA pool may be either in solution or dry mixture.
To produce the DNA pool, for example, a plurality of cohesive ends for each block are prepared in the above-men- tioned shuffling process. Referring to the embodiment of Fig. 1, for example, when, for the cohesive end aιr of the block al, complementary strands to the other cohesive ends, aA and a2f, are prepared in addition to the complementary strand to a3f, then DNAs of A - al - a2 - B and A - al -a2 - al can be ob- tained. If a complementary strand to the other cohesive end alf of al is added, it is also possible to produce other DNAs comprising a series of the same blocks, such as A - al - al - al.
In the same manner, for the cohesive ends of a2 and a3 , if oligonucleotides that are complementary to the cohesive ends of the other blocks or complementary to the other cohesive end of themselves are added, other sequences comprising these can be produced.
In general, a DNA is divided into blocks of al, a2 , a3 , . . . , an- Then, each block is processed to have a cohesive end or cohesive ends according to the above-mentioned process. The cohesive ends are designed to be oligonucleotides that are complementary to the cohesive ends of the other blocks or are complementary to the other cohesive end of themselves. All or a part of the thus-obtained DNA blocks are mixed and ligated to each other, thereby producing a nucleic acid pool containing different nucleic acids composed of the blocks as differently sequenced at random. Expression of Genetic Information in Shuffled DNA or DNA Pool
The thus-shuffled, single or mixed, double-stranded DNAs are blunted. The blunting may be omitted, if DNA blocks with one cohesive end are positioned at the ends of the shuffled, double-stranded DNA. For example, the 5 '-terminal of the sequence containing a DNA block with a predetermined promoter sequence, which is based on the direction of the promoter, is not made to have a cohesive end but is made to have a blunt end, while the 3 '-terminal of the sequence containing a DNA block with a predetermined terminator sequence, which is based on the direction of the terminator, is not made to have a cohesive end but is also made to have a blunt end. In that manner, it is possible to directly obtain a gene in which the blocks of the intended gene have been shuffled between the promoter and the terminator, without blunting it. After this, the thus-shuffled DNA is inserted into a desired vector, preferably an expression vector such as pKK223-3, using a DNA ligase. the promoter sequence and the terminator sequence to be in the shuffled DNA are not limited to only one each, but a plurality of promoter sequences and terminator sequences may be therein.
If desired, the polynucleotide blocks positioned at the both ends of the shuffled DNA may be designed to have suitable restriction enzyme recognizing sites. In this case, the DNA may be ligated to a suitable vector, using the defined re- striction enzymes.
Next, the vector library thus produced in the manner mentioned above is introduced into a suitable host, in which the genetic information is expressed. Thus, the intended gene- tic product with favorable properties and also the gene coding for it can be obtained. Any and every ordinary host can be used herein. Preferred examples of the host include cells of E. coli , bacillus bacteria, yeasts, and lactic acid bacteria. If desired, in-vitro transcription systems and translation systems are also employable herein. In those cases, the genetic information can be expressed even when the gene is not ligated to a vector.
The "genetic information" as referred to herein indicates the information on a gene which is carried by a DNA and which is translated into a protein or is transcribed into RNA in a suitable living body by the DNA for itself or after having been ligated to any other DNA or RNA.
The genetic information that is expected to be expressed according to the method of the present invention is not specifically defined, but includes, for example, those on various genetic products, such as enzymes, antibodies, hormones receptor proteins and ribozymes, and those on various control functions of, for example, operators, promoters and attenua- tors.
Examples
Now, the present invention is described in detail hereinunder with reference to the following examples, which, however, are not intended to restrict the scope of the present invention. Example 1: Production of DNA Pool
A nucleic acid pool was produced in accordance with the process mentioned below, based on the wild-type alkali protease
(Japanese Patent Application-Laid Open No. 5-91876) as cloned from a protease API21 (Bacillus NKS-21; FERM BP-93-1) having a sequence of Sequence Number 1.
(1) Step a) : Preparation of Oligonucleotide Blocks for Primer (1-1) Synthesis of Oligonucleotide Blocks:
Using an automatic DNA synthesizer, Model 392 (manufactured by Perkin Elmer Co.), synthesized were 14 oligonucleotides; oligo FW (Sequence Number 2) , oligo RV (Sequence Number 3) , oligo lr (Sequence Number 4) , oligo lb (Sequence Number 5) , oligo la (Sequence Number 6) , oligo 2r (Sequence Number 7) , oligo 2b (Sequence Number 8) , oligo 2a (Sequence Number 9) , oligo 3r (Sequence Number 10) , oligo 3b (Sequence Number 11) , oligo 3a (Sequence Number 12) , oligo 4r (Sequence Number 13), oligo 4b (Sequence Number 14), oligo 4a (Sequence Number 15) and oligo A (Sequence Number 16) . These are parts of the base sequence of API21 (Japanese Patent Application Laid-Open No. 5-91876) (including complementary strands) , or oligonucleotides containing a part of the base sequence. However, the sequence of oligo 4a is to follow glutamine of Sequence Number 1 and, and this contains a termination codon of the gene. These oligonucleotides were so designed that they might be the best when the oligo A was overhung on the 3 '-terminal of the amplified DNA in the experiment to follow hereinunder, using a Tag polymerase. These oligonucleotides were synthesized in a DM trityl-on condition (that is, while the 5' -hydroxyl group was protected with dimethoxytrityl group) , and purified through an OPC column. The reagents used herein were obtained from Perkin Elmer Co.
(1-2) Addition of Ribonucleotide to Blocks:
Next, 500 pmols of oligo lr, 1 nmol of ATP and 10 units of terminal deoxynucleotidyl transferase were added to a standard solution comprising: 50 mM Tris-HCl buffer (pH 8.0)
10 mM MgCl2
5 mM DTT (dithiothreitol)
25 % PEG 6000
1 mM HCC (hexaammine cobalt chloride) 10 μg/ml BSA (bovine serum albumin) , to thereby make 10 μl in total. The resulting solution was left at 37 °C for 1 hour.
Oligo 2r, oligo 3r, oligo 4r, oligo lb, oligo 2b, oligo 3b and oligo 4b were processed in the same manner as above. These four polynucleotides thus formed are referred to as oligo lr ' , oligo 2r', oligo 3r', oligo 4r', oligo lb', oligo 2b', oligo 3b' and oligo 4b' . (1-3) Phosphorylation:
500 pmols of oligo la, 1 nmol of ATP and 10 units of polynucleotide kinase were dissolved in the standard solution having the same composition as above to make 10 μl in total.
The resulting solution was left at 37_C for 1 hour. Oligo 2a, oligo 3a and oligo 4a were processed in the same manner as above. These polynucleotides thus formed are referred to as oligo la', oligo 2a', oligo 3a' and oligo 4a'. (1-4) Ligation of Oligonucleotide Blocks:
500 pmols of oligo la', 100 pmols of oligo lb', 100 pmols of oligo 2b', 100 pmols of oligo 3b', 100 pmols of oligo 4b', which had been obtained in the above, as well as 1 nmol of ATP and 50 units of T4 RNA ligase were added to the same standard solution as that mentioned above to make 10 μl in total, and these were reacted at 25°C for 4 hours. The other combinations, oligo 2a' with oligo lb', oligo 2b', oligo 3b' and oligo 4b'; oligo 3a' with oligo lb', oligo 2b', oligo 3b' and oligo 4b'; and oligo 4a' with oligo lb', oligo 2b', oligo 3b' and oligo 4b', were also reacted in the same manner as above. A mixture of the four polynucleotides thus formed as a result of this reaction, oligo la' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b', is referred to as oligo 1M; a mixture of the four polynucleotides, oligo 2a' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b', is referred to as oligo 2M; a mixture of the four polynucleotides, oligo 3a' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b', is referred to as oligo 3M; and a mixture of the four polynucleotides, oligo 4a' ligated to oligo lb', oligo 2b', oligo 3b' and oligo 4b' , is referred to as oligo 4M. (2) Steps b) to d) : Formation of Gene Blocks A template, plasmid pSDT812 (Japanese Patent Application Laid-Open No. 1-141596) , which had been prepared by inserting, into the Clal cleaving site of pHSG396, the gene of the wild- type alkali protease as cloned from Bacillus NKS-21, was subjected to PCR with primers, oligo 1M and oligo 2r'. The gene fragment as amplified through this reaction was treated with a ribonuclease, and then heated at 80_C for 5 minutes, whereby the polynucleotide (s) positioned at the 5 '-terminal of the ribonucleotide existing in the both strands or one strand was/were removed. As a result of this, prepared was a gene block with cohesive end(s) . This gene block is referred to as block 1M.
The other four combinations, oligo 2M and oligo 3r', oligo 3M and oligo 4r', oligo 4M and oligo RV, and oligo FW and oligo lr', were processed in the same manner as above. These blocks thus prepared are referred to as block 2M, block 3M, block B, and block F, respectively.
Example 2 : Shuffling
Block 1M, block 2M, block 3M, block B and block F of the same amount were blended and ligated together, using Pfu DNA ligase.
After the ligation, the reaction mixture was subjected to agarose gel electrophoresis, through which was collected the DNA fragment of about 1.5 kbp.
Example 3: Identification of Nucleic Acid Pool
The thus-collected DNA of about 1.5 kbp was digested with restriction enzymes, EcoRI and BamHI, then mixed with a plasmid, pHY300PLK (manufactured by Yakulto Honsha Co.), which had been digested with restriction enzymes, EcoRI and BamHI and processed with an alkali phosphatase, and thus ligated together, using a ligation kit (manufactured by Takara Shuzo Co.). Using the resulting DNA, cells of E. coli JM105 were transformed, from which were selected tetracycline-resistant transformants. From these transformants, plasmid DNAs were extracted, purified and analyzed according to ordinary methods. Thus were obtained 97 clones with a DNA of 1.5 kbp as inserted between the EcoRI and BamHI recognizing sites of pHY300PLK.
The base sequences of these DNAs thus obtained in the manner mentioned above were sequenced to analyze how block 1M, block 2M, block 3M, block F and block B were ligated in what order or, that is, how these were shuffled. As in the principle, block F was positioned at the first site while block B at the fifth site, and block 1M, block 2M and block 3M were shuffled between the two. Table 1 shows different, types of shuffling, and the number of clones with each type.
Table 1
Figure imgf000035_0001
As in the above, it has been confirmed that, if three blocks of one gene are shuffled according to the method of the present invention, a nucleic acid pool is obtained that covers all combinations of clones each containing the same or different three of these blocks.
Example 4: Screening of Genetic Products Obtained from Nucleic Acid Pool
The DNAs as produced in Example 3 were mixed. Using the resulting DNA mixture, cells of Bacillus subtiliε UOT0999 were transformed. Tetracycline-resistant transformants were selec- ted. 300 transformants were replicated on a skim milk-containing medium plate, on which were found clear zones around the colonies of 12 transformants. Accordingly, it is under- stood that the enzyme which the shuffled gene codes for can be selected depending on its activity. The base sequences of these 12 clones that formed the clear zones were analyzed, from which it was found that these were sequenced in the same order of blocks as in the wild-type enzyme.
Example 5 : Detection of Genetic Products
From 10 clones (one clone forms halo, while 9 clones do not) as selected from the transformants that had obtained in Example 3, and also from the host, Bacillus subtilis U0T0999, full-length RNAs were prepared. These were procesεed with a ribonuclease-free deoxyribonuclease, in order to remove the influence of the plasmids on the hybridization to be effected later on. Next, using oligo lr as the probe, these were subj- ected to Northern hybridization. As a result, all lanes corresponding to the RNA of the transformants gave detectable bands, but no band was detected on the lanes corresponding to the RNA of the host.
Advantages of the Invention
According to the present invention, provided is a double- stranded DNA molecule with any desired cohesive end or ends. Using this, it is possible to obtain various DNAs with various base sequences which are substantially apart from the naturally-existing base seguence spaces, and also a DNA pool of a mixture of such DNAs, through simple processes. Therefore, it is possible to obtain excellent genetic products, such as proteins and enzymes, which could not be obtained in conventional methods and which were not examined by organisms in the past. In addition, according to the method of the present invention for producing a nucleic acid pool, it is possible to obtain a mixture of nucleic acids while optionally shuffling the constitutive blocks at random in the intermediate parts but fixing the terminal sequences to be predetermined, desired ones, and it is also possible to shuffle the constitutive blocks without changing the amino acid sequence which each block codes for. Therefore, as compared with a me- thod of producing a completely-randomized nucleic acid pool, there is a high possibility that useful genetic products can be produced according to the method of the present invention.
Sequence Listing
Sequence Number: 1
Length of Sequence: 1122 Type of Sequence: Nucleic Acid
Number of Strands: Double-stranded
Topology: Linear
Kind of Sequence: Genomic DNA
Source: Bacillus NKS-21 (FERM BP-93-1) Characteristics of Sequence:
Code Indicating Characteristics: Sig Peptide
Existing Site: 1 . . . 93
Method of Determining Characteristics: S
Code Indicating Characteristics: Mat Peptide Existing Site: 104 . . . 1112
Method of Determining Characteristic: S
Sequence:
ATG AAT CTT CAA AAA ATA GCC TCA GCG TTG AAG GTT AAG CAA TCG GCA48 Met Asn Leu Gin Lys lie Ala Ser Ala Leu Lys Val Lys Gin Ser Ala -100 -95 -90
TTG GTC AGC AGT TTA ACT ATT TTG TTT CTA ATC ATG CTA GTA GGT ACG96 Leu Val Ser Ser Leu Thr lie Leu Phe Leu lie Met Leu Val Gly Thr
-85 -80 -75
ACT AGT GCA AAT GGT GCG AAG CAA GAG TAC TTA ATT GGT TTC AAC TCA 144 Thr Ser Ala Asn Gly Ala Lys Gin Glu Tyr Leu lie Gly Phe Asn Ser -70 -65 -60 -55
GAC AAG GCA AAA GGA CTT ATC CAA AAT GCA GGT GGA GAA ATT CAT CAT 192 Asp Lys Ala Lys Gly Leu lie Gin Asn Ala Gly Gly Glu lie His His -50 -45 -40 GAA TAT ACA GAG TTT CCA GTT ATC TAT GCA GAG CTT CCA GAA GCA GCG 240 Glu Tyr Thr Glu Phe Pro Val lie Tyr Ala Glu Leu Pro Glu Ala Ala
-35 -30 -25
GTA AGT GGA TTG AAA AAT AAT CCT CAT ATT GAT TTT ATT GAG GAA AAC 288 Val Ser Gly Leu Lys Asn Asn Pro His lie Asp Phe lie Glu Glu Asn -20 -15 -10
GAA GAA GTT GAA ATT GCA CAG ACT GTT CCT TGG GGA ATC CCT TAT ATT 336 Glu Glu Val Glu lie Ala Gin Thr Val Pro Trp Gly He Pro Tyr He -5 1 5 10 TAC TCG GAT GTT GTT CAT CGT CAA GGT TAC TTT GGG AAC GGA GTA AAA 384 Tyr Ser Asp Val Val His Arg Gin Gly Tyr Phe Gly Asn Gly Val Lys
15 20 25
GTA GCA GTA CTT GAT ACA GGA GTG GCT CCT CAT CCT GAT TTA CAT ATT 432 Val Ala Val Leu Asp Thr Gly Val Ala Pro His Pro Asp Leu His He 30 35 40
AGA GGA GGA GTA AGC TTT ATC TCT ACA GAA AAC ACT TAT GTG GAT TAT 480 Arg Gly Gly Val Ser Phe He Ser Thr Glu Asn Thr Tyr Val Asp Tyr 45 50 55 AAT GGT CAC GGT ACT CAC GTA GCT GGT ACT GTA GCT GCC CTA AAC AAT 528 Asn Gly His Gly Thr His Val Ala Gly Thr Val Ala Ala Leu Asn Asn
60 65 70
TCA TAT GGC GTA TTG GGA GTG GCT CCT GGA GCT GAA CTA TAT GCT GTT 576 Ser Tyr Gly Val Leu Gly Val Ala Pro Gly Ala Glu Leu Tyr Ala Val 75 80 85 90
AAA GTT CTT GAT CGT AAC GGA AGC GGT TCG CAT GCA TCC ATT GCT CAA 624 Lys Val Leu Asp Arg Asn Gly Ser Gly Ser His Ala Ser He Ala Gin
95 100 105
GGA ATT GAA TGG GCG ATG AAT AAT GGG ATG GAT ATT GCC AAC ATG AGT 672 Gly He Glu Trp Ala Met Asn Asn Gly Met Asp He Ala Asn Met Ser 110 115 120
TTA GGA AGT CCT TCT GGG TCT ACA ACC CTG CAA TTA GCA GCA GAC CGC 720 Leu Gly Ser Pro Ser Gly Ser Thr Thr Leu Gin Leu Ala Ala Asp Arg 125 130 135 GCT AGG AAT GCA GGT GTC TTA TTA ATT GGG GCG GCT GGA AAC TCA GGA 768 Ala Arg Asn Ala Gly Val Leu Leu He Gly Ala Ala Gly Asn Ser Gly
140 145 150
CAA CAA GGC GGC TCG AAT AAC ATG GGC TAC CCA GCG CGC TAT GCA TCT 816 Gin Gin Gly Gly Ser Asn Asn Met Gly Tyr Pro Ala Arg Tyr Ala Ser 155 160 165 170
GTC ATG GCT GTT GGA GCG GTG GAC CAA AAT GGA AAT AGA GCG AAC TTT 864 Val Met Ala Val Gly Ala Val Asp Gin Asn Gly Asn Arg Ala Asn Phe
175 180 185
TCA AGC TAT GGA TCA GAA CTT GAG ATT ATG GCG CCT GGT GTC AAT ATT 912 Ser Ser Tyr Gly Ser Glu Leu Glu He Met Ala Pro Gly Val Asn He 190 195 200
AAC AGT ACG TAT TTA AAT AAC GGA TAT CGC AGT TTA AAT GGT ACG TCA 960 Asn Ser Thr Tyr Leu Asn Asn Gly Tyr Arg Ser Leu Asn Gly Thr Ser 205 210 215 ATG GCA TCT CCA CAT GTT GCT GGG GTA GCT GCA TTA GTT AAA CAA AAA1008 Met Ala Ser Pro His Val Ala Gly Val Ala Ala Leu Val Lys Gin Lys
220 225 230
CAC CCT CAC TTA ACG GCG GCA CAA ATT CGT AAT CGT ATG AAT CAA ACA1056 His Pro His Leu Thr Ala Ala Gin He Arg Asn Arg Met Asn Gin Thr 235 240 245 250
GCA ATT CCG CTT GGT AAC AGC ACG TAT TAT GGA AAT GGC TTA GTG GAT1104 Ala He Pro Leu Gly Asn Ser Thr Tyr Tyr Gly Asn Gly Leu Val Asp
255 260 265
GCT GAG TAT GCG GCT CAA 1122 Ala Glu Tyr Ala Ala Gin 270 272 Sequence Number: 2
Length of Sequence: 20 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
GATTTTAGAA TTCGCAGCGG
Sequence Number: 3
Length of Sequence: 25
Type of Sequence: Nucleic Acid
Number of Strand: Single-stranded
Topology: Linear Kind of Sequence: Other Nucleic Acid, Synthetic DNA
Sequence:
CCGGATTCCT TAAAGCCCTG AATAA
Sequence Number: 4 Length of Sequence: 17
Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
ACAGTCTGTG CAATTTC
Sequence Number: 5 Length of Sequence: 17 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
GAAATTGCAC AGACTGT
Sequence Number: 6 Length of Sequence: 20
Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
CCTTGGGGAA TCCCTTATAT
Sequence Number: 7
Length of Sequence: 17 Type of Sequence: Nucleic Acid
Number of Strand: Single-stranded
Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA
Sequence: CCCAATACGC CATATGA Sequence Number: 8
Length of Sequence: 17 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
TCATATGGCG TATTGGG
Sequence Number: 9
Length of Sequence: 20 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
GTGGCTCCTG GAGCTGAACT
Sequence Number: 10
Length of Sequence: 16
Type of Sequence: Nucleic Acid
Number of Strand: Single-stranded
Topology: Linear Kind of Sequence: Other Nucleic Acid, Synthetic DNA
Sequence:
TCTGATCCAT AGCTTG Sequence Number: 11
Length of Sequence: 16 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology : Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
CAAGCTATGG ATCAGA
Sequence Number: 12
Length of Sequence: 20
Type of Sequence: Nucleic Acid
Number of strand: Single-stranded
Topology: Linear Kind of Sequence: Other Nucleic Acid, Synthetic DNA
Sequence:
CTTGAGATTA TGGCGCCTGG
Sequence Number: 13 Length of Seguence: 17
Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
TGAGCCGCAT ACTCAGC
Sequence Number: 14 Length of Sequence: 17 Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
GCTGAGTATG CGGCTCA
Sequence Number: 15 Length of Sequence: 20
Type of Sequence: Nucleic Acid Number of Strand: Single-stranded Topology: Linear
Kind of Sequence: Other Nucleic Acid, Synthetic DNA Sequence:
TAATCCCTAA GGATGTACTG
Brief Description of the Drawing
Fig. 1 is a graphical view showing one embodiment of trie method of the present invention for shuffling a DNA.
INDICATIONS RELATING TO A DEPOSITED MICROORGANISM
Figure imgf000045_0001
For receiving Office use only For International Bureau use only 2J Th_s sneet was received with the international This sheet was received bv the International Bureau application
Authorized officer Authorized officer
Figure imgf000045_0002
Form PCT/RO/134 (July 1992)

Claims

1. A DNA with a cohesive end comprising (a) a double- stranded DNA having the same sequence as that of a part of a gene, and (b) a single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, wherein the single-stranded DNA is linked to either one end of the double-stranded DNA to form a cohesive end.
2. A DNA with cohesive ends comprising (a) a double- stranded DNA having the same sequence as that of a part of a gene, (b) a first, single-stranded DNA having a base sequence that exists on said gene at the site not adjoining the part corresponding to said double-stranded DNA or a base sequence which said gene does not have, and (c) a second, single- stranded DNA having a base sequence that exists on said gene at the site adjoining the part corresponding to said double- stranded DNA, wherein the second, single-stranded DNA is linked to said double-stranded DNA at one end corresponding to said adjoining site, while the first, single-stranded DNA is linked thereto at the other end of the complementary strand opposite to said end, thereby forming cohesive ends.
3. The DNA with a cohesive end or cohesive ends as claimed in claim 1 or 2 , wherein the single-stranded DNA has a length of 2 bases or more.
4. The DNA with a cohesive end or cohesive ends as claimed in any one of claims 1 to 3, wherein the cohesive end/ends is/are positioned at the 3 '-terminal/terminals.
5. A method for producing a DNA with a cohesive end or cohesive ends, wherein a part of a DNA, as a template, and an oligonucleotide containing at least one ribonucleotide, as a primer, are subjected to DNA polymerase reaction to prepare a double-stranded DNA, then the ribonucleotide (s) is/are removed through enzymatic reaction or chemical reaction, and the nucleotide (s) remaining at the 5 ' -terminal (s) of the site(s) at which said ribonucleotide (s) existed are removed.
6. A method for producing the DNA with a cohesive end as set forth in claim 1, comprising the following steps a) to d) : a) a step of linking (i) an oligonucleotide having the same base sequence as that of a part of a gene DNA to (ii) an oligonucleotide having a base sequence that exists on the gene at the site not adjoining the base sequence of (i) or a base sequence which the gene does not have, and containing at least one ribonucleotide, in such a manner that the oligonucleotide (ii) is positioned at the 5' -terminal of the oligonucleotide
(i); b) a step of preparing a double-stranded DNA through DNA polymerase reaction between a DNA containing the part corresponding to the oligonucleotide (i) in said a) , as a template, and the linked oligonucleotide as obtained in the previous step a) , as a primer; c) a step of removing the ribonucleotide from said double- stranded DNA through enzymatic reaction or chemical reaction; and d) a step of removing the nucleotide remaining at the 5'- terminal of the site at which said ribonucleotide existed.
7. A method for producing the DNA with cohesive ends as set forth in claim 2, comprising the following steps a) to d) : a) a step of linking (i) an oligonucleotide having the same base sequence as that of a part of a gene DNA to (ii) an oligonucleotide having a base sequence that exists on the gene at the site not adjoining the base sequence of (i) or a base sequence which the gene does not have, and containing at least one ribonucleotide, in such a manner that the oligonucleotide (ii) is positioned at the 5' -terminal of the oligonucleotide (i) ; b) a step of preparing a double-stranded DNA through DNA polymerase reaction between a DNA containing the part corresponding to the oligonucleotide (i) in said a), as a template, and (i) the linked oligonucleotide as obtained in the previous step a) and (ii) an oligonucleotide which is a complementary strand of an oligonucleotide existing on the gene at the site separated from said oligonucleotide-corresponding part by at least 3 bases or more toward the 3" -terminal and which contains at least one ribonucleotide, as primers; c) a step of removing the ribonucleotides from said double-stranded DNA through enzymatic reaction or chemical reaction; and d) a step of removing the nucleotides remaining at the 5'- ter inals of the sites at which said ribonucleotides existed.
8. A method for shuffling a DNA, comprising dividing a DNA into a plurality of DNA blocks each having a cohesive end or cohesive ends, followed by ligating them together into a sequence that is different from the sequence of the original, non-divided DNA.
9. A method for shuffling a DNA, comprising applying the method as set forth in any one of claims 5 to 7 to various sites of a DNA, thereby dividing the DNA into a plurality of DNA blocks each having a cohesive end or cohesive ends, at least one block of which shall have a cohesive end that is complementary to the cohesive end of another block not having been directly adjacent to said one block on the original DNA, followed by ligating them together into a sequence that is different from the sequence of the original, non-divided DNA.
10. The shuffling method as claimed in claim 8 or 9 , wherein the DNA is divided into 3 or more blocks .
11. The shuffling method as claimed in any one of claims 8 to 10, wherein the blocks are ligated together using a DNA ligase.
12. A DNA as shuffled according to the method as set forth in any one of claims 8 to 11.
13. The DNA as claimed in claim 12, wherein a gene coding for an enzymatic function or a control gene for the gene is shuffled.
14. The DNA as claimed in claim 13, wherein the gene is a gene that codes for any one of proteases, Upases, cellulases, amylases, catalases, xylanases, oxidases, dehydrogenases, oxygenases and reductases.
15. The DNA as claimed in claim 13 or 14, wherein the gene is one derived from prokaryotes.
16. The DNA as claimed in claim 15, wherein the gene is one derived from bacillus bacteria.
17. The DNA as claimed in claim 16, wherein the gene is a protease API21 gene.
18. A DNA pool containing plural kinds of DNAs having different structures that are obtained according to the shuffling method as set forth in any one of claims 8 to 11.
19. The DNA pool as claimed in claim 18, which contains 10 or more kinds of DNAs.
20. A method for producing a DNA pool, comprising applying the method as set forth in any one of claims 5 to 7 to various sites of a template DNA to thereby prepare a mixture of DNA blocks each having a cohesive end or cohesive ends that satisfies the following conditions, followed by ligating these into any desired sequences:
Condition 1: Each block has a double-stranded site having the same sequence as that of a part of the template DNA.
Condition 2: At least two of the blocks that constitute the block mixture further have, in addition to said double- stranded site, a single-stranded site (cohesive end) that is complementary to the cohesive end of blocks that are not directly adjacent to said blocks on the template DNA.
Condition 3: The block mixture contains at least two different blocks which are the same in the double-strcinded site but are different only in the single-stranded site and which satisfy the condition 2.
21. The method for producing a DNA pool as claimed in claim 20, wherein the template DNA is a gene that codes for an enzymatic function or a control gene DNA for the gene.
22. The method for producing a DNA pool as claimed in claim 21, wherein the template DNA is a gene DNA that codes for any one of proteases, lipases, cellulases, a ylases, catalases, xylanases, oxidases, dehydrogenases, oxygenases and reductases.
23. The method for producing a DNA pool as claimed in claim 22, wherein the template DNA is one derived from prokaryotes.
24. The method for producing a DNA pool as claimed in claim 23, wherein the template DNA is one derived from bacillus bacteria.
25. The method for producing a DNA pool as claimed in claim 24, wherein the template DNA is a protease API21 gene.
26. The method for producing a DNA pool as claimed in any one of claims 20 to 25, wherein the DNA blocks are ligated together using a DNA ligase.
27. A genetic product to be obtained by expressing the genetic information on DNA molecules that exist in the DNA pool as set forth in any one of claims 18 to 26.
PCT/DK1997/000317 1996-08-07 1997-07-23 Double-stranded dna with cohesive end(s), and method of shuffling dna using the same WO1998005765A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU36917/97A AU3691797A (en) 1996-08-07 1997-07-23 Double-stranded dna with cohesive end(s), and method of shuffling dna using the same

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP8208422A JPH1066576A (en) 1996-08-07 1996-08-07 Double-stranded dna having protruding terminal and shuffling method using the same
JP8/208422 1996-08-07

Publications (1)

Publication Number Publication Date
WO1998005765A1 true WO1998005765A1 (en) 1998-02-12

Family

ID=16555966

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/DK1997/000317 WO1998005765A1 (en) 1996-08-07 1997-07-23 Double-stranded dna with cohesive end(s), and method of shuffling dna using the same

Country Status (3)

Country Link
JP (1) JPH1066576A (en)
AU (1) AU3691797A (en)
WO (1) WO1998005765A1 (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6248541B1 (en) 2000-04-21 2001-06-19 Genencor International, Inc. Screening under nutrient limited conditions
WO2001081568A1 (en) * 2000-04-21 2001-11-01 Genencor International, Inc. Non-pcr based recombination of nucleic acids
WO2002018629A1 (en) * 2000-08-29 2002-03-07 Macquarie University Degenerate oligonucleotide gene shuffling
EP1192280A1 (en) * 1999-06-14 2002-04-03 Diversa Corporation Synthetic ligation reassembly in directed evolution
US6534292B1 (en) 2000-05-08 2003-03-18 Genencor International, Inc. Methods for forming recombined nucleic acids
US6582914B1 (en) 2000-10-26 2003-06-24 Genencor International, Inc. Method for generating a library of oligonucleotides comprising a controlled distribution of mutations
US6808904B2 (en) 1994-06-16 2004-10-26 Syngenta Participations Ag Herbicide-tolerant protox genes produced by DNA shuffling
EP1482433A2 (en) * 2001-08-10 2004-12-01 Xencor Protein design automation for protein libraries
US6994963B1 (en) 2000-07-10 2006-02-07 Ambion, Inc. Methods for recombinatorial nucleic acid synthesis
WO2006047669A2 (en) * 2004-10-27 2006-05-04 Monsanto Technology Llc Non-random method of gene shuffling
US7315786B2 (en) 1998-10-16 2008-01-01 Xencor Protein design automation for protein libraries
US7379822B2 (en) 2000-02-10 2008-05-27 Xencor Protein design automation for protein libraries
WO2013032850A3 (en) * 2011-08-26 2013-04-25 Gen9, Inc. Compositions and methods for high fidelity assembly of nucleic acids
AU2009212959B2 (en) * 1999-06-14 2013-08-15 Bp Corporation North America Inc. Synthetic ligation reassembly in directed evolution
US9216414B2 (en) 2009-11-25 2015-12-22 Gen9, Inc. Microfluidic devices and methods for gene synthesis
US9217144B2 (en) 2010-01-07 2015-12-22 Gen9, Inc. Assembly of high fidelity polynucleotides
US10081807B2 (en) 2012-04-24 2018-09-25 Gen9, Inc. Methods for sorting nucleic acids and multiplexed preparative in vitro cloning
US10202608B2 (en) 2006-08-31 2019-02-12 Gen9, Inc. Iterative nucleic acid assembly using activation of vector-encoded traits
US10207240B2 (en) 2009-11-03 2019-02-19 Gen9, Inc. Methods and microfluidic devices for the manipulation of droplets in high fidelity polynucleotide assembly
US10308931B2 (en) 2012-03-21 2019-06-04 Gen9, Inc. Methods for screening proteins using DNA encoded chemical libraries as templates for enzyme catalysis
US10457935B2 (en) 2010-11-12 2019-10-29 Gen9, Inc. Protein arrays and methods of using and making the same
CN111019938A (en) * 2019-12-06 2020-04-17 天津大学 Long-chain DNA with sticky end and preparation method thereof
US11072789B2 (en) 2012-06-25 2021-07-27 Gen9, Inc. Methods for nucleic acid assembly and high throughput sequencing
US11084014B2 (en) 2010-11-12 2021-08-10 Gen9, Inc. Methods and devices for nucleic acids synthesis

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6958213B2 (en) 2000-12-12 2005-10-25 Alligator Bioscience Ab Method for in vitro molecular evolution of protein function
US20020086292A1 (en) 2000-12-22 2002-07-04 Shigeaki Harayama Synthesis of hybrid polynucleotide molecules using single-stranded polynucleotide molecules

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991007506A1 (en) * 1989-11-08 1991-05-30 The United States Of America, Represented By The Secretary, United States Department Of Commerce A method of synthesizing double-stranded dna molecules
WO1995017413A1 (en) * 1993-12-21 1995-06-29 Evotec Biosystems Gmbh Process for the evolutive design and synthesis of functional polymers based on designer elements and codes
WO1995022625A1 (en) * 1994-02-17 1995-08-24 Affymax Technologies N.V. Dna mutagenesis by random fragmentation and reassembly

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991007506A1 (en) * 1989-11-08 1991-05-30 The United States Of America, Represented By The Secretary, United States Department Of Commerce A method of synthesizing double-stranded dna molecules
WO1995017413A1 (en) * 1993-12-21 1995-06-29 Evotec Biosystems Gmbh Process for the evolutive design and synthesis of functional polymers based on designer elements and codes
WO1995022625A1 (en) * 1994-02-17 1995-08-24 Affymax Technologies N.V. Dna mutagenesis by random fragmentation and reassembly

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHEMISTRY LETTERS, Volume 2, 1995, KOICHI NISHIGAKI et al., "Restriction-Enzyme-Nondependent Recombination and Rearrangement of DNA (RRR)", page 131. *

Cited By (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6808904B2 (en) 1994-06-16 2004-10-26 Syngenta Participations Ag Herbicide-tolerant protox genes produced by DNA shuffling
US7315786B2 (en) 1998-10-16 2008-01-01 Xencor Protein design automation for protein libraries
EP1192280A1 (en) * 1999-06-14 2002-04-03 Diversa Corporation Synthetic ligation reassembly in directed evolution
AU2009212959B2 (en) * 1999-06-14 2013-08-15 Bp Corporation North America Inc. Synthetic ligation reassembly in directed evolution
EP2865754A1 (en) * 1999-06-14 2015-04-29 BP Corporation North America Inc. Synthetic ligation reassembly in directed evolution
USRE45349E1 (en) 1999-06-14 2015-01-20 Bp Corporation North America Inc. Synthetic ligation reassembly in directed evolution
EP1192280A4 (en) * 1999-06-14 2006-04-05 Diversa Corp Synthetic ligation reassembly in directed evolution
US7379822B2 (en) 2000-02-10 2008-05-27 Xencor Protein design automation for protein libraries
US6248541B1 (en) 2000-04-21 2001-06-19 Genencor International, Inc. Screening under nutrient limited conditions
WO2001081568A1 (en) * 2000-04-21 2001-11-01 Genencor International, Inc. Non-pcr based recombination of nucleic acids
US6534292B1 (en) 2000-05-08 2003-03-18 Genencor International, Inc. Methods for forming recombined nucleic acids
US7037726B2 (en) 2000-05-08 2006-05-02 Genencor International, Inc. Methods for forming recombined nucleic acids
US6994963B1 (en) 2000-07-10 2006-02-07 Ambion, Inc. Methods for recombinatorial nucleic acid synthesis
US7771974B2 (en) 2000-08-29 2010-08-10 Macquarie University Degenerate oligonucleotide gene shuffling
WO2002018629A1 (en) * 2000-08-29 2002-03-07 Macquarie University Degenerate oligonucleotide gene shuffling
US6582914B1 (en) 2000-10-26 2003-06-24 Genencor International, Inc. Method for generating a library of oligonucleotides comprising a controlled distribution of mutations
EP1482433A3 (en) * 2001-08-10 2006-04-12 Xencor, Inc. Protein design automation for protein libraries
EP1482433A2 (en) * 2001-08-10 2004-12-01 Xencor Protein design automation for protein libraries
WO2006047669A3 (en) * 2004-10-27 2006-08-24 Monsanto Technology Llc Non-random method of gene shuffling
WO2006047669A2 (en) * 2004-10-27 2006-05-04 Monsanto Technology Llc Non-random method of gene shuffling
US10202608B2 (en) 2006-08-31 2019-02-12 Gen9, Inc. Iterative nucleic acid assembly using activation of vector-encoded traits
US10207240B2 (en) 2009-11-03 2019-02-19 Gen9, Inc. Methods and microfluidic devices for the manipulation of droplets in high fidelity polynucleotide assembly
US9968902B2 (en) 2009-11-25 2018-05-15 Gen9, Inc. Microfluidic devices and methods for gene synthesis
US9216414B2 (en) 2009-11-25 2015-12-22 Gen9, Inc. Microfluidic devices and methods for gene synthesis
US11071963B2 (en) 2010-01-07 2021-07-27 Gen9, Inc. Assembly of high fidelity polynucleotides
US9217144B2 (en) 2010-01-07 2015-12-22 Gen9, Inc. Assembly of high fidelity polynucleotides
US9925510B2 (en) 2010-01-07 2018-03-27 Gen9, Inc. Assembly of high fidelity polynucleotides
US11084014B2 (en) 2010-11-12 2021-08-10 Gen9, Inc. Methods and devices for nucleic acids synthesis
US11845054B2 (en) 2010-11-12 2023-12-19 Gen9, Inc. Methods and devices for nucleic acids synthesis
US10457935B2 (en) 2010-11-12 2019-10-29 Gen9, Inc. Protein arrays and methods of using and making the same
US10982208B2 (en) 2010-11-12 2021-04-20 Gen9, Inc. Protein arrays and methods of using and making the same
EP2944693A1 (en) * 2011-08-26 2015-11-18 Gen9, Inc. Compositions and methods for high fidelity assembly of nucleic acids
EP3594340A1 (en) * 2011-08-26 2020-01-15 Gen9, Inc. Compositions and methods for high fidelity assembly of nucleic acids
US11702662B2 (en) 2011-08-26 2023-07-18 Gen9, Inc. Compositions and methods for high fidelity assembly of nucleic acids
EP3954770A1 (en) * 2011-08-26 2022-02-16 Gen9, Inc. Compositions and methods for high fidelity assembly of nucleic acids
WO2013032850A3 (en) * 2011-08-26 2013-04-25 Gen9, Inc. Compositions and methods for high fidelity assembly of nucleic acids
US10308931B2 (en) 2012-03-21 2019-06-04 Gen9, Inc. Methods for screening proteins using DNA encoded chemical libraries as templates for enzyme catalysis
US10927369B2 (en) 2012-04-24 2021-02-23 Gen9, Inc. Methods for sorting nucleic acids and multiplexed preparative in vitro cloning
US10081807B2 (en) 2012-04-24 2018-09-25 Gen9, Inc. Methods for sorting nucleic acids and multiplexed preparative in vitro cloning
US11072789B2 (en) 2012-06-25 2021-07-27 Gen9, Inc. Methods for nucleic acid assembly and high throughput sequencing
CN111019938A (en) * 2019-12-06 2020-04-17 天津大学 Long-chain DNA with sticky end and preparation method thereof
CN111019938B (en) * 2019-12-06 2023-07-18 天津大学 Long-chain DNA with sticky end and preparation method thereof

Also Published As

Publication number Publication date
AU3691797A (en) 1998-02-25
JPH1066576A (en) 1998-03-10

Similar Documents

Publication Publication Date Title
WO1998005765A1 (en) Double-stranded dna with cohesive end(s), and method of shuffling dna using the same
US5508169A (en) Indexing linkers
EP1092039B1 (en) Methods for generating highly diverse libraries
US5514568A (en) Enzymatic inverse polymerase chain reaction
US5556747A (en) Method for site-directed mutagenesis
US5834252A (en) End-complementary polymerase reaction
US5426039A (en) Direct molecular cloning of primer extended DNA containing an alkane diol
US20010049125A1 (en) End-complementary polymerase reaction
JP2000512852A (en) Methods and kits for preparing multi-component nucleic acid constructs
US5891637A (en) Construction of full length cDNA libraries
AU2003267008B2 (en) Method for the selective combinatorial randomization of polynucleotides
Booth et al. Assembly and cloning of coding sequences for neurotrophic factors directly from genomic DNA using polymerase chain reaction and uracil DNA glycosylase
EP0812911B1 (en) A method of forming a macromolecular microgene polymer
US5952201A (en) Method of preparing oligonucleotide probes or primers, vector therefor and use thereof
JPH08140685A (en) Site-specific variation transduction
WO1998005764A1 (en) Nucleic acid pool and method for producing the same
EP0245130A1 (en) Vectors comprising appropriate means for an enzymatical synthesis of nucleotide sequences, particularly dna fragments
US6929914B2 (en) Method for accelerated genome walking and DNA fingerprinting
EP1548113B1 (en) A method for obtaining circular mutated and/or chimaeric polynucleotides
KR100482530B1 (en) Method for effecting site-directed mutagenesis
US20040248131A1 (en) Methods for dna mutagenesis and dna cloning
CN117802071A (en) Protein yCas12a and related biological material and application thereof
US20040132054A1 (en) Method for the selective combinatorial randomization of polynucleotides
JPH05276946A (en) Creation of dna

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE HU IL IS KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN ZW AM AZ BY KG KZ MD RU TJ TM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH KE LS MW SD SZ UG ZW AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: CA