WO1999031964A1 - Nucleotide polymorphisms in soybean - Google Patents
Nucleotide polymorphisms in soybean Download PDFInfo
- Publication number
- WO1999031964A1 WO1999031964A1 PCT/US1998/026935 US9826935W WO9931964A1 WO 1999031964 A1 WO1999031964 A1 WO 1999031964A1 US 9826935 W US9826935 W US 9826935W WO 9931964 A1 WO9931964 A1 WO 9931964A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- phal
- locus
- loci
- probe
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/172—Haplotypes
Definitions
- NUCLEOTIDE POLYMORPHISMS IN SOYBEAN FIELD OF THE INVENTION The invention is in the field of agricultural technology, particularly marker assisted selection of soybean.
- a mutation in a base in one DNA strand typically requires a complementary base change in the opposite strand.
- the polymorphism, or change, between a wildtype and a mutant includes both strands of the DNA molecule. If a change allows the organism to be viable and fecund, any base can substitute for any other base at one or more nucleotide positions in a DNA sequence, or any length of DNA bases can be inserted or deleted. Mutations that are selectively neutral or provide an advantage in a particular environment may then proliferate within a population. Genetic markers represent (mark the location of) specific loci in the genome of a species or closely related species, and sampling of different genotypes at these marker loci reveals genetic variation.
- the genetic variation at marker loci can then be described and applied to genetic studies, commercial breeding, diagnostics, cladistic analysis of variance, or genotyping of samples.
- Genetic markers have the greatest utility when they are highly heritable, multi-allelic, and numerous. Most genetic markers are highly heritable because their alleles are determined by the nucleotide sequence of DNA, which is highly conserved from one generation to the next, and the detection of their alleles is unaffected by the natural environment. Markers have multiple alleles because, in the evolutionary process, rare, genetically-stable mutations in DNA sequences defining marker loci arose and were disseminated through the generations along with other existing alleles.
- Genomics 5:874-879 amplified fragment-length polymo ⁇ hism (AFLP) Vos et al. (1995) Nucleic Acids Res 23:4407-4414; microsatellite or simple-sequence repeat (SSR) Weber JL and May PE (1989) Am J Hum Genet 44:388-396; random-amplified polymo ⁇ hic DNA (RAPD)
- QTL include genes that control, to some degree, numerically representable phenotypic traits (disease resistance, crop yield, resistance to environmental extremes, etc.), that are distributed within a family of individuals as well as within a population of families of individuals.
- An experimental paradigm has been developed to identify and analyze QTL. This paradigm involves crossing two inbred lines and genotyping multiple marker loci and evaluating one to several quantitative phenotypic traits among the progeny of the cross. QTL are then identified and ultimately selected for based on significant statistical associations between the genotypic values determined by genetic marker technology and the phenotypic variability among the segregating progeny. Unfortunately, complete sets of genetic markers are not available for a variety of important crops, making it difficult to quickly assess the genotype of any particular individual.
- soybeans are a major cash crop which provide most of the world's protein and vegetable oils, complete sets of genetic markers which span the soybean genome are not available. Accordingly, there exists a need to develop genetic markers for genotyping, marker assisted selection, positional cloning of nucleic acids and the like, e.g., in soybean. This invention provides these and many other features.
- Example technologies which may be used to detect the loci include allele-specific hybridization (ASH), the polymerase chain reaction (PCR), random-amplified polymo ⁇ hic DNA (RAPD), restriction- fragment-length polymo ⁇ hism (RFLP), single strand conformation polymo ⁇ hism (SSCP), allele-specific polymerase chain reaction (ASPCR), genetic-bit analysis (GBA), nick-translation PCR (TaqMan ® ), hybridization to solid phase arrays (e.g., very large scale immobilized polymer arrays (VLSIPS arrays)), and the like.
- ASH allele-specific hybridization
- PCR polymerase chain reaction
- RAPD random-amplified polymo ⁇ hic DNA
- RFLP restriction- fragment-length polymo ⁇ hism
- SSCP single strand conformation polymo ⁇ hism
- ASPCR allele-specific polymerase chain reaction
- GAA genetic-bit analysis
- TaqMan ® nick-translation PCR
- methods of detecting one or more genetic nucleotide polymo ⁇ hism in a biological sample from a soybean plant are provided by hybridizing a probe nucleic acid to one of the loci described herein.
- a biological sample derived from a soybean plant is provided, and a probe nucleic acid is hybridized to a target nucleic acid including a nucleotide polymo ⁇ hism from the locus.
- Preferred loci include pA060A, pA077A, pA086A, pA169A, pA280A, pA378A, pA505A, pA519A, pA588A, pA947B, pB032A, pB032B, pB039A, pBLT24A, pBLT65A, php02265A, php02301A, php02361A, php02370C, php02387A, ⁇ hp02388A, php02393A, php02396A, php02636A, php03522A, php05219A, php05233A, php05278A, php05342A, php07659A, php08584A, phpl2105A, php02340B, php05264A, phpl0355B, pK069A, p
- Particularly preferred "php" loci include php02265A, php02301A, php02361A, php02370C, php02387A, php02388A, php02393A, php02396A, php02636A, php03522A, php05219A, php05233A, php05278A, php05342A, php07659A, php08584A, phpl2105A, php02340B, php05264A, phpl0355B php02329A, php02371A, php05290A, php02376A, and phpl0078A.
- At least one of the detected loci will typically be a locus with a "php" designation.
- One newly discovered advantage for all of the loci noted above is that probes which specifically hybridize to the selected locus do not specifically hybridize to additional loci in the soybean genome because the loci are all unique in the soybean genome.
- the loci and included polymo ⁇ hic nucleotides are in linkage disequilibrium with a Quantitative Trait Locus (QTL) such as resistance to soybean cyst nematode, brown stem rot, phytopthora rot or the like. Accordingly, the presence or absence of the detected locus corresponds to the presence or absence of a particular QTL.
- QTL Quantitative Trait Locus
- a variety of probe nucleic acids which hybridize to the loci are provided by the invention.
- the probes include the amplicons, PCR primers, and the like, described herein, which are used to identify and detect the loci.
- Hybridization of a probe to a locus is detected to confirm the presence of the locus and typically to determine whether a particular polymo ⁇ hic nucleotide is present at the locus. This detection is performed directly or indirectly.
- Direct detection, methods of detecting hybridization include Southern analysis, northern analysis, array-dependent nucleic acid hybridization on a nucleic acid polymer array, in situ hybridization, or other methods which directly monitor the hybridization.
- Indirect detection includes, e.g., detection of an amplification product which is dependent on hybridization of the probe to the target nucleic acid.
- the polymerase chain reaction (PCR) and/or the ligase chain reaction (LCR) are used to monitor hybridization, e.g., by detecting formation of an amplicon which is synthesized only if a probe (e.g., a PCR primer) hybridizes to the target.
- a probe e.g., a PCR primer
- the probe or target is optionally amplified prior to detection.
- Preferred amplification methods include PCR, LCR, and cloning of the target nucleic acid.
- a second probe nucleic acid is hybridized to a second target nucleic acid linked to a second nucleotide polymo ⁇ hism in a second locus selected from the second group of loci consisting of the loci noted above.
- a plurality of probes (a third, fourth, fifth... nth probe) are hybridized to a plurality (a third, fourth, fifth... nth) polymo ⁇ hic nucleotide in one of the loci noted above. In one embodiment, a majority of the noted loci are detected.
- all of the loci noted above are detected, thereby providing a comprehensive genotype.
- 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or any intermediate percentage thereof of the loci are detected in alternate embodiments.
- the methods are applicable to detection of loci and genotyping in a variety of biological samples, including a soybean plant, a soybean plant extract, an isolated soybean plant tissue, an isolated plant tissue extract, a soybean plant cell culture, a soybean plant cell culture extract, a recombinant cell comprising a nucleic acid derived from a soybean plant, a soybean plant seed, and an extract of a recombinant cell comprising a nucleic acid derived from a soybean plant.
- the target nucleic acid which is detected can include the first polymo ⁇ hic nucleotide, or it may be proximal to the polymo ⁇ hic nucleotide.
- the target nucleic acid includes polymo ⁇ hic nucleotide to be detected.
- it can also be convenient to detect nucleotides proximal to the polymo ⁇ hic nucleotide. For example, when LCR is used, the presence or absence of a polymo ⁇ hic nucleotide is detected by amplifying nucleotide regions flanking the polymo ⁇ hic nucleotide.
- the present invention includes marker-assisted selection of soybean plants, e.g., by detecting any of the loci noted above and selecting a plant based upon the presence or absence of one or more desired polymo ⁇ hic nucleotide.
- nucleic acids corresponding to nucleotides proximal to or including the marker nucleic acids are cloned.
- a nucleic acid flanked by two nucleic acid loci is cloned.
- this cloned nucleic acid includes a coding sequence.
- the cloned nucleic acid is optionally transduced into cells or plants, e.g., to make transgenic plants (e.g., soybean) expressing the coding sequence.
- nucleotide polymo ⁇ hisms proximal to the selected loci are identified and mapped, e.g., by genetic mapping or nucleotide sequencing of nucleic acid regions genetically linked to the selected locus from genetically diverse strains of soybean.
- the identification of these additional polymo ⁇ hisms provides additional marker regions which are used to identify the source of a soybean nucleic acid.
- nucleotide polymo ⁇ hisms are also detected by separating nucleic acids having the polymo ⁇ hisms by size and or charge, thereby separating the nucleic acids.
- single-strand conformation polymo ⁇ hism can be performed on two or more nucleic acids on a polyacrylamide gel.
- Amplification methods and compositions for detecting nucleic acids linked to loci are also provided. Typical amplification methods of the present invention include PCR, asymmetric PCR, and LCR.
- amplification primer lengths are less than 100 nucleotides, although they may be longer or shorter, e.g., between about 10 and 50 nucleotides, typically between about 15 and 25 nucleotides, or as long as or longer than 100-200 nucleotides, or the like.
- the primer is a PCR primer
- the primer provides a polymerase extendible substrate and the primer-dependent polymerase extends the primer.
- the primer is an allele-specific primer.
- the first primer hybridizes adjacent to a second primer on the template nucleic acid and the first and second primers are ligated with a ligase enzyme, thereby amplifying the portion of the template hybridized to the first and second primers.
- the method includes hybridizing a second primer to the template, wherein the first and second primer hybridize to complementary strands of the template nucleic acid.
- Amplification mixtures for practicing the amplification methods are also provided.
- a PCR reaction mixture having, e.g., a polymerase enzyme, deoxy nucleotides, a template nucleic acid comprising a polymo ⁇ hic nucleotide which hybridizes under stringent conditions to a locus as above, and primers which specifically hybridize to the template nucleic acid are also provided.
- Primers include the PCR primers described herein, and additional primers selected to amplify portions of the amplicons described herein.
- the primers are optionally allele-specific primers to facilitate quantitative PCR.
- PCR amplicons are also provided, including nucleic acids having a polymo ⁇ hic nucleotide.
- the amplicon hybridizes under stringent conditions to a locus selected from a group of loci consisting of those set forth above. Exemplar amplicons are described herein. Particularly preferred amplicons include phpl l l38 and php 11627.
- a variety of additional compositions are also provided by the present invention.
- One class of compositions has a first recombinant nucleic acid which differentially hybridizes under allele-specific hybridization conditions to a first allele from a locus in the soybean genome selected from the above loci, where the first recombinant nucleic acid shows decreased hybridization affinity for a second allele from the selected locus.
- the composition optionally includes one or more additional recombinant nucleic acids (i.e., additional probes) which differentially hybridize under allele-specific hybridization conditions to a second allele from a selected locus, wherein the second nucleic acid shows decreased hybridization affinity for the first allele from the selected locus.
- additional recombinant nucleic acids i.e., additional probes
- multi-color hybridization nucleic acid probe hybridization techniques such as comparative genomic hybridization (CGH) or fluorescence in situ hybridization (FISH) can be used to detect different alleles on different chromosomes.
- a composition including a recombinant nucleic acid which specifically hybridizes to a first allele-specific probe and a second allele-specific probe.
- the recombinant nucleic acid can be a probe, target nucleic acid, chromosomal nucleic acid, recombinant nucleic acid or the like.
- the first and second allele-specific probes hybridize under allele-specific hybridization conditions to a first haplotype of a locus in the soybean genome noted above.
- the composition optionally comprises additional materials such as allele-specific probes for the detection of the nucleic acid, or the like.
- Sets of nucleic acid probes are also provided, including sets of nucleic acid probes having a plurality of probe nucleic acids which specifically hybridize to a plurality of target nucleic acids which hybridize under stringent conditions to a plurality of the loci noted above.
- the sets may be in any of a variety of physical arrangements, including arrays, containers, or the like.
- the set is in kit form, i.e., having the set of nucleic acids, and optionally comprising one or more additional component such as a container, instructional materials, one or more control target nucleic acids, and recombinant cells comprising one or more target nucleic acids.
- Transgenic plants are provided.
- a transgenic plant having a recombinant nucleic acid which hybridizes under stringent conditions to a target nucleic acid is provided.
- the target nucleic acid is genetically linked to (and preferably comprises) a nucleotide polymo ⁇ hism from a locus selected from the group of loci noted above.
- the recombinant nucleic acid comprises a coding sequence encoded by a gene in linkage disequilibrium with a Quantitative Trait Locus (QTL).
- QTL include a QTL for resistance to soybean cyst nematode, a QTL for resistance to brown stem rot, and a QTL for resistance to phytopthora rot.
- nucleotide polymo ⁇ hism is a change or difference between two related nucleic acids.
- nucleotide polymo ⁇ hism refers to a nucleotide which is different in one sequence when compared to a related sequence when the two nucleic acids are aligned for maximal correspondence.
- a “genetic nucleotide polymo ⁇ hism” refers to a nucleotide which is different in one sequence when compared to a related sequence when the two nucleic acids are aligned for maximal correspondence, where the two nucleic acids are genetically related, i.e., homologous, e.g., where the nucleic acids are isolated from different strains of a soybean plant, or from different alleles of a single strain, or the like.
- a “biological sample” is a portion of material isolated from a biological source such as a plant, isolated plant tissue, or plant cell, or a portion of material made from such a source such as a cell extract or the like.
- a “probe nucleic acid” is an RNA or DNA or analogue thereof.
- the probe may be of any length. Typical probes include PCR primers, PCR amplicons, cloned genomic nucleic acids encoding a genetic locus of interest, and the like.
- Marker assisted selection refers to the process of selecting a desired trait or desired traits in a plant or plants by detecting one or more nucleic acids from the plant, where the nucleic acid is associated with the desired trait.
- a "locus” is a nucleic acid region where a polymo ⁇ hic nucleic acid resides.
- a “genetic marker” is a region on a genomic nucleic acid mapped by a marker nucleic acid.
- a “marker nucleic acid” is a nucleic acid which is an indicator for the presence of a marker locus.
- the marker can be either a probe nucleic acid which identifies a target nucleic acid genetically linked to the locus, or a sequence hybridized by the probe, i.e., a genomic nucleic acid linked to the locus.
- a probe will be used to hybridize to or amplify the locus.
- Example markers include isolated nucleic acids from the locus, cloned nucleic acids comprising the locus, PCR primers for amplifying the locus, and the like.
- Two nucleic acid sequences are "genetically linked" when the sequences are in linkage disequilibrium.
- a “vector” is a carrier composition which assists in transducing, transforming or infecting a cell with a nucleic acid, thereby causing the cell to express vector associated nucleic acids and, optionally, proteins other than those native to the cell, or in a manner not native to the cell.
- the term vector includes nucleic acid (ordinarily RNA or DNA) to be expressed by the cell (a “vector nucleic acid”).
- a vector optionally includes materials to aid in achieving entry of the nucleic acid into the cell, such as a retroviral particle, liposome, protein coating or the like.
- a “promoter” is an array of nucleic acid control sequences which directs transcription of a nucleic acid.
- a promoter includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of a polymerase II type promoter, a TATA element.
- a promoter also optionally includes distal enhancer or repressor elements which can be located as much as several thousand base pairs from the start site of transcription.
- a "constitutive” promoter is a promoter which is active in a selected organism under most environmental and developmental conditions.
- An “inducible” promoter is a promoter which is under environmental or developmental regulation in a selected organism.
- the terms “isolated” or “biologically pure” refer to material which is substantially or essentially free from components which normally accompany it as found in its native state.
- the loci include polymo ⁇ hic nucelotides which vary depending . on the particular strain of soybean considered. These loci are used in plant breeding projects, e.g., for marker-assisted selection, for positional cloning of linked nucleic acid regions of soybean chromosomal nucleic acids, and the like.
- polymo ⁇ hic nucleotides are detected directly, or by detecting nucleic acids in linkage disequilibrium with the loci.
- loci which are prefixed by "php” were previously completely unknown, with no information being publicly available regarding the loci. Some of the other loci (those not preface by “php”) were previously identified by binding to RFLP probes; however, no sequence information regarding the loci was available and it was, therefore, not possible to design probes which hybridize specifically to polymo ⁇ hic nucleotides.
- the locus SOYBPSP was previously sequenced in an intron of a gene, but no polymo ⁇ hism was previously identified at the locus.
- Clones comprising loci not prefaced by "php” i.e., comprising publicly available RFLP probes are publicly available from Biogenetic Services, Inc. (Brookings SD) and PE AgGen, Inc. (formerly Linkage Genetics) (Salt Lake City, UT). Uses for Nucleotide Polvmo ⁇ hisms
- nucleotide polymo ⁇ hisms described here are used, e.g., for DNA-finge ⁇ rinting soybean varieties, genetic-linkage mapping of the soybean genome, marker association with specific genes or quantitative-trait loci (QTL) affecting phenotypic traits, marker-assisted selection for preferred genotypes in soybean breeding, positional cloning of genes from the soybean genome and other pu ⁇ oses that will be apparent upon complete review of this disclosure.
- QTL quantitative-trait loci
- DNA fmge ⁇ rinting is the application of multiple genetic markers to DNA extracted from an individual or pooled group of related individuals, such as an inbred line or variety, so that the cumulative marker allele profile provides a description of the variety's overall genotype. Comparisons of these marker allele profiles can be made among samples of different varieties, or among different samples of the same variety, and estimates can be made regarding the genetic relationships among these samples. These estimates can be obtained without knowing the pedigrees of the finge ⁇ rinted varieties.
- DNA finge ⁇ rints are used to determine whether a soybean variety is described and used within guidelines established, e.g., by Plant Variety Protection and patent laws. Seed of one variety may be sold by two different vendors using different variety names, or a variety may have been genetically bred from another variety by repeated cycles of backcrossing and selection to the extent that the new variety was essentially derived from the original variety. In these situations, questions about ownership may need to be answered. DNA-finge ⁇ rint profiles can be obtained from each variety or seed source and compared. If done properly, the data will show with a high probability whether or not the different samples are genetically alike.
- the polymo ⁇ hisms at the 63 polymo ⁇ hic loci described herein invention provide a basis for quickly and reliably comparing DNA-finge ⁇ rint profiles in soybean. These 63 loci are well distributed around the soybean genome and provide an adequate number of loci to make reasonable conclusions regarding variety or seed-source identities and relationships.
- DNA finge ⁇ rinting is used by soybean breeders to identify diverse breeding parents, to create novel recombinant populations, to better understand the structure of the soybean genome, and to better understand the history of pedigree breeding.
- the 63 example loci herein provide sufficient polymo ⁇ hisms to estimate genetic relationships among soybean varieties.
- DNA finge ⁇ rinting is used by soybean seed companies to estimate genetic purity of a seedlot for quality control and labeling of their product, and to identify the variety source of any contamination found among finge ⁇ rinted samples. These 63 loci provide sufficient polymo ⁇ hisms to estimate genetic purity within a seedlot. Genetic Mapping of the Soybean Genome
- Genetic mapping is done by finding polymo ⁇ hic markers that are genetically linked to each other (in linkage groups) or linked to genes or QTL affecting phenotypic traits of interest within a segregating population.
- the alignment of markers into linkage groups is useful as a reference for future use of the markers and for accurately positioning genes or QTL relative to the markers.
- the nucleotide polymo ⁇ hisms described here for 63 exemplar loci provide a means to utilize these loci in genetic mapping studies in soybean. Many of these loci have multiple sub-loci and haplotypes across the sub-loci. Each haplotype provides a different allele composition within a locus, thereby expanding the utility of these marker loci to more soybean mapping studies than possible with only two alleles per locus.
- loci php05219A, php07659A, phpl0355B, and pK069A were found to cluster around a resistance QTL on group G; the loci pT155A, pBLT24A, and pBLT65A were clustered around a resistance QTL on group A; and the locus php02301A was near a resistance QTL on group M.
- These three QTL provide resistance to soybean cyst nematode (Heterodera glycines lchinohe) (See also, Webb et al.
- Theor Appl Genet 91:574-581 The loci php02636A on group C, php08584A on group S, and pK079A on group L26 were all linked to additional QTL for resistance to soybean cyst nematode.
- the locus pB032B on group J was near Rbs 3 for resistance to brown stem rot.
- the loci pK418A and pA280A on group N were near Rps,
- the locus pR045A on group F was linked to Rps 3
- the loci pA378A and pL183A on group G were near Rps 4
- the locus pT005A on group G was near Rps 5 , all providing resistance to phytophthora rot.
- MAS marker-assisted selection
- a nucleic acid corresponding to the marker nucleic acid is detected in a biological sample from a plant to be selected. This detection can take the form of hybridization of a probe nucleic acid to a marker, e.g., using allele-specific hybridization, Southern analysis, northern analysis, in situ hybridization, hybridization of primers followed by PCR amplification of a region of the marker or the like. A variety of procedures for detecting markers are described herein. After the presence (or absence) of a particular marker in the biological sample is verified, the plant is selected, i.e., used to make progeny plants by selective breeding.
- Nucleotide polymo ⁇ hisms were developed at markers near numerous resistance loci in soybean that are effective against soybean cyst nematode, Phytophthora sojae (phytophthora rot), and Phialophora gregata (brown stem rot). These are among the most damaging pathogens to soybeans in North America.
- Use of the nucleotide polymo ⁇ hisms described here and genetically-linked nucleotides as genetic markers for disease resistance loci is an effective method of selecting resistant varieties in breeding programs. When a population is segregating for multiple loci affecting multiple diseases, the efficiency of MAS compared to phenotypic screening becomes even greater because all the loci can be processed in the lab together from a single sample of DNA. Another advantage over field evaluations for disease reaction is that MAS can be done at any time of year regardless of the growing season. Moreover, environmental effects are irrelevant to marker-assisted selection.
- Backcross breeding is the process of crossing a progeny back to one of its parents. Backcrossing is usually done for the pu ⁇ ose of introgressing one or a. few loci from a donor parent into an otherwise desirable genetic background from the recurrent parent. The more cycles of backcrossing that is done, the greater the genetic contribution of the recurrent parent to the resulting variety. This is often necessary, because resistant plants may be otherwise undesirable, i.e., due to low yield, low fecundity, or the like.
- strains which are the result of intensive breeding programs may have excellent yield, fecundity or the like, merely being deficient in one desired trait such as resistance to a particular pathogen.
- the 63 marker loci described in the Examples below are distributed around the soybean genome and are used to select for the recurrent-parent genotype.
- MAS for the recurrent-parent genotype can be combined with MAS for the disease resistance loci using these markers. Accordingly, it is possible to use the markers to introduce disease resistance QTL into plant varieties having an otherwise desirable genetic background using the markers of the invention for selection of the QTL and for selection of the otherwise desirable background.
- Positional Cloning in Soybean Positional gene cloning uses the proximity of a mapped gene and its linked markers to physically define a cloned chromosomal fragment that contains a desired gene. If two or more markers flanking the gene are physically close to each other, they may hybridize to the same DNA fragment, thereby identifying a clone on which the gene is located. If flanking markers are more distant from each other, a fragment containing the gene may be identified by constructing a contig of overlapping clones.
- a marker is ideally locus-specific to reliably identify a clone from the targeted chromosomal region.
- the soybean genome is highly duplicated (Shoemaker et al (1996) (Glycine subgenus soja) Genetics 144:329-338, but each nucleotide polymo ⁇ hism and its PCR primers described here is specific to a single locus in the soybean genome and therefore correctly identifies soybean clones that hybridize to a corresponding probe DNA sequence corresponding to a particular target genomic location.
- Some of these marker loci are closely linked to agronomically important genes, such as genes for resistance to soybean cyst nematode and fungal pathogens, and are used as locus-specific reference points in positional cloning efforts for these genes.
- DNA sequences which code for necessary proteins are well conserved across a species, there are regions of DNA which are non-coding or code for portions of proteins which do not have critical functions and therefore, absolute conservation of nucleic acid sequence is not strongly selected for.
- variable regions are identified by genetic markers.
- genetic markers are bound by probes such as oligonucleotides or amplicons which bind to variable regions of the genome.
- probes such as oligonucleotides or amplicons which bind to variable regions of the genome.
- the presence or absence of binding to a genetic marker identifies individuals by their unique nucleic acid sequence.
- a marker binds to nucleic acid sequences of all individuals but the individual is identified by the position in the genome bound by a marker probe.
- the major causes of genetic variability are addition, deletion, or point mutations, recombination and transposable elements within the genome of individuals in a plant population.
- Point mutations are typically the result of inaccuracy in DNA replication.
- DNA polymerase "switches" bases, either transitionally (i.e. , a purine for a purine and a pyrimidine for a pyrimidine) or transversionally (i.e., purine to pyrimidine and vice versa).
- the base switch is maintained if the exonuclease function of DNA polymerase does not correct the mismatch.
- the DNA strand with the point mutation becomes the template for a complementary strand and the base switch is inco ⁇ orated into the genome.
- Transposable elements are sequences of DNA which have the ability to move or to jump to new locations within a genome and several examples of transposons are known in the art.
- probe nucleic acids for detecting markers, including probes which are PCR primers, allele-specific probes, PCR amplicons and the like for the detection of polymo ⁇ hic nucleotides at the loci disclosed herein, as well as genetically-linked sequences.
- nucleic acid compositions of this invention whether DNA, RNA, cDNA, genomic DNA, or analogues thereof, or a hybrid of these molecules, are isolated from biological sources or synthesized in vitro.
- the nucleic acids of the invention are present in transfected whole cells, in transfected cell lysates, in transgenic plants (especially soybean) or in partially purified or substantially pure form.
- RNA polymerase mediated techniques e.g., NASBA
- PCR polymerase chain reaction
- LCR ligase chain reaction
- NASBA RNA polymerase mediated techniques
- RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. See, Ausbel, Sambrook and Berger, all supra.
- Oligonucleotides for use as probes, e.g., in in vitro amplification methods, for use as gene probes, or as inhibitor components (e.g., ribozymes) are typically synthesized chemically according to the solid phase phosphoramidite triester method described by Beaucage and Caruthers (1981), Tetrahedron Letts.
- Oligonucleotides can also be custom made and ordered from a variety of commercial sources known to persons of skill. Purification of oligonucleotides, where necessary, is typically performed by either native acrylamide gel electrophoresis or by anion-exchange HPLC as described in Pearson and Regnier (1983) J. Chrom. 255: 137-149. The sequence of the synthetic oligonucleotides can be verified using the chemical degradation method of Maxam and Gilbert (1980) in Grossman and Moldave (eds.) Academic Press, New York, Methods in Enzymology 65:499-560.
- nucleic acids which encompass multiple loci, or to detect, clone, or isolate nucleic acids linked to polymo ⁇ hic nucleotides.
- positional cloning is used to isolate nucleic acids proximal to polymo ⁇ hic nucleotides, e.g., at more than one locus.
- nucleic acids are in linkage disequilibrium with the polymo ⁇ hic nucleotides, i.e., they are genetically linked to the polymo ⁇ hic nucleotides on a chromosomal nucleic acid.
- a nucleic acid genetically linked to a polymo ⁇ hic nucleotide optionally resides up to about 50 centimorgans from the polymo ⁇ hic nucleic acid, although the precise physical distance will vary depending on the cross-over frequency of the particular chromosomal region. Typical distances from a polymo ⁇ hic nucleotide are in the range of 1-50 centimorgans, for example, less than 1-5, about 1-5, 1, 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 centimorgans, etc.
- RNA and DNA nucleic acids including recombinant plasmids, recombinant lambda phage, cosmids, yeast artificial chromosomes (YACs), PI artificial chromosomes, Bacterial Artificial Chromosomes (BACs), and the like are known.
- YACs yeast artificial chromosomes
- BACs Bacterial Artificial Chromosomes
- a general introduction to YACs, BACs, PACs and MACs as artificial chromosomes is described in Monaco and Larin (1994) Trends Biotechnol 12(7): 280-286. Examples of appropriate cloning techniques for making large nucleic acids, and instructions sufficient to direct persons of skill through many cloning exercises are found in Berger and Kimmel, Sambrook, and Ausubel, all supra.
- nucleic acids hybridizing to the polymo ⁇ hic nucleic acids disclosed herein are cloned into large nucleic acids such as YACs, or are detected in YAC genomic libraries cloned from soybean.
- YACs and YAC libraries The construction of YACs and YAC libraries is known. See, Berger, supra, and Burke et al. (1987) Science 236:806-812.
- Gridded libraries of YACs are described in Anand et al (1989) Nucleic Acids Res. 17, 3425-3433, and Anand et al. (1990) Nucleic Acids Res. Riley (1990) 18: 1951-1956 Nucleic Acids Res.
- cosmids or other molecular vectors such as BAC and PI constructs are also useful for isolating or cloning nucleic acids linked to polymo ⁇ hic nucleic acids.
- Cosmid cloning is also known. See, e.g., Ausubel, chapter 1.10.11 (supplement 13) and the references therein. See also, Ish-Horowitz and Burke (1981) Nucleic Acids Res. 9:2989-2998; Murray (1983) Phage Lambda and Molecular Cloning in Lambda II (Hendrix et al , eds) 395-432 Cold Spring Harbor Laboratory, NY; Frischholz et al. (1983) J.Mol Biol.
- any of the cloning or amplification strategies described above are useful for creating contigs of overlapping clones, thereby providing overlapping nucleic acids which show the physical relationship at the molecular level for genetically linked nucleic acids.
- a common example of this strategy is found in whole organism sequencing projects, in which overlapping clones are sequenced to provide the entire sequence of a chromosome.
- a library of the organism's cDNA or genomic DNA is made according to standard procedures described, e.g., in the references above. Individual clones are isolated and sequenced, and overlapping sequence information is ordered to provide the sequence of the organism. See also, Tomb et al.
- a labeled probe nucleic acid is specifically hybridized to a marker nucleic acid from a biological sample and the label is detected, thereby determining that the marker nucleic acid is present in the sample.
- a marker comprising a polymo ⁇ hic nucleic acid can be detected by allele-specific hybridization of a probe to the region of the marker comprising the polymo ⁇ hic nucleic acid.
- a marker can be detected by Southern analysis, northern analysis, in situ analysis, or the like.
- Two single-stranded nucleic acids "hybridize" when they form a double- stranded duplex.
- the region of double-strandedness can include the full-length of one or both of the single-stranded nucleic acids, or all of one single stranded nucleic acid and a subsequence of the other single stranded nucleic acid, or the region of double- strandedness can include a subsequence of each nucleic acid.
- “Stringent hybridization conditions” in the context of nucleic acid hybridization are sequence dependent and are different under different environmental parameters. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993), id. Generally, stringent conditions are selected to be about 5° C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH. The T m is the temperature
- T d is used to define the temperature at which at least half of the probe dissociates from a perfectly matched target nucleic acid.
- G-C base pairs in a duplex are estimated to contribute about 3°C to the T m
- A-T base pairs are estimated to contribute about 2°C, up to a theoretical maximum of about 80-100°C.
- An example of stringent hybridization conditions for hybridization of complementary nucleic acids that have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formalin with 1 mg of heparin at 42°C, with the hybridization being carried out overnight.
- An example of stringent wash conditions for a Southern blot of such nucleic acids is a 0.2x SSC wash at 65 °C for 15 minutes (see, Sambrook, supra for a description of SSC buffer). Often the high stringency wash is preceded by a low stringency wash to remove background probe signal.
- An example low stringency wash is 2x SSC at 40°C for 15 minutes.
- a signal to noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization.
- an allele-specific probe is usually hybridized to a marker nucleic acid (e.g., a genomic nucleic acid, an amplicon, or the like) comprising a polymo ⁇ hic nucleotide under highly stringent conditions.
- a marker nucleic acid e.g., a genomic nucleic acid, an amplicon, or the like
- hybridization technology for detecting marker nucleic acids is allele-specific hybridization, or "ASH. " This technology is based on the stable annealing of a short, single-stranded oligonucleotide probe to a single-stranded target nucleic acid only when base pairing is completely complementary. The hybridization can then be detected from a radioactive or non-radioactive label on the probe (methods of labeling probes and other nucleic acids are set forth in detail below).
- ASH markers are polymo ⁇ hic when their base composition at one or a few nucleotide positions in a segment of DNA is different among different genotypes.
- two or more different ASH probes are designed to have identical DNA sequences except at the polymo ⁇ hic nucleotide(s).
- Each probe will have exact homology with one allele sequence so that the complement of probes can distinguish all the alternative allele sequences.
- Each probe is hybridized against the target DNA. With appropriate probe design and stringency conditions, a single-base mismatch between the probe and target DNA will prevent hybridization and the unbound probe will wash away.
- ASH markers are used as dominant markers where the presence or absence of only one allele is determined from hybridization or lack of hybridization by only one probe. The alternative allele may be inferred from the lack of hybridization.
- Heterogeneous target nucleic acids i.e., chromosomal DNA from a multiallelic plant
- ASH probes designed from the possible degenerate DNA sequences coding for a known amino acid sequence, could be used to identify clones containing the rabbit ⁇ -globin DNA that coded for that protein (Wallace et al. (1981) Nuclei Acids Res 9:879-894). They also showed that the only probe that hybridized to the clones had exact homology to the clone, whereas three probes that did not hybridize to the clones had a single base-pair mismatch with the target DNA. ASH markers have been developed to diagnose susceptibility to human diseases caused by point mutations in DNA sequence.
- Examples are for the /3 s -globin allele that can cause sickle-cell anemia (Conner et al. (1983) Proc Natl Acad Sci USA 80:278-282), the /3°-thalassemia allele that can cause /3-thalassemia (Pirastu et al. (1983) New England J Med 309:284-287), the /3,-antitrypsin allele that can cause liver cirrhosis and pulmonary emphysema (Kidd (1983) Nature 304:230-234), the HLA-DR haplotypes associated with immune response (Angelini et al.
- ASH markers have also been developed to identify strains of fungi resistant to the fungicide benzimidazole because of specific point mutations in the j8-tubulin gene in Venturia inaequalis (Koenraadt and Jones (1992) Phytopathology 82: 1354-1358 and Rhynchosporium secalis (Wheeler et al. (1995) Pestic Sci 43:201-209).
- An ASH probe is designed to form a stable duplex with a nucleic acid target only when base pairing is completely complementary. One or more base-pair mismatches between the probe and target prevents stable hybridization. This holds true for numerous variations of the process.
- the probe and target molecules are optionally either RNA or denatured DNA; the target molecule(s) is/are any length of nucleotides beyond the sequence that is complementary to the probe; the probe is designed to hybridize with either strand of a DNA target; the probe ranges in size to conform to variously stringent hybridization conditions, etc.
- PCR polymerase chain reaction
- ASH data were obtained by amplifying nucleic acid fragments (amplicons) from genomic DNA using PCR, transferring the amplicon target DNA to a membrane in a dot-blot format, hybridizing a labeled oligonucleotide probe to the amplicon target, and observing the hybridization dots by autoradiography.
- amplicons nucleic acid fragments
- ASH technologies are adapted to solid phase arrays for the rapid and specific detection of multiple polymo ⁇ hic nucleotides.
- a target nucleic acid e.g., a genomic nucleic acid, or an amplicon
- Either the probe, or the target, or both, can be labeled, typically with a fluorophore. Where the target is labeled, hybridization is detected by detecting bound fluorescence. Where the probe is labeled, hybridization is typically detected by quenching of the label. Where both the probe and the target are labeled, detection of hybridization is typically performed by monitoring a color shift resulting from proximity of the two bound labels.
- a variety of labeling strategies, labels, and the like, particularly for fluorescent based applications are described, supra.
- an array of ash probes are synthesized on a solid support.
- chip masking technologies and photoprotective chemistry it is possible to generate ordered arrays of nucleic acid probes.
- These arrays which are known, e.g., as "DNA chips,” or as very large scale immobilized polymer arrays (“VLSIPSTM” arrays) can include millions of defined probe regions on a substrate having an area of about 1cm 2 to several cm 2 .
- VLSIPSTM procedures provide a method of producing 4 n different oligonucleotide probes on an array using only 4n synthetic steps.
- oligonucleotide arrays on a glass surface is performed with automated phosphoramidite chemistry and chip masking techniques similar to photoresist technologies in the computer chip industry.
- a glass surface is derivatized with a silane reagent containing a functional group, e.g., a hydroxy 1 or amine group blocked by a photolabile protecting group.
- Photolysis through a photolithogaphic mask is used selectively to expose functional groups which are then ready to react with incoming 5'-photoprotected nucleoside phosphoramidites.
- the phosphoramidites react only with those sites which are illuminated (and thus exposed by removal of the photolabile blocking group).
- the phosphoramidites only add to those areas selectively exposed from the preceding step. These steps are repeated until the desired array of sequences have been synthesized on the solid surface.
- Combinatorial synthesis of different oligonucleotide analogues at different locations on the array is determined by the pattern of illumination during synthesis and the order of addition of coupling reagents.
- Monitoring of hybridization of target nucleic acids to the array is typically performed with fluorescence microscopes or laser scanning microscopes.
- one of skill is also able to order custom-made arrays and array- reading devices from manufacturers specializing in array manufacture. For example, Affymetrix Co ⁇ . in Santa Clara CA manufactures DNA VLSIPTM arrays.
- probe design is influenced by the intended application. For example, where several allele-specific probe-target interactions are to be detected in a single assay, e.g., on a single DNA chip, it is desirable to have similar melting temperatures for all of the probes. Accordingly, the length of the probes are adjusted so that the melting temperatures for all of the probes on the array are closely similar (it will be appreciated that different lengths for different probes may be needed to achieve a particular T m where different probes have different GC contents). Although melting temperature is a primary consideration in probe design, other factors are optionally used to further adjust probe construction.
- a marker is used as a chromosome probe to cytogenetically detect the presence of a polymo ⁇ hic nucleic acid or region linked to the nucleic acid.
- cytogenetic identification of a chromosomal region provides a way of determining the physical location of the region hybridized by the probe, i.e., in reference to other known markers.
- a probe which hybridizes to a polymo ⁇ hic nucleotide or a linked nucleic acid is chemically linked to a colorometric label, or fluorophore.
- the probe is used to paint the chromosome with the color label, thereby identifying regions which are hybridized by the label.
- Chromosome painting refers to the staining of specific metaphase or prophase chromosomes or regions of chromosomes with probe mixtures, e.g., probes hybridizing to the polymo ⁇ hic nucleic acids of the invention, and optionally, additional probes hybridizing to additional regions.
- the painting signal is preferably obtained by fluorescence in situ hybridization (FISH) of such mixtures with the target genome.
- Comparative genomic hybridization is also a known approach for identifying the presence and localization of sequences in a genome compared to a reference genome. See, Kallioniemi, et al. (1992) Science 258:818. CGH can provide a quantitative estimate of copy number and also provides information regarding the localization of amplified or deleted sequences in a normal chromosome. Many in situ detection techniques are known and can be adapted to the present invention.
- Fluorescent in situ hybridization FISH
- reverse chromosome painting FISH on DAPI stained chromosomes
- generation of Alphoid DNA probes for FISH using PCR PRINS labeling of DNA
- free chromatin mapping e.g., in Tijssen (1993) Laboratory Techniques in biochemistry and molecular biology—hybridization with nucleic acid probes parts I and II. Elsevier, New York, and, Choo (ed) (1994) Methods In Molecular Biology Volume 33- In Situ Hvbridization Protocols Humana Press Inc. , New Jersey (see also, other books in the Methods in Molecular Biology series).
- color-labeling strategies are useful for distinguishing the presence or absence of a chromosomal nucleic acid. They are also useful for the detection of multiple probes with multiple labels.
- chromosomes are optionally stained with multiple probes, optionally having multiple color labels. In this way, it is possible to quickly provide a genetic map of a sample at the molecular level. Furthermore, it is possible to determine whether two polymo ⁇ hic nucleotides from the same locus are present. For example, if two allele-specific probes with different color labels are hybridized to a chromosomal sample under allele-specific hybridization conditions,- it is possible specifically to detect both polymo ⁇ hic nucleotides.
- a sample which is homozygous for the polymo ⁇ hic nucleotide specifically bound by the first probe will look “blue” to an observer
- a sample which is homozygous for the polymo ⁇ hic nucleotide specifically bound by the second probe will look “yellow” to an observer
- a sample which is heterozygous and binds both probes will appear “green” to an observer. It will be appreciated that many color combinations are possible.
- the effect to the observer is that a "green” signal is observed.
- emission characteristics can be monitored; indeed, even when the fluorophores emit a non-visible wavelength of light, a combination color can be assigned to a ratio between any two (or more, e.g., where more than two probes are used in an assay) wavelengths of light.
- a polymo ⁇ hic nucleotide is detected by amplifying the polymo ⁇ hic nucleotide and detecting the resulting amplicon.
- This strategy are used to detect polymo ⁇ hic nucleic acids, depending on the materials available, and the like.
- nucleic acids primers which hybridize to regions of a genomic nucleic acid that flank a polymo ⁇ hic nucleotide to be detected are used in PCR or LCR reactions to generate an amplicon comprising the polymo ⁇ hic nucleotide.
- PCR and LCR strategies are known in the art and are found in Berger, Sambrook, Ausubel, and Innis, all supra. See also, as Mullis et al. , (1987) U.S. Patent No. 4,683,202.
- a nucleic acid having a polymo ⁇ hic nucleic acid to be detected (a genomic DNA, a genomic clone, a genomic amplicon or the like) is hybridized to primers which flank the polymo ⁇ hic nucleotide to be detected (e.g., nucleotide polymo ⁇ hisms at a locus such as pA060A, pA077A, pA086A, pA169A, pA280A, pA378A, pA505A, pA519A, pA588A, pA947B, pB032A, pB032B, pB039A, pBLT24A, pBLT65A, php02265A, php02301A, php02361A, php02370C, php02387A, php02388A, php02393A, php02396A, php
- Example primers which amplify the polymo ⁇ hic nucleic acids are provided in the examples section below.
- the primers are extended in a PCR reaction (typically including a thermostable polymerase enzyme such as Taq, deoxynucleotides, Mg + + and the like; See, Ausubel, Innis, Berger or Sambrook for typical PCR conditions).
- the resulting PCR amplicons comprise the polymo ⁇ hic nucleic acid to be detected.
- Exemplar amplicons include phal2105, phal2390, phal2391, phal2392, phal2393, phal2394, phal2394, phal2395, phal2396, phal0634, phal0623, phal0624, phal0649, phal l l35, phal0792, phal0635, phal0638, phal0648, phal0621, phal l071, phal l073, phal0640, phal l076, phal0653, phal0598, phal0615, phal0646, phal0618, phal0620, phal0782, phalll31, phalll32, phal0650, phal0651, phal ll38, phal0637, phal l078, phal l079, phal l l39, phal0655, phal l701, phal l627, phal0633, phal l074, phal l075, phal0632, phal
- Detection is typically performed by running PCR reaction products out on an acrylamide or agarose gel and detecting the size of the reaction products; alternatively, the products can be detected by allele-specific hybridization, by allele-specific hybridization to a polymer array as described supra, or by sequencing the PCR amplicons (using standard Sanger dideoxy or Maxam-Gilbert methods).
- the polymo ⁇ hic nucleotides in amplicons are optionally detected by cleaving the amplicon with a restriction enzyme that recognizes the polymo ⁇ hic nucleic acid, in an adaptation of standard RFLP analysis.
- primer nucleic acids in the examples section below, one of skill is easily able to select a variety of other primers which can be used in PCR amplification of nucleic acids comprising or proximal to polymo ⁇ hic nucleotides.
- methods of amplifying large nucleic acids by PCR are summarized in Cheng et al. (1994) Nature 369: 684-685 and the references therein, in which PCR amplicons of up to 40kb are generated. More typically, standard PCR is used to create amplicons of between about 100 and about 5,000 nucleotides in length, e.g., using the techniques described in Ausubel and Innis, supra.
- primers that hybridize to essentially any region of an amplicon made using the primers of the invention are designed by reference to the sequence of the amplicon. The sequence of the primers are selected to hybridize to regions of the amplicon.
- Amplicons are sequenced by any of a variety of protocols. Most DNA sequencing today is carried out by chain termination methods of DNA sequencing. The most popular chain termination methods of DNA sequencing are variants of the dideoxynucleotide mediated chain termination method of Sanger. See, Sanger et al. (1977) Proc. Nat. Acad. Sci, USA 74:5463-5467. For a simple introduction to dideoxy sequencing, see, Current Protocols in Molecular Biology, F.M. Ausubel et al , eds. , Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. , (Supplement 37, current through 1997) (Ausubel), Chapter 7. Thousands of laboratories employ dideoxynucleotide chain termination techniques. Commercial kits containing the reagents most typically used for these methods of DNA sequencing are available and widely used.
- the boronated nucleotide is stocastically inco ⁇ orated into PCR products at varying positions along the PCR amplicon.
- An exonuclease which is blocked by inco ⁇ orated boronated nucleotides is used to cleave the PCR amplicons.
- the cleaved amplicons are then separated by size using polyacrylamide gel electrophoresis, providing the sequence of the amplicon.
- the sequence is optionally used to select primers complementary to the amplicon, i.e., primers which will hybridize to the amplicon. It is expected that one of skill is thoroughly familiar with the theory and practice of nucleic acid hybridization and primer selection. Gait, ed. Oligonucleotide Synthesis: A Practical Approach, IRL Press, Oxford (1984); W.H.A. Kuijpers Nucleic Acids Research 18(17), 5197 (1994); K.L. Dueholm J. Org. Chem. 59, 5767-5773 (1994); S.
- nucleotides at the 5' end of a primer can inco ⁇ orate structural features unrelated to the target nucleic acid; for instance, in one embodiment, a sequencing primer hybridization site (or a complement to such a primer, depending on the application) is inco ⁇ orated into the amplification primer, where the sequencing primer is derived from a primer used in a standard sequencing kit, such as one using a biotinylated or dye-labeled universal M13 or SP6 primer.
- the primers are typically selected so that there is no complementarity between any known target sequence and any constant primer region.
- constant regions in primer sequences are optional.
- all primer sequences are selected to hybridize only to a perfectly complementary DNA, with the nearest mismatch hybridization possibility from known DNA sequence typical having at least about 50 to 70% hybridization mismatches, and preferably 100% mismatches for the terminal 5 nucleotides at the 3' end of the primer.
- the primers are selected so that no secondary structure forms within the primer.
- Self-complementary primers have poor hybridization properties, because the complementary portions of the primers self hybridize (i.e., form hai ⁇ in structures).
- Primers are selected to have minimal cross-hybridization, thereby preventing competition between individual primers and a template nucleic acid and preventing duplex formation of the primers in solution, and possible concatenation of the primers during PCR. If there is more than one constant region in the primer, the constant regions of the primer are selected so that they do not self-hybridize or form hai ⁇ in structures.
- selection steps are performed using simple computer programs to perform the selection as outlined above; however, all of the steps are optionally performed manually.
- One available computer program for primer selection is the MacVectorTM program from Kodak. In addition to programs for primer selection, one of skill can easily design simple programs for any or all of the preferred selection steps.
- amplicons are generated with the primers described herein.
- the amplicons can be generated by exponential amplification as described in the examples herein, or by linear amplification using a single specific primer, or by using one of the example primers below in conjunction with a set of random primers.
- the amplicons are characterized by a variety of physicochemical properties, including, but not limited to the following.
- the amplicons of the invention are produced in an amplification reaction using the primers as described above, with genomic soybean nucleic acid as a template (or a derivative thereof, such as a cloned or in vitro amplified genomic nucleic acid).
- genomic soybean nucleic acid as a template (or a derivative thereof, such as a cloned or in vitro amplified genomic nucleic acid).
- single stranded forms of the amplicons hybridize under stringent conditions to the template nucleic acid. Conditions for specific hybridization of nucleic acids, including amplicon nucleic acids are described above.
- a third physicochemical property of amplicons of the invention is that they specifically hybridize to one or more of the primers in the examples section below.
- the primers used to make the amplicon will hybridize to the amplicon; indeed, in PCR amplification strategies, hybridization of the primers to the amplicon is usually required for amplification. Additional physicochemical properties of the amplicons are described in the examples section, where example amplicons are described with reference, e.g., to size and hybridization to particular primers.
- LCR is used to amplify specifically a polymo ⁇ hic nucleic acid.
- amplification product By detecting the amplification product, presence of the polymo ⁇ hic nucleotide is confirmed.
- Detection is typically performed by running LCR reaction products out on an acrylamide or agarose gel and detecting the size of the reaction products; alternatively, the products can be detected by allele-specific hybridization, by allele-specific hybridization to a polymer array as described supra, or by sequencing the LCR amplicons (using standard Sanger dideoxy or Maxam-Gilbert methods). Detection techniques such as PCR amplification or other in vitro amplification methods are also used to detect LCR products.
- LCR The ligation chain reaction
- LAR ligation amplification reaction
- LCR provides a mechanism for linear or exponential amplification of a target nucleic acid via ligation of complementary oligonucleotides hybridized to a target. This amplification is performed to distinguish target nucleic acids that differ by a single nucleotide, providing a powerful tool for the analysis of genetic variation in the present invention, i.e., for distinguishing polymo ⁇ hic nucleotides.
- LCR The principle underlying LCR is straightforward: Oligonucleotides which are complementary to adjacent segments of a target nucleic acid are brought into proximity by hybridization to the target, and ligated using a ligase. To achieve linear amplification of the nucleic acid, a single pair of oligonucleotides which hybridize to adjoining areas of the target sequence are employed: the oligonucleotides are ligated, denatured from the template and the reaction is repeated. To achieve exponential amplification of the target nucleic acid two pairs of oligonucleotides (or more) are used, each pair hybridizing to complementary sequences on e.g. , a double-stranded target polynucleotide.
- the target and each of the ligated oligonucleotide pairs serves as a template for hybridization of the complementary oligonucleotides to achieve ligation.
- the ligase enzyme used in performing LCR is typically thermostable, allowing for repeated denaturation of the template and ligated oligonucleotide complex by heating the ligation reaction. LCR is useful as a diagnostic tool in the detection of genetic variation.
- LCR methods it is possible to distinguish between target polynucleotides which differ by a single nucleotide at the site of ligation. Ligation occurs only between oligonucleotides hybridized to a target polynucleotide where the complementarity between the oligonucleotides and the target is perfect, enabling differentiation between allelic variants of a gene or other chromosomal sequence. The specificity of ligation during
- LCR can be increased by substituting the more specific NAD + -dependant ligases such as E. coli ligase and (thermostable) Taq ligase for the less specific T4 DNA ligase.
- NAD analogues in the ligation reaction further increases specificity of the ligation reaction. See, U.S. Pat. No. 5,508, 179 to Wallace et al.
- Nucleotide polymo ⁇ hisms are also detected using other in vitro detection methods, including TAS, 3SR and Q/3 amplification.
- TAS the self-sustained sequence replication system (3SR) and the Q ⁇ replicase amplification system (QB) are reviewed in The Journal Of NIH Research (1991) 3, 81-94.
- the present invention may be practiced in conjunction with TAS (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86, 1173 or the related 3SR (Guatelli et al. (1990) Proc. Natl. Acad. Sci.
- a probe for use in an in situ detection procedure, an in vitro amplification procedure (PCR, LCR, NASBA, etc.), hybridization techniques (allele-specific hybridization, in situ analysis, Southern analysis, northern analysis, etc.) or any other detection procedure herein can be labeled with any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means.
- Useful labels in the present invention include spectral labels such as fluorescent dyes (e.g., fluorescent isothiocyanate, Texas red, rhodamine, digoxigenin, biotin, and the like), radiolabels (e.g. , 3 H, 125 I, 35 S, 14 C, 32 P, 33 P, etc.), enzymes (e.g.
- spectral colorimetric labels such as colloidal gold or colored glass or plastic (e.g. polystyrene, polypropylene, latex, etc.) beads.
- the label may be coupled directly or indirectly to a component of the detection assay (e.g. , a probe, primer, amplicon, YAC, BAC or the like) according to methods well known in the art.
- a component of the detection assay e.g. , a probe, primer, amplicon, YAC, BAC or the like
- a wide variety of labels may be used, with the choice of label depending on sensitivity required, ease of conjugation with the compound, stability requirements, available instrumentation, and disposal provisions.
- a detector which monitors a probe- target nucleic acid hybridization is adapted to the particular label which is used.
- Typical detectors include spectrophotometers, phototubes and photodiodes, microscopes, scintillation counters, cameras, film and the like, as well as combinations thereof. Examples of suitable detectors are widely available from a variety of commercial sources known to persons of skill.
- an optical image of a substrate comprising a nucleic acid array with particular set of probes bound to the array is digitized for subsequent computer analysis.
- inco ⁇ oration of radiolabeled nucleotides into nucleic acids is straightforward, this detection represents a preferred labeling strategy.
- Exemplar technologies for inco ⁇ orating radiolabels include end-labeling with a kinase or phoshpatase enzyme, nick translation, inco ⁇ oration of radio-active nucleotides with a polymerase and many other well known strategies.
- Fluorescent labels are also preferred labels, having the advantage of requiring fewer precautions in handling.
- Preferred labels are typically characterized by one or more of the following: high sensitivity, high stability, low background, low environmental sensitivity and high specificity in labeling.
- Fluorescent moieties, which are inco ⁇ orated into the labels of the invention are generally are known, including Texas red, digoxigenin, biotin, 1- and 2-aminonaphthalene, p,p'- dia inostilbenes, pyrenes, quaternary phenanthridine salts, 9-aminoacridines, p,p'-diaminobenzophenone imines, anthracenes, oxacarbocyanine, merocyanine, 3-aminoequilenin, perylene, bis-benzoxazole, bis-p-oxazolyl benzene, 1,2-benzophenazin, retinol, bis-3-aminopyridinium salt
- Individual fluorescent compounds which have functionalities for linking to an element desirably detected in an apparatus or assay of the invention, or which can be modified to inco ⁇ orate such functionalities include, e.g. , dansyl chloride; fluoresceins such as 3,6-dihydroxy-9-phenylxanthydrol; rhodamineisothiocyanate; N-phenyl l-amino-8- sulfonatonaphthalene; N-phenyl 2-amino-6-sulfonatonaphthalene; 4-acetamido-4-isothiocyanato-stilbene-2,2'-disulfonic acid; pyrene-3-sulfonic acid;
- 2,2'(vinylene-p-phenylene)bisbenzoxazole p-bis(2-(4-methyl-5-phenyl-oxazolyl))benzene; 6-dimethylamino-l,2-benzophenazin; retinol; bis(3'-aminopyridinium) 1, 10-decandiyl diiodide; sulfonaphthylhydrazone of hellibrienin; chlorotetracycline; N-(7-dimethylamino-4-methyl-2-oxo-3-chromenyl)maleimide; N-(p-(2- benzimidazolyl)-phenyl)maleimide; N-(4-fluoranthyl)maleimide; bis(homovanillic acid); resazarin; 4-chloro-7-nitro-2, l,3- benzooxadiazole; merocyanine 540; resorufin; rose bengal;
- fluorescent tags are commercially available from SIGMA chemical company (Saint Louis, MO), Molecular Probes, R&D systems (Minneapolis, MN), Pharmacia LKB Biotechnology (Piscataway, NJ), CLONTECH Laboratories, Inc. (Palo Alto, CA), Chem Genes Co ⁇ . , Aldrich Chemical Company (Milwaukee, WI), Glen Research, Inc. , GIBCO BRL Life Technologies, Inc. (Gaithersberg, MD), Fluka Chemica- Biochemika Analytika (Fluka Chemie AG, Buchs, Switzerland), and Applied Biosystems (Foster City, CA) as well as other commercial sources known to one of skill.
- nucleic acids are labeled by culturing recombinant cells which encode the nucleic acid in a medium which inco ⁇ orates fluorescent or radioactive nucleotide analogues in the growth medium, resulting in the production of fluorescently labeled nucleic acids.
- nucleic acids are synthesized in vitro using a primer and a DNA polymerase such as taq.
- a primer and a DNA polymerase such as taq.
- Hawkins et al. U.S. Pat. No. 5,525,711 describes pteridine nucleotide analogs for use in fluorescent DNA probes, including PCR amplicons.
- the label is coupled directly or indirectly to a molecule to be detected (a product, substrate, enzyme, or the like) according to methods well known in the art.
- a molecule to be detected a product, substrate, enzyme, or the like
- Non radioactive labels are often attached by indirect means.
- a ligand molecule e.g. , biotin
- a nucleic acid such as a probe, primer, amplicon, YAC, BAC or the like.
- the ligand then binds to an anti-ligand (e.g.
- streptavidin molecule which is either inherently detectable or covalently bound to a signal system, such as a detectable enzyme, a fluorescent compound, or a chemiluminescent compound.
- a signal system such as a detectable enzyme, a fluorescent compound, or a chemiluminescent compound.
- ligands and anti-ligands can be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, and cortisol, it can be used in conjunction with labeled anti-ligands. Alternatively, any haptenic or antigenic compound can be used in combination with an antibody. Labels can also be conjugated directly to signal generating compounds, e.g. , by conjugation with an enzyme or fluorophore or chromophore.
- Enzymes of interest as labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or oxidoreductases, particularly peroxidases.
- Fluorescent compounds include fluorescein and its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc.
- Chemiluminescent compounds include luciferin and 2,3-dihydrophthalazinediones, e.g. , luminol.
- Means of detecting labels are well known to those of skill in the art. Thus, for example, where the label is a radioactive label, means for detection include a scintillation counter or photographic film as in autoradiography.
- Nucleic acids which are genetically linked to the loci described herein are optionally cloned and transduced into cells, especially to make transgenic plants.
- the cloned sequences are useful as molecular tags for selected plant strains, and are further useful for encoding polypeptides. Often, these polypeptides are encoded by a QTL and are responsible for the phenotypic effects of the QTL.
- nucleic acids linked to a selected locus or selected loci are introduced into plant cells, either in culture or in organs of a plant, e.g., leaves, stems, fruit, seed, etc.
- the expression of natural or synthetic nucleic acids encoded by nucleic acids linked to polymo ⁇ hic nucleic acids can be achieved by operably linking a nucleic acid of interest to a promoter, inco ⁇ orating the construct into an expression vector, and introducing the vector into a suitable host cell.
- an endogenous promoter linked to the nucleic acids can be used.
- Typical vectors contain transcription and translation terminators, transcription and translation initiation sequences, and promoters useful for regulation of the expression of the particular nucleic acid.
- the vectors optionally comprise generic expression cassettes containing at least one independent terminator sequence ⁇ sequences permitting replication of the cassette in eukaryotes, prokaryotes, or both (e.g. , shuttle vectors) and selection markers for both prokaryotic and eukaryotic systems.
- Vectors are suitable for replication and integration in prokaryotes, eukaryotes, or preferably both. See, Giliman & Smith, Gene 8:81 (1979); Roberts, et al , Nature, 328:731 (1987);
- nucleic acids there are several well-known methods of introducing nucleic acids into bacterial cells, any of which may be used in the present invention. These include: fusion of the recipient cells with bacterial protoplasts containing the DNA, electroporation, projectile bombardment, and infection with viral vectors, etc.
- Bacterial cells are often used to amplify increase the number of plasmids containing DNA constructs of this invention. The bacteria are grown to log phase and the plasmids within the bacteria can be isolated by a variety of methods known in the art (see, for instance, Sambrook). In addition, a plethora of kits are commercially available for the purification of plasmids from bacteria.
- the isolated and purified plasmids are then further manipulated to produce other plasmids, used to transfect plant cells, or inco ⁇ orated into Agrobacterium tumefaciens to infect plants.
- the in vitro delivery of nucleic acids into bacterial hosts can be to any cell grown in culture. Contact between the cells and the genetically engineered nucleic acid constructs, when carried out in vitro, takes place in a biologically compatible medium.
- concentration of nucleic acid varies widely depending on the particular application, but is generally between about 1 ⁇ M and about 10 mM. Treatment of the cells with the nucleic acid is generally carried out at physiological temperatures (about 37 °C) for periods of time of from about 1 to 48 hours.
- a nucleic acid operably linked to a promoter to form a fusion gene is expressed in bacteria such as E. coli and its gene product isolated and purified.
- Promoters in nucleic acids linked to the above loci are identified, e.g. , by analyzing the 5' sequences upstream of a coding sequence in linkage disequilibrium with the loci.
- nucleic acids will be associated with a QTL.
- Sequences characteristic of promoter sequences can be used to identify the promoter. Sequences controlling eukaryotic gene expression have been extensively studied. For instance, promoter sequence elements include the TATA box consensus sequence (TATA AT), which is usually 20 to 30 base pairs upstream of a transcription start site. In most instances the TATA box aids in accurate transcription initiation.
- TATA AT TATA box consensus sequence
- a promoter element In plants, further upstream from the TATA box, at positions -80 to -100, there is typically a promoter element with a series of adenines surrounding the trinucleotide G (or T) N G. See, e.g. , J. Messing, et al, in GENETIC ENGINEERING IN PLANTS, pp. 221-227 (Kosage, Meredith and Hollaender, eds. (1983)).
- a plant promoter fragment is optionally employed which directs expression of the gene in all tissues of a regenerated plant.
- Such promoters are referred to herein as "constitutive" promoters and are active under most environmental conditions and states of development or cell differentiation.
- constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1 '- or 2'- promoter derived from T-DNA of Agrobacterium tumafaciens, and other transcription initiation regions from various plant genes known to those of skill.
- the plant promoter may direct expression of the polynucleotide of the invention in a specific tissue (tissue-specific promoters) or may be otherwise under more precise environmental control (inducible promoters).
- tissue-specific promoters under developmental control include promoters that initiate transcription only in certain tissues, such as fruit, seeds, or flowers.
- polyadenylation region at the 3 '-end of the coding region is typically included.
- the polyadenylation region can be derived from the natural gene, from a variety of other plant genes, or from T-DNA.
- the vector comprising the sequences (e.g. , promoters or coding regions) from genes of the invention will typically comprise a marker gene which confers a selectable phenotype on plant cells.
- the marker can encode biocide resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosluforon or Basta.
- the DNA constructs of the invention are introduced into plant cells, either in culture or in the organs of a plant by a variety of conventional techniques.
- the DNA construct can be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation and microinjection of plant cell protoplasts, or the DNA constructs can be introduced directly to plant cells using ballistic methods, such as DNA particle bombardment.
- the DNA constructs are combined with suitable T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector.
- the virulence functions of the Agrobacterium tumefaciens host directs the insertion of the construct and adjacent markers into the plant cell DNA when the cell is infected by the bacteria.
- Microinjection techniques are known in the art and well described in the scientific and patent literature.
- the introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski, et al, EMBO J. 3:2717 (1984).
- Electroporation techniques are described in Fromm, et al, Proc. Nat 'I. Acad. Sci. USA 82:5824 (1985).
- Ballistic transformation techniques are described in Klein, et al, Nature 327:70-73 (1987).
- Agrobacterium tumefaciens-mediated transformation techniques including disarming and use of binary vectors, are also well described in the scientific literature. See, for example Horsch, et al, Science 233:496-498 (1984), and Fraley, et al, Proc. Nat 'I. Acad. Sci. USA 80:4803 (1983).
- Agrobacterium-mediated transformation is a preferred method of transformation of dicots.
- Generation of Transgenic Plants Transformed plant cells which are derived by any of the above transformation techniques can be cultured to regenerate a whole plant which possesses the transformed genotype and thus the desired phenotype.
- Such regeneration techniques rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the desired nucleotide sequences.
- Plant regeneration from cultured protoplasts is described in Evans, et al., Protoplasts Isolation and Culture. Handbook of Plant Cell Culture, pp. 124-176, Macmillian Publishing Company, New York, (1983); and Binding, REGENERATION OF PLANTS, PLANT PROTOPLASTS, pp. 21-73, CRC Press, Boca Raton, (1985).
- Regeneration can also be obtained from plant callus, explants, somatic embryos (Dandekar, et al, J. Tissue Cult. Meth.
- the expression cassette is stably inco ⁇ orated in transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a number of standard breeding techniques can be used, depending upon the species to be crossed.
- sequence listing provides complete or partial sequences for a number of amplicons comprising the marker loci herein and for various primers and probe sequences useful in allele-specific hybridization, PCR and the like.
- the information is presented in DNA sequences.
- One of skill will readily understand that the sequence also fully describes the complementary strand of the provided DNA, i.e., by using standard base-pairing rules, the sequence of complementary DNA is provided and can be written out by any competent practitioner in the art.
- RNAs having the same sequence are provided by substituting "T" residues with "U” residues, and RNAs corresponding to the complementary strand are similarly provided.
- a variety of conservatively modified variations of the sequences are also fully provided.
- coding regions denoted by open reading frames, beginning with the start codon "ATG” coding for methionine and optionally ending with a stop codon can generally be modified by substituting codons which equivalently code for the same amino acid.
- ATG start codon
- TAA stop codon
- TGA stop codon
- any coding sequence can be equivalently represented by any sequence having equivalent codons, and that recitation of a single sequence provides all of these coding sequences.
- all possible coding sequences are not written out separately.
- Simple computer programs can also be used to list any or all such nucleic acids, given the provided sequence.
- coding regions where the nucleotides TTT are optionally substituted with TTC, and vice-versa.
- the codons TTA, TTG, CTT, CTC, CTA, CTG are optionally substituted for one another, in any combination.
- Coding regions where ATT, ATC or ATA appear are optionally substituted for one another.
- the codons GTT, GTC, GTA and GTG are all optionally substituted for one another.
- the codons TCT, TCC, TCA, AGT, TCG and AGC are optionally substituted for one another.
- the codons CCT, CCC, CCA and CCG are optionally substitued for one another.
- ACT, ACC, ACA, and ACG are optionally substitued for one another.
- the codons GCT, GCC, GCA and GCG are optionally substitued for one another.
- the codons TAT and TAC are optionally substitued for one another.
- the codons TAA, TAG and TGA are optionally substitued for one another.
- the codons CAT and CAC are optionally substitued for one another.
- the codons CAA and CAG are optionally substitued for one another.
- the codons AAT and AAC are optionally substitued for one another.
- the codons AAA and AAG are optionally substitued for one another.
- the codons GAA and GAG are optionally substitued for glutamic acid) are optionally substitued for one another.
- the codons TGT and TGC are optionally substitued for one another.
- the codons CGT, CGC, CGA and CGG are optionally substitued for one another.
- the codons GGT, GGC, GGA and CCC are optionally substitued for one another.
- conservative substitutions are also provided by the given sequences.
- conservatively modified variants are those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences.
- amino acid sequences one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art.
- the following six groups each contain amino acids that are conservative substitutions for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), and Tryptophan (W).
- nucleic acids the amplicons and probes in the sequence listing are typically used in hybridization experiments, e.g., for marker-assisted selection. Accordingly, amplicons and probes which are substantially similar or identical, which can be used in the methods herein (e.g., intra-specific alleles, genetically engineered nucleic acids made, e.g., by modification of the provided sequences, and the like) are provided by the accompanying sequences.
- bases may be added, deleted or changed without substantially altering the hybridization properties of the nucleic acid. For example, bases which do not hybridize to a given probe can be modified without altering the hybridization properties of the probe to the given sequence.
- non-hybridizing regions e.g., flanking regions
- a nucleic acid which is essentially the same (has the same desired phsiochemical properties, i.e., hybridizes to the same probe) as the written sequence.
- the amplicons are optionally larger than probes which hybridize to them. Accordingly, the regions which are not involved in hybridization are not essential for hybridization to a probe.
- the regions of an amplicon flanking the polymo ⁇ hic nucleotide which do not hybridize to a probe are not critical for hybridization to the probe.
- a probe e.g., an allele-specific probe, as described, supra
- sequences in the sequence listing are optionally part of larger sequences, e.g., the nucleic acids can be cloned into vectors known in the art. See, Sambrook, Ausubel, Berger and Innis, all supra. Furthermore, subsequences of the given sequences are easily constructed, either by synthetically or recombinantly joining nucleotides to yield the subsequence.
- Typical subsequences are at least about 10 nucleotides, often at least about 20 nucleotides, generally often at least about 30 nucletoides, and optionally any length, e.g., 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900 or the like and can, of course, be full-length.
- a subsequence can include, e.g.
- nucleic acid 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 100% or any percentage between those listed of a particular full-length nucleic acid.
- the subsequences are characterized by the ability to specifically hybridize to the complement of the full-length sequence, and by sequence identity with a sequence in the sequence listing over a selected comparison window.
- a "comparison window”, as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 10 to 1000, usually about 20 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.
- Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith & Waterman, Adv. Appl Math.
- the amplicons of the sequence listing are most preferably obtained by amplifying a genomic nucleic acid (e.g., a genomic clone from a library or genomic nucleic acid isolated from a plant) using the primers described for the particular amplicon.
- a genomic nucleic acid e.g., a genomic clone from a library or genomic nucleic acid isolated from a plant
- the amplicons are also obtained by other means, including synthetic creation of a nucleic acid having the indicated sequence, or any other method described herein. Modifying Nucleic Acids in the Sequence Listing
- nucleic acids in the sequence listing can be made using common techniques. For example, due to the degeneracy of the genetic code, “silent substitutions” (i.e., substitutions of a nucleic acid sequence which do not result in an alteration in an encoded polypeptide) are an implied feature of every nucleic acid sequence which encodes an amino acid. Similarly, “conservative amino acid substitutions,” in one or a few amino acids in an amino acid sequence of a packaging or packageable construct, are substituted with different amino acids with highly similar properties (see, above) are also readily identified as being highly similar to a disclosed construct. Such conservatively substituted variations of each explicitly disclosed sequence are a feature of the present invention. Substantially identical nucleotides such as allelic variants or recombinantly engineered nucleic acids comprising all or a portion of a given sequence are also made using these standard techniques.
- nucleic acid of the invention can be selected based upon the sequences provided and upon knowledge in the art regarding retroviruses generally.
- the specific effects of many mutations on hybridization, for instance, can be determined with virtual certainty, even in the absence of experimental information on a hybridization.
- general knowledge regarding the nature of proteins and nucleic acids allows one of skill to select appropriate sequences with activity similar or equivalent to the nucleic acids in the sequence listings herein.
- most modifications to nucleic acids are evaluated by routine screening techniques in suitable assays for the desired characteristic. For instance, changes in the immunological character of encoded polypeptides can be detected by an appropriate immunological assay.
- Marker loci must have multiple alleles to be used for genetic studies, genetic selection, and genetic identification within a species.
- ASH allele-specific hybridization
- locus-specific oligonucleotide primers to amplify each locus by PCR
- allele-specific oligonucleotide probes to hybridize with and distinguish each allele.
- genomic DNA fragments were ligated into the Lambda ZAP II vector, packaged, plated on E. coli strain XLl-Blue MRF', selected based on homology to a DNA probe and excised in pBluescript SK (-) phagemid in E. coli strain SOLRTM using ⁇ xAssistTM helper phage (Stratagene, La Jolla, California 92037 U.S.A.).
- selected DNA fragments were ligated into the pCRTM vector and transformed into E.
- Minipreps were precipitated in 0.1 vol 7.5 M NH 4 Ac and 2.0 vols ⁇ tOH at -20° C for 20 min, spun in a microcentrifuge for 20 min, washed in 70% ⁇ tOH, dried, and dissolved in 10 mM Tris/HCl pH 8.5. Insert DNA was sequenced using the Taq DyeDeoxy terminator cycle sequencing reaction and a Perkin Elmer ABI 373 or 377 DNA sequencer. Sequencing primers were designed from the vector.
- T d dissociation temperature
- T d (((((3 x #GC) + (2 x #AT)) x 37) - 562) / #bp) - 5, where #GC, #AT, and #bp are the number of guanine-cytosine base pairs, the number of adenine-thymine base pairs, and the number of total base pairs, respectively, involved in the annealing of the primer to the template DNA.
- #GC, #AT, and #bp are the number of guanine-cytosine base pairs, the number of adenine-thymine base pairs, and the number of total base pairs, respectively, involved in the annealing of the primer to the template DNA.
- #GC, #AT, and #bp are the number of guanine-cytosine base pairs, the number of adenine-thymine base pairs, and the number of total base pairs, respectively, involved in the annealing of the primer to the template DNA.
- #GC, #AT, and #bp are the number of gu
- the PCR reaction mixture to amplify each desired locus consisted of 1.0 ⁇ M of the forward primer and 1.0 ⁇ M of the reverse primer, IX of buffer (20 mM Tripotassium citrate, 20 mM MgSO 4 , 40 mM Tris base, 10 mM Glycine, 5 mM L-Histidine, and 0.01 % Triton X-100), 240 ⁇ M of each dNTP, 0.4 U ⁇ h-- of Hot Tub DNA polymerase (Amersham Life Science, Arlington Heights, IL 60005 U.S. A), and 1.0 ng ⁇ 1 of template DNA, all diluted in HPLC-grade H 2 O.
- the PCR reaction was done in 33 cycles with an initial denaturing for 2 min at 94° C, subsequent denaturing for 30 seconds at 94° C , annealing for 2 min at 58° C, and extension for 2.5 min at 70° C. A final extension was done for 3 min at 70° C. An aliquot of each PCR product was run on a 1-2% agarose gel and stained with EtBr for viewing under ultraviolet light.
- Each amplicon was purified for sequencing using a QIAquick-spin PCR Purification Kit using the manufacturer's protocol (Qiagen, Chatsworth, California 91311 U.S.A.). If multiple bands were amplified from one PCR, all were individually extracted from the gel and purified using the QIAquick Gel Extraction Kit (Qiagen, Chatsworth, California 91311 U.S.A.). These products were then sequenced using the Taq DyeDeoxy terminator cycle sequencing reaction on a Perkin Elmer ABI 373 or 377 DNA sequencer. Each oligonucleotide primer used to amplify a DNA fragment by PCR was also used to prime the sequencing of that fragment from one end.
- ASH oligonucleotide probes were designed, synthesized, and tested for each allele.
- the ASH probes were designed to have a dissociation temperature of about 37° C. They were synthesized with a Perkin Elmer ABI 394 DNA/RNA Synthesizer using (-cyanoethyl phosphoramidite chemistry with the dimethoxytrityl protecting group removed, and were purified by desalting over a Pharmacia NAP 10 column.
- probes were tested against the 10 sequenced soybean varieties, 12 additional North American ancestor varieties, and 72 recombinant-inbred lines of the BSRIOI X PI437.654 mapping population. Probes were also tested for signal (hybridization of the correct probe) to noise (hybridization of the incorrect probe) ratios for each locus using the amplified target DNA from one variety in a dilution series. The dilution series were made using about 100 ng, 10 ng, and 1 ng of PCR-product (target) DNA.
- Target DNA was attached to a Hybond N+ nylon membrane (Amersham Life Science, Arlington Heights, IL 60005 U.S. A) in the following manner.
- the membrane was soaked briefly in water and the excess water was removed from the membrane by blotting.
- Two microliters of the PCR product or diluted PCR product was pipetted onto the moist membrane.
- the membrane was placed on a blotter paper samrated with a DNA denaturing solution (0.6 M NaCI, 0.4 M NaOH), DNA side up, for two minutes.
- the membrane was then transferred to a blotter paper samrated with a neutralizing solution (0.5 M Tris pH7.5, 1.5 M NaCI) for 10 minutes.
- each membrane was then baked at 85-90° C for 1-2 hours and UV-crosslinked at 20,000 ⁇ J. Prior to hybridization, each membrane was soaked in a hybridization- washing buffer (0.75 M Na, 0.5 M PO 4 , 1.0 mM disodium EDTA, and 1 % sarkosyl) for 30 min at 65° C, then in fresh buffer for 30 min to overnight at room temperature.
- a hybridization- washing buffer (0.75 M Na, 0.5 M PO 4 , 1.0 mM disodium EDTA, and 1 % sarkosyl
- Each ASH probe was end-labeled with 32 P transferred from the ⁇ position of ATP by the T4 polynucleotide kinase reaction according to the kinase manufacturer's protocol (New England Biolabs, Beverly, Massachusetts 01915 U.S.A.). Hybridization of the probe and target DNA was done for at least one hour while shaking at 60 rpm at room temperature. Afterwards, the hybridization solution was discarded and the membrane was sequentially washed, each time in fresh solution, once for 2 min, once for 15 min, and twice for 30 min while shaking at 60 ⁇ m. The hybridized membrane was placed against X-ray film for 30 minutes to
- the X-ray film was developed and the probe was evaluated for its hybridization characteristics. If necessary, the probe was redesigned and tested to increase the signal-to-noise ratio or signal strength.
- each amplicon uses the prefix 'pha' as an acronym for Pioneer Hi-Bred amplicon, followed by a unique identification number (Tables 2 and 3). Locus or amplicon names can both be used to refer to a DNA region containing specific polymo ⁇ hic sub-loci.
- the particular primer pairs described here produced amplicons ranging in length between 86 and 1880 base pairs. About half the amplicon sizes were estimated from bands viewed on agarose gels and about half were obtained by sequencing the entire amplicon region.
- loci php05219A (amplicon phalll38) and phpl0355B (amplicon phall627) each produced two fragment sizes because each locus had an insertion-deletion event that varied among different soybean genotypes (Table 4).
- pK079A One locus, pK079A, had two non-overlapping amplicons, pha 11074 and pha 11075. Two amplicons were needed for this locus because the DNA sequences flanking its two sub-loci were conserved at a second pK079 locus. A region of DNA between the two sub-loci was found to be different from the second pK079 locus so PCR primers specific to pK079A were designed for two amplicons, one for each sub-locus.
- polymo ⁇ hic nucleotides for each allele of each sub-locus were designated by upper-case letters within the probe sequences shown in Table 2. Each oligonucleotide probe distinguishes one allele of a sub-locus and locus. A public soybean variety representative of each allele is listed in Table 2 for the pu ⁇ ose of example.
- ASH allele-specific hybridization
- Other genetic marker technologies are equally effective as a means to exploit these polymo ⁇ hisms in soybean improvement programs.
- other DNA sequences than specifically shown here are used for primers and probes to detect these polymo ⁇ hisms, e.g., as described supra.
- Any DNA sequences flanking these polymo ⁇ hic nucleotides and within the size range amplifiable by PCR are used as PCR primers to amplify the polymo ⁇ hic DNA, and the forward and reverse primers are designed from either DNA strand.
- Any DNA sequence containing the polymo ⁇ hic nucleotides and within the discriminatory capability of DNA hybridization conditions are used as hybridization probes to detect the polymo ⁇ hism. The probes are designed from either DNA strand.
- the genetic map locations by linkage group for 43 of these 63 loci were established by mapping them as ASH markers in segregating populations (Table 2). Mapping showed the alleles of each locus to be heritable and to segregate normally in a recombinant-inbred population.
- the linkage-group names used here were the same as those used as a reference map for soybean (Shoemaker and Olson, 1993, supra.). Thirty eight of these 63 loci mapped as ASH markers to 17 linkage groups that correspond to the public reference map and 5 loci mapped to linkage groups on three other genetic maps that have not yet been identified with linkage groups on the public reference map. These later groups were named using a combination of letter (B, L or Z) and number.
- markers Twenty markers, pA059A, pA064A, pA077A, pA593A, pBLT15A, pK401A, pR153A, php02340B, php02396A, php05264A pA343A, pA748B, pA858A, pB132A, pG17.3A, php02371A, php05290A, php02329A, php02376A, and php08320E, were not mapped as ASH markers, but sixteen of their loci have been mapped as either RFLP or AFLP markers.
- loci php02396A, php05264A, php02329A and php02376A have not been mapped as markers of any kind. Regardless of their map status, the 63 loci described here reside on most, and possibly all, of the 20 chromosome pairs in soybean. Table 2. Soybean marker loci: their genetic linkage group, sub-loci, and polymo ⁇ hic nucleotides, probe sequences for allele-specific hybridizations, and a representative soybean variety for each allele.
- CTGCAGCTCC TTCTGCATCT TCCACCTCAA TAGTGATGGN TAACTCTATG GAAGATGCAG TGGTAGTGTG GTGACCAGAT GTTCCTCCAC AATAATGCTC CATTTCTTTG ACCTCAGGGA TTCCAGTGAT CTCCATGTCT TGACCTCTGC CCACATTCCA TTGTCTTGAC CTCCTCCCAC ATTCCATTGT CTTGACCTCC
- CTGCAGCACT CGAAAGTGAC CTGCATCAGC CTTCAACCTT CACCCTTATG CAGCAACAAC CAAAACCATA ATTTAAAACC TATTTTGAAA CATATCCATG TACATTTTCC CGCTATCAAG CCTGTTCTTT TAACAATAAT CATTATTCTC TTGTCCAAAA ACTGGAAGCA ACTGAGATCA AGACAAATTG AGTGCAAAAT CTACACAATC TATGTTCAAA ATATTGAGTG CAGGTCTCTT TCTTGCAATT CCAAGTCGTA TCTATTGGGG TCACACACAA ATGGAATATC ACTAAAGCTA TCGAAGAA GATTCCACCC AGATCATCAT CAGTAGCTA TTTGTGTCTG TTGATCCTTT GTGGCTTCCC CATCA[T/G] CTTCTTCTCC [A/CJGAAAC ATTTGAATTC TTCAACT AATTCCCTAA ATAACGA ( ⁇ 410 bp) TGATGGTATG TGGAGATTTA
- CTATTCTTGC TTCCCATCTA CCAGTGTGG TGGTGCCTGC AAAACATTCA CCATGTGTTA GTGATTCATC TTCGCAAGCA ACAATTCAGA AGCCAATTCG AGTCGTATTC ATCAGAAATA TCCACCTCGC AACACCCCTA TACTTTGATA CACCTCTTGA AAAGCCACTG CTCTTTCTGA TCAATTTCAA AATAACCAAA GTCAAAAAAT CATATATAAT ATCAAATTCT GTACAACTTT ATATGCTGCA
- CTGCAGGTAC TTTTTATGAA TCAACTGCTT CTATTTCACA GCTTGTTTTC ATTTATCTCT TCTAAAAGGG GAAGAGGGAA AAATTGTATT ATTTGCTAAA
- GTTAGATTGT TTTCTCCTCT CCCGTTTCAG CTCACGCTCA TTCTAAGGAA TTAAAAATAC ACACAGGAAT TAATTTACTG TCAGTAAAAG ATGGACGGTA TTGCATAATT GTTCTAATGT GTTTCACAA GCATCTCTAA TGTACTAGAC ATGATCCGA GACATTAGAA CAAACCATTA TACCTGTAAC CAAGTTTCAT TACGCACAAC TGCACAAGGT TGTGCGGCAC TTGTGGAATT TGCCTTAGAA
- ACTTTACGGA TCATACCAAN CCAAAGAACA TTGAGGTTTA ATATATATAT ATATATAGCT TATGTCACTT TAATCAAAGC AGCAATTGCT GAAATTGCTG GAGACAAGAA GTTGTTTAAT GTGGTGGCTT TTTGAGAAAT AACAGACAAC CCCAATCTAA AAAAGTCCAG GAAGATATTG TTTACGTGTT GGGATTGAGA
- CTGCAGTTAC GAGTAATTGG GTAGGTTACT GCACTAATA CTTTAGTTGA CTTTTGAGGT GGTTGAGAT ACTAGCTATC TATTAATCTG TTCTATAAAA TACAGGCATA GCTGTTCTCT TGAACTATAG AATTTACATA GCCTTTT[C/ A]CCTATTAT TAATAACACC AAGACACACA TATACTGCTA GACACTTCTG
- TCTACCACAT AGC phal0638 (SEQ ID NO:730, 731)
- CTGCAGCAAC AACCCCCAAA CCCCGTTGCC AGAAACAAGA CAATTTGTCA AATGCAGAGC ATATATAAAA AATAGAGAGT AGGATCATCA CCTTTGCTGG
- ..CCAACC ACATATAGAC TGCTTATTAA NACCCATCCA AGATTTCTAC TGAACCAAAA CGAATATGAT CCTAAATATN CATTGAATTA ATCTTACCCA GATGGGGGGT CATTGATTGT AGTCCCCTTT CCAAAAACCG GGGGGTTCCT AAATAAATTG AAAATAAAAA AGATTAGGAA GCTAACAGAA ATTTGAATAT AACGCTCCAA AGTTCACACC ATATCAGTCC
- CTGCAGTCAA TCATAATATT CAACCATGAA ATCGAAGTAG TATCTAACCT GTCTGCTCTC TTTTGCATTT CTGATGAGTC CATCAAGACA AGTATTGATG ACCTTTAGGT CATCCTGAAA CTTTCTTTGC CTTGGGATTA TCCACCTTGC CAATGGAATT TTCCAATAT GGAATGTAGA AAGTGGATCT ATGTTCAGC TTCAAAAAGA GTGCCATAGA CTGC[C/T]T ATAACAGACT TAAAAGCAGA
- TAGGTTACAT TCATAAAACA CTTGACATTA TTAGCACAGG AAAGACAAGC TATTGAAACC ACTGATTATT CTTTAAAAGT TATTTAGACT CCTAAGTTGT AACAGAATGC TGGTATTAAT TTAGTTAATT ACTGCAG pha 10646 (SEQ ID NO : 616 , 735)
- TCTCTTACTC AATTTTCATT GGCACCCTCG ACATTCGATC TTATTTCTTC CCTCGCCTAA AGTTACCCGC GGCTGCACCT GCTCCCTGTG CACCCGAGCC TCCTCTTCGG GTTTTCATGT ACGATCTCCC TCGCCGATTC AACGTCGGCA TGATTGACCG CCGGAGCGCG TCGGAGACGC CGTCACGTTG AGGACTGGCC GGCGTGG... ( ⁇ 255 bp). ..AACTCGCC CAAGCGTTCT TCGTGCCGTT
- CTTCTCGNCG CTCAGCTTCA ACACGCACGG CCACACCATG AAGGATCCCG CCACGCAGAT TGATCGCCAA TTGCAGGTGC GTCGCTAGGT TTCTCGATTG TTCCGAATTG ATTGGTGAAT CANTCAAATG CTATGCTATG AGAGTTATTT TCACAGCAGG TTTTGCATGT TTAGCCTTAG AACGCGATTA GTATGATCAC TTGAAGATTT A[C]GATACA GGATGCTTGT TGAACGACAT TAGATGCTNA
- TGATTTTCTT TTGGATCCTG AGAACAAGCC
- CCCCAGAATT TCACTCATG CATGTGCCAG AGTTCAGGAA TAGTCTTCCA AGGGAAACAA
- AGAAGCCCT CATCTTGTTT CTCAAATTCA TCTTCATTTG GAATCAAAGG AGGCCTGAC TGACTTTGTA GTAGGAGGAG GCCTTTGCAT TGCTGTGCTA CAAATGGTG
- CTTAATAAT phal0653 (SEQ ID NO:744, 745)
- ACAGCCTCAC TAACTGCATA [C/A]GATAT TATTGTTTAT ATTTAATCCA AACATCATGT TATGGCAGTC AAACACAACA CAAAAGAATC ATCAATATGT CAGAGCTAGA ACTCCCTCTG GTTACTAAAA ACCAATTCTC ATGATCCAGT CCACTCATTG TTTAACTCA GAGACAGAGT ACGAAGCATA ACAAACCTTT TCTAGGATC TGCACTATAC ACACCATCAA CATCTGTCCA AATTGTGACC TGACGAGCCT TAAATAGAG CACCCATAAT TGCTGCCGAG AAGTCACTTC CATCTCTT CAGTGTGGTA GGAATGTTTT GAGGTGTGCT TGCAATGAAT CCAGTGGCAA TGATTACCTT ACATGGATTC AAAGAGTACC ATTTCTCAAG TCTCTTCTCA GATTCCA phal0655 (SEQ ID NO:746)
- TCATCTTAAT TAATGTCTTC TACACCGTCT GCCTGCAACA GAGTATGTAT GTATATATAA ACAACATTCC AGGGCCAGTG TCATACAAGT TACAGCCAAT TTTCAAAGTA ATTAAGTGAT GCTAAACTAA ATCAAGACTG AGTTTAAATT TTCAAAACTA ACTCATTTCA CATATATTTT GAGTTATTTA GGCCAACTCT TGCTAACCAA AGTTGGGTGA ATATTCTAAC TGGTCCCCAA AACTGTGAAG
- CAAACACTTC CTGCACTCAT CACTTGGAAG ATCCCTATCA CACTGCACCC ATCCATACAA TGTCTCATTG TCACCCCAAT CGAACTCCTC CACTGCCCAA AACTGCTTGG TCTCTATAGT TGCTTTTGTA ATCAAACCLC/T]TC[C/ A]ATAGCATC CTCAATTfC/ TjTCTTAGTT TCTGTTGA[A /GjTTCTGTG TTTGGACCAC GATCTTTGTGTG AGACAAATTC AATTTTTAAG TGACTTTAAA
- CTGCAGCAAC CGGTCTTCTA TCAAAACTAA CACATTTGAA TCTGTCTTGG AATGATCTTC ATTCACAGAT GCCTCCAGAA TTTGGCCTCC TTCAGAACCT
- GAGTTTGTCT CACAATAATT TGACTGGTTC AATTCCAAAG TCCATGTCAA AGCTAAACAA GCTCAAAATC CTCAAGCTGG AATTCAATGA ACTAAGTGG AGAGATACCA ATGGAGCTTG GAATGCTTCA GAGTCTTCTT GCTGTAAACA TATCATACAA CAGGCTCACA GGAAGGCTTC CTACAAGTAG CATATTTCA GAACTTGGAC AAAAGTTCCT TGGAAGGAAA CCTGGGTCTT TGTTCACCCT TGTTGAAGGG TCCATGTAAG ATGAATGTCC CCAAACCACT [A/TJGTGCT TGACCCAAA TGCCTATAAC AACCAAATAA GTCCTCAAAG GCAAACAAA CGAATCATCT GAGTCTGGCC CAGTCCATCG CCACAGGTTC CTTAGTGTAT
- TTAGTGACTG AGTTTGCACC AAATGGTAGC TTGCAAGCCA AGCTACATGA AAGGCTTCCT TCAAGTCCTC CTCTTTCTTG GGCTATAAGG TTCAAAATCT TGCTTGGAAC AGCAAAGGGG CTTGCTCATT TGCACCACTC TTTCCGTCCA CCGATCATCC ACTACAACAT AAAGCCAAGT AACATTTTGC TTGACGAAAA TTACAACGCC AAGATCTCAG ATTTTGGGTT GGCTCGGCTT CTGACAAAGC
- JCCCCTTTCC CAAAGGGGAA ACACCAAAGA TCTTGGGTTT TGAACACTTC TCCACTCGAA GATGGGGTTC AAGACTAGGA TCTGGGTCGT TGACGCCAGA CGGTGCATGG CAAGGTTCAA GACTAGGCTC GGGATCGTTG ACTCCTGATG GTATTGGGCT TGCTTCACGA TTAGGCTCTG GGTGTGTGAC ACCTGATGGT CTGGGGCfA/ T]GGAATCCA GGTTAGGTTC TGGCTGTTTG ACACCTGACA
- AAAACTCTT TATAGTGT CGGTGCGTAG ACTCTGTTAC TCTTCCC CTATGTATCT TGTATCATCA TACCAAAAAT AAAAGCAAAA TTTAAGTTGA ATTAGTAATT TTGCAGAGGG GTGTTTATGA TGGTCTACTA GATGCATTCC TGAAAATAGT TAGAGAAGAG GGTGCAGGAG AACTTTACAG AGGTCTTACT CCGAGTCTGA TTGGAGTAAT TCCATATTCT GCCACCAATT ACTTTGCCTA
- GTACAAGTCC GTATGGGATT AATCACTTTG TTTTACTTGT GGGTTATGGT TCAGCGGATG GTGTAGATTA CTGGATAGCG AAAAATTCAT GGGGAGAAGA TTGGGGAGAA GATGGTTACA TTTGGATCCA AAGAAACACG GGTAATTTAT TAGGAGTGTG TGGGATGAAT TATTTCGCTT CATACCCAAC CAAAGAGGAA TCAGAAACAC TGGTGTCTG CTCGCGTTAA AGGTCATCGA AGAGTTGATC
- CTGCAGAAAC TGTCGGTTTT ATTAGTAATT CAGCACAAAT TCTTCGGTGG AGTTTTTT TTNTTCTGTT TTTGGCTGCG TTTACGTGTG TTTGTTTAGT TTCCTGGAAA TTGTTCAAAT TGGTTAAAGG ATCTTTATAA GTGGACATTT CAGTTGGAGT TAATTGCGCA ATTGAAAGTT AAGGCATAAA TGTCTCTATT
- CTGCAGGTAA TTTCTTAATG AATGGGTTGT TCTTTTGCTC
- GTGTTTGTCA TTGATGGTTC TCAGCTATTG GCATGAGGAA ATTATCAACT TCACAACAAA AAGCAGTTAG TCATGAAAGC AGGCTAAGGC CAAATGATAT [ATT/G]TTC TTCAAATTTG AAACTGAAAA ATGGGCCAAA TTGTTTCTTC TGCATATGAT CATATTTAT CACTATATAA GTTAAC[A/G ]TGAAGCAGT .
- CTGCAGAGAC CATTCAAGAT CGACTGAAGA ATTTGATTGC CTCGTCCCCT
- ATCATATATT TCAGATTTAA ATCATCCJAC TCGATA[G/A ]CTTATTT[G T/AT/GC]AT GTATCATGTA TGCATGCTGT ATCTT[T/G] C[A/G]T[CC TCATAGAATA ]ATTAACCTA ATCTT[C/T] TATAATCTTT TATTT[C/T] TTT[A/G]TT ATTAGAAAAA GACTAAATAG GAAAGGATAG ATCATATATA C[TCTAG/TC TGG/TC/GTT AG]AAGACTC AAA[T/C]TT GTAACTTAA[
- AACCGACGCT GAGACTCGCC ATGGCACACA CGTCCGTTCG GGATGCCGGA GTTTCCACC GTCGTTGCAT CTCAATATA ATACCACACG TTTACGATC
- CTGCAGCTG ATGGGAAGAT TTACTTCTGG TAGACTATT TTAAAATCTT
- GAGCCTTGCC CAATTAGAAA AAGAATAATG GAAAGGAAAC ACAAGGAAAA ATTTACGGAA GTAGACTCTC CCATAACAAA TAGAAAGATA ATGGAAGCTC TCAATATGTA AGAGAAGAAA TGATAGCACA TATCATTCAA GAGATTTATA AATCTAATCT TATCCTGCAC CCTGTTCTTA CTTGTCTCTC CTCTCCACCC TGTCTTTTTT ATTGCTCCCA AGTTACTGAG TCTGCCCTTC TATTCCTCCT
- TTTTTCA [TC /GTJGCACTT TTTGTGATAT GGTTCTGATT TTGTTGCCAT TAACAATTTG CTTGGTACTT ATTCTGTGAC AGGTGATTTT ATTCTCAGAC ATTCTTACCC CACTTTCTGG AATGAATATA CCCTTTGATA TTGTGAAGGG TAAGGGTCCT GTTATATTTG ATCCTATTCA CACAGCTGCC CAGGTTGATC AAGTGAGGGA GTTTATTCCT GAAGAATCAG TTCCATATGT TGGTGAAGCA
- TTTTTGTTTT TCCCTAATTA TTTTCCACCT GCGGGACCTC TCACTATTCC TTGTCAGTCC AGCTCTTGGG TTATGCAGAA AACTTAACAA TGTGGATATT CCCTTTATTT TTTATTCGGT CCTCCTCACC CATGTGTGTT TACGCTTTTT ACTTCCCATT CCCTTCCTTG TAGCAATTAC CTTATTGGCC ATCCAATTTT CTTAATATTC C
- TAAATTGTGC TTGGTTTTGG TATCAGATTG ACTACAT[A/ GjATGAAGCC ATTCTGACAT CATTCTGAAA TAATGAAATT TGGAATTGGA AATCTT[G/A JCTGTGATCT TTTTTCCCCC TATTTGAGGG AAGCAAATAC TAGTTTGGCT TAATTACACT TTTAGTTCCT TTATTTTAGC CTATGCACGA TTTTGGTTCC CCTAGTTCTA ATTGCTTGCA TTTAGTCTTT GTAGTTACTC AATTGTTAAA
- ATTAAGTCCC TCACCTCATA TTTTGTCCAA TATTTAGTGG AAATCTCACA AGGTGTGATG TAAGTCATGT G[G/A]CATT GATTTGTGTA GATTGTTGTA GGCAAAATAT ATACAA[T/C JATCTAAATA AAATATTTCA TATATTTGTT TCCTTTGTTA ATCTATACAA AAGTATAAAT CAATG[A/T] TGGATAAATT CTGGCTGTCT TTTTCAATGA ATAATGTCTG GCCTTGCAGC TTTCTCAAAC
- CTGCAGGAC ATTCACAGTC ATTGCCGCA GTGGAAGGAA TTAACAAAGA TAGTAACAG GCAATCTAAC TGCACTGTCA GACTGTGAT A GAACAGTC AA TGCAGGA TG TAGTGGT GGACTTGCGG GACTATGCCT TAGAGTTCAT TATCAACAAT GGTGGCATTG ACACCGAAGA GGATTACCCC TTTCAAGGTG CTGTTGGTAT TTGTGA[T/C ]CAATATAAG GTTAGTTTTA CCTTTGATTC
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU19276/99A AU1927699A (en) | 1997-12-19 | 1998-12-18 | Nucleotide polymorphisms in soybean |
CA002314992A CA2314992A1 (en) | 1997-12-19 | 1998-12-18 | Nucleotide polymorphisms in soybean |
BR9813805-7A BR9813805A (en) | 1997-12-19 | 1998-12-18 | "nucleotide polymorphisms in soy" |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US6818597P | 1997-12-19 | 1997-12-19 | |
US60/068,185 | 1997-12-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1999031964A1 true WO1999031964A1 (en) | 1999-07-01 |
Family
ID=22080955
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1998/026935 WO1999031964A1 (en) | 1997-12-19 | 1998-12-18 | Nucleotide polymorphisms in soybean |
Country Status (5)
Country | Link |
---|---|
AR (1) | AR017917A1 (en) |
AU (1) | AU1927699A (en) |
BR (1) | BR9813805A (en) |
CA (1) | CA2314992A1 (en) |
WO (1) | WO1999031964A1 (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006130494A2 (en) * | 2005-05-27 | 2006-12-07 | Monsanto Technology Llc | Methods and compositions to enhance plant breeding |
US7973212B2 (en) | 2003-08-01 | 2011-07-05 | Pioneer Hi-Bred International, Inc. | Soybean plants having superior agronomic performance and methods for their production |
US8090541B2 (en) | 2000-07-17 | 2012-01-03 | Dna Landmarks Inc. | Map-based genome mining method for identifying regulatory loci controlling the level of gene transcripts and products |
WO2013096818A1 (en) | 2011-12-21 | 2013-06-27 | The Curators Of The University Of Missouri | Soybean variety s05-11268 |
WO2013096810A1 (en) | 2011-12-21 | 2013-06-27 | The Curators Of The University Of Missouri | Soybean variety s05-11482 |
WO2014100229A1 (en) * | 2012-12-21 | 2014-06-26 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with soybean cyst nematode resistance and methods of use |
WO2014100346A1 (en) * | 2012-12-21 | 2014-06-26 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with soybean cyst nematode resistance and methods of use |
WO2015061548A1 (en) | 2013-10-25 | 2015-04-30 | Pioneer Hi-Bred International, Inc. | Stem canker tolerant soybeans and methods of use |
US9493843B2 (en) | 2012-12-20 | 2016-11-15 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with Phytophthora tolerance in soybean and methods of use |
WO2017136204A1 (en) | 2016-02-05 | 2017-08-10 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with brown stem rot resistance in soybean and methods of use |
US10555527B2 (en) | 2009-05-18 | 2020-02-11 | Monsanto Technology Llc | Use of glyphosate for disease suppression and yield enhancement in soybean |
CN113795597A (en) * | 2021-02-08 | 2021-12-14 | 中国农业科学院作物科学研究所 | Soybean SNP typing detection chip and application thereof in molecular breeding and basic research |
CN114107554A (en) * | 2022-01-24 | 2022-03-01 | 华智生物技术有限公司 | Primer group for detecting purity of soybean variety and application thereof |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5015580A (en) * | 1987-07-29 | 1991-05-14 | Agracetus | Particle-mediated transformation of soybean plants and lines |
US5491081A (en) * | 1994-01-26 | 1996-02-13 | Pioneer Hi-Bred International, Inc. | Soybean cyst nematode resistant soybeans and methods of breeding and identifying resistant plants |
US5689035A (en) * | 1995-09-26 | 1997-11-18 | Pioneer Hi-Bred International, Inc. | Brown stem rot resistance in soybeans |
-
1998
- 1998-12-18 AR ARP980106514A patent/AR017917A1/en not_active Application Discontinuation
- 1998-12-18 AU AU19276/99A patent/AU1927699A/en not_active Abandoned
- 1998-12-18 CA CA002314992A patent/CA2314992A1/en not_active Abandoned
- 1998-12-18 BR BR9813805-7A patent/BR9813805A/en not_active IP Right Cessation
- 1998-12-18 WO PCT/US1998/026935 patent/WO1999031964A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5015580A (en) * | 1987-07-29 | 1991-05-14 | Agracetus | Particle-mediated transformation of soybean plants and lines |
US5491081A (en) * | 1994-01-26 | 1996-02-13 | Pioneer Hi-Bred International, Inc. | Soybean cyst nematode resistant soybeans and methods of breeding and identifying resistant plants |
US5689035A (en) * | 1995-09-26 | 1997-11-18 | Pioneer Hi-Bred International, Inc. | Brown stem rot resistance in soybeans |
Non-Patent Citations (6)
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8090541B2 (en) | 2000-07-17 | 2012-01-03 | Dna Landmarks Inc. | Map-based genome mining method for identifying regulatory loci controlling the level of gene transcripts and products |
US7973212B2 (en) | 2003-08-01 | 2011-07-05 | Pioneer Hi-Bred International, Inc. | Soybean plants having superior agronomic performance and methods for their production |
US8987548B2 (en) | 2003-08-01 | 2015-03-24 | Pioneer Hi-Bred International, Inc. | Soybean plants having superior agronomic performance and method for their production |
WO2006130494A2 (en) * | 2005-05-27 | 2006-12-07 | Monsanto Technology Llc | Methods and compositions to enhance plant breeding |
WO2006130494A3 (en) * | 2005-05-27 | 2007-06-07 | Monsanto Technology Llc | Methods and compositions to enhance plant breeding |
US7608761B2 (en) | 2005-05-27 | 2009-10-27 | Monsanto Technology Llc | Method for disease control in MON89788 soybean |
US10273498B2 (en) | 2005-05-27 | 2019-04-30 | Monsanto Technology Llc | Methods and compositions to enhance plant breeding |
US9605272B2 (en) | 2005-05-27 | 2017-03-28 | Monsanto Technology Llc | Methods and compositions to enhance plant breeding |
US11479787B2 (en) | 2009-05-18 | 2022-10-25 | Monsanto Technology, Llc | Use of glyphosate for disease suppression and yield enhancement in soybean |
US10555527B2 (en) | 2009-05-18 | 2020-02-11 | Monsanto Technology Llc | Use of glyphosate for disease suppression and yield enhancement in soybean |
WO2013096818A1 (en) | 2011-12-21 | 2013-06-27 | The Curators Of The University Of Missouri | Soybean variety s05-11268 |
WO2013096810A1 (en) | 2011-12-21 | 2013-06-27 | The Curators Of The University Of Missouri | Soybean variety s05-11482 |
US9493843B2 (en) | 2012-12-20 | 2016-11-15 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with Phytophthora tolerance in soybean and methods of use |
US9464330B2 (en) | 2012-12-21 | 2016-10-11 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with soybean cyst nematode resistance and methods of use |
US9994920B2 (en) | 2012-12-21 | 2018-06-12 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with soybean cyst nematode resistance and methods of use |
WO2014100346A1 (en) * | 2012-12-21 | 2014-06-26 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with soybean cyst nematode resistance and methods of use |
WO2014100229A1 (en) * | 2012-12-21 | 2014-06-26 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with soybean cyst nematode resistance and methods of use |
WO2015061548A1 (en) | 2013-10-25 | 2015-04-30 | Pioneer Hi-Bred International, Inc. | Stem canker tolerant soybeans and methods of use |
WO2017136204A1 (en) | 2016-02-05 | 2017-08-10 | Pioneer Hi-Bred International, Inc. | Genetic loci associated with brown stem rot resistance in soybean and methods of use |
CN113795597A (en) * | 2021-02-08 | 2021-12-14 | 中国农业科学院作物科学研究所 | Soybean SNP typing detection chip and application thereof in molecular breeding and basic research |
WO2022165853A1 (en) * | 2021-02-08 | 2022-08-11 | 中国农业科学院作物科学研究所 | Soybean snp typing detection chip and use thereof in molecular breeding and basic research |
CN113795597B (en) * | 2021-02-08 | 2023-11-17 | 中国农业科学院作物科学研究所 | Soybean SNP (Single nucleotide polymorphism) typing detection chip and application thereof in molecular breeding and basic research |
CN114107554A (en) * | 2022-01-24 | 2022-03-01 | 华智生物技术有限公司 | Primer group for detecting purity of soybean variety and application thereof |
CN114107554B (en) * | 2022-01-24 | 2022-04-15 | 华智生物技术有限公司 | Primer group for detecting purity of soybean variety and application thereof |
Also Published As
Publication number | Publication date |
---|---|
BR9813805A (en) | 2000-10-03 |
CA2314992A1 (en) | 1999-07-01 |
AU1927699A (en) | 1999-07-12 |
AR017917A1 (en) | 2001-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Morgante et al. | PCR‐amplified microsatellites as markers in plant genetics | |
KR101303110B1 (en) | Corn Event MIR604 | |
EP1869187B1 (en) | Elite event a2704-12 and methods and kits for identifying such event in biological samples | |
US9012613B2 (en) | QTL controlling Sclerotinia stem rot resistance in soybean | |
US6733965B2 (en) | Microsatellite DNA markers and uses thereof | |
EP1995320A2 (en) | Polynucleotide markers | |
JP2013055956A (en) | Corn event tc1507 and method for detection thereof | |
AU2019246847B2 (en) | Qtls associated with and methods for identifying whole plant field resistance to sclerotinia | |
WO1999031964A1 (en) | Nucleotide polymorphisms in soybean | |
BRPI0610520A2 (en) | polynucleotide, vector, recombinant DNA construct, processes for transforming a host cell, for conferring or enhancing colletotrichum resistance and stem rot, to alter the expression level of a protein capable of conferring colletotrichum resistance and stem rot of colletotrichum-resistant plants to determine the presence or absence of the locus of rcg1, computer system for identifying a colletotrichum-resistant plant, genetic marker, methods of producing a corn product and plant, and uses | |
AU2016369320A1 (en) | Brassicaceae plants resistant to plasmodiophora brassicae (clubroot) | |
KR102254956B1 (en) | A molecular marker for selecting onion white bulb color and the use thereof | |
CN111295447A (en) | Maize elite event MZIR098 | |
US8946514B2 (en) | Sorghum fertility restorer genotypes and methods of marker-assisted selection | |
Horvath et al. | A Co‐Dominant PCR‐Based Marker for Assisted Selection of Durable Stem Rust Resistance in Barley | |
CN109182342B (en) | Rice blast resistance gene Pisj of rice and application thereof | |
KR102175444B1 (en) | Molecular marker for identification of oriental melon and identification method using the same marker | |
AU2014386227B2 (en) | Markers linked to reniform nematode resistance | |
WO1999041415A9 (en) | Transposable element-anchored, amplification method for isolation and identification of tagged genes | |
KR101439449B1 (en) | A High-density Genetic linkage map of Capsicum sp. | |
KR101439450B1 (en) | A High-density Genetic linkage map of Capsicum sp. | |
KR101439457B1 (en) | A High-density Genetic linkage map of Capsicum sp. | |
KR101439451B1 (en) | A High-density Genetic linkage map of Capsicum sp. | |
Wang et al. | Expressed sequence tag-PCR markers for identification of alien barley chromosome 2H in wheat | |
KR101439453B1 (en) | A High-density Genetic linkage map of Capsicum sp. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
ENP | Entry into the national phase |
Ref document number: 2314992 Country of ref document: CA Ref country code: CA Ref document number: 2314992 Kind code of ref document: A Format of ref document f/p: F |
|
NENP | Non-entry into the national phase |
Ref country code: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09581970 Country of ref document: US |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase |