WO2002099129A2 - Designing degenerate pcr primers - Google Patents

Designing degenerate pcr primers Download PDF

Info

Publication number
WO2002099129A2
WO2002099129A2 PCT/GB2002/002640 GB0202640W WO02099129A2 WO 2002099129 A2 WO2002099129 A2 WO 2002099129A2 GB 0202640 W GB0202640 W GB 0202640W WO 02099129 A2 WO02099129 A2 WO 02099129A2
Authority
WO
WIPO (PCT)
Prior art keywords
primer
pcr
primer pair
sequences
sequence
Prior art date
Application number
PCT/GB2002/002640
Other languages
French (fr)
Other versions
WO2002099129A3 (en
Inventor
David John Griffiths
Paul Kellam
Robert Anthony Weiss
Original Assignee
University College London
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University College London filed Critical University College London
Priority to AU2002311446A priority Critical patent/AU2002311446A1/en
Publication of WO2002099129A2 publication Critical patent/WO2002099129A2/en
Publication of WO2002099129A3 publication Critical patent/WO2002099129A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/70Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage

Definitions

  • the invention relates to a method of designing a panel of primers for detecting viruses in a high throughput polymerase chain reaction assay.
  • Viruses require the use of the cellular- translation and transcription machinery to replicate. In the process of replication they often have deleterious effects on the host cell and thus on the host organism. Viruses constitute an important class of pathogens causing many diseases, leading to loss of life in humans and economic loss in the agricultural industries.
  • the polymerase chain reaction allows the amplification of a specific region of a polynucleotide.
  • the specificity of the reaction is due to the primers which during the course of PCR bind to the region to be amplified in a sequence specific manner.
  • the invention provides a method of designing primers which can be used in high throughput screening to detect viruses. The method may be used to detect unknown viruses which have not yet been sequenced.
  • the invention provides a method of designing a panel of degenerate primer pairs for screening for new members of multiple known virus families in a biological sample, wherein each primer pair in the panel binds a sequence that is conserved across members of a said virus family and selectively directs amplification of sequence of said family by PCR, which method comprises
  • each primer in the pair binds a nucleotide sequence that encodes a conserved region identified in (b) and wherein the primer pair is designed to amplify by PCR the nucleotide sequence between the nucleotide sequences that encode conserved regions in members of the first virus family, and
  • the invention also provides a method of designing a panel of degenerate primer pairs for screening for new members of multiple known virus families in a biological sample, wherein each primer pair in the panel binds a sequence that is conserved across members of a said virus family and selectively directs amplification of sequence of said family by PCR, which method comprises (a) providing a plurality of nucleotide sequences from members of a first virus family,
  • each primer in the pair binds a conserved region identified in (b) and wherein the primer pair is designed to amplify by PCR the nucleotide sequence between the conserved regions in members of the first virus family, and
  • the invention additionally provides a panel of primers which has been designed by the method of the invention.
  • the invention provides a method of designing a panel of primer pairs which can be used in high throughput virus screening.
  • the method comprises initial steps which deduce the sequences of the primers using computer based calculations, and optional later steps in which the primers are synthesised and tested empirically, for example to determine optimal PCR conditions and/or to select primer pairs with desired further properties.
  • the panel of primers provided by the method are designed to be capable of detecting unknown viruses based on nucleotide and/or amino acid sequences in the unknown virus which are similar (homologous) to nucleotide and/or amino acid sequences in a known virus. These conserved sequences typically have a role in providing a necessary or advantageous activity or property to the virus. conserveed nucleotide sequences may be coding or non-coding sequences.
  • the panel of primers is designed to detect viruses which are single stranded or double stranded DNA or single stranded or double stranded RNA viruses.
  • the viruses are generally capable of infecting prokaryotic or eukaryotic cells, such as bacterial, animal, plant, yeast or fungal cells.
  • the viruses are mammalian (preferably primate) or avian viruses, such as human, pig, horse, sheep, goat, cow, chicken, turkey or duck viruses.
  • the viruses are typically from any combination of the following families: Adenoviridae, Arenaviridae, Arteriviridae, Astroviridae, Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Deltavirus, Filoviridae, Flaviviridae, Hepadnaviridae, Herpesviridae, Orthomyxoviridae, Papovaviridae, Paramyxoviridae, Parvoviridae, Picornaviridae, Polydnaviridae, Poxviridae, Reoviridae, Retroviridae, Rhabdoviridae, Togaviridae or Bornavirus.
  • the primers of the panel are capable of detecting unknown viruses in a biological sample. Such a sample either originates from a single individual or is a pooled sample from individuals of the same species. Thus the panel of primers detects viruses which infect the same species (from which the sample originates).
  • primer pairs designed in the method bind sequence which is conserved across members of a virus family.
  • the panel which is designed in the method may comprise primer pairs that bind sequence which is conserved across substantially all members of the family or across a subset of the members of the family, for example across all members of a subfamily or of a genus.
  • the primer pairs bind at least 70%, at least 80%, or at least 90% of the known viruses of the family, subfamily or genus. Preferably less than 10, such as less than 5, primer pairs will be used for the detection of any given family, subfamily or genus in the panel.
  • the panel of primer pairs is generally capable of detecting viruses from at least 10, 15, 20, 30 or more families, typically up to a maximum of 35 families.
  • the panel of primer pairs may comprise sets of primer pairs which perform a nested PCR reaction.
  • a set of primer pairs comprises a first and second primer pair.
  • the first primer pair is able to amplify a template nucleotide sequence from a virus to form a PCR product.
  • the second primer pair is able to amplify a nucleotide sequence using the PCR product generated by the first primer pair as a template.
  • the use of nested sets of primer pairs allows increased sensitivity.
  • each primer pair is specific for a particular virus family, so that it does not detect viruses of other families .
  • the plurality of amino acid sequences or nucleotide sequences are provided from different known viruses of the same family.
  • the amino acid sequences or nucleotide sequences will be for the same protein of the different viruses. Typically at least 5, 10, 20, 50, 100 or more sequences are provided. The maximum number of sequences provided will, for example, be 300 sequences.
  • Each of the sequences which is provided is typically at least 20, 50, 100, 200 or more amino acids or nucleotides in length.
  • the maximum length of the nucleotide sequences is 1000 nucleotides and the maximum length of the amino acid sequences is 300 amino acids.
  • the sequences may be obtained from a database of sequences, such as GenBank.
  • the sequences may be obtained from a database comprising virus sequences which are organised into homologous protein families (based on sequence similarity relationships).
  • sequences are obtained from the VTDA database (described in Alba et al (2001) Nucleic Acids Research 29, 133-136) or the Virus Division of GenBank.
  • the sequences may be provided in the form of a database, preferably in computer-readable form.
  • the sequences are preferably provided in the form of a computer-readable database constructed using programs which identify homologous protein families, such as GeneTableMaker, MKDOM or PSCBuilder.
  • conserved regions typically such conserved regions will have a length of at least 12 nucleotides, such as at least 15, 21, 27, 36, 99 or more nucleotides (generally up to a maximum length of 200 nucleotides) or at least 4, 5, 7, 10, 25 or more amino acids (generally up to a maximum length of 50 amino acids).
  • the virus sequences which are being provided will of course share identity or similarity.
  • the amino acids or nucleotides in at least 50% of the positions in the region will be the same in at least 50 %, 60%, 70%, or 80% of the viruses of the group (i.e. in the family, genus or subfamily).
  • the algorithm which identifies conserved regions generally uses a multiple sequence alignment method.
  • the method may comprise (a) aligning all pairs of sequences separately to calculate a distance matrix giving the divergence of each pair of sequences, (b) calculating a guide tree from the distance matrix, and (c) aligning the sequences progressively according to the branching order in the guide tree.
  • a preferred algorithm for the aligning the conserved sequences is CLUSTALW as described in Thompson et al (1994) Nucleic Acids Research 22, 4673-80.
  • Other algorithms that can be used for aligning sequences are MultAlin (Corpet (1988) Nucleic Acids Research 16, 10881-90) or Jalview (Clamp et al (1998) http://barton.ebi.co.uk).
  • BLOCKS of conserved regions of amino acids may be extracted from the multiple alignments, typically using the program Blocks Multiple Alignment Processor. Alternatively the entire process of performing multiple alignments and extracting BLOCKS can be performed using BLOCKMAKER (Henikoff and Henikoff (1994) Genomics 19, 97-107).
  • the output from the alignment and BLOCK extraction set i.e. the information describing the identified conserved regions
  • Such output is typically in the form of partial sequences which correspond to the conserved regions (BLOCKS).
  • BLOCKS are input into a primer design algorithm. In one embodiment such an algorithm is CODEHOP.
  • the conserved regions which are chosen as targets for primers preferably comprise few codohs with degenerate counterparts, i.e. preferably the sequence has a low redundancy, such as a redundancy of less than 512 fold, 256 fold or 128 fold.
  • Each primer binds in accordance with Watson-Crick base pairing and thus the binding is sequence specific.
  • Each primer will thus be designed to be wholly or partially complementary to the sequence to which it binds.
  • Each of the primers typically has a length of at least 8 nucleotides, such as at least 10, 12, 15, 20, 30, 40 or more nucleotides (up to a maximum of 50 nucleotides for example).
  • the primer may. comprises at least 2, 4 or 6, up to a maximum of 10 for example, inosine bases. Inosine is able to bind to any of the four nucleotides and therefore use of inosine causes a reduction in effective redundancy.
  • Each primer pair will be designed so that the PCR product generally has a length of at least 20, such as at least 50, 100, 200, 500, 1000 or more nucleotides (and typically up to a maximum of 5xl0 3 nucleotides long).
  • Each primer is preferably designed so that it anneals to a single site, i.e. the primer will not bind to any other site in the genome of the relevant virus.
  • Each primer is preferably designed so that it does not exhibit secondary structure, i.e. the nucleotides in the primer will not bind substantially to any other nucleotide in the primer apart from those to which it is covalently linked.
  • each primer is designed so that it does not bind other primers with the same sequence.
  • the 3' region, and preferably the 3' terminal nucleotide of the primer binds to the target sequence with high affinity, thus preferably this region or nucleotide comprises a G or C.
  • each primer is designed to have an annealing temperature of from 30 to 65 ° C, such as 50 to 60 ° C or 35 to 45 ° C.
  • each primer pair may be designed to ensure that the two primers do not bind to each other.
  • such an algorithm designs primers according to the following rules:
  • a set of blocks is input, where a block is an aligned array of amino acid sequence segments without gaps that represents a highly conserved region of homologous proteins.
  • a weight is provided for each sequence segment, which can be increased to favour the contribution of selected sequences in designing the primer.
  • a codon usage table is chosen for the target genome.
  • PSSM amino acid position-specific scoring matrix
  • step 4 For each position of the block, the most common codon corresponding to the amino acid chosen in step 3 is selected utilizing the user-selected codon usage table. This selection is used for the default 5' consensus clamp in step 8. 5)
  • a DNA PSSM is calculated from the amino acid matrix (step 2) and the codon usage table.
  • the DNA matrix has three positions for each position of the amino acid matrix.
  • the score for each amino acid is divided among its codons in proportion to their relative weights from the codon usage table, and the scores for each of the four different nucleotides are combined in each DNA matrix position. Nucleotide positions are treated independently when the scores are combined. As an option, the highest scoring nucleotide residue from each position can replace the most common codons from step 4 that are used in the consensus clamp.
  • the degeneracy is determined at each position of the DNA matrix based on the number of bases found there.
  • a weight threshold can be specified such that bases that contribute less than a minimum weight are ignored in determining degeneracy.
  • Possible degenerate core regions are identified by scanning the DNA matrix in the 3' to 5' direction.
  • a core region must start on an invariant 3' nucleotide position, have length of 11 or 12 positions ending on a codon boundary, and have a maximum degeneracy of 128 (this is the default setting of CODEHOP).
  • the degeneracy of a region is the product of the number of possible bases in each position.
  • Candidate degenerate core regions are extended by addition of a 5' consensus clamp from step 4 or 5.
  • the length of the clamp is controlled by a melting point temperature calculation (the CODEHOP default is 60 °C) and is usually about 20 nucleotides. 9) Steps 7 and 8 are repeated on the reverse complement of the DNA matrix from step 5 for primers corresponding to the opposite DNA strand.
  • CODEHOP Rose et al (1998) Nucleic Acids Research 26, 1628- 1635) is used to design the primer pairs. This program uses the above rules.
  • the primers designed by the algorithm may then be mapped back to the original sequence to choose primer pairs which provide the desired length of PCR product.
  • primer pairs can then be synthesised and tested. They are typically tested to determine the optimal conditions for using the primers in a PCR reaction.
  • the PCR reaction is carried out in a PCR mixture that generally comprises the following: the template polynucleotide (which will be amplified in the event of virus detection), one or more primer pairs designed as described above, a polymerase enzyme (typically a DNA polymerase, such as Taq polymerase), deoxynucleotide triphosphates (dATP, dTTP, dCTP and dGTP) and a suitable buffer.
  • a polymerase enzyme typically a DNA polymerase, such as Taq polymerase
  • deoxynucleotide triphosphates dATP, dTTP, dCTP and dGTP
  • the PCR reaction generally comprises cycles of the following steps: a denaturation step, a primer annealing step and a polynucleotide synthesis step.
  • the PCR reaction comprises at least 25 cycles, such as 30, 35, 40 or more cycles, up to a maximum of 60 cycles for example.
  • the PCR mixture is heated to a temperature at which the polynucleotides in the PCR mixture (in particular the polynucleotide region to be amplified) denature to single stranded form.
  • the denaturing temperature is generally from 85 to 98 "C.
  • the primers bind to template nucleotide sequence in a sequence specific manner.
  • This step is generally carried out at a temperature of from 30 to 65 °C.
  • the polymerase replicates/synthesises nucleotide sequence based on template sequence by addition of nucleotides to the 3' end of the bound primers. This step is generally carried out at about 72 ⁇ 0
  • the primers are tested for their ability to amplify one or more of the plurality of nucleotide sequences from known viruses which were used to design the primers, or in the case of amino acid sequences from known viruses being used to design the primers the primers may be tested for their ability to amplify the nucleotide sequence from the virus which encodes the amino acid sequence.
  • the primers may be tested in a range of buffer conditions to determine optimal buffer conditions for PCR using the primers.
  • the buffer conditions which may be tested include pH (typically between 7 and 10), magnesium concentration (typically from 0.5 M to 5 mM), potassium chloride (typically from 0 to 100 mM), ammonium chloride (typically 0 to 100 mM), glycerol (typically 0 to 20%), dimethysulphoxide (typically 0 to 20%), ethanol (typically 0 to 20%), sorbitol (typically 0 to 20%) or betaine (typically 1M betaine).
  • pH typically between 7 and 10
  • magnesium concentration typically from 0.5 M to 5 mM
  • potassium chloride typically from 0 to 100 mM
  • ammonium chloride typically 0 to 100 mM
  • glycerol typically 0 to 20%
  • dimethysulphoxide typically 0 to 20%
  • ethanol typically 0 to 20%
  • sorbitol typically 0 to 20%
  • betaine typically 1M betaine
  • the primers may be tested at a range of different temperatures to determine the optimal temperatures in the PCR reaction.
  • the primers are tested in PCR reaction in which a range of primer annealing temperatures are tested.
  • the range of temperatures is from 30 to 65 ° C .
  • the panel of primer pairs or a group of primers within the panel may be designed to be used together on the same plate (i.e. using the same thermal cycles). Thus such primer pairs will be designed to work at the same annealing temperature.
  • a group of primer pairs within the panel are designed to have similar optimal conditions for use in PCR so that they can be used optimally in the same well or reaction vessel, i.e. that they can be used in multiplex PCR.
  • Such a group typically comprises at least 2, 3, 4, 5, 6 or more primer pairs (up to a maximum of 8 primer pairs for example).
  • the computer based method steps may be used to design primer pairs which are calculated to have similar annealing temperatures and/or the primers are tested to select primer pairs which can be used optimally together. Such testing typically determines whether the primers work optimally with the same buffers and/or whether the primers have similar annealing temperatures.
  • At least one or both primers of each primer pair in the group carries a label. Typically at least one of the primers in each primer pair will carry a different label from that used for the other primer pairs.
  • the PCR product generated by labelled primers carries the labels present on the primers. Thus after the group of primers have been used for PCR in the same well detection of the labels in the PCR products can be used to deduce which PCR product was formed from each primer pair.
  • all forward primers of the group are labelled with one colour and the reverse primers are labelled with a different colour.
  • the primers are labelled with a fluorescent label, such as fluorescein based labels (e.g. fluorescein isothiocyanate).
  • a fluorescent label such as fluorescein based labels (e.g. fluorescein isothiocyanate).
  • fluorescein based labels e.g. fluorescein isothiocyanate
  • Different primer pairs may be labelled with fluorescent labels of different colours.
  • the fluorescent labels which are used may be capable of detection by a Beckman CEQ2000TM or Applied Biosystems A3700TM fluorescent DNA analyser.
  • the fluorescent labels may obtained from Beckman Coulter or Applied Biosystems.
  • each PCR product which is generated by the group of primers differs in size from all the other PCR products by at least 20, such as at least 50, 100, 200, 500, 1000 or more nucleotides.
  • Each PCR product may for example differ in size from all other PCR products by up to a maximum of 3000 nucleotides.
  • Example 1 illustrates the invention.
  • a panel of primers was designed for detecting unknown viruses from the family Herpesviridae according to the strategy shown in Figure 1.
  • the amino acid sequences of herpes virus DNA packaging protein UL15 were obtained from the VTDA database (Alba et al, see above). These sequences are shown in Table 1. The sequences obtained from the VTDA database were then imported into
  • CLUSTALW This compares the protein sequences to identify conserved regions and then aligns the sequences according to the conserved regions.
  • the alignment produced by CLUSTALW is shown in Table 2.
  • the BLOCKMAKER program was then used to extract blocks of conserved aligned sequences which do not contain gaps from CLUSTALW and enter them into CODEHOP.
  • the primer sequences were then designed by CODEHOP using the conserved sequences.
  • the output from the CODEHOP program is shown in Table 3.
  • the 'Complement of Block' sequences shown in Table 3 shows the sequence of the other strand allowing primers to be designed for amplification in the opposite direction.
  • RVKAHTNATSCSCYIIJNKPVFITI DGAMRNTAELF PDSFMQEI IGGGNISGAHRDEPVFTKTAQDRF Ii

Abstract

A method of designing PCR primers for screening for new members of known virus families.

Description

DESIGNING DEGENERATE PCR PRIMERS
Field of the invention
The invention relates to a method of designing a panel of primers for detecting viruses in a high throughput polymerase chain reaction assay.
Background of the invention All organisms appear to be capable of infection by viruses, including bacteria, animals and plants. Viruses require the use of the cellular- translation and transcription machinery to replicate. In the process of replication they often have deleterious effects on the host cell and thus on the host organism. Viruses constitute an important class of pathogens causing many diseases, leading to loss of life in humans and economic loss in the agricultural industries.
Summary of the invention
The polymerase chain reaction (PCR) allows the amplification of a specific region of a polynucleotide. The specificity of the reaction is due to the primers which during the course of PCR bind to the region to be amplified in a sequence specific manner. The invention provides a method of designing primers which can be used in high throughput screening to detect viruses. The method may be used to detect unknown viruses which have not yet been sequenced.
In particular the invention provides a method of designing a panel of degenerate primer pairs for screening for new members of multiple known virus families in a biological sample, wherein each primer pair in the panel binds a sequence that is conserved across members of a said virus family and selectively directs amplification of sequence of said family by PCR, which method comprises
(a) providing a plurality of amino acid sequences from members of a first virus family,
(b) comparing the sequences to identify conserved regions,
(c) designing a first primer pair using a computer based method, wherein each primer in the pair binds a nucleotide sequence that encodes a conserved region identified in (b) and wherein the primer pair is designed to amplify by PCR the nucleotide sequence between the nucleotide sequences that encode conserved regions in members of the first virus family, and
(d) repeating steps (a) to (c) for each virus family. The invention also provides a method of designing a panel of degenerate primer pairs for screening for new members of multiple known virus families in a biological sample, wherein each primer pair in the panel binds a sequence that is conserved across members of a said virus family and selectively directs amplification of sequence of said family by PCR, which method comprises (a) providing a plurality of nucleotide sequences from members of a first virus family,
(b) comparing the sequences to identify conserved regions,
(c) designing a first primer pair using a computer based method, wherein each primer in the pair binds a conserved region identified in (b) and wherein the primer pair is designed to amplify by PCR the nucleotide sequence between the conserved regions in members of the first virus family, and
(d) repeating steps (a) to (c) for each virus family.
The invention additionally provides a panel of primers which has been designed by the method of the invention.
Detailed description of the invention
The invention provides a method of designing a panel of primer pairs which can be used in high throughput virus screening. The method comprises initial steps which deduce the sequences of the primers using computer based calculations, and optional later steps in which the primers are synthesised and tested empirically, for example to determine optimal PCR conditions and/or to select primer pairs with desired further properties.
The panel of primers provided by the method are designed to be capable of detecting unknown viruses based on nucleotide and/or amino acid sequences in the unknown virus which are similar (homologous) to nucleotide and/or amino acid sequences in a known virus. These conserved sequences typically have a role in providing a necessary or advantageous activity or property to the virus. Conserved nucleotide sequences may be coding or non-coding sequences.
In one embodiment the conserved sequences code for or are from virus proteins which have the following activities: DNA or R A polymerase (replicase), topoisomerase (helicase/gyrase), endonuclease (integrase), nucleic acid binding protein, protease, transcription factors, envelope glycoproteins, structural protein (e.g. capsid or nucleocapsid protein).
The panel of primers is designed to detect viruses which are single stranded or double stranded DNA or single stranded or double stranded RNA viruses. The viruses are generally capable of infecting prokaryotic or eukaryotic cells, such as bacterial, animal, plant, yeast or fungal cells. Preferably the viruses .are mammalian (preferably primate) or avian viruses, such as human, pig, horse, sheep, goat, cow, chicken, turkey or duck viruses.
The viruses are typically from any combination of the following families: Adenoviridae, Arenaviridae, Arteriviridae, Astroviridae, Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Deltavirus, Filoviridae, Flaviviridae, Hepadnaviridae, Herpesviridae, Orthomyxoviridae, Papovaviridae, Paramyxoviridae, Parvoviridae, Picornaviridae, Polydnaviridae, Poxviridae, Reoviridae, Retroviridae, Rhabdoviridae, Togaviridae or Bornavirus. The primers of the panel are capable of detecting unknown viruses in a biological sample. Such a sample either originates from a single individual or is a pooled sample from individuals of the same species. Thus the panel of primers detects viruses which infect the same species (from which the sample originates).
Generally in the method at least 15, 30, 50, 100, 200 or more, typically up to a maximum of 300 different primer pairs are designed. The primer pairs designed in the method bind sequence which is conserved across members of a virus family. The panel which is designed in the method may comprise primer pairs that bind sequence which is conserved across substantially all members of the family or across a subset of the members of the family, for example across all members of a subfamily or of a genus. Generally, the primer pairs bind at least 70%, at least 80%, or at least 90% of the known viruses of the family, subfamily or genus. Preferably less than 10, such as less than 5, primer pairs will be used for the detection of any given family, subfamily or genus in the panel.
The panel of primer pairs is generally capable of detecting viruses from at least 10, 15, 20, 30 or more families, typically up to a maximum of 35 families. The panel of primer pairs may comprise sets of primer pairs which perform a nested PCR reaction. Generally such a set of primer pairs comprises a first and second primer pair. The first primer pair is able to amplify a template nucleotide sequence from a virus to form a PCR product. The second primer pair is able to amplify a nucleotide sequence using the PCR product generated by the first primer pair as a template. The use of nested sets of primer pairs allows increased sensitivity. In a preferred embodiment each primer pair is specific for a particular virus family, so that it does not detect viruses of other families .
In the method of the invention the plurality of amino acid sequences or nucleotide sequences are provided from different known viruses of the same family. The amino acid sequences or nucleotide sequences will be for the same protein of the different viruses. Typically at least 5, 10, 20, 50, 100 or more sequences are provided. The maximum number of sequences provided will, for example, be 300 sequences.
Each of the sequences which is provided is typically at least 20, 50, 100, 200 or more amino acids or nucleotides in length. In general the maximum length of the nucleotide sequences is 1000 nucleotides and the maximum length of the amino acid sequences is 300 amino acids. The sequences may be obtained from a database of sequences, such as GenBank. The sequences may be obtained from a database comprising virus sequences which are organised into homologous protein families (based on sequence similarity relationships).
In a preferred embodiment the sequences are obtained from the VTDA database (described in Alba et al (2001) Nucleic Acids Research 29, 133-136) or the Virus Division of GenBank. The sequences may be provided in the form of a database, preferably in computer-readable form. The sequences are preferably provided in the form of a computer-readable database constructed using programs which identify homologous protein families, such as GeneTableMaker, MKDOM or PSCBuilder.
The sequences which have been provided are compared to identify conserved regions. Typically such conserved regions will have a length of at least 12 nucleotides, such as at least 15, 21, 27, 36, 99 or more nucleotides (generally up to a maximum length of 200 nucleotides) or at least 4, 5, 7, 10, 25 or more amino acids (generally up to a maximum length of 50 amino acids).
Across the conserved region the virus sequences which are being provided will of course share identity or similarity. Typically the amino acids or nucleotides in at least 50% of the positions in the region will be the same in at least 50 %, 60%, 70%, or 80% of the viruses of the group (i.e. in the family, genus or subfamily).
The algorithm which identifies conserved regions generally uses a multiple sequence alignment method. The method may comprise (a) aligning all pairs of sequences separately to calculate a distance matrix giving the divergence of each pair of sequences, (b) calculating a guide tree from the distance matrix, and (c) aligning the sequences progressively according to the branching order in the guide tree. A preferred algorithm for the aligning the conserved sequences is CLUSTALW as described in Thompson et al (1994) Nucleic Acids Research 22, 4673-80. Other algorithms that can be used for aligning sequences are MultAlin (Corpet (1988) Nucleic Acids Research 16, 10881-90) or Jalview (Clamp et al (1998) http://barton.ebi.co.uk). BLOCKS of conserved regions of amino acids may be extracted from the multiple alignments, typically using the program Blocks Multiple Alignment Processor. Alternatively the entire process of performing multiple alignments and extracting BLOCKS can be performed using BLOCKMAKER (Henikoff and Henikoff (1994) Genomics 19, 97-107). The output from the alignment and BLOCK extraction set (i.e. the information describing the identified conserved regions) is then entered into the algorithm which designs the primers. Such output is typically in the form of partial sequences which correspond to the conserved regions (BLOCKS). These BLOCKS are input into a primer design algorithm. In one embodiment such an algorithm is CODEHOP.
In the primer design step the conserved regions which are chosen as targets for primers preferably comprise few codohs with degenerate counterparts, i.e. preferably the sequence has a low redundancy, such as a redundancy of less than 512 fold, 256 fold or 128 fold. Each primer binds in accordance with Watson-Crick base pairing and thus the binding is sequence specific. Each primer will thus be designed to be wholly or partially complementary to the sequence to which it binds.
Each of the primers typically has a length of at least 8 nucleotides, such as at least 10, 12, 15, 20, 30, 40 or more nucleotides (up to a maximum of 50 nucleotides for example). In one embodiment the primer may. comprises at least 2, 4 or 6, up to a maximum of 10 for example, inosine bases. Inosine is able to bind to any of the four nucleotides and therefore use of inosine causes a reduction in effective redundancy.
Each primer pair will be designed so that the PCR product generally has a length of at least 20, such as at least 50, 100, 200, 500, 1000 or more nucleotides (and typically up to a maximum of 5xl03 nucleotides long).
Each primer is preferably designed so that it anneals to a single site, i.e. the primer will not bind to any other site in the genome of the relevant virus.
Each primer is preferably designed so that it does not exhibit secondary structure, i.e. the nucleotides in the primer will not bind substantially to any other nucleotide in the primer apart from those to which it is covalently linked. In addition preferably each primer is designed so that it does not bind other primers with the same sequence.
In one embodiment the 3' region, and preferably the 3' terminal nucleotide of the primer binds to the target sequence with high affinity, thus preferably this region or nucleotide comprises a G or C.
Generally each primer is designed to have an annealing temperature of from 30 to 65 ° C, such as 50 to 60 ° C or 35 to 45 ° C. In addition each primer pair may be designed to ensure that the two primers do not bind to each other.
The primers .are designed by a computer based algorithm. In one embodiment such an algorithm designs primers according to the following rules:
1) A set of blocks is input, where a block is an aligned array of amino acid sequence segments without gaps that represents a highly conserved region of homologous proteins. A weight is provided for each sequence segment, which can be increased to favour the contribution of selected sequences in designing the primer. A codon usage table is chosen for the target genome.
2) An amino acid position-specific scoring matrix (PSSM) is computed for each block using the odds ratio method. 3) A consensus amino acid residue is selected for each position of the block as the highest scoring amino acid in the matrix.
4) For each position of the block, the most common codon corresponding to the amino acid chosen in step 3 is selected utilizing the user-selected codon usage table. This selection is used for the default 5' consensus clamp in step 8. 5) A DNA PSSM is calculated from the amino acid matrix (step 2) and the codon usage table. The DNA matrix has three positions for each position of the amino acid matrix. The score for each amino acid is divided among its codons in proportion to their relative weights from the codon usage table, and the scores for each of the four different nucleotides are combined in each DNA matrix position. Nucleotide positions are treated independently when the scores are combined. As an option, the highest scoring nucleotide residue from each position can replace the most common codons from step 4 that are used in the consensus clamp.
6) The degeneracy is determined at each position of the DNA matrix based on the number of bases found there. As an option, a weight threshold can be specified such that bases that contribute less than a minimum weight are ignored in determining degeneracy.
7) Possible degenerate core regions are identified by scanning the DNA matrix in the 3' to 5' direction. A core region must start on an invariant 3' nucleotide position, have length of 11 or 12 positions ending on a codon boundary, and have a maximum degeneracy of 128 (this is the default setting of CODEHOP). The degeneracy of a region is the product of the number of possible bases in each position.
8) Candidate degenerate core regions are extended by addition of a 5' consensus clamp from step 4 or 5. The length of the clamp is controlled by a melting point temperature calculation (the CODEHOP default is 60 °C) and is usually about 20 nucleotides. 9) Steps 7 and 8 are repeated on the reverse complement of the DNA matrix from step 5 for primers corresponding to the opposite DNA strand.
In one embodiment CODEHOP (Rose et al (1998) Nucleic Acids Research 26, 1628- 1635) is used to design the primer pairs. This program uses the above rules.
The primers designed by the algorithm may then be mapped back to the original sequence to choose primer pairs which provide the desired length of PCR product.
The above-described computer based method is repeated until the desired number of primer pairs have been designed. Optionally the primer pairs can then be synthesised and tested. They are typically tested to determine the optimal conditions for using the primers in a PCR reaction.
The PCR reaction is carried out in a PCR mixture that generally comprises the following: the template polynucleotide (which will be amplified in the event of virus detection), one or more primer pairs designed as described above, a polymerase enzyme (typically a DNA polymerase, such as Taq polymerase), deoxynucleotide triphosphates (dATP, dTTP, dCTP and dGTP) and a suitable buffer.
The PCR reaction generally comprises cycles of the following steps: a denaturation step, a primer annealing step and a polynucleotide synthesis step. Typically the PCR reaction comprises at least 25 cycles, such as 30, 35, 40 or more cycles, up to a maximum of 60 cycles for example. Generally in the denaturation step the PCR mixture is heated to a temperature at which the polynucleotides in the PCR mixture (in particular the polynucleotide region to be amplified) denature to single stranded form. The denaturing temperature is generally from 85 to 98 "C. In the primer annealing step the primers bind to template nucleotide sequence in a sequence specific manner. This step is generally carried out at a temperature of from 30 to 65 °C. In the polynucleotide synthesis step the polymerase replicates/synthesises nucleotide sequence based on template sequence by addition of nucleotides to the 3' end of the bound primers. This step is generally carried out at about 72^0
In one embodiment the primers are tested for their ability to amplify one or more of the plurality of nucleotide sequences from known viruses which were used to design the primers, or in the case of amino acid sequences from known viruses being used to design the primers the primers may be tested for their ability to amplify the nucleotide sequence from the virus which encodes the amino acid sequence. The primers may be tested in a range of buffer conditions to determine optimal buffer conditions for PCR using the primers. The buffer conditions which may be tested include pH (typically between 7 and 10), magnesium concentration (typically from 0.5 M to 5 mM), potassium chloride (typically from 0 to 100 mM), ammonium chloride (typically 0 to 100 mM), glycerol (typically 0 to 20%), dimethysulphoxide (typically 0 to 20%), ethanol (typically 0 to 20%), sorbitol (typically 0 to 20%) or betaine (typically 1M betaine).
The primers may be tested at a range of different temperatures to determine the optimal temperatures in the PCR reaction. Preferably the primers are tested in PCR reaction in which a range of primer annealing temperatures are tested. Typically the range of temperatures is from 30 to 65 ° C .
The panel of primer pairs or a group of primers within the panel may be designed to be used together on the same plate (i.e. using the same thermal cycles). Thus such primer pairs will be designed to work at the same annealing temperature.
In one embodiment a group of primer pairs within the panel are designed to have similar optimal conditions for use in PCR so that they can be used optimally in the same well or reaction vessel, i.e. that they can be used in multiplex PCR. Such a group typically comprises at least 2, 3, 4, 5, 6 or more primer pairs (up to a maximum of 8 primer pairs for example).
To provide such primer pairs the computer based method steps may be used to design primer pairs which are calculated to have similar annealing temperatures and/or the primers are tested to select primer pairs which can be used optimally together. Such testing typically determines whether the primers work optimally with the same buffers and/or whether the primers have similar annealing temperatures.
In one embodiment at least one or both primers of each primer pair in the group carries a label. Typically at least one of the primers in each primer pair will carry a different label from that used for the other primer pairs. The PCR product generated by labelled primers carries the labels present on the primers. Thus after the group of primers have been used for PCR in the same well detection of the labels in the PCR products can be used to deduce which PCR product was formed from each primer pair. In one embodiment all forward primers of the group are labelled with one colour and the reverse primers are labelled with a different colour.
In a preferred embodiment the primers are labelled with a fluorescent label, such as fluorescein based labels (e.g. fluorescein isothiocyanate). Different primer pairs may be labelled with fluorescent labels of different colours. The fluorescent labels which are used may be capable of detection by a Beckman CEQ2000™ or Applied Biosystems A3700™ fluorescent DNA analyser. The fluorescent labels may obtained from Beckman Coulter or Applied Biosystems.
Another way of being able to determine which PCR products are generated by which primer pair is for each primer pair in the group to generate a PCR product of different size to the PCR products generated by the other primer pairs of the group. Typically each PCR product which is generated by the group of primers differs in size from all the other PCR products by at least 20, such as at least 50, 100, 200, 500, 1000 or more nucleotides. Each PCR product may for example differ in size from all other PCR products by up to a maximum of 3000 nucleotides.
The following Example illustrates the invention: Example
The Example below refers to Figure 1 which shows how primers were designed using a database known as 'VTDA', and computer programs know as 'CLUSTALW', 'BLOCKMAKER' (or 'BLOCKS') and 'CODEHOP'.
Designing a panel of primers
A panel of primers was designed for detecting unknown viruses from the family Herpesviridae according to the strategy shown in Figure 1. The amino acid sequences of herpes virus DNA packaging protein UL15 were obtained from the VTDA database (Alba et al, see above). These sequences are shown in Table 1. The sequences obtained from the VTDA database were then imported into
CLUSTALW. This compares the protein sequences to identify conserved regions and then aligns the sequences according to the conserved regions. The alignment produced by CLUSTALW is shown in Table 2.
The BLOCKMAKER program was then used to extract blocks of conserved aligned sequences which do not contain gaps from CLUSTALW and enter them into CODEHOP. The primer sequences were then designed by CODEHOP using the conserved sequences. The output from the CODEHOP program is shown in Table 3. The 'Complement of Block' sequences shown in Table 3 shows the sequence of the other strand allowing primers to be designed for amplification in the opposite direction.
Table 1. All protein sequences of DNA packaging protein UL 15 extracted from VTDA. Here written as a list and unaligned.
>gi_10180719 MFGG^GEETKRHFERIiMKTKHDRLGASHRϊrERS IRDGDMVDAPFIiNFAI PVPRRHQTVMPAIGILHNCC
ΩSLGIYS&ITTBMLYSSIACSEFOEIBmiSVPRaPRITNAQAFIjSP mRVΑNSIIFQEYOEmiΑΑSR
NAYYSTrøtfSFISimTSDAF QLTOTISRFSKIΛIASFI
KMILMHATYFVTSVIiGDHAERAER ^VAF^
MSSFEGIRIGYTSHIRKAIEPVFEDIGDRl-RRWFGAimVDHV GETITFSFPSGT.KSTV FASSHtfrNSI RGQDF∞ΛFVDEA^IRPDAVQTIIGFLNQATCKIIFVSSTN^
EHMKHVTDYTNATSCSCrraiNKPVFITtmGAIIRRTAEM
IDRF IYRPS VNNQDPFSQDLYVYVDPAFTA1OT ASGTGVAVIGKYGTDYIVFG EHYFLRA TGESSD
SIGY /AQCa-IQICaiHR RFGVI IAIEGNSNQDSAVAIATRIAIEMISYM AAVAPTPHNVSFYHS S
NG DVEYPYFIJ-QRQKTTAFDFFIAQFNSGRVIJ^QD VS VS TTDPVEYLT QLTNISEVV GPTCT RTFSGKKGGNDDTVVA TMAVYISAHIPDMAFAPIRV
>gi_7673189
MFGG LGEETlOlHFEPJ^KTi DRLGASHR-reRSIRDGDi VDAPF]^
DSLGI YSAITTRMIiYSS lACSEFDELRRDS VPRCTPRITNAQAFLS PMMM ANS I IFQEYDEMECAAHR
NAYYSTMNSFISMRTSDAFKQLTVFISRFSKLLIASFRDV-ΩLD^ K IFDACmFCOTCFTTOSRRASER LRVAFDTPHFSDIV RHFRQRATVF^
MSSFEGIRIGYISΞIRKAIEPVFΕDIGDRl-RR FGAIffiVDWreGETITFSFPSGLKS V FASSH-TrNSI
RGQDFlttLFVDEAliTIRPDAVQTIIGFLNQATCKIIFVSS NSGKASTSF YG GSADDLIiNVVTYICD
EHM HVTDYTNATSCSCYVIiHKPVFITMDGA RRTAEMF PDSFMQEIIGGGVVDRTIC GDRSIFTASA
IDRFljIYRPSTVtraQDPFSQD YVYVDPAFTAN KASGTGVAVIGKYGTDYIVFGLEHYF RALTGESSG SIGYCVAQI-lilQICAIHRKRFGVIKIAIEG SNQDSAVAIATRIAIEMISYMKAAVAPTPHNVSFYHS S
NG DVEYPYFL QRQKTTAFDFFIAQFNSGRVIASQDLVS VSLT DPVEYLTKQL'rNISEVV GPTCT
RTFSGK GG DDTWALT AVYISAHIPDiMAFAPIRV
>gi_Ξ68928Ξ
MFGGALGESAKKHFERL RDRlffiRLGASRΪ^CLARGGSIiTOAPFT.NF^ DGTGIYSAIA RL YAGIVSSEFGEVRRESLSNGHISKRNREAL APTLTRVA S ITFHEYDDAQCAAHR NAYYSTOJNTFGSMRTSDAFQQ1-ASFIDRFSK IAAASF DVNILDR-OTAPKR.RITAPSYDKPHGTLE FQ KMIL4 HATYF TSVI4T-EDHAERAER1. RVIFDIPDFSDAATRHFRQRATVF VPRRHG T F VP IALA MSSFEGIRIGYTSHIRKAIEPVFΈEIGDRLRR FGTQC^ΠDHVKGETITFSFPSGSRSTVTFASSHNT SI RGQDFHLIFVDEA-ITIRPDAVQTIIGFL^QANCKIIFVSSTNSGKASTSFLYGLKGSADDLI.OTVTYICD EHMKHVT-TCTNATSCSCTVtNKPVFiramGAM^
VERLX-LYRPSTVRKQDILSRDLYVYVDPAF A.TRASGTGIAVIGRYGADYIIFGLEHFF RALTGESAD AIGECΪiAQCIAQIt.AIHCΕRFGTIRVAVEG SKQDSAVAIATRISID ASYVQSGVAPAPHDVCFYHS P AGSNVEYPFFL QRQKTAAFDFFIARF SGRVLASQDLVSTTISLSTDPVEYLTKQLTNLSEVVTGATGT RTFSGKKGGYDDTVVALVMAVYISAHASDATFAPIRGVEATCKGPTEA >gi_1869837
MFGQQJ^SDVQQYLERI^KQRQQKVGVDEASAG T GGDAIΛVPFLDFATATPKRHQTVVPGVGTLHDCC EHSP FSAVARRLLF S VPAQ RGRDFGGDHTAK EFLAPE VRAVARLRFRECAPEDAVPQRNAYYSV -4TCQALHRSEAFRQIJVHFVRDFAQI-LKTSFRASSI^ HATYFl-AAVIΛGDHAEQVOTFI^RLVFEIPLFSDTAVRHFRQRAT^^ GIKIGYTAHIRKATEPVFDEIDACI.RG FGSSRVDHVKGETISFSFPDGSRSTIVFASSHN NGIRGQDF miLFVDEANFIRPDAVQTIMGFI-NQANCKIIFVSSTOTGKAS^
V TH ATACSCTIIl- P FITroG R AD FIJPDSFMQEIIιGGQ E GDD PV TKSAGERFLLYR
PSTTTNSGLi PELYVYVDPAFTAm^RASGTGIAVVGRYRDDFIIFAIjEHFFIiRA TGSAPADIARiVVH
SLAQVIJUiHPGAFRSVRVAVEGNSSQDSAVAIATΞVHTEιHRIl4ASAGANGPGPEL FYHCEPPGGAVLY PFFLL-πCQKTPAFEYFIϊOXFNSGGVMASQELVSVTVRLQTDPVEYL^
NGAADDLMVAVIMAIYLAAPTGI PPAFFPITRTS
>gi_59501
MFGQQ ASDVQQYLERI^ QRQ VGADEASAG TMGGDALRVPFLDFATATPKRHQTVVPGVGTLHDCC
EHSP FSAVARRLLFNS VPAQ KGRDFGGDHTAKLEFLAPELVRAVARLRFKECAPADVVPQR AYYSV r-IWFQA HRSEAFRQLVHFVRDFAQLLKTSFRASSLTETTGPPKKRAKVDVATHGRTYGT ELFQKMILM
HATYFLAAVIiGDHAEQWOTFLR VFEIPLFSDAAVRHFRQRATVFLVPRRHG T F VPLIALSliASFR
GIKIGYTAHIRKATEPVFEEIDACLRGWFGSARVDHVKGETISFSFPDGSRSTIVFASSHNTNGIRGQDF
NTj FVDEAWi IRPDAVQTIMGFI- QANCKIIFVSS
VVTHTNATACSCYILNKPVFITiDGAVRRTADLF ADSFMQEIIGGQARETGDDRPVIiTKAGERFLLYR PSTTTNSGLMAPD YVYVDPAFTA.4 RASGTGVAVVGRYRDDYIIFA-EHFFLRA TGSAPADIARCVVH
SLTQV]_AI■HPGAFRGWV EGNSSQDSAV IAT-- HTE^l3^
YPFF L-JTCQ TPAFEIIFIKKFNSGGVMASQEIVSATV^
RNGASDDLMVAVIMAIYIAAQAGPPHTFAPITRVS
>gi_2605992 MFGKALSRETIQYFET RKEVQSRSGAiaraU-AEAQTGGEDDVKrAFT.l^AIPTPQRHQTVVPGVGT HDC
CETAQIFASVARRL FRS S RGGESKER DPSSVEAYVDPiσπQALKTISFVEYNDAEARSCRNAYYS
IMNTFDSLRSSDAFHQVANFVARFSRLVDTSFNGADLDGDGQQTSKRIKVDVPTYGKQRGT ELFQKMI
^πIATYFIAAVI GDHAD IGA KM FOTPEFSD TIRH RQR FIl PRRHG}^^WFL PLIAI- I^
KGIKIGYTAHIRKATEPVFDEIGARl-RQWFGNSPVDHV GENISFSFPDGSKSTIVFASSH TNGIRGQD FNIiFVDFJUreiRPEAVQTIIGFl-NQTNCKIIF^
RVKAHTNATSCSCYIIJNKPVFITI DGAMRNTAELF PDSFMQEI IGGGNISGAHRDEPVFTKTAQDRF Ii
YRPSTVANQDIMSmmYVYVDPAFTTNAMASGTGVAVVGRYRSiπϊIVFGLEHFFLSALTGSSAE IARCV
AQO-AKVFAIHSRPFOSVRIAVEGNSSQDAAVAIATNIQ EUW RQADVVHMPGTV FYHCTPPGSSVA
YPFFLLQKQKTGAFDHFIKAFNSGLVLASQELISOTVRLQ^ RNGASDDMLVALVMAVYMASLPPTTNAFSSLSTQ
?gi_330792
MFGRV GRETVQYFEAIΛREVQARRGAKNRAAEAQNGGEDDAKTAFlrø
CETAQIFASVARRL FRS SK QSGEARER DPASVEAYVDPKVRQALKTISFVEYSDDEARSCRNAYYS
I ffiWFD IJ SSDAFHQ SFV FS L DTSFNGADωGDGQQ S RA ^ PTYG QRGTLELFQ MI MHATYFIAAVILGDHADRIGAFLKMVFHTPEFSDATIRHFRQRAT F'LVPRRHGKTWFLVP IALALATF GIKIGYTAHIRKATEPVFDEIGARLRQWFGNSP\TOHv7KGENISFSFPDGSKSTIVFASSHNTNGIRGQD
FN LFVDEANFIRPEAVQΪT IGFLNQTNCKI IFVSSTlrøSKASTSF YTOKGAADD NVVTYICDEHrø
RVKAHTNATACSCTIl-OTPVFITmDGAMR.TrAELFIjP^
YRPS QDIMSSDLY YVDPAF NA^ASG G AVVGRYRSN VVFG!EHFFLSALTGSSAELIA CV AQCLAQVFAIH RPFDSVRVAVEGNSSQDAAVAIATNIQLEI-^W RRADVVPMPGAV FYHCTPHGSSVA
YPFFIi QKQ TGAFDHFIKAF SGSV ASQE VSOTVRLQTDPrø^
RNGASDDMLVALVMAVYLSS PPTSDAFSSIiPAQ
>gi_97131
IWGGAVGEQSARYFQRL RERQRRAAERGARPDGGGGARGEDDARVPFtJDFAVAAPKRHQTVVPGVGTLH GYCEIAPLFAATASRLUliTSMARAEAGTJsrrGTGEAΞVSREIΛGV SALRFAAHPPAEAAAHCNAYHS
AIiESJIRASGAFAQVAAFVARFSRLVGTSFSHLGGGDDADPPRAKRARVEPPSGQTRGA ELFQKMILMPA TYFVAATliGEHAERIGAF RVAFlWPDFSDAAVAHFRQRATVFLVPRRHGKTv^LVPLIA ALATFKGI KIGYTAHIRKATEPVFEEIVAPO-RQΪWiGGERVDlOT GEVISFSFPDGARSTIVFASSHNT GIRCSQDFNL Fl/DEAHFIRPEAVQTIVGFIMQASCKIIFVSSTNTGKASTSFLY.^
AHGGATACSCYVI_JKPVFITJroAAARITrAETFLPNSFMQEIIG43GEVARRAEPAAVFTRAAGEQFL YRP STAAARGPWPERLY YIDPAFTSNARASGSGIAVVGRHRGSWLVLGLEHFFLPALTGSSAAEIARCAVRC FAQmAVHRRRLDG FVAVEG SSQDSAVAIALGVRRELDSI_AASGAVPMPAETRFYHC3iPPGSAVAYPF FLLQKQ TAAFDI-FIP-DFNSGRVVASQDIASLTVRLQTDPVEYLFEQLQHLTESTAGPGGARAFSG RRG AADDLMVA VMAVFVGSLPPTDGAFCPLAPRPPAD >gi_58S9808 MS IMFGRTLGEESVRYFERLKRRRDERFGTLESPTPCSTRQGSLGNATQIPFLNFAIDVTRRHQAVIPG IGTLHHCCEYIPLFSATARRAMFGAF SSTGYNCTPMΠ/ P RYSVTSRALWSPELKKAVSSVQFYEYSPE EAAPHRNAYSGVIΩTOFRAFS SDSFCQLSTFTQRFSY ΕTSFESIEECGSΞGRAIA VDVPIYGRYKGT-. ELFQKMIIJMHΓΓHFISSVI,LGDHADRVDCFLRTVFNTPSVSDSVLEHF QKSTVFLVPRRHGKTWFLVPE IA VMATFRGIKVGYTAHIRKATEPVFEGIKSR EQWFGANYVDHVKGESITFSFTDGSYSTAVFASSHN TNGIRGQDF-JT- FVDEAHFIRPDAVQTIVGF NQTNCKI IFVSSTNTGKASTSFLY-TTRGSSDQLIJNVVT YVCDDHMPRVLAHSDVTACSCTYVIMPWITIYMGAMRR
T TARERFILYRPSTVANCAILSSVLYVYVDPAFTSTTITLASGTGVAIVGRYKSD IIFGLEHFF RALTG TSSSEIGRCVTQC GHILALHPLWF NVHVSIEGNSSQDSAVAISIAIAQQFAVLIEKGATVLSSAPV LFY HSIPPGCSVAYPFFLILQKQKTPAVDYFV RFNSGNIIASQELVSLTVKLGVDPVEY C QIJDNLTEVI G GMGRØ-DTKTYTG GTTGTMSDD MVALIMSVYIGSSCIPDSVFMPI
>gi_S708110
M GKESVEIV RYRDALRKRTMERGPDDVDGQEMSDSl^ITTASICDRrroSARDTMNSPASRFQFAIDVP
QRHQACIAPIGSFHNCC^ISPJiFSYMASEIIYENtiASYST YTDTDAALNDLQVSPKRQLFTGAAEDSIL
PAIiRQKLAN NFARFAPSDSLIHDKAFDGIMNGYRGFVKSDEFSQT-NfTFIYRFHTIiK^ RAK EKTTSEQRDGTIiELFQKMIIJlHA YFASSICTiGEGSTERSimYLSTVF-rrS FSENIIQHFRQRTT
VF VPRRHGKT FLVPLISLLVSSFEGIRIGYTAHl^KATEPVFIEIFTRLYKWFGAKQVEQVKGETITF
TFRNGNKSAIVFASSQNTNGIiRGQDFimjFVDEAOTIKPAALH^
LLY LKG TNSIi VVTYICDEH PEIQ RTDVTTCSC^rVV HKPVFVSlTO
GGRAGKYDSDRT VPVRA DQFLIYRPSTSSKPNISGLGKI TVYVDPAFTTNRSASGTGIALVTALRDS MvTiMGAEHFYLDALTGEAALEIAQCVY CIAYCCO-IHAGAFREIRIAVEGNSSQDSAAAIAGjaTE LDS RRRLGFS TFAHSRQPGTAMAHPFY I.NKQKSRAFDLFVSLFNSGRFMASQELVSNT VLS DPCEYLV
DQIRNITVTHGQGPDSFRTFSGKQGRVPDDMLVAAVMSTY ALEGSPTAGYHPIAPIGRRQRPA
>gi_1813970
MLRGDSAAKIQERYAELQKRKSHPTSCISTAFTIWATLC^KRYQMMHPELGLAHSαTE^ RDYNS PEESQRELLFHERLKSALDK TFRPCSEEQRASYQKLDALTE YRDPQFQQIOTiTMTDFKK LDG
GFSTAVEGDAKAIP-EPFQKlttLIHVIFFIAVT IPVLANRV QYLIHAFQIDFLSQTSIDIFKQKATVF
LVPRRHGKTvWIIPIISF -OMIGISIGYVAHQKHVSQFVLKEVEFRCRHTFARDYVΛ/ENKDNVISIDH
RGAKSTA FASCY-røTSIRGQOTHLI-.jVDEAHFIKKEAFOT^
RlαrøAPFDiIiN SYVCTEHLHSFTEkGDATACPCYRLH^ NKISQNTVLITDQSREEFDILRYSTl-N NAYDYFGKT YVYLDPAFTTNRKASGTGVAAVGAY^
G EHFF RDLSESSEVAIAECAAHMIISVLSLHPY DELRIAVEGNTNQAAAVRIAC IRQSVQSSTLIR
VLFYHTPDQiraiEQPFYIJ4GRDKALAVEQFISRFNSGYI-aSQELVSYTI SHDPIEYLLEQIQ-irJHRV
TLAEGTTARYSAKRQNRISDD I IAVIMATY CDDIHAIRFRVS
>gi_274S29S MLRSCDIDAIQKAYQS I IWKHEQDVKISSTFPNSAIFCQKRFI I TPΞLGFTHAYCRHV PLYLFCDRQR
HVKS IAICDP]-NC^VI-S3XLKFTAIIEKCWEVQYQKH E QTSFYRNP FLQIEKFIQDFQRWICGDFENT NKKERIKLEPFQKS ILIHI IFFISVTKLPT AIffiVIDY KYKFDIEFINESSVNI KQKASVFLVPRRHG KCTFMIPVICFI&KmEGISIGYVAHQKHVSHFVMKDVEFKCRRFFPQrø^
FASCΥlJTHSIRGQSFNIilVDESHFI KDAFSTILGFLPQSSTKIIFISSTNSGimSTSFLT LSN-^ MLTWSYVCEDHVHIII NDRGNATTCACYRI^PKFISINADVT 'ADLF^ LITEQGLIEFDLFRYSTISKQIIPFLGKELYIYIDPAYTINRRASGTGVAAIGTYGDQYIIYGMEHYFLE SLLSNSDAS IAECASHMI AVLE HPFFTELKI I IEGNSNQSSAVKIACILKQTISVIRYKHITFFHTLD QSQIAQPFYLLGREKRIAVEYFIS-TFNSGYIKASQELISFTIKITYDPIEYVIΞQIKNLHQININ^ NAK QTCSDD LIS I IMAIYMCHEGKQTSFKEI >gi 325496 MLRTCDITHIKNNYEAI IWKGERDCSTISTKYPNSAIFYKKRFIMLTPE GFAHSYNQQVTKP YTFCEKQ RHLJMR PLTI PS SH LQEMKFLPASDKSFESQYTEF ESFKILYREPLF QIDGFIKDFRKWIKGEF MSFGDTRKIQLEPFQKNILI^IFFIAVTKLPALAIWVINYLTHVFDIEFVNES RHGKT FIVPIISFIIKNIEGISIGYVAHQKHVSHFVM EVEFKOLRMFPEKTITCN^DNVITIDHQNIKS TALFASCYLOTHQSIRGQSFI&LIVDESHFI KDAFSTI GFLPQASTKILFISSTNSGIΠISTSF]^KR-SN SPFEMLSVVSYVCEDHAHMLI^RGNATACSCYRI_HKPKFISINAEVI TAMIF EGAFIHEIMGGATCLIV
INDVLITOQGQTEFEFFRYSTIN-an-IPF GKDLYVYI^PAYTGiraRASGTGIAAIGTY DQYIVYGriEH YF ESI-IΩ'SSDTAIAEαuiHMI SILDrHPFFTEV IIIEGNS-TQASAV rACII3SENITAN SIQVTFF HTPDQNQIAQPFYI.LGKEKKIiAVEFFISNFNSGNI ASQELISFTI ITYDPVEYAI^QIRNIHQISVim YITYSAKKQACSDDIiI IAI IMAIYVCSGNSSASFREI >gi_8S4039
MKLNNSPFEMLSWSYVCEDHAH flLIffiRGNATACSCYRLHKPKFISINAEV^
ATCNVINDVLITEQGQTEFEFFRYSTIIrørLIPFLGKDLYVYIΛPAYTGNRRASGTGIAAIGTYLDQYIV YGMEΞYFLESLMTSSDTAIAEC-AAHMILS ILDLHPFFTEVKI I IEGNSNQASAVKIACI IKENITANKS I QVTFFlCTPDQNQIAQPFYLLGIS iα-AVEFFIS FNSGNI .^QE ISFTIKITYDPVEYAIiEQIR IHQ ISVIrøYITYSAKKQACSDDLIIAIIMAIYVCSGNSSASFREI >gi_5733SS4
^π.RTCDITHIKro^E IIv>reGERNCS IS PNSAIFYKKRFrMLTPELGF HSYNQQ KP YTFCE Q RHLKLmKPLTILPSLTRKLQEMKFLPASDKSFESQYTEFLESFKILYREPLFLQIDGFIKDFRKWΪKGEF .TOFGDTRKIQI-EPFQKNILI-mFFIAVTKLPALANRVINYLTHVFDIEFVIrøST^^ RHGKT FIVPI ISF LKNIEGIS IGYVAHQKHVSHFVMKEVEFKCRRMFPEKTITCU3NVITIDHQNIKS TALFASCYNTHS IRGQSFNLLIVDESjHFIK.03AFSTILGFLPQASTKILFISSTNSGjraSTSFI-MKL.TOTS PFEMLSWSYVCEDHAHI^LNERGNATACSCYRLHKPKFISINAEVKKTANLF^^
NDVLITEQGQTEFEFFRYSTINK_aiPFLGKDLYVYLDPAYTGNRRASGTGIAAIGTYLDQYIVYGr«IEHY FLESLMTSSDTAIAECAAHMILS ILDLHPFFTEVKI I IEGNSNQASAVKIACI IKENITANKS IQVTFFH TPDQNQIAQPFYI.LGKEKKLAVEFFISNFNSGNIKASQELISFTIKITYDPVEYALEQIRNIHQISVNNY ITYSAKKQACSDDLI IAI IMAIYVCSGNSSASFREI >gi_499S048
MKI-IOTSPFEMLSVVSYVCEDHAHMLNERGNATACSCYRLHKPKFISINAEVKKTANLFLEGAFIHEIMGG ATOiVINDVLITEQGQTEFEFFRYSTINKNLIPFLGKDLYVYLDPAYTGNRRASGTGIAAIGTYLDQYIV ' YGMEHYFLESLMTSSDTAIAECAAHMILS ILDLHPFFTEVKI I IEGNSNQASAVKIACI IKENITANKS I QVTFFHTPDQNQIAQPFYI DGKEKKLAVEFFISNFNSGNIKASQELISFTIKITYDPVEYALEQIRNIHQ ISVNNYITYSAKKQACSDDLI IAI IMAIYVCSGNSSASFREI
>gi_1136808 ' r LSRHRERIlAANI4EET D GERWELSAPTF RHCP T R AHPFIGVVH INSYSS LET C RHHPA TPTSANPDVGTPRPSEDNVPAKPRIαLESLSTYLQMRCT/REDAHVSTADQLVEYQAGRKTHDSLHACSVYR ELQAFLVNLSSFIJNGCYVPGVH LEPFQQQLVMHTFFFLVSIKAPQKTHQLFGLFKQYFGLFETPNSVLQ TFKQKASVFLIPRRHGKT IVVAI ISMLLASVENINIGOTAHQKHVANSVFAEI IKTLCRWFPPKNLNIK KENGTIIYTRPGGRSSSIMCaTCFNKNSIRGQTFHLLYVDFJU^IKKDALPAILGFMLQKDAKLIFISSV
Figure imgf000017_0001
GAFDTEIMGEGAASSNATLYRVVGDAALTQFDMC^VDTTAQEVQKCLGKQLFVYIDPAYTITNTEASGTGV GAVVTSTQTPTRSLILGi EHFFLRDLTGAAAYEIASCACrraiKAIAVLHTTIERVNAAVEGNSSQDSGVA IATVLNEICPLPIHFLHYTDKSSALQWPIYMLGGEKSSAFΕTFIYALNSGTLSASQTVVSNTIKISFDPV TYLVEQVRAIKCVPI^t^GGQSYSAKQKHMSDDLLVAVVriAHFMATDDRHMYKPISPQ . ■ >gi_1718281
MLQKDAKLIFISSVNSSDRSTSFLLNLRNAQEKMLNVVSYVCADHREDFHLQDALVSCPCYRLHIPTYIT IDES IKTTTNLFMEGAFDTELMGEGAASSNATLYRVVGDAALTQFDMCRVDTTAQQVQKCLGKQLFVYID PAYTNNTEASGTGVGAVVTSTQTPTRSLILGMEHFFLRDLTGAAAYEIASCACTMIKAIAVLHPTIERVN AAVEGNSSQDSGVAIATVIiNEICPLPIHFLHYTDKSSALQ PIYMLGGEKSSAFETFIYALNSGTLSASQ TVVSNTIKISFDPVTYLVEQVRAIKCVPLRDGGQSYSAKQKHMSDDIJ.VAVVMAHFi^aTDDRHMYKPISP
Q >gi_224S515
MLQKDAKLIFISSVNSSDRSTSFI,INi NAQEKMLNVVSYV(-ADHREDFHLQDALVSCPσιrRLHIPTYIT IDESIKTTTNLFMEGAFDTEI-MGEGAASSNATLYRVVGDAALTQFDMCRVDTTAQQVQKCLGKQLFVYID PAYTNNTEASGTGVGAVVTSTQTPTRSLILGMEHFFLRDLTGAAAYEIASCACTMIKAIAVLHPTIERVN AAVEGNSSQDSGVAIATVLNEICPLPIHFLHYTDKSSALQWPIYMLGGEKSSAFETFIYALNSGTLSASQ TVVSNTIKISFDPVTYLVEQVRAIKCVPLRDGGQSYSAKQKHMSDDLLVAVVMAHFMATDDRHMYKPISP
Q
>gi_2246552
MIJ_SRH ERI- AN QETAKDAGER^ffiLSAPTFTRHCP TA -AHPFIGVVHRINSYSSV ETYCT HHP
TPTSANPDVGTPRPSEDNVPAKPRIjLESLSTYLQMRCVREDAHVSTADQLVEYQAARKTHDSLHACSVYR ELQAFLVNLSSFIiNGCYVPGVH LEPFQQQLVMHTFFFLVSIKAPQKTHQLFGLFKQYFGLFETPNSVLQ TFKQKASVFLIPRRHGKT IVVAIISi iLIJ^VENINIGYVAHQKHVANSVFAEIIKTLCRWFPPKNLNIK KENGTI I TRPGGRSSSLMCATCFNKNS IRGQTFNLLYVDEANFIKKDALPAILGEMLQKDAKLIFISS V NSSDRSTSFLϋ^NLRNAQEKMLNVVSYVO DHREDFHLQDALVSCPCYRLHIPTYITIDESIKTTT^ GAFDTEIilGEGAASSNATLYRVVGDAALTQFDMCRVDTTAQQVQKCLGKQLFVYIDPAYTNNTEASGTGV GAWTSTQTPTRSLILGMEHFFLRDLTGAAAYEIASCACTMIKAIAVLHPTIERVNAAVEGNSSQDSGVA IATVI.5ΕICPLPIHFLHYTDKSSALQWPIYMLGGEKSSAFETFIYALNSGTLSASQTVVSNTIKISFDPV TYLVEQVRAIKC^PLRDGGQSYSAKQKHMSDDIiVAVVI^AHFMATDDRHMYKPIS PQ >gi_4494933 MLQKDAKLIFISSSNSSDKSTSFIjLNLKDAHEKMLNVVNYVCPDHKDDFNLQDTVVACPCYRLHIPAYIT IDETVRSTTNLFXEGAFSTELMGDAATSAQSMHKIVSDSSLSQLDLCRVKSTSQDIQGAMKPCLHVYIDP AYTJ TDASGTGIGAVIAVNHKVIKCIl.LGVEHFFLRDLTGTAAYQIASI.S*^^
AVEGNSSQDAGVAIATVL-ffiICSVPLSFLHHVDKNTLIRSPIYMLGPEKAI AFESFIYAI_5rSGTFSASQT SHTIKLSIT3]? AY IDQIKAIRCIPL:ro343HTYCAK^^ >gi_7330018 II QKDAKL FISSSNSSDKSTSFLLNLKDAHEK^lIΛNV YVCPDHxπ)DF-TLQDTV^
IDETVRSTTNLFLEGAFSTEIiMGDAATSAQSMHKIVSDSSLSQLDLCRVESTSQDIQGAMKPCLHVYIDP
AYTN1TTDASGTGIGAVIAVNHKVIKCILLGVEHFFLRDLTGTAAYQIASCAAALIRAIVTLHPQITHVNV
AVEGNSSQDAGVAIATVLNEICSVPLSFLHHADKNTLIRSPIYMLGPEKAKAFESFIYALNSGTFSASQT SH I SFDPVAYLIDQI-C IRCIP DGσHTY ^U Q TMSDDVL AA rAHYM T]roKF F S E
>gi_4019255
MIiIΛKAKKALMENLTEASSTQSETEWTVDTPTMITNIKKSERMAYSKIGVIPSINLYΞASLTSFCRLYRP I ALKQPLPQTGTLPvIΛPSEKPYISQKLSNYVKSLTLKHVϊrøDIEAEAEYYASVQTEKTFMECPIYLELRQ FIINLSSFLNGCYVKKSTHIEPFQLQLIl-HTFYFLISIKSPESTNKLFDIFKEYFGLGEi DSAMLQNFKQ I ASIFLIPRRHGKTWIVVAIIS4 LITSVENLHVGYVAHQKHVANSVFTEIINTLQKKrFPS-aTIDVKKENG TIIYKIPGKKPSTLMCaSCFNKNSIRGQTFNLLYIDEANFIKKDSLPAILGFHLQKDAKLIFISSVNSGD KATSFLFNLKNASE-ΦrLNIVNYICPDHKDDFSLQDSLISCPCYKLYIPTYITIDETIKNTTNLFLDGAFT TELMGDISVMSKNNIHKVIGETALMQFDLCRIDTTKPEITQCLNS IMYLYIDPAYTNNSEASGTGIGAI I ALi NNSSKCIIVGIEHYFLKDLTGTATYQIASCACΞLIRAALVLYPHIQAVHVAVEGNSSQDSAVAISTF LNECSPVKVNFMHYKDKTTAMQWPIYMLGSEKSQAFESFIYAINSGTISASQS I ISNTIKLTFDPISYLI EQIRAIRCΥPLRDGSHTYCAi ζRTVSDDVLVAVVMAHFFSTSNKHIFKQLNSI >gi_4019257
MLQKDAKLIFISSVNSGDKaTSFLFNLKNASEKMLNIV-^ICPDIIKDDFSLQDSLISCPCYKLYIPTYIT IDETIKOTTm.FLDGAFTTEl-MGDISWSia^
PAYTNNSEASGTGIGAIIALKNNSSKCIIVGIEHYFLKDLTGTATYQIASCACSLIRAALVLYPHIQAVH VAVEGNSSQDSAVAISTFLNECSPVKVNFMHYKDKTTAMQWPIYMLGSEKSQAFESFIYAINSGTISASQ S I ISNTIKLTFDPISYLIEQIRAIRC^PLRDGSHTYCAKKRTVSDDVLVAVVi^lTUffFSTSNKHIFKQLNS I
>gi_S 0355
MLLLKAKKAI IENLSEVSSTQAETD DMSTPTI ITNTSKSERTAYSKIGVI PSVNLYSSTLTSFCKLYHP LTI-^QTQPQTGTLRIJLPHEKPLILQDLS.TYVKLLTSQNVCHDTEANTEYNAAVQTQKTSMECPTYLELRQ FVILTTSSFLNGCYVKRSTHIEPFQLQLII^HTFYFLISIKSPESTNRLFDIFKEYFGLREMDPDMLQIFKQ ' KASIFLIPRRHGKT IVVAIISMLLTSVENIHVGYVAHQKHVANSVFTEIINTLQKWFPSRYIDIKKENG T IYKSPDKKPSTLMCATCFNKNS IRGQTFN]-LYIDEANFIKKDSLPAILGFMLQKDAKLIFISSVNSGD
RATSFLFNLKNASEKMIINIV-TYICPDHKDDFSLQDSLISCPCYKLYIPTYITIDETIKNTTNLFLDGAFT TEI^GDMSGISKSNIHKVISE4AITQFDLCRADTTKPEITQCLNSTMYIYIDPAYTNNSEASGTGIGAIL TFKNNSSKCIIVGMEHYFLKDLTGTATYQIASCACSLIRASLVLYPHIQCVHVAVEGNSSQDSAVAISTL INECSPIKVYFIHYKDKTTTMQ PIYMLGAEKSIAFESFIYAINSGTISASQSIISNTIKLSFDPISYLI EQIRSIRCΪP]_RDGSHTYαUCKRTVSDDVLVAVVMAYFFATSNKHIFKPLNST >gi_695201 MLQI03AKIIFISSVNSSDQTTSFLYNLKNAKEKMLNVVNYVCPQHREDFSLQESVVSCPCYRLHIPTYIA IDENIKDTTNLFMEGAFTTELMGDGAAATTQTNMHKVVGEPALVQFDLCRVDTGSPEAQRGLNPTLFLYV DPAYTN-TOEASGTGMGAVVSMKNSDRCVVVGVEHFFLKELTGASSLQIASCAAALIRSLATLHPFVREAH VAIEGNSSQDSAVAIATIjLHERSPLPVKFLHHADKATGVQWPMYILGAEKARAFΕTFIYALNSNTLSCGQ AIVSNTIiaSFDPVAYLIEQIRAIKCnrPLKDGTVSYCAKHKGGSDDTLVAVVMA-KFATSDRHVFKNHMK
QI >gi_4928934
MLLSSFRNHLQKIrøl.OrSVQAQNIDWPVETPVLISKDSKTNRIMPLIGVI^^
TKQPKFTPDIGYVRDLKKHDQYFLPKLQHHLSTLCEAYNHVDRQAQVEFNASILTLKAFNANGVLNELKQ FLINLSCFIiNGCYVSKSTCIELFQKQLILHTFYFLISIKTPEETIΛKMFTFFKHYVGLFDIDDNriLQCFKQ KSTVFLIPRRHGKT IVVAIIS πjIiASVENVHIGYVAHQKHVANAVFTEIITTLYQrøPSI^ TIIYTKPGRKPSTIϋMCATCFNKNSIRGQTFNILYVDEAiWIKKEALPAILGFMLQKDAKIIFISSVNSAD KSTSFLFNLRNAKEKMI-NVVNYVCPEHKEDFNLQSTLTSCPCYRLHIPTYITIDESIKNTTNLFIJD TEIiMGDISTFPTSSMFKVVEEQALFHFDICRVDTTQIDTVKIIDNVLYVYVDPAYTSNSEASGTGIGAVV PLKTKVKTIILGIEHFYLKNLTGTASQQIAYCVTSMIKAILTLHPHINHVNVAVEGNSSQDSAVAISTFI NEYCPVPVFFAHOTERSSVFQ PIYILGSEKSQAFEKFICAIiNTGTLSASQTIVSNTIKISFDPVAYLME QIRAIRCLPLKDGSYTYCAKQKTMSDDTLVAVVMANYMAISEKHTFKELCKT >gi_1632798 MLYASQRGRLTE LRNALQQDSTTQGα-GAETPSIMYTG SD AHP VGTIHASN YCP r RAYCRH GPRPVFVASDESLPMFGASPALHTPVQVQMCLLPELRDTLQRLLPPPNLEDSEALTEFKTSVSSARAILE DPNFLEMREFVTSLASFLS43QYKHKPARLFJ^QKQVVLHSFYFLISIKSLEITDTMFDIFQSAFGLEEMT I1EKLHIFKQKASVFLIPRRHGKT IVVAIISLILS.ILS.TVQIGYVAHQKHVASAVFTEIIDTLTKSFDSK RVEVNKETSTITFRHSGKISSTVMCATCFNKNSIRGQTFHLLFVDEANFIKKEALPAILGFMLQKDAKII FISSVNSADQATSFLYKLKDAQERLLNVVSYVCQEHRQDFDMQDSMVSCPCFRLHIPSYITMDSNIRATT NLFLDGAFSTELMGDTSSLSQGSLSRTVRDDAINQIiELCRVDTLNPRVAGRLASSLYVYVDPAYTNNTSA SGTGIAAVTΞDRADPNRVIVLGLEHFFLKDLTGDAALQIATCTATALVSSIVTLHPHl^EEVKVAVEGNSSQ DSAVAIASI IGESCPLPCAFVHTKDKTSSLQ PMY]^TNEKSKAFERLIYAVOTASLSASQVTVSNTIQL SFOPVLYLISQIRAIKPIPLPJDGTYTYTGKQRNLSDDVLVALV1S1AHFLATTQKHTFKKVH >gi_2337991
MFYVKVMPALQKACEELQNQ SAKSGK PVPETPLVAVETRRSER PHPYLGLLPGVAAYSSTLEDYCHL YNPYIDALTRCΩLGQTHRRVATQPVLSDQLCQQLKKLFSCPRNTSVKAKIiEraAAVRTHQALDNSQVFLE LKTFVLNLSAFLNKRYSDRSSHIELFQKQLl4 HTFFFLVSIKAPELCEKFCNIFKLYFNIDTMDQATLDI FKQKASVFLIPRRHGKT IVVAIISILLASVQDLRIGYVAHQKHVANAVFTEVINTLHTFFPGKYMDVKK ENGTIIFGLPISrKKPSTLLCaTCFNKNSIRGQTFQLLFVDEANFIKKDALPTILGFMLQKDAKIIFISSSN SSDQSTSFLYlttKGASERi I-WVSYVCSNHIffiDFSMQDGLISCPCΥSl.HVPSYISIDEQIKTTTNLFLDG
VFΌTELMGDSSCGTLSTFQIISESALSQFELCRIDTASPQVQAHLNSTVHMYIDPAFTNNLDASGTGISV
IGRLGAKTKVILGCEHFFLQKLTGTAALQIASCATSLLRSWIIHPMIKCAQITIEGNSSQDSAVAIANF IDECAPIPVTFYHQSDKTKGVLCPLYLLGQEKAVAFESFIYAM. LGLCKASQLIVSHTIKLSFDPVTYLL
EQVRAIKCQSIJRDGSHTYHAKQKNLSDDLLVSVVMSLYLSSANTLPFKPLHIERFF
>gi_2317977
ICLQKDAKIFFISSVNSGEKTTSFLYNLKDANEKMVNVVSYVCSEHMEDFNKQSAITACPCYRLYVPEFIT
INDNIKCTTNLL-JEGSFATELMG^MQSHTEVSGNSMIHESSLTRJ-DFYRODTAGQGAPTTENTLFVYIDP AYGNNVHASGTGIVAMSHCKHTKKCI ILGLEHFFLNNLTGTAAHNIASCATALLEGILFQHPWIQEIRCI IEGNSNQDSAVAIATFISHNIKLPTLFASYRDKTGMQ PIYMLSGDKTIJAFQNFISSI.^QGLLCASQTVV SlSroVLLSSDPISYLIEQIKNTKCIYHKNKTITFQSKTHTMSDDVLIACVMTCYVMTTNKISYISFSIK >gi_6625593 MFIASKKSYFEAVYRSTVSSHSEEFWKSDDPVYFTQYKKQCNRLPNAYLGTLHSASKYSEIJFRHYVATFS NS PLDFPQSVFNERNPCEYSVPYIiDSALQCSAKTLVGCSVSTTERNEYEVCKEATRCFKDAMSHKVLKVF LSNLSWFLKGHYKSKQAFLEPFQKQLILHSFMFVAS IKCPETTTKLFDΞFKFLLDMLYFDNTDLLTFLQK S PAFLI PRRHGKT I TAI I Si LLTSVDDLHIGYVAHQKHVSLAVFLEISNILLAWFPRKNIDIKKENGV ILYSHPGKKSSTLMCATCFNKNSIRGQTFNIiFVDEANFIKKEALPAILGFMLQKDAKIFFISSVNSGEK TTSFLYNLKDAtffiKMVNVVSYVCSEHMEDFNKQSAITACPCYRLYVPEFITINDNIKCTTNLLLEGSFAT ELMGNMQSHTEVSGNSMIHESSLTl^FYRCDTAG^GAPTTENTLFVYIDPAYGNNVHASGTGIVAMSHC KHTKKCI ILGLEHFFω«π^TGTAAHNIAS<-aTALLEGILFQHPWIQEIRCIIEGNSNQDSAVAIATFISH NIKLPTLFASYRDKTGMQWPIYMLSGDKTLAFQNFISSIjNQG]_LaVSQTVVSNTVLLSSDPISYLIEQIK NTKCIY-ΠOTKTITFQSKTHTMSDDVLIACVMTCYVMTTNKISYISFS IK 1 10 20 30 40 50 GO 70 80 90 100 110 120 130
1 + + + + 4 + + j + »-+ + + + 1 gi_10180719 HFGGLLGEETkRHFERLH TKIIDR GflSHRNERSIRDG----D4 VDflPF~LHFniPVPRRHQTVMPRIGILHHCCDSLGIYSHITTRHLYSSIflCSEFDELRRD S¥PRCΫP gi_7G73189 MFGGLLGEETKRHFERLHKTKHDRLGRSHRNERSIRDG DHVD8PF~LHFRIPVPRRHQTVHPflIGILHHCCDSLGIYSflITTRHLYSSIflCSEFDELRRD SYPRCYP gi_5GB9285 HFGGHLGESRKKHFERLLRDRHERLGRSRKHECLRRGG SLVDRPF— LHFRISVPRRHQTVHPRVGTLHDCCDGTGIYSRIRTRLLYRGIVSSEFGEVRRE SLSNGHI gi_1869837 HFGQQLRSDV QYLERLEKQRQQ VGV-DERSRGLTLG GDRLRVPF— LDFRTRTPKRHQTVVPGVGTLHDCCEHSPLFSRVRRRLLFNSLVPflQLRGROFG GD H gi_59501 HFGQQLRSDVQQYLERLEKQRQLKVGR-DERΞRGLTHG GDRLRVPF--LOFRTRTPKRHQTVWGVGTLHDCCEHSPLFSRVRRRLLFHSLVPRQLKGRDFG GD H
CO gi-.2G05992 HFGKRLSRETIQYFETLRKEVQSRSGR-KHRRRERQTG— -GEDDVKTRF— LHFRIPTPQRHQTVVPGVGTLHDCCETRQIFRSVRRRLLFRSLSKHRGGESKER LD P c gi_330792 HFGRVLGRETVQYFERLRREVQHRRGR-KHRRREHQHG— GEDDR TRF— LHFRIPTPQRHQTVVPGVGTLHDCCETRQIFRSVRRRLLFRSLSKMQSGERRER LD P
CD gi_971317 HFGGRVGEQSRRYFQRLLRERQRRRRE-RGRRPDGGGGRRGEDDRRVPF~LDFRVRRP RHQTVVPGVGTLHGYCELRPLFRRTflSRLLLTSHRRREflG LH T O gi_5869808 HSLIHFGRTLGEESVRYFERLKRRRDERFGTLESPTPCSTRQGSLGHRTQIPF— LHFRIDVTRRHQRVIPGIGTLHHCCEYIPLFSRTRRRRHFGRFLSSTGYHCTPH VVLKP R gi.,5708110 HLGKESVEIV RYRDRLRKRTHERGPDDVDGQEHSDSHFITTRSICDRHDSflRDTHHSPRSRFQFRIDVPQRHQRCIRPIGSFHHCCRISRRFSYHflSEIIYEHLRSYSTKYTDTDRflLHDLQVSPKRQL gi_1813970 MLRGDSRR IQERYRELQ RKSHPTSCIST-flFTHVRTLCR RYQMMHPELGLRHSCHERFLPLMRFCGRHRDYHSPEESQREL gi_2746296 MLRSCDIDRIQ RYQSIIHkHEQDVK-ISS-TFPHSRIFCQKRFIILTPELGFTHRYCRHV PLYLFCDRQRHV SK--I m gi_325496 MLRTCDITHIKHHYERIIHKGERDCSπST^KYPHSRIFYK RFIHLTPELGFRHSYHQQVKPLYTFCE QRHL KHRKPL gi_57335G4
CO HLRTCDITHIKHHYERIIHKGERHCSTIST-KYPHSRIFYKKRFIHLTPELGFRHSYHQQVKPLYTFCEKQRHL KHRKPL
I gi L13G808 HLLSRHRERLRRHLEETRKD--RGEr-RHEL-SRPTFTRHCPKTRRMRHPFIGVVHRIHSYSSVLETYCTRHHP RTPTSRHPDV GTPRPSE m gi_224G552 HLLSRHRERLRRHLQCTRKD— RGE-RHEL-SRPTFTRHCPKTRRHRHPFIGVVHRIHSYSSVLETYCTRHHP RTPTSRHPDV GTPRPSE m 'gi-4019255 MLLLKRKKRLMEHLTERSST^-QSETEHTV-DTPTHITHIKKSERMRYSKIGVIPSIHLYSRSLTSFCRLYRP LRLKQPLPQT GTLRLLP gi_G0355 HLLLKRKKRIIEHLSEVSSt-.-QflETDMDH-STPTIITHTSKSERTRYSKIGVIPSVHLYSSTLTSFCKLYHP LTLHQTQPQT GTLRLLP gi_4928934 MLLSSFRHHLQKHYEKYSVα—RQHIDMPV-ETPVLISKDSKTHRLRHPLlGVISRIHLYSPTLKYYCDEYST TKQPKFTPDI GYVRDLK c gi...2337991 MFYVKVHPRLQKflCEELQHQHSRKSGKHPVPETPLVRVETRRSERHPHPYLGLLPGVnRYSSTLEDYCHLYHP YIDHLTRCDL GQTHRRV gi_lG32798 MLYRSQRGRLTEHLRHRLQQDSTTQGCLGR-ETPSIMYTGRKSDRMRHPLVGTIHflSHLYCPHLRRYCRHYGPRPVFVRSDESLPHF GRSPRLH m gi_6B25593 MFIRSKKSYFERVYRSTVSS— HSEEFHKSDDPVYFTQYKKQCHRLPHRYLGTLHSflSKYSEHFRHYVRTFSH SPLDFPQSVF HERHPCE r gi_1718281 gi_2246515 gi_4494933 gi_733001B gi_4019257 gi_G95201 gi_2317977 gi_854039 gi_4996048
Consensus
TABLE 2
131 140 150 160 170 180 .190 200 f210 220 230 240 250 260
[ +_ + + :+ + + + j + + + + + -. I gi_10180719 RITHflQRFLSPllMi RVRHSIIFQEYDEMECRRHRHflYYSTMHSFISMRTSDRFKgLTVFISRFSKLLIflSFROVHKLDDHTVK— KRRRIDRPSYDKLHGTLELFQKHILMHRTYFVTSVLLGD-HRERfl gi_7G73189 RITHOQRFLSPH«MRVflHSIIFQEYDEHECflRHRHflYYST«HSFIStlRTSDflFKqLTVFISRFSKLLIRSFRDVHKLDDHTVK~KRRRIDflPSYDKLHGTLELFαKHIFDRCHLFCHFCFTMR-SRRflS gi_5G89285 SKRHRERLLRPTLTRVRHSITFHEYDDRQCRRHRHRYYSTMHTFGSHRTSDRFQQLflSFIDRFSKLLRRSFKDVHI^DRHHRP— RRRITRPSYDKPHGTLELFQKHILHHRTYFLTSVLLED-HRERfl gi_18G9837 — TRKLEFLRPELVRflVRRLRFRECflPEDflVPQRHRYYSVLHTFQRLHRSERFRQLVHFVRDFRQLLKTSFRRSSLRETTGPP-KKRflKVDVflTHGQTYGTLELFQKHILMHflTYFLflRVLLGD-HREQV gi_59501 — TRKLEFLRPELVRRVRRLRFKECflPRDVVPQRHRYYSVLHTFQRLHRSERFRQLVHFVRDFRQLLKTSFRRSSLTETTGPP-KKRRKVDVRTHGRTYGTLELFQKHILMHRTYFLflRVLLGD-HREQV
CO gi_2G05992 — SSVERYVDPKVKQRLKTISFVEYHDRERRSCRHRYYSIHHTFDSLRSSDRFHQVRHFVRRFSRLVDTSFHGflDLDGDGQQT-SKRIKVDVPTYG QRGTLELFQKHILHHRTYFIRRVILGD-HRDRI c gi_330792 — RSVERYVDPKVRQRLKTISFVEYSDDERRSCRHRYYSIMHTFDRLRSSDflFHQVRSFVRRFSRLVDTSFHGRDLDGDGQQR-SKRRRVDVPTYGKQRGTLELFQKHILMHRTYFinRVILGD-HRDRI
03 gl_971317 — GTGEflHVSRELflGVLSflLRFflRHPPflERRflHCHRYHSVMRflLESHRRSGflFRQVRRFVRRFSRLVGTSFSHLGGGDDRDPPRflKRRRVEPPS-GqTRGRLELFQKHILttPflTYFVRflTLLGE-HflERI CO gl_58G9808 — YSVHRHVSPELKKRVSSVαFYEYSPEEflRPHRHRYSGVHHTFRRFSLSDSFCQLSTFTQRFSYLVETSFESIEECGSHG KRRKVDVPIYGRYKGTLELFQKHILMHTTHFISSVLLGD-HRDRV gl_5708110 FTGRREDSILPRLRQKLRHLHFRRFRPSDSLIHDKflFDGIMHGYRGFVKSDEFSqLHHFIYRFHTLLKKSFSGQflSHDY KRRKLEKTTSEqRDGTLELFQKHILHHRTYFRSSICLGEGSTERS gi_1813970 LFHERLKSRLDKLTFRPCSEEQ-R RSYqK-LDRLTELYRDPqFQqiHHFHTDFKKHLDGGFSTRVEGD RKRIRLEPFQKHLLIHVIFFIRVTKIPV-LRHRV gi_274G29G RICDPLHCRLSKLKFTRIIEKHTE VqγqKHLELQTSFYRHPHFLqiEKFiqDFqRHICGDFENT--HK KERIKLEPFqKSILIHIIFFISVTKLPT-LRHHV m gi_32549G TILPSLSHKLQEHKFLPRSDKSFE SqΫTEFLESFKILYREPLFLqiOGFIKDFRKHIKGEFHDF— GD - TRKIQLEPFQKNILIHVIFFIRVTKLPfl-LflHRV
CO gi_57335G4 -r-TILPSLTRKLqEHKFLPRSDKSFE SqYTEFLESFKILYREPLFLqiDGFIKDFRK IKGEFHDF— GD- TRKiqLEPFDKHILIHVIFFIRVTKLPR-LRHRV
I gi_113680B DHVPRKPRLLESLSTYLQHRCVREDRHVSTRDqLVEYQRGRKTHDSLHRCSVYRELqRFLVHLSSFLHGCYVP = GVH LEPFQQqLVMHTFFFLVSIKRPQ-KTHqL mm gi_224G552 DHVPRKPRLLESLSTYLqHRCVREDRHVSTRDqLVEYqRRRKTHDSLHRCSVYRELqRFLVHLSSFLHGCYVP GVHHLEPFQqqLVHHTFFFLVSIKRPQ-KTHqL
H gi_4019255 SE KPYISQKLSHYVKSLTLKHV HDIERE—REYYRSVqTEKTFttECPI.YLELRQFIIHLSSFLHGCYVK KSTHIEPFQLDLILHTFYFLISIKSPE-STHKL gi_G0355 HE KPLILQDLSHYVKLLTSqHVCHDTERH— TEYHRRVqTdKTSHECPTYLELRQFVIHLSSFLHGCYVK RSTHIEPFqLQLILHTFYFLISIKSPE-STHRL
7) gi_4928934 KH DqYFLPKLqHHLSTLCERYHHVDRQRQ~VEFHRSILTLKflFNRHGVLHELKQFLINLSCFLNGCYVS KSTCIELFQKqLILHTFYFLISIKTPE-ETHKH
C gi_2337991 RT qPVLSDqLGqQLKKLFSCPRHTSVKRK— LEFERRVRTHQRLDHSqVFLELKTFVLHLSRFLHKRYSD RSSHIELFQKqLIHHTFFFLVSIKHPE-LCEKF m gi_lG32798 TPVQVQMCLLPELRDTLqRLLPPPHLEDSERL— TEFKtSVSSRRRILEDPHFLEHREFVTSLRSFLSGqYKH ' KPRRLERFqKqVVLHSFYFLISIKSLE-ITDTH r gi_GG25593 YSV— rPYLDSRLqCSRKTLVGCSVSTTER HEYEVCKERTRCFKDRHSHKVLKVFLSHLSMFLKGHYKS KqRFLEPFDKQLILHSFHFVRSIKCPE-TTTKL gJL_1718281 gi.,2246515 gi--4494933 gi_7330018 gi_4019257 gi_G95201 gi_2317977 gi_854039 gi_499G04Q
TABLE 2 CONTINUED
Figure imgf000022_0001
Figure imgf000023_0001
23
Figure imgf000024_0001
uicoi r-.HrJ ur-ro oxx'^rx uinin rHTOroHinroxr-TH -xx
Figure imgf000024_0002
651 GGO G70 880 G90 700 710 720 730 740 750 7G0 770 780 I — +- — +_ i — +-. +- +_ — +_. +- +_ +_. — -_+- gi_10180713 flP—TPHHVSFYHSKSHGTDVEYPYFLLQRQKTTHFDFFIBαFHSGRVLfiSQOLVSTTVSLrrOPVEYLTKQLTHISEVVTG PTCTRTFSGKKGG— ^HDDfVVflLTHHVYISflH-IPDHfiFHPIRV gi_7G73183 flP~TPHHVSFYHS SHGTDVEYPYFLLqRQ TTHFDFFIflQFHSGRVLflSqDLVSTTVSLTTDPVEYLTKQLTNISEVVTG PTCTRTFSG KGG---HDOtVVRLTHHVYISflH-IPDHflFflPIRV gi_5689285 flP—flPHDVCFYHSKPnGSHVEYPFFLLQRQKTHflFDFFIHRFHSGRVLflSqDLVSTTISLSTDPVEYLT QLTHLSEVVTG RTGTRTFSG KGG---YDDfVVRLYHflVYISflH-flSDRTFRPIRG gi_.18G9B37 flH-GPGPELLFYHCEPPGGHVLYPFFLLH QKTPflFEYFIKKFHSGGVHHSQELVSVTVRLCTDPVEYLSEQLNHLIETVSP HTDVRHYSGKRNGfi-^RODLHVfiyiHRIYLflflPTGIPPflFFPΪTR gi_53501 RDRGSGPELLFYHCEPPGSflVLYPFFLLHKQKTPflFEHFIKKFHSGGVHHSqEiVSRTVRLqTDPVEYLLEQLHHLTETVSP-^-HTDVRTYSGKRHGR- SDι.LHVRVIHflIYLflflQflGPPHTFnpiTR gi_2605992 VH— HPGTVLFYHCTPPGSSVHYPFFLLq QKTGflFDHFI RFHSGLVLRSqELISHTVRLQTDPVEYLLtQHKHLTEVITG TSETRVπGKRHGH-^SDDrlLVflLVHflVYHRSLPPTTHRFSSLST gi_330792 VP~HPGRVLFYHCTPHGSSVRYPFFLLqKQKTGHFDHFIKRFHSGSVLflSqELVSHTVRLβTDPVEYLLtQHKHLfEVVTG-'~TSETRVFTGKRHGR— SDDHLVflLVHRVYLSSLPPTSDRFSSLPR gi_97I317 VP--HPflETRFYHCRPPGSnVRYPFFLLQKQKTRRFDHFIRLFHSGRVVflSQDLRSLTVRLαTDPVEYLFEQLQHLTESTnG PGGRRRFSGKRRGR— RDDLHVflLVtiWFVGSLPPTDGRFCPLflP gi_58G9808 LS--SRPVLLFYHSIPPGC5VRYPFFLLQKQKTPRVDYFV RFHSGHIIflSQELVSLTVI LGVDPVEYLC QLDNLTEVIKGGHGHLDTKTYTGI GTTGTHSDDLHVRLIHSVYIGSSCIPDSVFHPI
CO gi_5708110 LGF5LTFRHSRQPGTHHnHPFYLLHKqKSRflFDLFVSLFH GRFHRSqELVSHTLVLSKDPCEYLVDQj-KNIT--VTHGqGPDSFRTFSGKQGRV--PDDHLVHRVHSTYLRLEGSPTflGrfHPIflP c gi_1813970 __ RVLFYHTPDQHHIEQ-PFYLHGRD RLRVEQFISRFHSGYIKRSQELVSYTIKLSHDPIEYLLEQIQHLHRVTLR EGTTRRYSRKRQHR-iSDDLIIRVIHflTYLCDDIHRIRFRVS
CO gi_27<1G296 HITFFHTLDQSQIflQ-PFYLLGREKRLfiVEYFISHFHSaYIKRSQELlSFTI πYDPIEYVIEQIKNLHQIHIH EHVT~YHRKKQ-T-CSDDLLISIIHflIYHCHEGKqTSFKEI CO gi_32549B QVTFFHTPDQHQIflq-PFYLLGKEK LHVEFFISHFHSGHIKRSqELlSFTI ITYDPVEYRLEQIRNIHQISVH NYH--YSB KQ-H-CSDDLIIRIIHRIΥVCSGNSSRSFREI gi_57335G4 QVTFFHTPnqHqiflQ-PFYLLGKEK LflVEFFISHFHSGHI RSqELlSFTI ITYDPVEYRLEqiRHIHQISVH NYrT~YSR Kq-R-CSDDLIIRIIMflIYVCSGHSSRSFREI gi_113G808 PIHFLHYTDKSSRLQMPIYHLGGE SSRFETFIYnLHSGTLSflSOTVVSHTIKISFDPVTYLVEQVRRI CVPLR— DGGQS-YSflKQK-H-HSDDLLVflVVHflHFHRTDDRHHYKPISPQ gi_224G552 PIHFLHYTDKSSRLQMPIYHLG 5EKSSnFETFIYflLHSGTLSflSqτVVSHTIKIBFDPVTYLVEQVRRIKCVPLR-^-DGGqS-YSRKqk:-H-l1SDDLLVflVVHflHFHRTIjDRHHYKPISPQ m gi_4019255 KVNFHHYKDKTTRHQWIYHLGSE SQRFESFIYRIHSGTISRSQSlISHTIKLTFDPISYLIEQIRRIRCYPLR-^-DGSHT-YCRK R-T-VSDDVLVflVVHHHFFSTSHKHIF qLHSI co gi_G0355 KVYFIllYKDKTTTHQ PIYHLGREKSIflFESFIYfllHSGTISflSqSlISHTIKLSFDPISYLIEqiRSIRCYPLR— -DGSHT-YCflKKR-T-VSDDVLVRVVHRYFFRTSHKHIFKPLHST gi_492B934 PVFFflHCNERSSVFQHPIYILGSEKSQflFEKFICflUITGTLSflSQTlVSHTIKISFDPVRYLHEQIRRIRCLPLK— DGSYT-YCR QK-T-HSDDTLVRVVHflNYHHISEKHTFKELCKT m gi_2337331 pVTFYHqSDKTKGVLCPLYLLGqEKRVRFESFIYflHHLGLCKflSqLIVSHTIKLSFDPVTYLLEQVRRIKCqSLR— DGSHT-YHRKQ -H-LSDDLLVSVVHSLYLSSRHTLPFKPLHIER 4 m gi_lG32798 PCRFVHTKDKTSSLQHPHYLLTHKS flFERLIYflVHTRSLSRSqVTVSHTIQLSFDPVLYLISqiRRI PIPLR---DGTYT-YTGKqR-H-LSDDVLVRLVHflHFLflTtqKHTFKKVH gi_GG25593 -PTLFHSYRDKT-GHqMPIYHLSGDKTLRFQHFISSLHQGLLCRSqTVVSJJTVLLSSDPISYLIEqiKHTKCIYH HKTIT-FQSKTH-T-MSDDVLIflCyHTCYVHTTH ISYISFSIK gi_1718281 -PIHFLHYTD SSflLqMPIYHLGGEKSSRFETFIYflLHSGTLSRSqTVVSHTIKISFDPVTYLVEqVRfllKCVPLR DGGOSYSRKQ H~HSDDLLVRVVHHHFHRTDDRHHYKPISPQ c gi_2246515 -PIHFLHYTDKSSRLQHPIYHLGGEKSSflFETFIYRLHSGTLSRSQTVVSHTIKISFDPVTYLVEQVRRIKCVPLR— DGGQSYSRKQKH— HSDDLLVflVVHflHFHRTDDRHHYKPISPQ gi_<1494933 -PLSFLHHVDKHTLIRSPIYHLGPE RKRFESFIYRLHSGTFSflSQTVVSHTI LSFDPVRYLIDQIKRIRCIPLK-^ — DGGHTYCH QKI HSDDVLVRRVHRHYHRTHD FVF SLE m gl_7330018 -PLSFLHHRDKHTl-IRSPIYHLGPE fiKRFESFIYflLHSGTFSRSQTVySHTI LSFDPVRYl-IBtilKniRCIPLK- -DGGHTYCflKqKT~HSDDVLVRflVHRHYHBTHD FVF SLE ro gi_4019257 -KVHFHHYKD TTRHQMPIYMLGSE SQRFESFIYRIHSGTISflSQStlSHTIKLTFDPISYLIEQIRRIRCYPLR -OGSHTYCR KRT—VSDDVLVRWHRHFFSTSH HIFKqLNSI gi_G95201 -PVKFLHHRDKflTGVQ PHYILGflEKflRflFETFIYflLHSHTLSCGqRIVSHTIKLSFDPVRYLIEQIRRI CYPLK- -DGTVSYCR HKG~GSDDTLVRVVHRHYFRTSDRHVFKHHHKQI gi_2317977 -PTLFRSYRDK-TGHQ PIYHLSGD TLnFQNπSSLHQGLLCHSqTVVSHTVLLSSDPISYLIEQI H^KCIYHK N TITFqSKTHT—HSDDVLIHCVHTCYVHTtHKISYISFSlK gi_854039 --αVTFFHTPDQHqiflQ-PFYLLa<EK LflVEFFISHFHSGlΗ flSqELISFTIKITYDPVEYflLEQIRHIHqiSV HHYITYSflKKqfi~CSDDLIIflIIHflIYVCSGHSSRSFREI gi_4996048 --αVTFFHTPDQUqiflα-PFYLLGKEK LflVEFFISHFHSGHIKRSqELISFTI πYDPVEYRLEqiRHIHqiSV NHYITYSRK qR— CSDDLIIRIIHRIYVCSGHSSRSFREI
Consensus .-P„F-h„ .l q-PiYΦLg. »;„fifit.F aa.Hsg-- .RSQ-v !SnT ! lsfDPhYl, „Q! rai.c„l tK.aK SDD.l!R Ha.2„t k
TABLE 2 CONTINUED
25
Figure imgf000026_0001
Table 3. Degenerate primers generated by CODEHOP
Block x7263xbliD
T L Y V Y I D P oligo : 5 ' -AACCTGTACGTGtayntngaycc-3 ' degen=64 temp=33.4 Extend clamp
T L Y V Y I D P A oligo : 5 ' -AACCTGTACGTGTACntngayccngc-3 ' degen=128 temp=36.0 Extend clamp
T L Y V' Y I D P A Y oligo:5'-AACCTGTACGTGTACATngayccngcnt-3' degen=128 temp=42.5 Extend clamp
Complement of Block x7263xbliD
Y I- D P A Y T N N T - ' atrnanctrggGCGGATGTGGTTGTXGT oligo : 5 ' -TGTTGTTGGTGTAGGCGggrtcnanrta-3 ; degen=64 temp=62.9
D P A Y T N N T R A anctrggncgnaTGTGGTTGTTGTGGGTCCG ligo : 5 ' -GCCTGGGTGTTGTTGGTGTangcnggrtcna-3 ' degen=128 temp=61.8
D P A Y T N T R A ctrggncgnawGTGGTTGTTGTGGGTCCG oligo :5 ' -GCCTGGGTGTTGTTGGTGwangcnggrtc-3 ' degen=S4 temp=δl.O
Block x7263xbliE
C I I F G M E H F F oligo : 5 ' -TGGATCATCTTCGGCATngarcaytwyt-3 ' degen=64 temp=55.7 Extend clamp
I .F G M E H F - F' L oligo : 5 ' -CATCTTCGGCATGGAGcaytwyt yyt-3 ' degen=S . temp=62.0
Complement of Block x7263xbliE
E H F F L R' D L T G ctygtrawrawGGACTTCCTGGACTGCCC ' oligo : 5 ' -CCCGTCAGGTCCTTCAGGwarwartgytc-3 ' degen=32 temp=61.7 ■ ■
H F F L R D L T G tygtrawrawrrACTTCCTGGACTGCCCG oligo : 5 ' -GCCCGTCAGGTCCTTCArrwar artgyt-3 ' degen=128 temp=60.8
H F F L R D L T G - gtrawrawrraCTTCCTGGACTGCCCG oligo : 5 ' -GCCCGTCAGGTCCTTCarrwarwartg-3 ' degen=64 temp=60.8
Block x7263xbliF
E V H I A V- E G N oligo : 5 ' -GGACGTGCACGTCGCCrtngarggnaa-3 ' degen=64 temp=S3.8-
Complement of Block x7263xbliF
E G N .S S Q D .S anctyccnttrwGGTTGGTCCTGAGGCGG oligo : 5 ' -GGCGGAGTCCTGGTTGGwrttnccytcna-3 ' degen=128 temp=62.7
E G N S S Q D S A V ctyccnttrwsGTTGGTCCTGAGGCGGC oligo : 5 ' -CGGCGGAGTCCTGGTTGswrttnccy-tc-3 ' degen=64 temp=63.9 . . ...

Claims

1. A method of designing a panel of degenerate primer pairs for screening for new members of multiple known virus families in a biological sample, wherein, each primer pair in the panel binds a sequence that is conserved across members of a said virus family and selectively directs amplification of sequence of said family by PCR, which method comprises
(a) providing a plurality of amino acid sequences from members of a first virus family,
(b) comparing the sequences to identify conserved regions,
(c) designing a first primer pair using a computer based method, wherein each primer in the pair binds a nucleotide sequence that encodes a conserved region identified in (b) and wherein the primer pair is designed to amplify by PCR the nucleotide sequence between the nucleotide. sequences that encode conserved regions in members of the first virus family, and , .
(d) repeating steps (a) to (c) for each virus family.
2. A method of designing a panel of degenerate primer pairs for screening for new members of multiple known virus families in a biological sample, wherein each primer pair in the panel binds a sequence that is conserved across members of a said virus family and selectively directs amplification of sequence of said family by PCR, which method comprises
(a) providing a plurality of nucleotide sequences from members of a first virus family,
(b) comparing the sequences to identify conserved regions, (c) desigmng a first primer pair using a computer based method, wherein each primer in the pair binds a conserved region identified in (b) and wherein the primer pair is designed to amplify by PCR the nucleotide sequence between the conserved regions in members of the first virus family, and
(d) repeating steps (a) to (c) for each virus family.
3. A method according to claim 1 or 2 which further comprises synthesising one or more of the primer pairs and deteπn ing optimal conditions for using the primer pairs in PCR.
4. A method according to iany one of the preceding claims which comprises testing the ability of one or more of the primer p«airs to amplify a nucleotide sequence that encodes an amino acid as defined in claim 1(a) or a nucleotide sequence as defined in claim 2(a).
5. A method according to claim 3 or 4 which comprises testing the primer pair(s). in a range of buffer conditions to determine the optimal buffer conditions for PCR.
6. A method according to any one of claims 3 to 5 which comprises testing the primer pair(s) at a range of different temperatures' to determine the optimal temperature for PCR.
7. A method according to any one of the preceding claims which comprises identifying one or more groups of primer pairs wherein the primer pairs in each group have similar optimal conditions of use in PCR such that they can be used optimally in the same reaction vessel.
8. A method according to claim 7 wherein each primer pair in a group generates' a PCR product of a different size to the other primer pair(s) in the group.
- 9. A method according to claim 7 or 8 wherein each primer pair in a group carries a different label from the other primer pair(s) in the group.
10. A method according to claim 9 wherein each primer pair in a group carries a 5 differently-coloured flourescent label.
11. A method according to any one of the preceding claims wherein the biological sample is a single-source sample from a single individual or is a pooled sample from more, than one individual of the same species.
10
12. A method according- to claim 11 wherein the biological sample is a human sample.
13. A method according to any one of the preceding cl<aims wherein at least 50% of •15 the primer pairs bind a sequence that is conserved across all of the genuses and/or . subfamilies.
.
14. A panel of primers designed according to any one of the preceding claims.
PCT/GB2002/002640 2001-06-07 2002-06-07 Designing degenerate pcr primers WO2002099129A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2002311446A AU2002311446A1 (en) 2001-06-07 2002-06-07 Designing degenerate pcr primers

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0113908A GB0113908D0 (en) 2001-06-07 2001-06-07 Designing degenerate PCR primers
GB0113908.8 2001-06-07

Publications (2)

Publication Number Publication Date
WO2002099129A2 true WO2002099129A2 (en) 2002-12-12
WO2002099129A3 WO2002099129A3 (en) 2003-09-18

Family

ID=9916128

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2002/002640 WO2002099129A2 (en) 2001-06-07 2002-06-07 Designing degenerate pcr primers

Country Status (3)

Country Link
AU (1) AU2002311446A1 (en)
GB (1) GB0113908D0 (en)
WO (1) WO2002099129A2 (en)

Cited By (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002099130A2 (en) * 2001-06-07 2002-12-12 University College London Virus detection using degenerate pcr primers
WO2006094238A2 (en) * 2005-03-03 2006-09-08 Isis Pharmaceuticals, Inc. Compositions for use in identification of adventitious viruses
US7956175B2 (en) 2003-09-11 2011-06-07 Ibis Biosciences, Inc. Compositions for use in identification of bacteria
US7964343B2 (en) 2003-05-13 2011-06-21 Ibis Biosciences, Inc. Method for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture
US8017358B2 (en) 2001-03-02 2011-09-13 Ibis Biosciences, Inc. Method for rapid detection and identification of bioagents
US8026084B2 (en) 2005-07-21 2011-09-27 Ibis Biosciences, Inc. Methods for rapid identification and quantitation of nucleic acid variants
US8057993B2 (en) 2003-04-26 2011-11-15 Ibis Biosciences, Inc. Methods for identification of coronaviruses
US8073627B2 (en) 2001-06-26 2011-12-06 Ibis Biosciences, Inc. System for indentification of pathogens
US8084207B2 (en) 2005-03-03 2011-12-27 Ibis Bioscience, Inc. Compositions for use in identification of papillomavirus
US8097416B2 (en) 2003-09-11 2012-01-17 Ibis Biosciences, Inc. Methods for identification of sepsis-causing bacteria
US8148163B2 (en) 2008-09-16 2012-04-03 Ibis Biosciences, Inc. Sample processing units, systems, and related methods
US8158936B2 (en) 2009-02-12 2012-04-17 Ibis Biosciences, Inc. Ionization probe assemblies
US8158354B2 (en) 2003-05-13 2012-04-17 Ibis Biosciences, Inc. Methods for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture
US8163895B2 (en) 2003-12-05 2012-04-24 Ibis Biosciences, Inc. Compositions for use in identification of orthopoxviruses
US8173957B2 (en) 2004-05-24 2012-05-08 Ibis Biosciences, Inc. Mass spectrometry with selective ion filtration by digital thresholding
US8187814B2 (en) 2004-02-18 2012-05-29 Ibis Biosciences, Inc. Methods for concurrent identification and quantification of an unknown bioagent
US8214154B2 (en) 2001-03-02 2012-07-03 Ibis Biosciences, Inc. Systems for rapid identification of pathogens in humans and animals
US8268565B2 (en) 2001-03-02 2012-09-18 Ibis Biosciences, Inc. Methods for identifying bioagents
US8298760B2 (en) 2001-06-26 2012-10-30 Ibis Bioscience, Inc. Secondary structure defining database and methods for determining identity and geographic origin of an unknown bioagent thereby
US8407010B2 (en) 2004-05-25 2013-03-26 Ibis Biosciences, Inc. Methods for rapid forensic analysis of mitochondrial DNA
US8534447B2 (en) 2008-09-16 2013-09-17 Ibis Biosciences, Inc. Microplate handling systems and related computer program products and methods
US8546082B2 (en) 2003-09-11 2013-10-01 Ibis Biosciences, Inc. Methods for identification of sepsis-causing bacteria
US8550694B2 (en) 2008-09-16 2013-10-08 Ibis Biosciences, Inc. Mixing cartridges, mixing stations, and related kits, systems, and methods
US8563250B2 (en) 2001-03-02 2013-10-22 Ibis Biosciences, Inc. Methods for identifying bioagents
US8822156B2 (en) 2002-12-06 2014-09-02 Ibis Biosciences, Inc. Methods for rapid identification of pathogens in humans and animals
US8871471B2 (en) 2007-02-23 2014-10-28 Ibis Biosciences, Inc. Methods for rapid forensic DNA analysis
US8950604B2 (en) 2009-07-17 2015-02-10 Ibis Biosciences, Inc. Lift and mount apparatus
US9149473B2 (en) 2006-09-14 2015-10-06 Ibis Biosciences, Inc. Targeted whole genome amplification method for identification of pathogens
US9194877B2 (en) 2009-07-17 2015-11-24 Ibis Biosciences, Inc. Systems for bioagent indentification
US9598724B2 (en) 2007-06-01 2017-03-21 Ibis Biosciences, Inc. Methods and compositions for multiple displacement amplification of nucleic acids
US9873906B2 (en) 2004-07-14 2018-01-23 Ibis Biosciences, Inc. Methods for repairing degraded DNA
US9890408B2 (en) 2009-10-15 2018-02-13 Ibis Biosciences, Inc. Multiple displacement amplification
US9976180B2 (en) 2012-09-14 2018-05-22 Population Bio, Inc. Methods for detecting a genetic variation in subjects with parkinsonism
US10059997B2 (en) 2010-08-02 2018-08-28 Population Bio, Inc. Compositions and methods for discovery of causative mutations in genetic disorders
US10210306B2 (en) 2006-05-03 2019-02-19 Population Bio, Inc. Evaluating genetic disorders
US10221454B2 (en) 2011-10-10 2019-03-05 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US10233495B2 (en) 2012-09-27 2019-03-19 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US10240205B2 (en) 2017-02-03 2019-03-26 Population Bio, Inc. Methods for assessing risk of developing a viral disease using a genetic test
US10407724B2 (en) 2012-02-09 2019-09-10 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US10522240B2 (en) 2006-05-03 2019-12-31 Population Bio, Inc. Evaluating genetic disorders
US10724096B2 (en) 2014-09-05 2020-07-28 Population Bio, Inc. Methods and compositions for inhibiting and treating neurological conditions
US10961585B2 (en) 2018-08-08 2021-03-30 Pml Screening, Llc Methods for assessing risk of developing a viral of disease using a genetic test
US11180807B2 (en) 2011-11-04 2021-11-23 Population Bio, Inc. Methods for detecting a genetic variation in attractin-like 1 (ATRNL1) gene in subject with Parkinson's disease

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5478724A (en) * 1991-08-16 1995-12-26 The Rockefeller University Lentivirus-specific nucleotide probes and methods of use
US5639871A (en) * 1988-09-09 1997-06-17 Roche Molecular Systems, Inc. Detection of human papillomavirus by the polymerase chain reaction
EP1013776A1 (en) * 1998-12-22 2000-06-28 Universiteit van Amsterdam A sensitive assay for the detection or quantitation of human cytomegalovirus nucleic acid
US6168917B1 (en) * 1996-10-02 2001-01-02 The United States Of America As Represented By The Department Of Health And Human Services Detection and identification of non-polio enteroviruses

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5639871A (en) * 1988-09-09 1997-06-17 Roche Molecular Systems, Inc. Detection of human papillomavirus by the polymerase chain reaction
US5478724A (en) * 1991-08-16 1995-12-26 The Rockefeller University Lentivirus-specific nucleotide probes and methods of use
US6168917B1 (en) * 1996-10-02 2001-01-02 The United States Of America As Represented By The Department Of Health And Human Services Detection and identification of non-polio enteroviruses
EP1013776A1 (en) * 1998-12-22 2000-06-28 Universiteit van Amsterdam A sensitive assay for the detection or quantitation of human cytomegalovirus nucleic acid

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
ALBA M MAR ET AL: "VIDA: A virus database system for the organization of animal virus genome open reading frames." NUCLEIC ACIDS RESEARCH, vol. 29, no. 1, 1 January 2001 (2001-01-01), pages 133-136, XP002249137 ISSN: 0305-1048 *
ROSE T M ET AL: "Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences" NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, SURREY, GB, vol. 26, no. 7, 1998, pages 1628-1635, XP002141299 ISSN: 0305-1048 cited in the application *
STEPHENSEN C B ET AL: "Phylogenetic analysis of a highly conserved region of the polymerase gene from 11 coronaviruses and development of a consensus polymerase chain reaction assay" VIRUS RESEARCH, AMSTERDAM, NL, vol. 60, no. 2, April 1999 (1999-04), pages 181-189, XP002242737 ISSN: 0168-1702 *
THOMPSON J D HIGGENS D G GIBSON T J: "CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice" NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, SURREY, GB, vol. 22, no. 22, November 1994 (1994-11), pages 4673-4680, XP002956304 ISSN: 0305-1048 *
VOELTER C ET AL: "Screening human tumor samples with a broad-spectrum polymerase chain reaction method for the detection of polyomaviruses" VIROLOGY, ACADEMIC PRESS,ORLANDO, US, vol. 237, no. 2, 27 October 1997 (1997-10-27), pages 389-396, XP002242736 ISSN: 0042-6822 *

Cited By (79)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8815513B2 (en) 2001-03-02 2014-08-26 Ibis Biosciences, Inc. Method for rapid detection and identification of bioagents in epidemiological and forensic investigations
US9752184B2 (en) 2001-03-02 2017-09-05 Ibis Biosciences, Inc. Methods for rapid forensic analysis of mitochondrial DNA and characterization of mitochondrial DNA heteroplasmy
US8802372B2 (en) 2001-03-02 2014-08-12 Ibis Biosciences, Inc. Methods for rapid forensic analysis of mitochondrial DNA and characterization of mitochondrial DNA heteroplasmy
US8563250B2 (en) 2001-03-02 2013-10-22 Ibis Biosciences, Inc. Methods for identifying bioagents
US8265878B2 (en) 2001-03-02 2012-09-11 Ibis Bioscience, Inc. Method for rapid detection and identification of bioagents
US8214154B2 (en) 2001-03-02 2012-07-03 Ibis Biosciences, Inc. Systems for rapid identification of pathogens in humans and animals
US8017358B2 (en) 2001-03-02 2011-09-13 Ibis Biosciences, Inc. Method for rapid detection and identification of bioagents
US9416424B2 (en) 2001-03-02 2016-08-16 Ibis Biosciences, Inc. Methods for rapid identification of pathogens in humans and animals
US8268565B2 (en) 2001-03-02 2012-09-18 Ibis Biosciences, Inc. Methods for identifying bioagents
WO2002099130A3 (en) * 2001-06-07 2003-09-04 Univ London Virus detection using degenerate pcr primers
WO2002099130A2 (en) * 2001-06-07 2002-12-12 University College London Virus detection using degenerate pcr primers
US8380442B2 (en) 2001-06-26 2013-02-19 Ibis Bioscience, Inc. Secondary structure defining database and methods for determining identity and geographic origin of an unknown bioagent thereby
US8298760B2 (en) 2001-06-26 2012-10-30 Ibis Bioscience, Inc. Secondary structure defining database and methods for determining identity and geographic origin of an unknown bioagent thereby
US8921047B2 (en) 2001-06-26 2014-12-30 Ibis Biosciences, Inc. Secondary structure defining database and methods for determining identity and geographic origin of an unknown bioagent thereby
US8073627B2 (en) 2001-06-26 2011-12-06 Ibis Biosciences, Inc. System for indentification of pathogens
US8822156B2 (en) 2002-12-06 2014-09-02 Ibis Biosciences, Inc. Methods for rapid identification of pathogens in humans and animals
US9725771B2 (en) 2002-12-06 2017-08-08 Ibis Biosciences, Inc. Methods for rapid identification of pathogens in humans and animals
US8057993B2 (en) 2003-04-26 2011-11-15 Ibis Biosciences, Inc. Methods for identification of coronaviruses
US7964343B2 (en) 2003-05-13 2011-06-21 Ibis Biosciences, Inc. Method for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture
US8158354B2 (en) 2003-05-13 2012-04-17 Ibis Biosciences, Inc. Methods for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture
US8476415B2 (en) 2003-05-13 2013-07-02 Ibis Biosciences, Inc. Methods for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture
US8013142B2 (en) 2003-09-11 2011-09-06 Ibis Biosciences, Inc. Compositions for use in identification of bacteria
US8546082B2 (en) 2003-09-11 2013-10-01 Ibis Biosciences, Inc. Methods for identification of sepsis-causing bacteria
US8097416B2 (en) 2003-09-11 2012-01-17 Ibis Biosciences, Inc. Methods for identification of sepsis-causing bacteria
US7956175B2 (en) 2003-09-11 2011-06-07 Ibis Biosciences, Inc. Compositions for use in identification of bacteria
US8163895B2 (en) 2003-12-05 2012-04-24 Ibis Biosciences, Inc. Compositions for use in identification of orthopoxviruses
US8187814B2 (en) 2004-02-18 2012-05-29 Ibis Biosciences, Inc. Methods for concurrent identification and quantification of an unknown bioagent
US9447462B2 (en) 2004-02-18 2016-09-20 Ibis Biosciences, Inc. Methods for concurrent identification and quantification of an unknown bioagent
US8173957B2 (en) 2004-05-24 2012-05-08 Ibis Biosciences, Inc. Mass spectrometry with selective ion filtration by digital thresholding
US9449802B2 (en) 2004-05-24 2016-09-20 Ibis Biosciences, Inc. Mass spectrometry with selective ion filtration by digital thresholding
US8987660B2 (en) 2004-05-24 2015-03-24 Ibis Biosciences, Inc. Mass spectrometry with selective ion filtration by digital thresholding
US8407010B2 (en) 2004-05-25 2013-03-26 Ibis Biosciences, Inc. Methods for rapid forensic analysis of mitochondrial DNA
US9873906B2 (en) 2004-07-14 2018-01-23 Ibis Biosciences, Inc. Methods for repairing degraded DNA
WO2006094238A2 (en) * 2005-03-03 2006-09-08 Isis Pharmaceuticals, Inc. Compositions for use in identification of adventitious viruses
WO2006094238A3 (en) * 2005-03-03 2007-02-08 Isis Pharmaceuticals Inc Compositions for use in identification of adventitious viruses
US8084207B2 (en) 2005-03-03 2011-12-27 Ibis Bioscience, Inc. Compositions for use in identification of papillomavirus
US8182992B2 (en) 2005-03-03 2012-05-22 Ibis Biosciences, Inc. Compositions for use in identification of adventitious viruses
US8551738B2 (en) 2005-07-21 2013-10-08 Ibis Biosciences, Inc. Systems and methods for rapid identification of nucleic acid variants
US8026084B2 (en) 2005-07-21 2011-09-27 Ibis Biosciences, Inc. Methods for rapid identification and quantitation of nucleic acid variants
US10529441B2 (en) 2006-05-03 2020-01-07 Population Bio, Inc. Evaluating genetic disorders
US10210306B2 (en) 2006-05-03 2019-02-19 Population Bio, Inc. Evaluating genetic disorders
US10522240B2 (en) 2006-05-03 2019-12-31 Population Bio, Inc. Evaluating genetic disorders
US9149473B2 (en) 2006-09-14 2015-10-06 Ibis Biosciences, Inc. Targeted whole genome amplification method for identification of pathogens
US8871471B2 (en) 2007-02-23 2014-10-28 Ibis Biosciences, Inc. Methods for rapid forensic DNA analysis
US9598724B2 (en) 2007-06-01 2017-03-21 Ibis Biosciences, Inc. Methods and compositions for multiple displacement amplification of nucleic acids
US9027730B2 (en) 2008-09-16 2015-05-12 Ibis Biosciences, Inc. Microplate handling systems and related computer program products and methods
US9023655B2 (en) 2008-09-16 2015-05-05 Ibis Biosciences, Inc. Sample processing units, systems, and related methods
US8148163B2 (en) 2008-09-16 2012-04-03 Ibis Biosciences, Inc. Sample processing units, systems, and related methods
US8550694B2 (en) 2008-09-16 2013-10-08 Ibis Biosciences, Inc. Mixing cartridges, mixing stations, and related kits, systems, and methods
US8609430B2 (en) 2008-09-16 2013-12-17 Ibis Biosciences, Inc. Sample processing units, systems, and related methods
US8252599B2 (en) 2008-09-16 2012-08-28 Ibis Biosciences, Inc. Sample processing units, systems, and related methods
US8534447B2 (en) 2008-09-16 2013-09-17 Ibis Biosciences, Inc. Microplate handling systems and related computer program products and methods
US9165740B2 (en) 2009-02-12 2015-10-20 Ibis Biosciences, Inc. Ionization probe assemblies
US8158936B2 (en) 2009-02-12 2012-04-17 Ibis Biosciences, Inc. Ionization probe assemblies
US8796617B2 (en) 2009-02-12 2014-08-05 Ibis Biosciences, Inc. Ionization probe assemblies
US8950604B2 (en) 2009-07-17 2015-02-10 Ibis Biosciences, Inc. Lift and mount apparatus
US9194877B2 (en) 2009-07-17 2015-11-24 Ibis Biosciences, Inc. Systems for bioagent indentification
US9890408B2 (en) 2009-10-15 2018-02-13 Ibis Biosciences, Inc. Multiple displacement amplification
US10059997B2 (en) 2010-08-02 2018-08-28 Population Bio, Inc. Compositions and methods for discovery of causative mutations in genetic disorders
US11788142B2 (en) 2010-08-02 2023-10-17 Population Bio, Inc. Compositions and methods for discovery of causative mutations in genetic disorders
US11339439B2 (en) 2011-10-10 2022-05-24 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US10221454B2 (en) 2011-10-10 2019-03-05 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US11180807B2 (en) 2011-11-04 2021-11-23 Population Bio, Inc. Methods for detecting a genetic variation in attractin-like 1 (ATRNL1) gene in subject with Parkinson's disease
US10407724B2 (en) 2012-02-09 2019-09-10 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US11174516B2 (en) 2012-02-09 2021-11-16 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US9976180B2 (en) 2012-09-14 2018-05-22 Population Bio, Inc. Methods for detecting a genetic variation in subjects with parkinsonism
US11008614B2 (en) 2012-09-14 2021-05-18 Population Bio, Inc. Methods for diagnosing, prognosing, and treating parkinsonism
US11618925B2 (en) 2012-09-27 2023-04-04 Population Bio, Inc. Methods and compositions for screening and treating developmental disorders
US10597721B2 (en) 2012-09-27 2020-03-24 Population Bio, Inc. Methods and compositions for screening and treating developmental disorders
US10233495B2 (en) 2012-09-27 2019-03-19 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US11549145B2 (en) 2014-09-05 2023-01-10 Population Bio, Inc. Methods and compositions for inhibiting and treating neurological conditions
US10724096B2 (en) 2014-09-05 2020-07-28 Population Bio, Inc. Methods and compositions for inhibiting and treating neurological conditions
US10544463B2 (en) 2017-02-03 2020-01-28 Pml Screening, Llc Methods for assessing risk of developing a viral disease using a genetic test
US10941448B1 (en) 2017-02-03 2021-03-09 The Universite Paris-Saclay Methods for assessing risk of developing a viral disease using a genetic test
US10563264B2 (en) 2017-02-03 2020-02-18 Pml Screening, Llc Methods for assessing risk of developing a viral disease using a genetic test
US10240205B2 (en) 2017-02-03 2019-03-26 Population Bio, Inc. Methods for assessing risk of developing a viral disease using a genetic test
US11913073B2 (en) 2017-02-03 2024-02-27 Pml Screening, Llc Methods for assessing risk of developing a viral disease using a genetic test
US10961585B2 (en) 2018-08-08 2021-03-30 Pml Screening, Llc Methods for assessing risk of developing a viral of disease using a genetic test
US11913074B2 (en) 2018-08-08 2024-02-27 Pml Screening, Llc Methods for assessing risk of developing a viral disease using a genetic test

Also Published As

Publication number Publication date
WO2002099129A3 (en) 2003-09-18
GB0113908D0 (en) 2001-08-01
AU2002311446A1 (en) 2002-12-16

Similar Documents

Publication Publication Date Title
WO2002099129A2 (en) Designing degenerate pcr primers
Meng et al. Virome analysis of tick-borne viruses in Heilongjiang Province, China
Boubourakas et al. Sensitive and rapid detection of peach latent mosaic viroid by the reverse transcription loop-mediated isothermal amplification
Sabanadzovic et al. Southern tomato virus: the link between the families Totiviridae and Partitiviridae
Tsai et al. Development of a loop-mediated isothermal amplification for rapid detection of orf virus
Moreno et al. Reference genes for reliable potyvirus quantitation in cassava and analysis of Cassava brown streak virus load in host varieties
WO2002099130A2 (en) Virus detection using degenerate pcr primers
CN105555972A (en) Genetic assays
Fauver et al. A reverse-transcription/RNase H based protocol for depletion of mosquito ribosomal RNA facilitates viral intrahost evolution analysis, transcriptomics and pathogen discovery
CA2925942C (en) Detection of rare microbiological nucleic acids
WO2009098789A1 (en) Method of detecting pathogenic virus in crustacean by lamp method and reagent kit for detection
Sahin et al. Novel and divergent bipartite mycoviruses associated with the ectomycorrhizal fungus Sarcosphaera coronaria
Aimone et al. An experimental strategy for preparing circular ssDNA virus genomes for next-generation sequencing
Zhang et al. Deep sequencing of banana bract mosaic virus from flowering ginger (Alpinia purpurata) and development of an immunocapture RT-LAMP detection assay
JP6436598B1 (en) Primer set for specifically amplifying nucleic acid derived from strawberry pathogenic virus and method for detecting strawberry pathogenic virus
JP6027023B2 (en) Compositions and methods for identifying and distinguishing viral components of multivalent transport fever vaccines
Leisova‐Svobodova et al. Detection of Garlic Viruses Using SYBR Green Real‐time Reverse Transcription‐Polymerase Chain Reaction
JP2016524906A (en) Molecular identification of allergic mites by PCR
WO2022194756A2 (en) Compositions and methods for detecting severe acute respiratory syndrome coronavirus 2 (sars-cov-2) variants having spike protein mutations
Lin et al. Development of an EvaGreen based real-time RT-PCR assay for rapid detection, quantitation and diagnosis of goose calicivirus
Song et al. RT-PCR-RFLP for genetic diversity analysis of the citrus tatter leaf virus strain of Apple stem grooving virus
KR102076341B1 (en) Composition for detecting severe fever with thrombocytopenia syndrome virus using LAMP and uses thereof
KR101617142B1 (en) Nucleic acid test based avian influenza virus detection kit with improved detection accuracy
JP3118572B2 (en) Simultaneous detection of citrus tarifavirus and citrus viroid by RT-PCR
Shaffer et al. First report of amazon lily mild mottle virus in peony in the United States

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP